top | item 44886766

(no title)

zzsshh | 6 months ago

Does this finally enable dynamically generating subtitles for movies with AI?

discuss

order

jeroenhd|6 months ago

Docs say:

    If set, the transcription output will be sent to the specified file or URL
    (use one of the FFmpeg AVIO protocols); otherwise, the output will be logged as info messages.
    The output will also be set in the "lavfi.whisper.text" frame metadata.
    If the destination is a file and it already exists, it will be overwritten.

    @item format
    The destination format string; it could be "text" (only the transcribed text will be sent to the destination), "srt" (subtitle format) or "json".
    Default value: @code{"text"}
I don't know if this can embed the subtitles, but it does support generating accompanying srt files.

Of course, you could already do that by just manually calling whisper on files, but now you don't need to export parts or transformed media files to feed into whisper.

regularfry|6 months ago

If you have enough processing power. Without a GPU it's going to lag.

jeroenhd|6 months ago

In my experience, a small/tiny whisper model has pretty okay English decoding speed on something relatively modern even without GPU support. There's a bunch of latency in the process (because of technological limitations) but the optimised C++ version shouldn't pose too much of a problem unless you're running in power saving mode. Battery life may be a problem on older laptops, though.

KeplerBoy|6 months ago

Whisper is pretty fast.

diggan|6 months ago

Finally? I think VLC demo'd this a while ago at some conference where they had a table, if I remember correctly.

SSLy|6 months ago

VLC and ffmpeg are unrelated projects

mmmpetrichor|6 months ago

I've been waiting a while now for automatic translated subtitles in vlc. I thought it would be here by now. I'm probably underestimating the difficulty but I'm surprised some video player hasn't done it by now. (as far as I know).