top | item 45980640

Deep Dive into FFmpeg 8.0

40 points| dutzi | 3 months ago |rendi.dev

11 comments

pinter69|3 months ago

Author here, available for questions

Whisper will hallucinate on audio segments that don't have any speech. VAD mitigates that. Expect worse results without it, especially on non-English audio.

cranberryturkey|3 months ago

This is great I’ll have to give it a shot

trq01758|3 months ago

"Lenovo laptop with Nvidia RTX 4040" 4060?

mikece|3 months ago

Is the point that you only need one tool -- ffmpeg -- to both generate transcripts as well as embed those into a video as opposed to having multiple tools?

pinter69|3 months ago

This is a 3 part series, the first one discusses the new native whisper integration. And correct, for the first post - the point is to show that you can only use ffmpeg to transcribe and embed subtitles in a video

radicality|3 months ago

Do you know if it’s supported on Mac too, with whatever platform specific optimizations like running it on the gpu / with MPS ?

pinter69|3 months ago

You mean Vulkan? In the blog post there is reference to all vulkan supported platforms

If you mean ffmpeg build with whisper - from memory I didn't see ffmpeg-builds for mac, so you will probably need to compile yourself