top | item 45592072

(no title)

fl_rn_st | 4 months ago

"Smart, invisible regex" sounds like a lot of bs... could you give a more technical explanation?

Also the Whisper model doesn't really have a context window, it already segments the audio with a certain amount of overlap between the chunks, I really have a hard time understanding what you are trying to say here.

discuss

rezivor|4 months ago

Whisper will fail > 99%* (edit, most of the time) of the time at lengths over 90 minutes and fairly high over one hour.

saaaaaam|4 months ago

This is absolutely not my experience. I regularly (weekly at least) use whisper for 90-120 minutes pieces of content and only rarely have problems.

fl_rn_st|4 months ago

This is just plain wrong. I have my own Whisper App in the AppStore (on iOS, with very limited memory capacity) and there are no problems at all with longer Audio / Video files.

pmarreck|4 months ago

Can't really declare that without declaring which whisper model in particular you are referring to, as there are a number of them

gcr|4 months ago

I’ve used whisper-cop on 5-hour podcasts without problems.

Would also love to hear what you mean by “smart invisible regex,” sounds like AI slop to me.