top | item 34178300

Speaker diarization (labels) for OpenAI Whisper generated transcripts

44 points| ufarooqi | 3 years ago |ufarooqi.com

5 comments

order

algon33|3 years ago

I tried using this for a technical talk[1], and it got the amount of speakers wrong. Which is somewhat suprising to me, as I would have thought diarization tech would just worked by now.

[1]https://www.youtube.com/watch?v=5lFxURxbyEc&list=PLiayR7yJx8...

ufarooqi|3 years ago

I'm gonna give it a try with your video. If I may ask how many speakers are there in this video. (I have to go through all of it otherwise). From what I can see, we have a teacher who is speaking most of the times and then few laughs from students in the background.