top | item 40886288

(no title)

They're all actually AI powered, generally some form of real-time RNN trained on identifying and isolating voice content from background noise or music.

rnnoise2 is an open-source model that does very well. There also are things like Waves Clarity VX, the Nvidia Broadcast (Audio Effects SDK) too, as well as plenty of other solutions like Supertone Clear, Krisp, etc etc etc.

discuss

CursedUrn|1 year ago

Does that mean youtube is AI generating your voice to "add it back" after silencing that part of the video? Does it ever generate different words to what you actually said?