(no title)
nonoesp | 1 year ago
They require training data longer than 15 seconds, which could lead the out out to resemble more the actual voice.
I've seen weird behaviors where the AI voice forces a British accent to pronounce certain words which I don't have.
Descript also uses voice synthesis to regenerate edited portions of conversations with a noticeable cut to smoothen the transition, which is pretty useful.
No comments yet.