(no title)
8s2ngy
|
6 months ago
I've been using Kokoro TTS with the CLI app, audiblez, mentioned in the "Similar Projects" section of the README. The model is fast and delivers impressive quality for its small size. Some issues I have faced, however, are:
a) It doesn't distinguish periods at the end of sentences from the dots in abbreviations such as "Mr." or "Mrs." The result is an awkward pause between "Mr." and the name.
b) It doesn't handle ellipses well.
c) Words are pronounced the same way regardless of context.
beboplifa|6 months ago
fudged71|6 months ago
rkagerer|6 months ago
hombre_fatal|6 months ago
The difference is that even weak LLMs are good at magically doing this, so I wonder what the problem is for the TTS mentioned above.