top | item 28074555

(no title)

mazoza | 4 years ago

I know the old speech team continues as Coqui https://github.com/coqui-ai/

discuss

About their TTS system: "These models provide speech synthesis with ~0.12 real-time factor on a GPU and ~1.02 on a CPU." The quality of the samples is really impressive but, wow, but isn't this computationally too expensive for many applications?

nyanpasu64|4 years ago

>If, for example, it takes 8 hours of computation time to process a recording of duration 2 hours, the real time factor is 4. When the real time factor is 1, the processing is done in real time. It is a hardware-dependent value.

I think real-time factors smaller than 1 are faster than real-time (not slower) and use less than 100% of a resource's computational power to keep up.

mazoza|4 years ago

I means it is faster than real time almost 10x

So it is the contrary