top | item 23490372

(no title)

josinalvo | 5 years ago

I think this is backwards... This is a corpus to train speech to text, not text to speech, right?

discuss

order

joshribakoff|5 years ago

It's a corpus designed to capture the full breadth of combinatorial nuances of human speech in a general sense.

reubenmorais|5 years ago

No, it is not. For one, it's a corpus of read speech, which means it does not capture well the characteristics of conversational human speech – hesitation, disfluencies, different tones and registers, etc. LibriSpeech has a paper explaining the design of the corpus, all you need to read is the first sentence of the abstract to know what it is supposed to capture:

This paper introduces a new corpus of read English speech, suitable for training and evaluating speech recognition systems.

http://www.danielpovey.com/files/2015_icassp_librispeech.pdf