top | item 22793842 (no title) pheme1 | 5 years ago As someone who thinker with Text to Speech (TTS), I can say this apply to TTS as well. Good model such as Tacotron2 rarely scale beyond clean ( good text and speech alignmen ) large ( > 12 hours ) datasets. discuss order hn newest No comments yet.
No comments yet.