(no title)
juthen
|
1 year ago
Speech.
The speech-to-text pipeline is inherent in us. The convertion model relies on our education and cultural factors.
The models can transcribe speech and do this conversion for new data generation. Have 10 mics at a public square and you'll have an infinite dataset (not a very smart one, necessarily...).
No comments yet.