(no title)
umlx | 1 year ago
I would like to improve accuracy by preserving context, but I haven't found a good way to do this at the moment.
If we are talking about the accuracy of the transcription, it is very good if you use a large model. At least the accuracy of whisper is far superior to Youtube's subtitle generation!
No comments yet.