Thanks, much appreciated for the clarification. I clearly overlooked that, which now it's pointed out seems entirely obvious, my bad. Only took negative karma for it to click, haha.
Ironically, the other link I posted at the same is actually speech to text. You want something like VOSK if you're looking for local machine transcription:
As for quality, I think its models are, IDK, maybe around the level that Youtube automatic captions were two or three years ago? So well over 90% accurate, and servicable for getting something to search for or clean up, but expect it to get a word wrong every now and then.
hgyjnbdet|1 year ago
Intralexical|1 year ago
https://news.ycombinator.com/item?id=40027675
As for quality, I think its models are, IDK, maybe around the level that Youtube automatic captions were two or three years ago? So well over 90% accurate, and servicable for getting something to search for or clean up, but expect it to get a word wrong every now and then.