top | item 29729313

(no title)

maxkrieger | 4 years ago

EDIT: If you feel strongly about this and think it's possible please send in PRs. Thanks!

Unfortunately you then lose ability to play back the original audio https://stackoverflow.com/questions/2319735/voice-recognitio.... This is a major usability tradeoff IMO, though I'm willing to be swayed to add an option.

I suspect Google's limiting this because they don't want devices "freeloading" their cloud transcription service, since most phones can't do it on-device.

discuss

order

junon|4 years ago

Can you not buffer the audio and then send the buffer to the transcription service, allowing you to keep the original?

If this were all on-device I'd use this in a heartbeat. I'd even pay for it. I worry about privacy though - I appreciate you went with Azure instead of Google, however!

verdverm|4 years ago

You really think Azure and Google Cloud have that much difference? If you use GCP, you can select whether they can also use the data for training, per request. Does Azure have that option or clarity?

smusamashah|4 years ago

Not just Pixel 5 and above, you can transcribe in real time with any android phone using Google Recorder app. Which essentially means that it doesn't need special processor (as they marketed for Pixel 6) or the cloud services to transcribe in real time.

App probably loads a model for offline use, I don't know if you could somehow use that app as an API or something.

Same goes for Chrome. You can see live transcript of any audio in Chrome Desktop without internet. That transcript is readonly and cannot be copy pasted even.

petargyurov|4 years ago

The link you posted is a 7 year old question; is the information still relevant? Surely copying the original audio is enough of a workaround?

Not a mobile dev so could be wrong.