Have you considered using the on-device Apple foundation models as a fallback/alternative to the remote APIs? There is of course Apple's SpeechTranscriber (which is likely faster and more power-efficient than Whisper), and also the permissiveContentTransformations guardrail which is specifically designed for your use case.
No comments yet.