(no title)
Leftium | 21 hours ago
This is an example python app wrapped in a (macOS) native shell using Electrobun: https://github.com/blackboardsh/audio-tts
Can you report how well Voxtral Realtime compares to the other currently supported streaming models? https://rift-transcription.vercel.app/local-setup
- Subjectively I've found Web Speech API feels the best (accuracy/latency), followed by moonshine medium
OpenAI Realtime WS API is on the roadmap, so I might be able to compare via RIFT in the future...
No comments yet.