(no title)
PhilippGille | 1 month ago
> Is anyone doing true end-to-end speech models locally (streaming audio out), or is the SOTA still “streaming ASR + LLM + streaming TTS” glued together?
Your setup is the latter, not the former.
PhilippGille | 1 month ago
> Is anyone doing true end-to-end speech models locally (streaming audio out), or is the SOTA still “streaming ASR + LLM + streaming TTS” glued together?
Your setup is the latter, not the former.
No comments yet.