(no title)
kwindla | 2 months ago
The integrated developer experience is much better on Vapi, etc.
The goal of the Pipecat project is to provide state of the art building blocks if you want to control every part of the multimodal, realtime agent processing flow and tech stack. There are thousands of companies with Pipecat voice agents deployed at scale in production, including some of the world's largest e-commerce, financial services, and healthtech companies. The Smart Turn model benchmarks better than any of the proprietary turn detection models. Companies like Modal have great info about how to build agents with sub-second voice-to-voice latency.[1] Most of the next-generation video avatar companies are building on Pipecat.[2] NVIDIA built the ACE Controller robot operating system on Pipecat.[3]
[1] https://modal.com/blog/low-latency-voice-bot - [2] https://lemonslice.com/ = [3] https://github.com/NVIDIA/ace-controller/
nextworddev|2 months ago
I just want to provide: - business logic - tools - configuration metadata (e.g. which voice to use)
I don't like Vapi due to 1) extensive GUI driven experience, 2) cost
ldenoue|2 months ago
Or PipeCat Cloud / LiveKit cloud (I think they charge 1 cent per minute?)