top | item 41746382

(no title)

davidz | 1 year ago

Currently it does: all audio is sent to the model.

However, we are working on turn detection within the framework, so you won't have to send silence to the model when the user isn't talking. It's a fairly straight forward path to cutting down the cost by ~50%.

discuss

order

rukuu001|1 year ago

Working on this for an internal tool - detecting no speech has been a PITA so far. Interested to see how you go with this.

pj_mukh|1 year ago

Can I currently put a VAD module in the pipeline and only send audio when there is an active conversation? Feel like just that would solve the problem?