(no title)
monroewalker | 1 year ago
The demo interactions are recorded, which is mentioned in their disclaimer under the demo UI. What isn't mentioned though is that they include past conversations in the context for the model on future interactions. It was pretty surprising to be greeted with something like "welcome back" and the model being able to reference what was said in previous interactions. The full disclaimer on the page for the demo is:
" 1. Microphone permission is required. 2. Calls are recorded for quality review but not used for ML training and are deleted within 30 days. 3. By using this demo, you are agreeing to our "
edit: Actually this has been posted quite a few times already and had good visibility a couple days ago: - https://news.ycombinator.com/item?id=43200400 Others: https://hn.algolia.com/?q=sesame.com
hn_user82179|1 year ago
jofzar|1 year ago
Edit: well I asked the "male" model to speak more like an Australian and yep, getting way more uncanny. If it had an Australian accent I think it would mess with me more
igleria|1 year ago
huijzer|1 year ago
I'm surprised by the lack of attention that Gemini 2.0 with native audio output got. They have a demo at https://youtu.be/qE673AY-WEI, which I think is really good too. The main problem with Google's model is that this audio output is not supported by the API, but you can try it at https://aistudio.google.com.
In general, text to speech is pretty good nowadays I think. For example, this is a little math video that I made a few days ago: https://www.youtube.com/watch?v=G1mvLrCfjFM with the (old) Google text to speech API. Honestly, I think the narration is better than I personally could have done. It's calm, well pronounced, and sounds relatively enthusiastic.
moralestapia|1 year ago
That's not a demo, that's a video. Anyone can make something like that in an afternoon with a couple friends and a microphone.
Also, Google is known for putting out fake "demos", remember the Google Duplex scam?
smusamashah|1 year ago
anon373839|1 year ago
Mistletoe|1 year ago
https://youtube.com/watch?v=C6ufImch00g
micw|1 year ago
ekianjo|1 year ago
znpy|1 year ago
Sounds (pun intended) reasonable.