Am I the only one that really dislikes these overly perky voices? And the one they used for the demo was GOD AWFUL. If this is what the future sounds like… idk. I was hoping for a calm, neutral and not eager assistant with a little class. A little Jarvis maybe? Something calm and easy on the ears.
These all sounded to me like people auditioning for a role in film and television or something.
I'm surprised that you can't configure the pace or pitch of the voices. The Web Speech API, around for a long time, has that capability. I'd also love to have the option of Australian and British accents.
This is contextual, the model seems to match the user vibe and the general situation. Take a look at the "Customer service proof of concept" video added to the announcement post[1] - there are two AIs talking using very different voices.
I had Sky answer in dutch, and hearing dutch with an american accent is funny but a little jarring. The frustrating part of being a flemish dutch speaker is that dutch itself is already poorly served by AI tools (it’s actually impressive chatgpt can do it at all) and additionally the flemish accent and dialects are not supported at all in pretty much anything, so all dutch text to speech sounds foreign.
I hope eventually they can pick up language, dialect and accent from youtube content or something like that, because voice casting will never scale to the thousands of variations they need to have to support a global audience. Instead of picking a voice from a handful of choices it should do a q&a where it learns your locale and intonation preferences and generates a tailored voice on the fly.
Still, I am impressed with the quality of the voices they have and with how well they manage dutch. It is just the case that there still is a lot of room for growth.
As a British person, I find all the current OpenAI voices have overly strong (American) accents, which are way too perky/enthusiastic for my ears. It would be great if they could offer a more neutral accent, or even one British accent? (Example: Voice 5 from pi.ai is excellent, IMHO)
Some British accents, in particular outside of London, can be almost unintelligible to general English speakers.
There is nothing intrinsically neutral about the British accent, it's more a matter of diction from the speaker. I would also not consider the OpenAI accents as neutral American, feels more like "Young Californian adult woman" accent to me.
Berlin famously casted a gender neutral voice for public transport announcements. It would be nice if OpenAI would have done that too. At least for one of the options. Computers better not have genders imo
> Berlin famously casted a gender neutral voice for public transport announcements.
The person in question is transsexual. You're saying that regular men and women aren't neutral but transsexuals are?
> Computers better not have genders imo
This doesn't make any sense. Gender is an intrinsic part of many languages which cannot be ripped out. It's impossible to talk in some languages without picking a way of speaking which is either masculine or feminine.
Fake, americanized, subtly sexualized. I'm pretty sad that they went for the lowest common denominator approach.
I'm not opposed to computers using emotion at all, mind you. But I don't like that arguably the AI company furthest ahead is choosing this gaudy hollywood approach to marketing.
I doubt the net result is support. Expanding creative community - yes. But support - not quite; established creators will likely suffer because of openai, most of them already having a harder time.
Sky has an uncomfortably flirty tone to me. If they're going to go that route, at least include an uncomfortably flirty male voice too. (Or admit Sky is there more for the enjoyment of straight male users than anything else)
> We believe that AI voices should not deliberately mimic a celebrity's distinctive voice—Sky’s voice is not an imitation of Scarlett Johansson but belongs to a different professional actress using her own natural speaking voice. To protect their privacy, we cannot share the names of our voice talents.
I love how they have to go out of their way to explain that, rather than train a model on Scarlett Johansson's voice, they came about her voice honestly, by just finding someone who's natural voice sounded the most like Scarlett Johansson... as, clearly, no one is questioning that this voice was chosen to mimic Scarlett Johansson.
Pretty sure that’s another joking reference to the movie Her rather than actual concern about similarity. If that was the case pointing it out would attract lawsuits
From their Discord server:
> @ everyone We’ve heard questions about how we chose the voices in ChatGPT, especially Sky. We are working to pause the use of Sky while we address them. Read more about how we chose these voices [link above, but no further details]
Wow, so they actually got bullied into removing the voice? What a world. I actually liked that voice or at least didn't find any issue with it, they are all perky assistant type voices, but why not? Adding extra choices is fine if people want it, but removing the voice due to loud minority effect is just sad.
Would it be possible to use prompting to change aspects of the voice output? Will the voice respond angrily if you ask to act an angry character in a play?
> 400 people did some unpaid work for us, only 5 of which eventually got paid, after 5 months of not knowing will they or won't they.
Fair enough, that's how things work for other voice acting jobs as well, I just doubt it usually takes almost half a year. I wonder how many of the shortlisted 14 have just given up somewhere along the way.
datahack|1 year ago
These all sounded to me like people auditioning for a role in film and television or something.
DeathArrow|1 year ago
vlasky|1 year ago
iLoveOncall|1 year ago
So if I'm asking it a question on a neutral tone it will answer in the same way.
exitb|1 year ago
[1] https://openai.com/index/hello-gpt-4o/
genericacct|1 year ago
jimsimmons|1 year ago
jareklupinski|1 year ago
LeoPanthera|1 year ago
Joeri|1 year ago
I hope eventually they can pick up language, dialect and accent from youtube content or something like that, because voice casting will never scale to the thousands of variations they need to have to support a global audience. Instead of picking a voice from a handful of choices it should do a q&a where it learns your locale and intonation preferences and generates a tailored voice on the fly.
Still, I am impressed with the quality of the voices they have and with how well they manage dutch. It is just the case that there still is a lot of room for growth.
nacnud|1 year ago
glimshe|1 year ago
There is nothing intrinsically neutral about the British accent, it's more a matter of diction from the speaker. I would also not consider the OpenAI accents as neutral American, feels more like "Young Californian adult woman" accent to me.
msoad|1 year ago
Spiwux|1 year ago
arianvanp|1 year ago
If you're interested in her voice
xdennis|1 year ago
The person in question is transsexual. You're saying that regular men and women aren't neutral but transsexuals are?
> Computers better not have genders imo
This doesn't make any sense. Gender is an intrinsic part of many languages which cannot be ripped out. It's impossible to talk in some languages without picking a way of speaking which is either masculine or feminine.
avereveard|1 year ago
AnonymousPlanet|1 year ago
GaggiX|1 year ago
iLoveOncall|1 year ago
unknown|1 year ago
[deleted]
iamflimflam1|1 year ago
isoprophlex|1 year ago
I'm not opposed to computers using emotion at all, mind you. But I don't like that arguably the AI company furthest ahead is choosing this gaudy hollywood approach to marketing.
riffraff|1 year ago
https://www.youtube.com/shorts/51ucQ4s7Crc
unknown|1 year ago
[deleted]
thih9|1 year ago
I doubt the net result is support. Expanding creative community - yes. But support - not quite; established creators will likely suffer because of openai, most of them already having a harder time.
gaymenexisttoo|1 year ago
saurik|1 year ago
I love how they have to go out of their way to explain that, rather than train a model on Scarlett Johansson's voice, they came about her voice honestly, by just finding someone who's natural voice sounded the most like Scarlett Johansson... as, clearly, no one is questioning that this voice was chosen to mimic Scarlett Johansson.
Havoc|1 year ago
defrost|1 year ago
th0ma5|1 year ago
easymodex|1 year ago
omarfarooq|1 year ago
prolyxis|1 year ago
iLoveOncall|1 year ago
jcunado109|1 year ago
[deleted]
BrilliantHunter|1 year ago
[deleted]
input_sh|1 year ago
> 400 people did some unpaid work for us, only 5 of which eventually got paid, after 5 months of not knowing will they or won't they.
Fair enough, that's how things work for other voice acting jobs as well, I just doubt it usually takes almost half a year. I wonder how many of the shortlisted 14 have just given up somewhere along the way.