(no title)
ryanhecht | 1 year ago
I feel like this naming scheme is growing a little tired. o1 is for general knowledge reasoning, o3-mini replaces o1-mini but might be more specialized than o1 for certain technical domains...the "o" in "4o" is for "omni" (referring to its multimodality) but the reasoning models start with "o" ...but they can't use "o2" for trademark reasons so they skip straight to "o3" ...the word salad is getting really hard to follow!
zamadatix|1 year ago
The non-reasoning line was also pretty straightforward. Newer base models get a larger prefix number and some postfixes like 'o' were added to signal specific features in each model variant. Great!
Where things went of the rails was specifically when they decided to also name the reasoning models with an 'o' for separate reasons but now as the prefix at the same time as starting a separate linear sequence but now as the postfix. I wonder if we'll end up with both a 4o and o4...
lolinder|1 year ago
The perplexing thing is that someone has to have said that, right? It has to have been brought up in some meeting when they were brainstorming names that if you have 4o and o1 with the intention of incrementing o1 you'll eventually end up with an o4.
Where they really went off the rails was not just bailing when they realized they couldn't use o2. In that moment they had the chance to just make o1 a one-off weird name and go down a different path for its final branding.
OpenAI just struggles with names in general, though. ChatGPT was a terrible name picked by engineers for a product that wasn't supposed to become wildly successful, and they haven't really improved at it since.
unsupp0rted|1 year ago
You can like Sama or dislike him, but he knows how to market a product. Maybe this is a bad call on his part, but it is a call.
thorum|1 year ago
FridgeSeal|1 year ago
This naming scheme is a dumpster fire. Every other comment is trying to untangle what the actual hierarchy of model performance is.
mrbungie|1 year ago
The naming is bad. Other people already said it you can "google" stuff, you can "deepseek" something, but to "chatgpt" sounds weird.
The model naming is even weirder, like, did they really avoid o2 because of oxigen?
kingnothing|1 year ago
If the model is for technical stuff, then call it the technical model. How is anyone supposed to know what these model names mean?
The only page of theirs attempting to explain this is a total disaster. https://platform.openai.com/docs/models
TeMPOraL|1 year ago
Normies don't have to know - ChatGPT app focuses UX around capabilities and automatically picks the appropriate model for capabilities requested; you can see which model you're using and change it, but don't need to.
As for the techies and self-proclaimed "AI experts" - OpenAI is the leader in the field, and one of the most well-known and talked about tech companies in history. Whether to use, praise or criticize, this group of users is motivated to figure it out on their own.
It's the privilege of fashionable companies. They could name the next model ↂ-↊↋, and it'll take all of five minutes for everyone in tech (and everyone on LinkedIn) to learn how to type in the right Unicode characters.
EDIT: Originally I wrote \Omega-↊↋, but apparently HN's Unicode filter extends to Greek alphabet now? 'dang?
n2d4|1 year ago
Who said this is not intentional? It seems to work well given that people are hyped every time there's a release, no matter how big the actual improvements are — I'm pretty sure "o3-mini" works better for that purpose than "GPT 4.1.3"
golly_ned|1 year ago
ninetyninenine|1 year ago
rowanG077|1 year ago
unknown|1 year ago
[deleted]
ryanhecht|1 year ago
...I like "DALL·E" and "Whisper" as names a lot, though, FWIW :p
koakuma-chan|1 year ago
golol|1 year ago
fourseventy|1 year ago
Someone1234|1 year ago
observationist|1 year ago
OpenAI is so scattered they haven't even bothered using their own state of the art AI to come up with a coherent naming convention? C'mon, get your shit together.
TeMPOraL|1 year ago
As for self-limiting usage by non-power users, they're already doing that: ChatGPT app automatically picks a model depending on what capabilities you invoke. While they provide a limited ability to see and switch the model in use, they're clearly expecting regular users not to care, and design their app around that.
golol|1 year ago
layer8|1 year ago
jtwaleson|1 year ago