top | item 45982813

(no title)

Reubend | 3 months ago

OpenAI likes to time their announcements alongside major competitor announcements to suck up some of the hype. (See for instance the announcement of GPT-4o a single day before Google's IO conference)

They were probably sitting on this for a while. That makes me think this is a fairly incremental update for Codex.

discuss

order

Palmik|3 months ago

GPT 5.1 / Codex already beats Gemini 3 on SWE Bench Verified and Terminal Bench and this pushes the gap further. Seems like a decent improvement.

skhameneh|3 months ago

There’s been community commentary that many of the GPT models are a tad overfitted WRT benchmarks. Benchmarks are not representative of end user experiences. That’s not to say the benchmarks aren’t useful at all, but are only useful as a subjective indicator.

knowriju|3 months ago

Would it be fair to compare a generic model with a model finetuned for coding?

bugglebeetle|3 months ago

That’s how the game is played. We should be grateful for all the competition that is driving these improvements, not whinging about the realities of what companies have to do to contest each other’s position.

johnecheck|3 months ago

It's funny, this release comes right after the Gemini 3 release that coincided with day 1 of Microsoft's Ignite conference.

criemen|3 months ago

Anthropic released the Opus 4.1 (basically, a new Opus 4 checkpoint) right around the big GPT-5 release date too, if I remember correctly. At this point, anything goes to stay relevant.

johnwheeler|3 months ago

Gemini is eating their lunch, and OpenAI knows it.

echelon|3 months ago

Google can rest on its enormous cash flows. OpenAI is going to have to fight like a dog to continue.

It's as easy as Google "placing ads" for the "search term" "ChatGPT" for them to bleed off users. They own every pane of glass and the "URL bar" is now a "search product" that Google owns.

I do not envy folks with OpenAI golden handcuffs.

This might ultimately only be a game that Google can win.

OpenAI better hope its users install its software, native apps, and browsers. Otherwise Google stands in the way and can intrude at any point.

peab|3 months ago

it's really getting old