top | item 45263822

(no title)

kerpal | 5 months ago

Claude/Anthropic is more focused on productivity (Coding, Spreadsheets, Reports). ChatGPT seems more focused on general-purpose LLM (Research, Cooking, Writing, Image Generation).

Makes sense that MS would partner with Anthropic since their tool-use for productivity (Claude Code) seems superior. I personally rarely code with ChatGPT, almost strictly Claude.

discuss

dmurray|5 months ago

Some people might be surprised that MS would pick the product with the best technological fit rather than the one they already have a deep business and financial relationship with.

Surely Microsoft's expertise these days is in cross-selling passable but not best-in-class products to enterprises who already pay for Microsoft products.

It says something about how they view the AI coding market, or perhaps the level of the gap between Anthropic and OpenAI here, that they've gone the other way.

dijit|5 months ago

They are right to be surprised.

Why is Azure popular? Not on its own merits, it's because there is a pre-existing relationship with Microsoft.

Why is Teams the most widely used chat tool? Certainly not because it's good.. it is, again, pre-existing business relationships.

Seems odd for a company that survives (perhaps even thrives) on these kinds of intertwined business reasons to, themselves, understand that they should go for merit instead.

thewebguyd|5 months ago

> It says something about how they view the AI coding market

I think Microsoft views models as a commodity and they'd rather lean into their strengths as a tool maker, so this is Microsoft putting themselves into a position to make tools around/for any AI/LLM model, not just ones they have a partnership with.

Honestly I think this sort of agnosticism around AI will work out well for them.

pnathan|5 months ago

I've been happy with Anthropic models. I also have been using the Google models more, with decent results. The Copilot/OpenAI models don't seem to be as good as a rule of thumb, can't explain exactly why.

Overall, I think Google has a better breadth of knowledge encoded, but Anthropic gets work done better.

_fat_santa|5 months ago

This has been largely my experience as well. Claude does way better with coding while ChatGPT does better with general questions.

bobbylarrybobby|5 months ago

The new gpt-codex-* models are giving Claude Code a serious run for its money IMO. If OpenAI can figure out the Codex CLI UI (better permissions, more back and forth before executing) then I think they will have the better agentic coder.

mnky9800n|5 months ago

I like perplexity's deep research model which is based on deepseek i think. i use that for most kind of writing, discussion, research, etc. where I need some kind of feedback. Claude seems to go crazy sometimes when you ask it to do the same task. Whereas for coding, Claude Code is obviously better than everything else under the sun.

SparkyMcUnicorn|5 months ago

I decided to give perplexity another try a few days ago, and it still seems to hallucinate things. Given the same exact tasks/prompts both Claude and Chatgpt got the facts correct.

jakderrida|5 months ago

I'd argue that Anthropic still has a hard edge on creativity for things like emulating people's comments.

I've fed into several models my past reddit comments (with the comments it's responding to) and asked it to duplicate the style. Claude has always been the only thing that comes even close to original responses that even I think would be exactly my response, wording and all.

GPT or Gemini will just borrow snippets from the example text and just smoosh it together to make semi-coherent points. Scratch that. They're coherent, but they're just unmistakably not from me.

m_mueller|5 months ago

GPT-5 is pretty decent nowadays, but Claude 4 Sonnet is superior in most cases. GPT beats it in cost and usable context window when something quite complex comes up to plan top-down.

boredtofears|5 months ago

What I find interesting is how much opinions vary on this. Open a different thread and people will seem to have consensus on GPT or Gemini being superior.

Even the bench marks don’t seem all that helpful.

CharlieIsAHero|5 months ago

What do you mean by usable context window? Sonnet 4 is 968k and gpt5 is 368k. Are you saying the context window on sonnet is useless?

airstrike|5 months ago

Are there any open models that compete with Claude in its tool use capabilities for complex tasks?

Feels like an area where we could use more competition...