I dare say I'm more familiar with the capabilities of the leading models than certain big tech CEOs are, at least judging by their publicly communicated opinions.
I use the 4o very often in my work and it mostly sucks. Sometimes it’s very good, sometimes it has nice knowledge that was faster to find from it than a search engine. Mostly it spouts out unhelpful noise (for my problems).
I’m sure if you need to make a to-do list in react it’s like magic (until the app gets complicated). In real world use, not so much.
(Also I have often code reviewed PRs from people who are heavy users and surprise surprise - their output is trash and very prone to bugs or being out of spec.)
I also think 4o sucks, but have you tried DeepSeek R1 (free on their website)? I thought it night and day between 4o and o3-mini on the following topics:
- reverse engineering: when fed assembly (or decomp or mock impl), it's been consistently been able to figure out what the function actually does/why it's there from a high-level perspective. Whereas ChatGPT merely states the obvious
- very technical C++ questions: DSR1 gives much more detailed answers, with bullet points and examples. Much better writing style. Slightly prone to hallucinations, but not that much
- any controversial topic: ChatGPT models are trained to avoid these because of its "safety" training
ChatGPT is a bit better (and faster) at writing simple code and doing some math faster, but that's it.
(obviously, common sense about what to share and not to share with these chatbots still apply, etc.)
kubb|1 year ago
anon22981|1 year ago
I’m sure if you need to make a to-do list in react it’s like magic (until the app gets complicated). In real world use, not so much.
(Also I have often code reviewed PRs from people who are heavy users and surprise surprise - their output is trash and very prone to bugs or being out of spec.)
TuxSH|1 year ago
- reverse engineering: when fed assembly (or decomp or mock impl), it's been consistently been able to figure out what the function actually does/why it's there from a high-level perspective. Whereas ChatGPT merely states the obvious
- very technical C++ questions: DSR1 gives much more detailed answers, with bullet points and examples. Much better writing style. Slightly prone to hallucinations, but not that much
- any controversial topic: ChatGPT models are trained to avoid these because of its "safety" training
ChatGPT is a bit better (and faster) at writing simple code and doing some math faster, but that's it.
(obviously, common sense about what to share and not to share with these chatbots still apply, etc.)