(no title)
gnulinux | 4 months ago
* Creative writing: Gemini is the unmatched winner here by a huge margin. I would personally go so far as to say Gemini 2.5 Pro is the only borderline kinda-sorta usable model for creative writing if you squint your eyes. I use it to criticize my creative writing (poetry, short stories) and no other model understands nuances as much as Gemini. Of course, all models are still pretty much terrible at this, especially in writing poetry.
* Complex reasoning (e.g. undergrad/grad level math): Gemini is the best here imho by a tiny margin. Claude Opus 4.1 and Sonnet 4.5 are pretty close but imho Gemini 2.5 writes more predictably correct answers. My bias is algebra stuff, I usually ask things about commutative algebra, linear algebra, category theory, group theory, algebraic geometry, algebraic topology etc.
On the other hand Gemini is significantly worse than Claude and GPT-5 when it comes to agentic behavior, such as searching a huge codebase to answer an open ended question and write a refactor. It seems like its tool calling behavior is buggy and doesn't work consistently in Copilot/Cursor.
Overall, I still think Gemini 2.5 Pro is the smartest overall model, but of course you need to use different models for different tasks.
jjmarr|4 months ago
* extinctions in amber,
* suicidal solecisms (solecism means a grammatically incorrect phrase),
* cliffs of broken glass windows,
* rot beneath the flowers,
While it made up a bunch of words like "acendless" or "slickborn" and it sounds like a hallucinatory oracle in the throes of a drug-induced trance channeling tongues from another world I ended up with some good raw material.
mreid|4 months ago
I always found this one a little poignant:
futureshock|4 months ago
It feels like you could create a cool workflow from low temperature creative association models feeding large numbers of tokens into higher temperature critical reasoning models and finishing with gramatical editing models. The slickborns will make the final judgement.
oscaracso|4 months ago
dash2|4 months ago
gnulinux|4 months ago
SoftTalker|4 months ago
New band name.
xnx|4 months ago
gniv|4 months ago
jbmilgrom|4 months ago
Wow
sinak|4 months ago
computerthings|4 months ago
[deleted]
bogtog|4 months ago
The other big use-case I like Gemini for is summarizing papers or teaching me scholarly subjects. Gemini's more verbose than GPT-5, which feels nice for these cases. GPT-5 strikes me as terrible at this, and I'd also put Claude ahead of GPT-5 in terms of explaining things in a clear way (maybe GPT-5 could meet what I expect better though with some good prompting)
dingnuts|4 months ago
no, wait, that analogy isn't even right. it's like going to watch a marathon and then claiming you ran in it.
dktp|4 months ago
It doesn't perform nearly as well as Claude or even Codex for my programming tasks though
hodgehog11|4 months ago
versteegen|4 months ago
greggh|4 months ago
https://eqbench.com/creative_writing.html
tonyhart7|4 months ago
while antrophic always been coding, there are lot of complaint on OpenAI GPT5 launch because general use model is nerfed heavily in trade better coding model
Google is the maybe the last one that has good general use model (?)
delaminator|4 months ago
coffeeaddict1|4 months ago
typpilol|4 months ago
Weird considering I've been hearing how they have way more compute than anyone
BoorishBears|4 months ago
Deepseek is not in the running