top | item 45031587

(no title)

robots0only | 6 months ago

Claude is extremely poor at vision when compared to Gemini and ChatGPT. i think anthropic severely overfit their evals to coding/text etc. use cases. maybe naively adding browser use would work, but I am a bit skeptical.

discuss

order

bdangubic|6 months ago

I have a completely different experience. Pasting a screenshot into CC is my de-facto go-to that more often than not leads to CC understanding what needs to be done etc…

user453|6 months ago

Is it overfitting if it makes them the best at those tasks?