top | item 44652328

(no title)

ashwindharne | 7 months ago

Gemini models seem to be much less predictable than Claude -- I used them initially on my Excel 'agent' b/c of the large context windows (spreadsheets are a lot of tokens) but Gemini (2.5 Pro AND Flash) would go rogue pretty regularly. It might start dumping the input sheet contents into the output formatted oddly, output unrelated XML tags that I didn't ask for, etc.

As soon as I switched to Anthropic models I saw a step-change in reliability. Changing tool definitions/system prompts actually has the intended effect more often than not, and it almost never goes completely off the rails in the same way.

discuss

order

No comments yet.