top | item 42892906

(no title)

cyounkins | 1 year ago

I switched an agent from Sonnet V2 to o3-mini (default medium mode) and got strangely poor results: only calling 1 tool at a time despite being asked to call multiple, not actually doing any work, and reporting that it did things it didn't

discuss

order

No comments yet.