(no title)
quadrature | 7 months ago
I've found that it hallucinates tool use for tools that aren't available and then gets very confident about the results.
quadrature | 7 months ago
I've found that it hallucinates tool use for tools that aren't available and then gets very confident about the results.
nusl|6 months ago
Kinda just got stuck in a self-confident loop that time. Other times the output is just far worse than Claude for similar use cases, where a couple months back it was stronger, at least in my subjective experience.