WingNews logo WingNews
top | new | best | ask | show | jobs
top | item 46044644

(no title)

herrvogel- | 3 months ago

What you describe could also be the difference in the hallucination rate [0]. Opus 4.5 has the lead here and Gemini 3 Pro performs here quite bad compared to the other benchmarks.

[0] https://artificialanalysis.ai/?omniscience=omniscience-hallu...

discuss

order

No comments yet.

powered by hn/api // news.ycombinator.com