Isn't it the opposite? From the link: Scores range from -100 to 100, where 0 means as many correct as incorrect answers, and negative scores mean more incorrect than correct.
Gemini 3 Flash scored +13 in the test, more correct answers than incorrect.
joecarpenter|2 months ago
Gemini 3 Flash scored +13 in the test, more correct answers than incorrect.
sabareesh|2 months ago
nemonemo|2 months ago
andai|2 months ago
Edit: Huh... It does score highest in "Omniscience", but also very high in Hallucination Rate (where higher score is worse)...
sabareesh|2 months ago