(no title)
mnicky | 19 days ago
So they could have paid a price in “model welfare” and released an LLM very eager to deliver.
It also shows in AA-Omniscience Hallucination Rate benchmark where Gemini has 88%, the worst from frontier models.
mnicky | 19 days ago
So they could have paid a price in “model welfare” and released an LLM very eager to deliver.
It also shows in AA-Omniscience Hallucination Rate benchmark where Gemini has 88%, the worst from frontier models.
No comments yet.