Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

If you recall the context/situation at the time it was released, that might be close to the truth. Google desperately needed to show competency in improving Gemini capabilities, and other considerations could have been assigned lower priority.

So they could have paid a price in “model welfare” and released an LLM very eager to deliver.

It also shows in AA-Omniscience Hallucination Rate benchmark where Gemini has 88%, the worst from frontier models.

 help



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: