Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

This just isn't accurate, on the overwhelming majority of real-world tasks (>90%) 3.5 Sonnet beats 4o. FWIW I've spoken with a friend who's at OpenAI and they fully agree in private.


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: