Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Presumably because GLM 4.5 or Qwen3 comparisons would clobber them in eval scores.


You can check the same evals OpenAI used for those models

Hint: unclobbered


And don't require KYC crap to predict next token




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: