Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Is there still evidence that more compute = better model?


Yes. Plenty of evidence.

The DeepSeek R1 model people are freaking out about, runs better with more compute because it's a chain of thoughts model.




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: