Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

As someone with a basement rig of 6x 3090s, not really. It's quite slow, as with that many params (685B) it's offloading basically all of it into system RAM. I limit myself to models with <144B params, then it's quite an enjoyable experience. GLM 4.5 Air has been great in particular




Did you find it better than GPT-OSS 120B? The public rankings are contradictory.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: