Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

For quantization, very big impact for small models, can drop at much as 10% on AIME. Our model does best on bfloat16 ;)

Come checkout our repo at: https://github.com/agentica-project/deepscaler



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: