Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

"Just" is the wrong way to put it.

Pretraining teaches LLMs everything. SFT and RL is about putting that "everything" into useful configurations and gluing it together so that it works better.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: