Languages, sizes, and degrees of open-ness. | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		refulgentis on Feb 2, 2024 \| parent \| context \| favorite \| on: OLMo: Accelerating the Science of Language Models ... Languages, sizes, and degrees of open-ness.

chuckhend on Feb 2, 2024 [–]

There's some more commentary on their open-ness in this blog too https://www.interconnects.ai/p/olmo

dwagnerkc on Feb 2, 2024 | [–]

That post also very helpfully links to another paper they published alongside the OLMo paper just on the dataset.

Dolma: an Open Corpus of Three Trillion Tokens for Language Model Pretraining Research

https://arxiv.org/abs/2402.00159

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact