Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
ACCount37
16 days ago
|
parent
|
context
|
favorite
| on:
EuroLLM: LLM made in Europe built to support all 2...
It is true. Datasets are somewhat cleaned, but only somewhat. When you have terabytes worth of text, there's only so much cleaning you can do economically.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: