Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

To be fair, in my experience LLM-s struggle to some extent with nearly every language that isn't English. Compared to many others, English is a very simple language and it has the largest body of material to learn from to boot. The more nuanced a language, the more LLM-s spout garbage.




They do pretty good Russian, even to the point of very elaborate speech styles etc.

Russian-speaking web was pretty strong once. There was a lot of educated people in ex-USSR who were apt with tech, knew Russian well and were willing to use it.

That's very much in the past now. But it'll linger in the training data for a while.


It's funny because I'm pretty sure modern LLMs wouldn't exist if Alexandra Elbakyan didn't come up with the concept of a shadow library.



Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: