Hacker Newsnew | past | comments | ask | show | jobs | submit | jn5's commentslogin

I am pretty sure they do, this data is just too valuable. At least meta admitted using a dataset called "books3" which contains ~200k pirated ebooks for llama 1 and 2 [1]. Anna's archive provides datasets for LLM training, but who knows who they are working with..

I also wonder if google is using their own dataset from books.google.com .

[1] https://torrentfreak.com/meta-admits-use-of-pirated-book-dat...


Related: Royal Navy says quantum navigation test a success https://news.ycombinator.com/item?id=36222625

Instead of gps, submarines can use maps of the gravitational constant for navigation


If you lose something, you always find it in the last place you search... because why would you keep searching after you found it


It took me an embarrassingly long time to realize that it was a joke when people said "It's always the last place you look." Like well into my teens. But ever since I figured it out, I always look at least one more place after finding something.


I think that there are many people who don't recognize it as a joke and pass it on as great wisdom. I was also pretty old when I realized the tautology.


If you're disorganized, you'll search in random places until you find it. Joke applies.

But if you are organized, you'll start with the most likely place and progress to increasingly less likely places. When you find it, there's no surprise, and no one gets much of a chuckle over your efforts.

If you don't understand the problem space, then saying "That's the last place I would have looked!" is an expression of exasperation about your lack of knowledge.


A tautology can still provide an insight by recognizing that two things are really just the same. In that sense I didn’t perceive it as a joke, even though there’s of course some humor attached to it.


Last is semantically ambiguous here, you’re not wrong, could also mean “last place (most unobvious) you would think to look” as well


Because it it were obvious, you had already found it.


Do you find it again?


Well, at least you were a teen when you realized it. In my case it was 1min ago in this very HN thread...

I always assumed it was kind of a 'life's a bitch' aphorism


Australian parody news site the chaser just announced [1] that they are going to put up a paywall to avoid having their content scraped for AI training. They feel like chatGPT already "is a more competent writer of satire than most of the people we’ve worked with". I guess they are too late

[1] https://chaser.com.au/uncategorized/why-the-chaser-is-going-...


Maybe using ultrasonic testing during the printing process, while the inner parts are still accessible


You can probably build a kind of acoustic signature of the whole at various different points in the boundary and check each build is sufficiently close to that signature.


I haven't seen anything like it in Europe either. Helsinki's library is incredible. It's also architecturally one of the most prominent buildings and right next to the parliament. It doesn't have a lot of books, though (100k)

https://www.oodihelsinki.fi/en/services/


This was developed by HN user riskable and recently mentioned here:

The obsessive pleasures of mechanical-keyboard tinkerers - https://news.ycombinator.com/item?id=32623908


Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: