I think it would be absolutely incredible to peruse old blogs and websites from ...

antonvs · on July 1, 2019

Someone's going to have to pay for hosting that data, though. The domain name is only a small part of the picture.

harshitaneja · on July 1, 2019

That's why we have wayback machine. Or we use another way to archive it.

comex · on July 1, 2019

The Wayback Machine is great, but it's basically a hack. Archiving shouldn't depend on a single centralized entity occasionally crawling the web and saving chunks of it to its archive (but only what it finds during the crawl, and excluding content with large file sizes, such as videos).

It ought to be built into the architecture of the Web, decentralized, immediate, and (at least for small file sizes) on by default. Oh, and censorship-resistant. Even for large file sizes, I think there ought to be some very easy-to-use mechanism to donate either hard disk space or money to publicly archive content of your choice.

Those are lofty goals, of course, but the current web has is quite vulnerable to bitrot as it is, and there's no guarantee the Internet Archive will continue to operate indefinitely.

dragonwriter · on July 1, 2019

> Archiving shouldn't depend on a single centralized entity

It doesn't. It's decentralized, with lots of archivists, it's just not federated.

comex · on July 2, 2019

Are there any other Web archives with scope comparable to the Wayback Machine? I have not heard of any. I guess there may be private archives which are not publicly known or accessible.

extra88 · on July 1, 2019

As the Wayback Machine currently operates, the present owner of a domain name can make the archives go away.