Kind of stagnant, but I liked the IDEA of what was being done with the Beaker Br...

pfraze · on April 18, 2020

We decided to take a year to go heads down and rework a lot of stuff, so we did get publicly stagnant but we're fulltime on the next version and should release a public beta soon. I tweet a lot about our progress if you can sort through my sillier tweets (@pfrazee).

Past year has had a lot of improvements

- Protocol moved to a hole-punching DHT for peer discovery (hyperswarm).

- Protocol now scales # of writes and # of files much better. We were able to put a Wikipedia export, which is millions of files in 2 flat dirs, into a single "drive" and get good read performance. This performance bump came from an indexing structure that's built into every log entry (hypertrie).

- Protocol now supports "mounting" which is a way to basically symlink drives onto each other. Good composition tool, esp useful for mounting deps in a /vendor directory.

- Browser now has a builtin editor that splits the screen for live editing. Feels similar to hackmd.

- Browser added a bash-like terminal for working with the protocol's filespace. It's glued to the current page so you can drive around the web using `cd`.

- Browser added a basic identity system. Every user has a profile site that's created automatically and maintains an address book of other users.

- We built out application tooling a fair amount. It's fairly easy to build multi-user apps now, where previously it was a bit of rocket surgery.

Some of the year was spent prototyping ideas and throwing them away as well. A bit inefficient, but helped us learn.

grng3r · on April 18, 2020

I've been reading about dat and doing some basic projects for about a year in my spare time, stagnating a bit lately as am finishing my degree. As I remember there was some talk about Python and Rust implementations at the time, is that still under development as well? I am not as enthusiastic with Js but really like the protocol and the idea behind it.... Thanks

pfraze · on April 18, 2020

Yeah I do know there's a Rust impl that people are working on

koeng · on April 18, 2020

I’ve been thinking of deploying dat for Nanopore DNA sequencing, but recall something about dat not being great at handling that kind of data.

Does the newest version of dat handle large files well (10gb)? Does it handle tons of files nested in a few directories well? Are there any issues I should know about there?

What is the command line support like for multi-writer?

Do you have any metrics for how much Dat is currently being used?

Thanks!

pfraze · on April 18, 2020

> Does the newest version of dat handle large files well (10gb)?

Large files work fine but currently any change to a file rewrites it in its entirety. That will mean history will be large until the GC kicks in, and any file that's modified has to be redownloaded in its entirety.

The team spent a fair amount of time looking at a solution to partial file-updates which works like inodes. They ultimately decided it was too difficult to pull off for now.

> Does it handle tons of files nested in a few directories well?

Yep, no issues there

> What is the command line support like for multi-writer?

We're still deciding on how to handle multi-writer. It's a priority for us after the upcoming stable release.

> Do you have any metrics for how much Dat is currently being used?

Nothing concrete atm. If I had to guess, it'd be no more than 1k.

karissa · on April 18, 2020

> Does the newest version of dat handle large files well (10gb)?

I'd be curious if anyone wanted to try it ;) https://github.com/mafintosh/hyperdrive/blob/master/index.js...

> What is the command line support like for multi-writer?

There is an experimental multiwriter CLI using hyperdrive and kappa-db (github.com/kappa-db)

https://cobox.cloud

ianopolous · on April 19, 2020

We've solved efficient partial file updates in Peergos, which is built on ipfs. Happy to talk you through our data structures if you're interested. The key ones are cryptree and Merkle-champs.

ignoramous · on April 19, 2020

Thanks. Has anyone by any chance benchmarked hypertrie against u/marknadal's [0] gun.js?

Speaking of which, the gun.js network hosts the internet-archive (archive.org) [1] which is likely to be bigger than the English Wikipedia (?). Cloudflare runs an IPFS gateway. Are there organizations of similar size dat is looking to onboard or has onboard-ed?

---

A few questions:

1a. Re: hole-punching: Do you support clients on LTE (or, networks behind CG-NAT)? Is hole-punching just for the DHT connections (which seems to operate over UDP4) or for Feed (hypercore) connections, too (which default to TCP?).

1b. Are there plans to use WebRTC (like WebTorrent) instead of TCP or UDP sockets?

1c. In case of TURN relays, how long do you intend to support traffic through your servers? It might get cost-prohibitive at some point. Are (will) TURN relays (be) deployed worldwide to combat latencies?

2. Are hyperswarm and hyperdrive different example usages for hypercore or are they complementary? If so, how are those projects related, as in, is hyperdrive built on top of hyperswarm?

3. The project is spread across disparate GitHub accounts-- Some top-level packages in mafintosh's account and some in hyperswarm's, for instance. Is this intentional?

4. I saw references to merkle-trees, flat-trees, kademlia, HAMT, bitfields... is there a documentation on why these data structures are used? I can guess kademlia is for the DHT and merkle-trees for some form of entropy management. Are such implementation details documented (looking for something similar to gun.js [2] or redis [3] docs)?

5a. Can you please point out a few major differences to gun.js and ProtocolLab's IPFS?

5b. Conversely, what are some use-cases for which dat really outshines other such protocols?

[0] https://news.ycombinator.com/item?id=15818856

[1] https://news.ycombinator.com/item?id=17685682

[2] https://gun.eco/docs/RAD

[3] https://news.ycombinator.com/item?id=15384396

m1sta_ · on April 19, 2020

Once a read key becomes public, it's always public right? I appreciate that arguably that's always the case.

karissa · on April 18, 2020

They are working on a new version, it's almost done. It's going to be a lot better! https://github.com/beakerbrowser/beaker/pull/1435

Taek · on April 18, 2020

Skynet is a new protocol that has been working on something like this. You can see some of the webapps built on Skynet here: https://skynethub.io/_B19P18DEI4Y_37wg6yXtfulEq2U8AcIO_lWM7s...

Main project website: https://siasky.net

faizshah · on April 18, 2020

For me one of the main positives of dat is there isn’t an attached cryptocurrency.

Taek · on April 19, 2020

Attaching a cryptocurrency to Sia allows us to draw a distinction between producers and consumers.

In typical p2p networks, you are expected to be both in roughly equal proportions, which means your network gets polluted by a lot of low-quality users who are just trying to do their part.

Sia has the consumers pay the producers, and has a marketplace mechanism that selects the highest performing producers to perform the jobs and receive the revenue.

This allows the network as a whole to be much, much more efficient.

theamk · on April 19, 2020

Wait, consumers pay the producers? As in, “you need to pay to download files”?

anaganisk · on April 19, 2020

Doesn't not paying make us end up rebuilding the internet again with ads and tracking? Free is not always an option.

theamk · on April 19, 2020

Doesn't paying encourage useless sites highly optimized to grab user attention, low quality content, shock sites, cloned sites, site bloat (if payment is per KB), and so on?

That said, I think there's definitely a space for paid internet. I was just surprised because siasky homepage says "Build a Free Internet" - perhaps it should say "Build a pay per view internet" instead?

Taek · on April 19, 2020

Free as in freedom, not free as in beer.

I would argue that the current model is more friendly to attention grabbing content (clickbait, etc) because advertising has the limitation that all views are worth the same value. In a pay-as-you-go Internet, users can whitelist high quality content sources as being okay to charge 10x or 100x what you'd typical accept to view a webpage. This would incentive content creators to build a brand and reputation that makes users comfortable putting them on the 'high quality' list, so that their content can see a massive revenue multiple relative to the number of eyeballs.

theamk · on April 19, 2020

This may work, but the problem is, that'll encourage internet made of content creators whose goal is to get a revenue out of the websites.

There are certainly a lot of those, but they are not nearly the whole internet. A lot of interesting sites would not want to be on the "pay as you go model" at all!

For example, we are on the news.ycombinator.com in the thread discussing datprotocol.com, and you are pointing me to siasky.net. Right now, the top 5 HN pages are adecentralizedworld.com, handsonscala.com, arxiv.org, a16z.net, and torproject.com. None of those websites make money from webpage ads. None of them are likely to move to pay-as-you-go internet -- because they care about their audience and not website revenue.

I suspect that Sia's decentralized pay-as-you-go world will be much worse (productivity-wise and information-wise) than the current internet -- all the interesting technical/science blogs and docs would be missing; while buzzfeed clones will be plentiful. There might be occasional high-quality journalism website, but most of those are getting paywalls anyway, so will they justify all new protocol?

Taek · on April 19, 2020

It's perfectly possible to not ask for a fee on your websites. Plenty of websites will continue to exist for free even though they could ask for money.

Also, the internet is already something that users have to pay for. Users pay their ISP every month for access. We envision a world where this utility payment extends to cover the content creators in addition to the infrastructure providers.

rakoo · on April 19, 2020

As far as I understand Sia is functionally equivalent to S3: You're not getting content the "producer" wants, you're getting exactly the content you, as a consumer, want.

koeng · on April 18, 2020

Is there anyway to pin skynet files using storage rather than Sia, or a way to download skynet files using the command line (independent of a curl request from a portal)?

Taek · on April 19, 2020

For your first question: technically yes, but it's not really the way it's meant to be done. If you want to pin something with storage, you can become a host on the Sia network and then make that content available through your host. But this also means the content is subject to going offline if you go offline (unless someone has re-pinned it to other hosts), and Skynet is really meant to be this high reliability platform where you don't need to worry about particular users going offline.

If you want to download skynet files using the command line, you can run 'siad' in portal mode as a daemon, and then use 'siac' to query the daemon. 'siac skynet download [skylink] [destination]' is the command to perform the download.

Doing it this way means you gain direct decentralized access to Skynet instead of needing to funnel through a portal.

koeng · on April 19, 2020

Is there a way to be a host and then pin other people's content that I like? I'd love to be a host that pins public content without an explicit financial transaction.

karissa · on April 18, 2020

I don't see a github repository or any link to the open source code there. Is it open source?

soupbowl · on April 18, 2020

I found this on their website: https://github.com/NebulousLabs/skynet-webportal

__ka · on April 18, 2020

The Cliqz browser implements the Dat protocol. Here's a writeup on the implementation: https://0x65.dev/blog/2020-03-02/implementing-the-dat-protoc...

Disclosure: I work at Cliqz.

rakoo · on April 19, 2020

Amazing ! Dat is probably one of the most interesting technologies out there, it's good to see more people step in, especially if it's a growing company.

Does that mean that Cliqz will start indexing dat sites ? The way I see it, the biggest issue of decentralized anything is always discoverability. If Cliqz is interested in it, it could have a leverage in making dat more widespread.

Dat even helps you with the indexing, because it's the sites themselves that tell you when they are updated and when they are updated, it's like an indexer's dream come true. And seeing how dat operates, Cliqz could even be a peer in the swarm of the sites it indexes to give them more chance to be reached (at the cost of Cliqz having a view on who visits what, which seems contrary to your DNA if I understand correctly)

anchpop · on April 18, 2020

You say you like the idea, do you have a problem with the implementation?