Great article, and some great links from it too. The list of models "following t...

hyperpallium · on Sept 3, 2019

And the 2007 evolved silicon that should not work https://www.damninteresting.com/on-the-origin-of-circuits/

prev disc: https://news.ycombinator.com/item?id=18099226

eatitraw · on Sept 3, 2019

> Evolved player makes invalid moves far away in the board, causing opponent players to run out of memory and crash

Well, this sounds like speedrunning. People have found arbitrary code execution vulnerabilities in SNES and used them to jump to the credits (which counts as completed the game) in less than a minute: https://www.youtube.com/watch?v=Jf9i7MjViCE

They also used the same technique to inject runnable code in the game: https://www.youtube.com/watch?v=jnZ2NNYySuE

b_tterc_p · on Sept 3, 2019

Nah that’s not the same thing.

In this case it’s just choosing an option that involves very large numbers because it’s learned that it’s opponent can’t handle large numbers. There’s no code injection

eatitraw · on Sept 3, 2019

The SMB3 ACE is one of the most technically interesting glitches. The usual skips and saves are much more mundane.

My point here is that there is similarity between (some) human players and some AI players. Even the discussion whether exploiting a glitch is actually 'winning' also looks very similar.

fauigerzigerk · on Sept 3, 2019

Some of these approaches would probably be called genius or at least very creative had they been found by a human :)

mlthoughts2018 · on Sept 3, 2019

Yes, some hedge fund would probably be happy to hire these AIs.

inimino · on Sept 3, 2019

Past performance is not indicative of future results.

ethbro · on Sept 3, 2019

That's why successful hedge funds hedge their own pay from clients with legal clauses to limit liability in the case of losses.

Aka "heads, we win; tails, I don't lose"

fho · on Sept 3, 2019

I skimmed through the list and ended up reading the whole article about "World Models": https://worldmodels.github.io/ Definitely a fascinating read!

chronolitus · on Sept 3, 2019

Seeing this work by Schmidhuber suddenly reminded of this song: https://twitter.com/i/status/1155091710281580548 a sort of tongue-in-cheek celebration his famous eagerness when discussing his work.

[edit] Just found out there's a second part: https://soundcloud.com/user-430897588-587876313/lstm-song-2

jimbokun · on Sept 3, 2019

Aren't a lot of these a kind of over fitting the data?

The learning agent discovers something that is true of the training set, but does not generalize to other examples of the problem outside of the training set.

The article talks about much of the problem being in data set construction, because it is very tricky to design data sets without accidental biases that the learner can use to correctly categorize the examples that have nothing to do with the actual problem you want the learner to solve. The traditional techniques to avoid over fitting, like holding out part of the training data, don't do any good if the entire data set is not representative of the real world in some systematic way.

kuu · on Sept 3, 2019

Many of them could be classified as "lateral thinking".

Quite interesting

teekert · on Sept 3, 2019

All signs of true creativity and thinking out of the box ;)