I trained a model to play a novel video game using only screenshots and a score using RL and I discovered how not to lose
I trained a model to play a novel video game using only screenshots and a score using RL and I discovered how not to lose