The great AI Wizard Siraj Raval explains Move 37, reinforcement learning, and the future of human work in this video.
If you’re a reader of this blog, then you know that I have mentioned Alpha Go Zero before on a few occasions. However, I think this video by the incomparable Siraj Raval explains it best. Watch this video to get a technical overview of its neural components.
In case you didn’t already know, DeepMind’s AlphaGo Zero algorithm beat the best Go player in the world by training entirely by self-play. It played against itself repeatedly, getting better over time with no human gameplay input. AlphaGo Zero was a remarkable moment in AI history, a moment that will always be remembered.