AlphaGo Zero: Learning from scratch | DeepMind
We introduce AlphaGo Zero, the latest evolution of AlphaGo, the first computer program to defeat a world champion at the ancient Chinese game of Go. Zero is even more powerful and is arguably the strongest Go player in history. Previous versions of AlphaGo initially trained on thousands of human amateur and professional games to learn how to play Go. AlphaGo Zero skips this step and learns to play simply by playing games against itself, starting…
In just three days AlphaGo Zero was able to train itself from scratch and acquire literally thousands of years of human Go knowledge simply by playing itself. The only input it had was what it does to the positions of the black and white pieces on the board.
It then played the original AlphaGo and AlphaGo Zero beat it 100 to 0. This is despite the fact the original AlphaGo had the benefit of learning from literally thousands of previously played Go games, including those played by human amateurs and professionals.
Just show that the old data from human games made the machine inferior.