This document summarizes research on developing an AI system called AlphaGo that can defeat human professionals at the game of Go. Key points:
1. AlphaGo uses deep neural networks including a policy network to select moves and a value network to evaluate board positions, trained through both supervised learning from expert games and reinforcement learning from self-play games.
2. Without lookahead search, the neural networks can play Go at a strong amateur level. Combined with a new Monte Carlo tree search algorithm, AlphaGo defeats other Go programs and the European Go champion.
3. This is the first time a computer program has defeated a human professional in the full game of Go, a feat previously thought to be at least