Deep Learning and the Game of Go

Code and other material for the book "Deep Learning and the Game of Go"

Deep Learning and the Game of Go

This repository is first and foremost a comprehensive machine learning framework for the game of Go, focussing on deep learning techniques. What you’ll find here is a library that builds up from the game-play basics to very advanced techniques. In particular, you find code for early approaches in game AI, intermediate techniques using deep learning, to implementations of AlphaGo and AlphaGo Zero - all presented in one common framework. You can install this library with pip and follow the examples in the code folder.

pip install dlgo

On the other hand, this repository at the same time contains Code, and sample chapters for the book “Deep Learning and the Game of Go” (Manning), available for early access here, which ties into the library and teaches its components bit by biy. If you’re following the code samples from the book, check out the branches for individual chapters.

Note for contributors: To ensure the book stays in sync, consider requesting changes and submitting pull requests against the improvements branch, instead of master (which we keep reserved for bug fixes etc.).

dl_go_cover

Playable demos

The book is all about getting you started to create your own bots. To make the experience more fun and interactive, we built and deployed several bots showcasing the techniques of the respective chapter. So far you can play:

Chapter 4 A tiny tree search bot on a 5x5 board here.
Chapter 7 A full 19x19 bot powered by a deep neural network trained to predict human moves here
Chapter 9 A bot playing on a 9x9 board, using policy gradients, here

These demos will be available in the liveBook version of the book as well.

Toward deep learning: a machine learning introduction
Go as a machine learning problem
Implementing your first Go bot
Playing games with tree search
Getting started with neural networks
Designing a neural network for Go data
Learning from data: a deep learning bot
Deploying bots in the wild
Enter deep reinforcement learning
Reinforcement learning with policy gradients
Reinforcement learning with value methods
Reinforcement learning with actor-critic methods
AlphaGo: Combining approaches
AlphaGoZero and AlphaZero: Combining approaches

Appendices

A. Mathematical foundations with Python
B. The backpropagation algorithm
C. Go programs and servers
D. Training and deploying bots using Amazon Web Services
E. Submitting a bot to the Online Go Server (OGS)

chapter_succession

Welcome

When AlphaGo hit the news in early 2016, we were extremely excited about this groundbreaking advancement in computer Go. At the time, it was largely conjectured that human-level artificial intelligence for the game of Go was at least 10 years in the future. We followed the games meticulously and didn’t shy away from waking up early or staying up late to watch the broadcasted games live. Indeed, we had good company — millions of people around the globe were captivated by the games against Fan Hui, Lee Sedol, and later, Ke Jie and others.

Shortly after the emergence of AlphaGo, we picked up work on a little open source library we coined BetaGo to see if we could implement some of the core mechanisms running AlphaGo ourselves. The idea of BetaGo was to bring AlphaGo to regular developers. While we were realistic enough to accept that we didn’t have the resources (time, computing power, or intelligence) to compete with Deepmind’s incredible achievement, it has been a lot of fun to create our own Go bot. Since then, we’ve had the privilege to speak about computer Go at quite a few occasions.

We strongly believe that the principles underpinning AlphaGo can be taught to a general software engineering audience in a practical manner. Enjoyment and understanding of Go comes from playing it and experimenting with it. It can be argued that the same holds true for machine learning, or any other discipline for that matter. In this book, we hope to use the game of Go as a gateway to the exciting world of deep learning. We start with the classical AI principles for board games. Right from the start, you’ll have a working Go AI that you can play against—although it will be very weak at first. Then we’ll introduce some techniques from the world of deep learning and reinforcement learning. As you learn each technique, you can incorporate it into your Go AI and watch it improve.

If you share some of our enthusiasm for either Go or machine learning, hopefully both, at the end of this book, we’ve done our job. If, on top of that, you know how to build and ship a Go bot and run your own experiments, many other interesting artificial intelligence applications will be accessible to you as well. Enjoy the ride!

Deep Learning and the Game of Go

Playable demos

Table of Contents

Welcome