alphago-lite

A lightweight implementation of Alphago style training pipeline based on the book

Deep Learning and the game of Go, and the paper
Mastering the game of Go with Deep Neural Networks and Tree Search

implementation

implementation overview

The project roughly follows the pipeline described in Deep Learning and the Game of Go.

1. supervised learning

Train a policy network to imitate human moves from professional Go games.

2. policy improvement

Improve the policy using reinforcement learning through self-play.

3. evaluation

Evaluate trained agents by running automated matches between models.

dataset

This project uses the Computer Go Dataset:

https://github.com/yenw/computer-go-dataset.git

references

Deep Learning and the Game of Go
Silver et al., Mastering the Game of Go with Deep Neural Networks and Tree Search https://www.nature.com/articles/nature16961

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
.github/workflows		.github/workflows
backend		backend
config		config
frontend		frontend
src/alphago		src/alphago
test		test
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
README.md		README.md
bot_vs_bot.py		bot_vs_bot.py
download_data.py		download_data.py
evaluate.py		evaluate.py
evaluate_self_play.py		evaluate_self_play.py
model_v_model.py		model_v_model.py
preprocessor.py		preprocessor.py
pyproject.toml		pyproject.toml
test_backend.py		test_backend.py
train_q.py		train_q.py
train_q_parallel.py		train_q_parallel.py
train_supervised.py		train_supervised.py
uv.lock		uv.lock
visualize_game.py		visualize_game.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

alphago-lite

implementation

implementation overview

1. supervised learning

2. policy improvement

3. evaluation

dataset

references

About

Uh oh!

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

alphago-lite

implementation

implementation overview

1. supervised learning

2. policy improvement

3. evaluation

dataset

references

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages