alphaGo

This is a student project recreating (fast) rollout policy network of DeepMinds AlphaGo.

aims

We tried to recreate two different networks used in the original Architecture of AlphaGo. The main goal of both networks is to predict the move played by a professional player from a given position. The objective of the first network is to predict moves as fast as possible, this network is called the rollout policy network. The second networks aim is to predict moves as accurate as possible.

data

The dataset we used containes Human Go games from KGS Go Server from games between 2001 and 2007, which resultet in roughly 2 Million board positions. For loading and preprocessing we used the kgsgo-dataset-preprocessor Furthermore we expandend the given representations of the board by features like neighboring positions of the last move.

results

The best accuracy we achieved was 39,1%, using a network of six Convolutional Layers with ReLU Activations followed by a single softmax layer. AlphaGo achieved an accuracy of 57.0% using a lot more features and a bigger network. Our best fast Softmax network achieved an accuracy of 17.1% compared to AlphaGo's 24.2%.
For more details see our project presentation.

Name		Name	Last commit message	Last commit date
Latest commit History 48 Commits
networks		networks
README.md		README.md
convolutional.py		convolutional.py
histories.txt		histories.txt
project_presentation.pdf		project_presentation.pdf
softmax.py		softmax.py
utilities.py		utilities.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

alphaGo

aims

data

results

About

Releases

Packages

Contributors 2

Languages

yypdyy95/alphaGo

Folders and files

Latest commit

History

Repository files navigation

alphaGo

aims

data

results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages