Using Deep Q-Network to Learn How To Play

Machine Learning Helicopter

Overview

Installation Dependencies:

Python 3
TensorFlow
pygame
OpenCV-Python

I have created a requirements text file to install the modules. However this only works with Python 3, you have to do extra steps to make it work with Python 2.

How to Run?

git clone [email protected]:SalvadorRamon/HelicopterMachineLearning.git
cd HelicopterMachineLearning
python deep_q_network.py

Deep Q-Network Algorithm

The pseudo-code for the Deep Q Learning algorithm, as given in [1], can be found below:

Initialize replay memory D to size N
Initialize action-value function Q with random weights
for episode = 1, M do
    Initialize state s_1
    for t = 1, T do
        With probability ϵ select random action a_t
        otherwise select a_t=max_a  Q(s_t,a; θ_i)
        Execute action a_t in emulator and observe r_t and s_(t+1)
        Store transition (s_t,a_t,r_t,s_(t+1)) in D
        Sample a minibatch of transitions (s_j,a_j,r_j,s_(j+1)) from D
        Set y_j:=
            r_j for terminal s_(j+1)
            r_j+γ*max_(a^' )  Q(s_(j+1),a'; θ_i) for non-terminal s_(j+1)
        Perform a gradient step on (y_j-Q(s_j,a_j; θ_i))^2 with respect to θ
    end for
end for

FAQ

Modify deep_q_network.py's parameter as follow:

OBSERVE = 10000
EXPLORE = 3000000
FINAL_EPSILON = 0.0001
INITIAL_EPSILON = 0.1

References

[1] Mnih Volodymyr, Koray Kavukcuoglu, David Silver, Andrei A. Rusu, Joel Veness, Marc G. Bellemare, Alex Graves, Martin Riedmiller, Andreas K. Fidjeland, Georg Ostrovski, Stig Petersen, Charles Beattie, Amir Sadik, Ioannis Antonoglou, Helen King, Dharshan Kumaran, Daan Wierstra, Shane Legg, and Demis Hassabis. Human-level Control through Deep Reinforcement Learning. Nature, 529-33, 2015.

[2] Volodymyr Mnih, Koray Kavukcuoglu, David Silver, Alex Graves, Ioannis Antonoglou, Daan Wierstra, and Martin Riedmiller. Playing Atari with Deep Reinforcement Learning. NIPS, Deep Learning workshop

[3] Kevin Chen. Deep Reinforcement Learning for Flappy Bird Report | Youtube result

Disclaimer

This work is highly based on the following repos:

[sourabhv/FlapPyBird] (https://github.com/sourabhv/FlapPyBird)
asrivat1/DeepLearningVideoGames

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
.idea		.idea
assets		assets
images		images
logs_ChopperAttack		logs_ChopperAttack
saved_networks		saved_networks
.DS_Store		.DS_Store
ChopperAttack.py		ChopperAttack.py
README.md		README.md
deep_q_network.py		deep_q_network.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Using Deep Q-Network to Learn How To Play

Machine Learning Helicopter

Overview

Installation Dependencies:

How to Run?

Deep Q-Network Algorithm

FAQ

References

Disclaimer

About

Releases

Packages

Languages

SalvadorRamon/HelicopterMachineLearning

Folders and files

Latest commit

History

Repository files navigation

Using Deep Q-Network to Learn How To Play

Machine Learning Helicopter

Overview

Installation Dependencies:

How to Run?

Deep Q-Network Algorithm

FAQ

References

Disclaimer

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages