Project 3: Collaboration and Competition - Tennis

Introduction

Given two agents that control rockets to bounce a ball over a net. The goal is that the agents must bounce ball between one another while not dropping or sending ball out of bounds. The agent receives a reward of +0.1 if it hits the ball over the net. It receives a reward of -0.1 if the ball hits the ground or sends out of bounds.

The observation space consists of 8 variables: the position of the ball and racket and the velocity of the ball and rocket. There are two continuous actions are available for each agent: (1) movement toward or away from the net, (2) jumping.

The environment is considered solved, when the agents get in average at least +0.5 reward over 100 episodes.

Set up

Download the environment that matches your operating system:
- Linux: click here
- Mac OSX: click here
- Windows (32-bit): click here
- Windows (64-bit): click here
Place the file in the DRLND (https://github.com/udacity/deep-reinforcement-learning), in the p3_collab-compet/ folder, and unzip (or decompress) the file.

Getting started

Open Tennis.ipynb

Train the agent

Run the cells from 1. to 5. After the training a checkpoint.pth file will be created containing all trained weights.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
README.md		README.md
Report.pdf		Report.pdf
Tennis.app.zip		Tennis.app.zip
Tennis.ipynb		Tennis.ipynb
checkpoint_actor.pth		checkpoint_actor.pth
checkpoint_critic.pth		checkpoint_critic.pth

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project 3: Collaboration and Competition - Tennis

Introduction

Set up

Getting started

Train the agent

About

Releases

Packages

Languages

anushmanukyan/Udacity-Project3

Folders and files

Latest commit

History

Repository files navigation

Project 3: Collaboration and Competition - Tennis

Introduction

Set up

Getting started

Train the agent

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages