Welcome to the Deep Convolutional Q-Learning for Pac-Man project! This project aims to develop an intelligent Pac-Man player using deep reinforcement learning techniques. The goal is to enable Pac-Man to learn optimal strategies for navigating the maze, avoiding ghosts, and maximizing its score by eating pellets and power-ups.
This project uses Deep Convolutional Q-Learning (DQN) to train an AI agent to play Pac-Man. The DQN algorithm combines Q-learning with deep neural networks to approximate the Q-values, which represent the expected future rewards of taking certain actions in given states. By using convolutional layers, the model can efficiently process the visual input from the game environment.
To run this project, you'll need Python and several dependencies. Follow the steps below to set up your environment:
Clone the repository:
git clone https://github.com/yourusername/deep-convolutional-q-learning-for-pac-man.git cd deep-convolutional-q-learning-for-pac-man
Create and activate a virtual environment (optional but recommended):
python -m venv venv source venv/bin/activate # On Windows, use `venv\Scripts\activate`
Install the required dependencies:
pip install -r requirements.txt
To train the Pac-Man agent, run the following command:
python deep_convolutional_q_learning_for_pac_man.py
This will start the training process, where the agent interacts with the Pac-Man environment and learns over time. The training parameters, such as the number of episodes, learning rate, and discount factor, can be adjusted in the script.
The DQN algorithm works as follows:
- Initialization: Initialize the replay memory and the Q-network with random weights.
- Experience Replay: Store the agent's experiences (state, action, reward, next state) in the replay memory.
- Mini-Batch Training: Randomly sample a mini-batch of experiences from the replay memory and use them to train the Q-network.
- Target Network: Periodically update a separate target network to stabilize training.
- Action Selection: Use an epsilon-greedy policy to balance exploration and exploitation.
The convolutional layers in the Q-network process the game frames to extract useful features, which are then used to predict the Q-values for each possible action.
├── assets/
│ └── ... (game assets and sprites)
├── models/
│ └── dqn_model.py (Q-network definition)
├── utils/
│ └── replay_memory.py (Experience replay memory implementation)
├── deep_convolutional_q_learning_for_pac_man.py (Main script)
├── requirements.txt (Dependency list)
└── README.md (Project documentation)
After training for a sufficient number of episodes, the agent should be able to play Pac-Man with improved strategies. The performance of the agent can be evaluated based on its average score, survival time, and ability to avoid ghosts while collecting pellets.