pong_model

The dumbest game you might ever play

Installation

Create a virtual environment
python -m venv .venv
Activate the environment
source .venv/bin/active
If installing for AMD GPU training/inference
1. pip install -r requirements-rocm.txt
2. For MI100 gpu,
  1. clone flash attention repo if using flash attention and install
    git clone https://github.com/Dao-AILab/flash-attention.git dependencies/flash-attention
  2. Navigate to flash_attention direcotry
    cd dependecies/flash_attention
  3. Modify the setup.py to include gfx908 in supported archs
  4. Install using ROCm environment
    export GPU_ARCHS=gfx908 && rocm-python setup.py install
If installing for Nvidia GPU
1. pip install flash-attn --no-build-isolation
Install all other dependencies
pip install -r requirements.txt

Model configuration

To adjust model parameters, update the model_configuration.py

If no GPU, be sure to set device to cpu

Training

Run the training script to generate a model
python trainer.py

By default, RNNModel is trained. Provide the --model_type CLI arg to train a different model type.

Run python trainer.py -h to see all options for training.

Some model types such as TransformerModel use multiple processes while training.
To prevent consuming all CPU, you can specify OMP_NUM_THREADS=4 to limit the number of threads.

Test the model

Run the main script with desired generator

exact - generates states computed mathematically
fuzzy - generates states using model trained on states generated from engine

e.g.
python main.py --generator_type exact

Run python main.py -h to see all options for running main script.

Some model types such as TransformerModel use multiple processes while running.
To prevent consuming all CPU, you can specify OMP_NUM_THREADS=2 to limit the number of threads.
OMP_NUM_THREADS=2 python main.py

The --model_path argument must be provided. This path is either an mlflow runs path or a relative path to local file

mlflow path example: 'runs:/000fc0c95642447899b50e9104b7f6a0/model_e44'
local path example: 'artifacts/000fc0c95642447899b50e9104b7f6a0/model_e44'

Loading a model from mlflow will cache the model in artifacts directory.

python main.py --device cpu --model_path "artifacts/48f737882a6b47c18981801e6f85b3f0/model_e59"

Name	Name	Last commit message	Last commit date
Latest commit jacazek docs: remove ROCm specific prefix Jan 9, 2025 533997d · Jan 9, 2025 History 58 Commits
artifacts/48f737882a6b47c18981801e6f85b3f0/model_e59	artifacts/48f737882a6b47c18981801e6f85b3f0/model_e59	capture sample model	Jan 9, 2025
docs	docs	style: clean up old code and comments	Jan 2, 2025
game	game	feat: main and train CLI args	Jan 6, 2025
models	models	capture sample model	Jan 9, 2025
.gitignore	.gitignore	fix: re-ignore pth files	Jan 1, 2025
README.md	README.md	docs: remove ROCm specific prefix	Jan 9, 2025
exact_engine.py	exact_engine.py	feat: main and train CLI args	Jan 6, 2025
fuzzy_engine.py	fuzzy_engine.py	feat: only use mlflow to manage model checkpoints	Jan 6, 2025
main.py	main.py	feat: only use mlflow to manage model checkpoints	Jan 6, 2025
main_arguments.py	main_arguments.py	feat: cli arg to set device for train/inference	Jan 7, 2025
model_loaders.py	model_loaders.py	feat: only use mlflow to manage model checkpoints	Jan 6, 2025
requirements-rocm.txt	requirements-rocm.txt	feat: adjustments	Dec 5, 2024
requirements.txt	requirements.txt	Update requirements.txt	Jan 2, 2025
runtime_configuration.py	runtime_configuration.py	feat: main and train CLI args	Jan 6, 2025
train_arguments.py	train_arguments.py	capture sample model	Jan 9, 2025
trainer.py	trainer.py	feat: add summary token to transformer	Jan 7, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

pong_model

Installation

Model configuration

Training

Test the model

Todo

About

Releases

Packages

Languages

jacazek/pong_model

Folders and files

Latest commit

History

Repository files navigation

pong_model

Installation

Model configuration

Training

Test the model

Todo

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages