RL Multi-agent Boxing Project

A reinforcement learning project built for the second CM50270 coursework at the University of Bath.

This project trains agents (like PPO, DQN, and others) to play Atari Boxing using the Gym ALE environment.

Getting Started

Windows users:
The multi-agent-ale-py package does not compile on native Windows.
You must use Windows Subsystem for Linux (WSL) to run this project.
Once WSL is installed, follow the WSL/Linux instructions below.

1. Clone the repository

git clone https://github.com/YOUR-TEAM-NAME/rl-boxing.git
cd rl-boxing

2. Create a virtual environment (recommended)

python3 -m venv venv

# For Mac/Linux/WSL:
source venv/bin/activate

3. Install Dependencies

If you're rendering the boxing gym locally:

pip install -r render_requirements.txt

If you're rendering in WSL (with X server like VcXsrv):

sudo apt update
sudo apt install cmake swig zlib1g-dev libboost-all-dev \
                 libsdl2-dev libsdl2-image-dev \
                 python3-dev build-essential

pip install -r og_requirements.txt

# Then add this to ~/.bashrc or ~/.zshrc:
export DISPLAY=:0.0

If you're training agents (on Hex):

pip install -r (your requirements txt file name).txt

4. Download Atari ROMs

AutoROM --accept-license

5. Run a test match (e.g., RandoAgent1 vs RandoAgent2)

python main_rando.py

6. Train PPO or DQN agents

python training/train_ppo.py
python training/train_dqn.py

7. Watch trained agents compete

python main.py

Agents

RandoAgent1: Random agent with equal probabilities
RandoAgent2: Slower/weaker random agent (for testing)
PolicyAgent: PPO agent (trained via train_ppo.py)
DQNAgent: DQN agent (trained via train_dqn.py)

Project Structure

rl-multiagent-boxing/
├── agents/
│   ├── __init__.py
│   ├── dqn_agent.py
│   ├── policy_agent.py
│   ├── rando_agent1.py
│   └── rando_agent2.py
├── models/
│   ├── ppo_model.h5
│   └── dqn_model. (keras or h5)
├── training/
│   ├── train_ppo.py
│   └── train_dqn.py
├── main.py                                 # PPO vs DQN match
├── main_rando.py                           # RandoAgent1 vs RandoAgent2
├── training_requirements_draft.txt         # A SAMPLE FILE FOR TRAINING ON HEX
├── rendering_requirements.txt              # For local rendering
├── og_rendering_requirements.txt           # For WSL local rendering
└── README.md

References

University of Bath — CM50270 Reinforcement Learning Coursework 2

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
agents		agents
metrics		metrics
models		models
plots		plots
training		training
.dockerignore		.dockerignore
.gitignore		.gitignore
.python-version		.python-version
Dockerfile		Dockerfile
README.md		README.md
entrypoint.sh		entrypoint.sh
main.py		main.py
main_rando.py		main_rando.py
metrics.py		metrics.py
og_requirements.txt		og_requirements.txt
render_requirements.txt		render_requirements.txt
training_requirements.txt		training_requirements.txt
training_requirements_draft.txt		training_requirements_draft.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RL Multi-agent Boxing Project

Getting Started

1. Clone the repository

2. Create a virtual environment (recommended)

3. Install Dependencies

If you're rendering the boxing gym locally:

If you're rendering in WSL (with X server like VcXsrv):

If you're training agents (on Hex):

4. Download Atari ROMs

5. Run a test match (e.g., RandoAgent1 vs RandoAgent2)

6. Train PPO or DQN agents

7. Watch trained agents compete

Agents

Project Structure

References

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

RL Multi-agent Boxing Project

Getting Started

1. Clone the repository

2. Create a virtual environment (recommended)

3. Install Dependencies

If you're rendering the boxing gym locally:

If you're rendering in WSL (with X server like VcXsrv):

If you're training agents (on Hex):

4. Download Atari ROMs

5. Run a test match (e.g., RandoAgent1 vs RandoAgent2)

6. Train PPO or DQN agents

7. Watch trained agents compete

Agents

Project Structure

References

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages