Mario Reinforcement Learning

This project aims to train an artificial intelligence (AI) agent to achieve proficiency in playing Super Mario Bros. through the application of reinforcement learning techniques. Specifically, it utilizes the Proximal Policy Optimization (PPO) algorithm from Stable Baselines 3.

The AI agent learns to navigate the iconic game environment, surmount obstacles, and strategically collect rewards such as coins and power-ups. This project not only demonstrates the capabilities of modern reinforcement learning algorithms but also explores their application in mastering complex, real-time video game scenarios.

Training

It total it took around 10 hours of training on a local Nvidia RTX 4070 Ti Super (16GB) for 2.5 million timesteps.

Installation

To install the required dependencies, you can use pip with the provided requirements.txt file.

Clone the repository:

git clone https://github.com/strcoder4007/Mario-Reinforcement-Learning.git
cd Mario-Reinforcement-Learning

Install the required dependencies:
```
pip install -r requirements.txt
```
This command will install all necessary packages including gym, gym-retro, and stable-baselines3.
Import the ROM: The ROM for Super Mario Bros. can be found in the repo itself, use the following command to import it into gym-retro:
```
python -m retro.import
```

Pre-trained Model

Checkpoint for a trained Mario: https://drive.google.com/file/d/1RRwhSMUrpBBRyAsfHLPGt1rlYFoiuus2/view?usp=sharing

Usage

After installing the dependencies, you can train the agent by running:
```
python train.py
```
This command will start the training process using the PPO algorithm. After/during training the best model will be saved in /tmp/ directory.
Monitor training progress and visualize results using Tensorboard visualizations. Logs are stored in the board folder. Use it by running:
```
tensorboard --logdir "board"
```
You can watch the agent playing the game using the best_model.zip in /tmp/ directory by running:
```
python run.py
```

Credits

Stable Baselines 3: For providing efficient implementations of reinforcement learning algorithms.
OpenAI Gym and gym-retro: For providing the environments to train and test the AI agent.
Nintendo: For creating the classic game Super Mario Bros.

Name		Name	Last commit message	Last commit date
Latest commit History 13 Commits
board		board
images		images
.gitignore		.gitignore
README.md		README.md
RandomAgent.py		RandomAgent.py
Super Mario Bros. (World).nes		Super Mario Bros. (World).nes
mario_ppo.gif		mario_ppo.gif
requirements.txt		requirements.txt
run.py		run.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Mario Reinforcement Learning

Training

Installation

Pre-trained Model

Usage

Credits

About

Releases

Languages

strcoder4007/Mario-Reinforcement-Learning

Folders and files

Latest commit

History

Repository files navigation

Mario Reinforcement Learning

Training

Installation

Pre-trained Model

Usage

Credits

About

Topics

Resources

Stars

Watchers

Forks

Releases

Languages