Convolutional Auto-Encoder with A3C for Super Mario Bros.

The objective of this project is to test the generalization capabilities of reinforcement learning on unseen states. The baseline A3C is efficient but can only be trained on individual levels; therefore, the goal is to use pretrained Convolutional Auto-Encoder to replace part of A3C model to improve generalization capabilities and even outperform A3C training performance.

The A3C implementation was inspired by vietnguyen91 github

Design

Input from gym --> Component 1 --> Component 2 --> Output

Component 1

Encoder Conv Layer 1 (weights frozen)
Encoder Conv Layer 2 (weights frozen)

Component 2

conv layer 1
conv layer 2
LSTM
Actor & Critic

Process:

Train Convolutiona Auto-Encoder (CAE) and save model and weights
Design CAE with A3C

Requirements

gym==0.10.9
torchvision==0.4.0
scikit_image==0.15.0
matplotlib==3.1.1
numpy==1.17.0
torch==1.2.0
opencv_python==4.1.0.25
gym_super_mario_bros==7.2.3
nes_py==8.1.1
skimage==0.0
tensorboardX==1.8
You need a GPU (CUDA) to run the project, the project will not work efficiently on CPU
It also only on Python3

Train

CAE trained models already provided, more information to follow soon

to run training process of Mario

run python3 main_learn.py

to test project

run python3 main_test.py

to run convolutional auto encoder

run python3 convolutional_autoencoder.py

to run sequential auto encoder

run python3 sequential_autoencoder.py

Results

Coming soon

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
src		src
MarioData.py		MarioData.py
allitems.csv		allitems.csv
convolutional_autoencoder.py		convolutional_autoencoder.py
main_learn.py		main_learn.py
main_test.py		main_test.py
readme.md		readme.md
sequential_autoencoder.py		sequential_autoencoder.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Convolutional Auto-Encoder with A3C for Super Mario Bros.

Design

Process:

Requirements

Train

Results

About

Releases

Packages

Contributors 2

Languages

nkanu17/mario-reinforcement_learning-convolutional-auto-encoder-with-a3c

Folders and files

Latest commit

History

Repository files navigation

Convolutional Auto-Encoder with A3C for Super Mario Bros.

Design

Process:

Requirements

Train

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages