GitHub - sadighian/baselines: Deep Reinforcement Learning research experiment using OpenAI Baselines'

This Repository

This repository contains the source code used for a research project in EPITA's Computer Science Master's program, to which the goal was to evaluate the best performing neural network architecture, given a neuron constraint (i.e., resource budget).

The project heavily relies on OpenAI's Baselines with only a few minor changes for the purpose of our experiment.

The Experiment

Abstract

We explore and evaluate the performance of different neural network architectures with a fixed neuron budget for deep reinforcement learning. We use a Double Dueling DQN algorithm with prioritized experience replay for the Atari game Space Invaders. We also use the default hyperparameters and frame stacking.

Experiment Overview

We train our 3 agents for 10 million frames (or steps in the environment), which takes approximately 16 hours per agent on our testing computer. The three agents consist of: first, a baseline with a 4-layer MLP, second, a shallow network with a 2-layer MLP, and third, a deep network with a 8-layer MLP. We use the game's high score as our metric to evaluate each agent (i.e., experiment).

The Results

The agent with the deep 8-layer MLP performed the best with an average score of 250, but the outcome is relatively similar across all three experiments (baseline and shallow networks achieved average scores of 242 and 247 respectively).

To read more about the experiment, refer to DDQN Research Report 2019-02-14.pdf.

Citing Our Research

@misc{Deep Reinforcement Learning,
  author = {Jonathan Sadighian, Nadia Sjöstedt, Rhea Moubarak},
  title = {Different Neural Network Depths for Deep Reinforcement Learning},
  year = {2019},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/sadighian/baselines}},
}

Name		Name	Last commit message	Last commit date
Latest commit History 299 Commits
baselines		baselines
data		data
docs/viz		docs/viz
.benchmark_pattern		.benchmark_pattern
.gitignore		.gitignore
.travis.yml		.travis.yml
DDQN Research Report 2019-02-14.pdf		DDQN Research Report 2019-02-14.pdf
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
benchmarks_atari10M.htm		benchmarks_atari10M.htm
benchmarks_mujoco1M.htm		benchmarks_mujoco1M.htm
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

This Repository

The Experiment

Citing Our Research

About

Uh oh!

Releases

Packages

Languages

License

sadighian/baselines

Folders and files

Latest commit

History

Repository files navigation

This Repository

The Experiment

Citing Our Research

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages