TRAINING VISION-BASED AGENT WITH THE ACTOR CRITIC MODEL IN AN ONLINE ENVIRONMENT

A agent for Slither.io, which is an online game,using actor-critic algorithms. To conquer the uncontrollable difficulty degree, dis-converge or fall into local minima, we propose four methods and greatly improve the performance.
For more detail, please see our report:Report

Requirements

Usage

We don't upload our model, so please train before test.

Train: python train_AC.py
Test: python play_AC.py

There might have some bug or unstable since it's our experiment code.

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
AC.py		AC.py
README.md		README.md
TRAINING-VISION-BASED-AGENT-WITH-THE-ACTOR-CRITIC-MODEL-IN-AN-ONLINE-ENVIRONMENT.pdf		TRAINING-VISION-BASED-AGENT-WITH-THE-ACTOR-CRITIC-MODEL-IN-AN-ONLINE-ENVIRONMENT.pdf
env.py		env.py
model.py		model.py
play_AC.py		play_AC.py
train_AC.py		train_AC.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TRAINING VISION-BASED AGENT WITH THE ACTOR CRITIC MODEL IN AN ONLINE ENVIRONMENT

Requirements

Usage

Demo

About

Releases

Packages

Languages

codingbaobao/SlitherIO_Actor-Critic

Folders and files

Latest commit

History

Repository files navigation

TRAINING VISION-BASED AGENT WITH THE ACTOR CRITIC MODEL IN AN ONLINE ENVIRONMENT

Requirements

Usage

Demo

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages