Policy Iteration - Reinforcement Learning


Src: UC Berkley 2017 Deep RL bootcamp Lecture 1 slides

Task at Hand

The task is to maximize a reward in a world that consists of an agent that can navigate in 4 directions - North, South, East and West. With a 20% of equally likely chance of deviating to left or right from the action asked to perform.


Src: UC Berkley 2017 Deep RL bootcamp Lecture 1 slides

Usage

Modify main.json to suit your needs. The key names are self explanatory. Then run python main.py.

You can also create your own <user-defined>.json file with every paramter defined and then run python main.py --json_path <user-defined>.json

Name		Name	Last commit message	Last commit date
Latest commit History 10 Commits
environment		environment
images		images
policyiter		policyiter
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
_config.yml		_config.yml
main.json		main.json
main.py		main.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Policy Iteration - Reinforcement Learning

Task at Hand

Usage

About

Releases

Packages

Languages

License

piyush2896/Policy-Iteration

Folders and files

Latest commit

History

Repository files navigation

Policy Iteration - Reinforcement Learning

Task at Hand

Usage

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages