diff --git a/README.md b/README.md index 35b0edc..c95c4c4 100644 --- a/README.md +++ b/README.md @@ -63,65 +63,65 @@ Codes (it'll send you to appropriate folder):
Experimental results (figures and tables on this page): -* RL Q-Learning Environment-1. Experimental results -* Q-learning algorithm resulted chart for the environment-1 -* Final Q-table with values from the final shortest route for environment-1 -* RL Q-Learning Environment-2. Experimental results -* Q-learning algorithm resulted chart for the environment-2 -* Final Q-table with values from the final shortest route for environment-1 -* RL Q-Learning Environment-3. Experimental results -* Comparison analysis of Q-Learning and Sarsa algorithms +* [RL Q-Learning Environment-1. Experimental results](#rl-q-learning-environment-1-experimental-results) +* [Q-learning algorithm resulted chart for the environment-1](#q-learning-algorithm-resulted-chart-for-the-environment-1) +* [Final Q-table with values from the final shortest route for environment-1](#final-q-table-with-values-from-the-final-shortest-route-for-environment-1) +* [RL Q-Learning Environment-2. Experimental results](#rl-q-learning-environment-2-experimental-results) +* [Q-learning algorithm resulted chart for the environment-2](#q-learning-algorithm-resulted-chart-for-the-environment-2) +* [Final Q-table with values from the final shortest route for environment-1](#final-q-table-with-values-from-the-final-shortest-route-for-environment-1) +* [RL Q-Learning Environment-3. Experimental results](#rl-q-learning-environment-3-experimental-results) +* [Comparison analysis of Q-Learning and Sarsa algorithms](#comparison-analysis-of-q-learning-and-sarsa-algorithms)
-### RL Q-Learning Environment-1. Experimental results +### RL Q-Learning Environment-1. Experimental results Environment-1 with mobile robot, goal and obstacles RL_Q-Learning_E-1 RL_Q-Learning_E-1
-### Q-learning algorithm resulted chart for the environment-1 +### Q-learning algorithm resulted chart for the environment-1 Represents number of episodes via number of steps and number of episodes via cost for each episode ![RL_Q-Learning_C-1](images/Charts-1.png)
-### Final Q-table with values from the final shortest route for environment-1 +### Final Q-table with values from the final shortest route for environment-1 ![RL_Q-Learning_T-1](images/Q-Table-E-1.png)
Looking at the values of the table we can see the decision for the next action made by agent (mobile robot). The sequence of final actions to reach the goal after the Q-table is filled with knowledge is the following: *down-right-down-down-down-right-down-right-down-right-down-down-right-right-up-up.*
During the experiment with Q-learning algorithm the found shortest route to reach the goal for the environment-1 consist of 16 steps and the found longest rout to reach the goal consists of 185 steps.
-### RL Q-Learning Environment-2. Experimental results +### RL Q-Learning Environment-2. Experimental results Bigger environment-2 with more obstacles ![RL_Q-Learning_E-2](images/Environment-2.png)
-### Q-learning algorithm resulted chart for the environment-2 +### Q-learning algorithm resulted chart for the environment-2 Represents number of episodes via number of steps and number of episodes via cost for each episode ![RL_Q-Learning_C-2](images/Charts-2.png)
-### Final Q-table with values from the final shortest route for environment-1 +### Final Q-table with values from the final shortest route for environment-1 ![RL_Q-Learning_T-2](images/Q-Table-E-2.png)
-### RL Q-Learning Environment-3. Experimental results +### RL Q-Learning Environment-3. Experimental results Super complex environment-3 with a lot of obstacles ![RL_Q-Learning_E-3](images/Environment-3.png)
-### Comparison analysis of Q-Learning and Sarsa algorithms +### Comparison analysis of Q-Learning and Sarsa algorithms ![RQ-learning_via_Sarsa](images/Q-learning_via_Sarsa.png)