Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
3 changes: 3 additions & 0 deletions reinforcement_learning/data/readme.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,3 @@
The data being used is stored in the following format :-
"state,action,reward,next_state,done" based on the Q-learning strategy after learning from experiences played in the episodes.
The output weights are stored in the solution folder named "cartpole_model".
14 changes: 14 additions & 0 deletions reinforcement_learning/exercise/readme.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,14 @@
# Exercise goal
- Learn the concept of Reinforcement learning by solving the cartpole player problem with the use of OpenAI gym.

# Data
- Using the cartpole player from OpenAI gym. [Link](https://gym.openai.com/docs/)

# Task
- This exercise helps in learning reinforcement learning using 3 approaches.
- First, using random playing strategy multiple times and saving parameters for the best average episode.
- Second, playing multiple times and updating the weights which leads to best optimum time.
- Third, using Deep Q-Learning strategy to optimize the parameters, left and right actions per se for various frames.

# Prerequisites
- Must have experience with basic Deep Learning using Tensorflow.
Loading