Lecture Reinforcement Learning
Lecture Reinforcement Learning
Lecture 5.1
Reinforcement Learning
The Edge AI and Robotics Teaching Kit is licensed by NVIDIA and UMBC under the
Creative Commons Attribution-NonCommercial 4.0 International License.
2
Topics
3
Learning Objectives - Reinforcement Learning
4
Reinforcement Learning Concepts
5
Concepts
• Environment- attributes
• Agents
• State/Actions
• Learning – policies, functions,
models
• Objective
• Rewards
6
© D . Poole and A. Mackworth 2019 Artificial Intelligence: Foundations of Computational Agents
Reinforcement Learning
Agent Environment
7
Reinforcement Learning
8
© D . Poole and A. Mackworth 2019 Artificial Intelligence: Foundations of Computational Agents
Reinforcement Learning Loop
From Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser and Wah Loon Keng (ISBN-13:
9
9780135172384)
Copyright © 2020 Pearson Education, Inc. All rights reserved.
Rewards and Values
Figure 1.4 Rewards r and values V(s) for each state s in a simple grid-world
environment. The value of a state is calculated from the rewards using
Equation 1.10 with = 0.9 while using a policy that always takes the
shortest path to the goal state with r = +1.
From Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser and Wah Loon Keng (ISBN-13: 9780135172384)
10 Copyright © 2020 Pearson Education, Inc. All rights reserved.
Approaches
11
Reinforcement Learning Approaches
From Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser and Wah Loon Keng (ISBN-13: 9780135172384)
Copyright © 2020 Pearson Education, Inc. All rights reserved.
12
Neural Networks Leveraged for RL
From Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser and Wah Loon Keng (ISBN-13: 9780135172384)
Copyright © 2020 Pearson Education, Inc. All rights reserved.
13
States
14
Simple Environment
Figure 3.1 Simple environment: five states, two actions per state
From Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser and Wah Loon Keng (ISBN-13:
9780135172384)
15 Copyright © 2020 Pearson Education, Inc. All rights reserved.
Simple Environment
From Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser and Wah Loon Keng
(ISBN-13: 9780135172384)
16 Copyright © 2020 Pearson Education, Inc. All rights reserved.
Simple Environment - Learning
Figure 3.3 Learning the Q*(s, a) for the simple environment from Figure 3.1
From Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser and Wah Loon Keng (ISBN-13: 9780135172384)
17 Copyright © 2020 Pearson Education, Inc. All rights reserved.
Simple Environment – Optimal Values
Figure 3.4 Optimal Q-values for the simple environment from Figure 3.1, = 0
(left), = 1 (right)
From Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser and Wah Loon Keng (ISBN-13:
9780135172384)
18 Copyright © 2020 Pearson Education, Inc. All rights reserved.
Processing of Data
From Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser and Wah Loon Keng (ISBN-13:
19 9780135172384)
Reinforcement Learning - GPU
20
Environment
21
Gym Openai
Gym (openai.com)
22
OpenAI Gym - Cartpole
From Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser and Wah Loon Keng (ISBN-13: 9780135172384)
23 Copyright © 2020 Pearson Education, Inc. All rights reserved.
OpenAI Gym - Cartpole
From Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser and Wah Loon Keng (ISBN-13: 9780135172384)
Copyright © 2020 Pearson Education, Inc. All rights reserved.
24
OpenAI Gym - LunarLander
From Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser and Wah Loon Keng (ISBN-13: 9780135172384)
25 Copyright © 2020 Pearson Education, Inc. All rights reserved.
OpenAI Gym - Environments
From Foundations of Deep Reinforcement Learning: Theory and Practice in Python by Laura Graesser and Wah Loon Keng (ISBN-13:
9780135172384)
26 Copyright © 2020 Pearson Education, Inc. All rights reserved.
Additional Information
27
Thank You
Edge AI and Robotics Teaching Kit