Robotics
Robotics
Performance standard
Critic Sensors
Learning from Observations
feedback
Environment
changes
Chapter 18, Sections 1–3 Learning Performance
element element
knowledge
learning
goals
experiments
Problem
generator
Agent Effectors
x x
x x
x x
F T WaitEstimate?
Bar? T F T T Raining?
No Yes No Yes
F T F T
Patrons?
Learning agent = performance element + learning element
None Some Full Learning method depends on type of performance element, available
F T Hungry?
feedback, type of component to be improved, and its representation
Yes No
For supervised learning, the aim is to find a simple hypothesis
Type? F that is approximately consistent with training examples
French Italian Thai Burger Decision tree learning using information gain
T F Fri/Sat? T
No Yes Learning performance = prediction accuracy measured on test set
F T
Substantially simpler than “true” tree—a more complex hypothesis isn’t jus-
tified by small amount of data