0% found this document useful (0 votes)

72 views

Robotics

1. The document discusses different types of learning agents and their components. Learning agents include a learning element that is designed based on the performance element, the functional component being learned, and the type of feedback available. 2. Inductive learning, also called science, involves learning a function from examples to find a hypothesis that approximates the target function. A key challenge is constructing the hypothesis to be consistent with the examples while maintaining simplicity. 3. Different examples of learning elements and performance elements are provided, such as using a neural network as the percept-action function for a simple reflex agent.

Uploaded by

api-19801502

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

72 views

Robotics

Uploaded by

api-19801502

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Learning agents

Performance standard

Critic Sensors
Learning from Observations
feedback

Environment
changes
Chapter 18, Sections 1–3 Learning Performance
element element
knowledge
learning
goals
experiments
Problem
generator

Agent Effectors

Chapter 18, Sections 1–3 1 Chapter 18, Sections 1–3 4

Outline Learning element

♦ Learning agents Design of learning element is dictated by
♦ what type of performance element is used
♦ Inductive learning
♦ which functional component is to be learned
♦ Decision tree learning ♦ how that functional compoent is represented
♦ what kind of feedback is available
♦ Measuring learning performance Example scenarios:
Performance element Component Representation Feedback

Alpha−beta search Eval. fn. Weighted linear function Win/loss

Logical agent Transition model Successor−state axioms Outcome

Utility−based agent Transition model Dynamic Bayes net Outcome

Simple reflex agent Percept−action fn Neural net Correct action

Supervised learning: correct answers for each instance

Reinforcement learning: occasional rewards

Chapter 18, Sections 1–3 2 Chapter 18, Sections 1–3 5

Learning Inductive learning (a.k.a. Science)

Learning is essential for unknown environments, Simplest form: learn a function from examples (tabula rasa)
i.e., when designer lacks omniscience
f is the target function
Learning is useful as a system construction method,
i.e., expose the agent to reality rather than trying to write it down O O X
An example is a pair x, f (x), e.g., X , +1
Learning modifies the agent’s decision mechanisms to improve performance X
Problem: find a(n) hypothesis h
such that h ≈ f
given a training set of examples
(This is a highly simplified model of real learning:
– Ignores prior knowledge
– Assumes a deterministic, observable “environment”
– Assumes examples are given
– Assumes that the agent wants to learn f —why?)

Chapter 18, Sections 1–3 3 Chapter 18, Sections 1–3 6

Inductive learning method Inductive learning method
Construct/adjust h to agree with f on training set Construct/adjust h to agree with f on training set
(h is consistent if it agrees with f on all examples) (h is consistent if it agrees with f on all examples)
E.g., curve fitting: E.g., curve fitting:
f(x) f(x)

x x

Chapter 18, Sections 1–3 7 Chapter 18, Sections 1–3 10

Inductive learning method Inductive learning method

Construct/adjust h to agree with f on training set Construct/adjust h to agree with f on training set
(h is consistent if it agrees with f on all examples) (h is consistent if it agrees with f on all examples)
E.g., curve fitting: E.g., curve fitting:
f(x) f(x)

x x

Chapter 18, Sections 1–3 8 Chapter 18, Sections 1–3 11

Inductive learning method Inductive learning method

x x

Ockham’s razor: maximize a combination of consistency and simplicity

Chapter 18, Sections 1–3 9 Chapter 18, Sections 1–3 12
Attribute-based representations Hypothesis spaces
Examples described by attribute values (Boolean, discrete, continuous, etc.) How many distinct decision trees with n Boolean attributes??
E.g., situations where I will/won’t wait for a table:

Example Attributes Target

Alt Bar F ri Hun P at P rice Rain Res T ype Est WillWait
X1 T F F T Some $$$ F T French 0–10 T
X2 T F F T Full $ F F Thai 30–60 F
X3 F T F F Some $ F F Burger 0–10 T
X4 T F T T Full $ F F Thai 10–30 T
X5 T F T F Full $$$ F T French >60 F
X6 F T F T Some $$ T T Italian 0–10 T
X7 F T F F None $ T F Burger 0–10 F
X8 F F F T Some $$ T T Thai 0–10 T
X9 F T T F Full $ T F Burger >60 F
X10 T T T T Full $$$ F T Italian 10–30 F
X11 F F F F None $ F F Thai 0–10 F
X12 T T T T Full $ F F Burger 30–60 T

Classification of examples is positive (T) or negative (F)

Chapter 18, Sections 1–3 13 Chapter 18, Sections 1–3 16

Decision trees Hypothesis spaces

One possible representation for hypotheses How many distinct decision trees with n Boolean attributes??
E.g., here is the “true” tree for deciding whether to wait:
= number of Boolean functions
Patrons?

None Some Full

F T WaitEstimate?

>60 30−60 10−30 0−10

F Alternate? Hungry? T
No Yes No Yes

Reservation? Fri/Sat? T Alternate?

No Yes No Yes No Yes

Bar? T F T T Raining?
No Yes No Yes

F T F T

Chapter 18, Sections 1–3 14 Chapter 18, Sections 1–3 17

Expressiveness Hypothesis spaces

Decision trees can express any function of the input attributes. How many distinct decision trees with n Boolean attributes??
E.g., for Boolean functions, truth table row → path to leaf:
= number of Boolean functions
A B A xor B
A = number of distinct truth tables with 2n rows
F T
F F F
B B
F T T
F T F T
T F T
T T F F T T F

Trivially, there is a consistent decision tree for any training set

w/ one path to leaf for each example (unless f nondeterministic in x)
but it probably won’t generalize to new examples
Prefer to find more compact decision trees

Chapter 18, Sections 1–3 15 Chapter 18, Sections 1–3 18

Hypothesis spaces Hypothesis spaces
How many distinct decision trees with n Boolean attributes?? How many distinct decision trees with n Boolean attributes??
= number of Boolean functions = number of Boolean functions
n n
= number of distinct truth tables with 2n rows = 22 = number of distinct truth tables with 2n rows = 22
E.g., with 6 Boolean attributes, there are 18,446,744,073,709,551,616 trees
How many purely conjunctive hypotheses (e.g., Hungry ∧ ¬Rain)??
Each attribute can be in (positive), in (negative), or out
⇒ 3n distinct conjunctive hypotheses
More expressive hypothesis space
– increases chance that target function can be expressed
– increases number of hypotheses consistent w/ training set
⇒ may get worse predictions

Chapter 18, Sections 1–3 19 Chapter 18, Sections 1–3 22

Hypothesis spaces Decision tree learning

How many distinct decision trees with n Boolean attributes?? Aim: find a small tree consistent with the training examples
= number of Boolean functions Idea: (recursively) choose “most significant” attribute as root of (sub)tree
n
= number of distinct truth tables with 2n rows = 22
function DTL(examples, attributes, default) returns a decision tree
E.g., with 6 Boolean attributes, there are 18,446,744,073,709,551,616 trees
if examples is empty then return default
else if all examples have the same classification then return the classification
else if attributes is empty then return Mode(examples)
else
best ← Choose-Attribute(attributes, examples)
tree ← a new decision tree with root test best
for each value vi of best do
examplesi ← {elements of examples with best = vi }
subtree ← DTL(examplesi, attributes − best, Mode(examples))
add a branch to tree with label vi and subtree subtree
return tree

Chapter 18, Sections 1–3 20 Chapter 18, Sections 1–3 23

Hypothesis spaces Choosing an attribute

How many distinct decision trees with n Boolean attributes?? Idea: a good attribute splits the examples into subsets that are (ideally) “all
positive” or “all negative”
= number of Boolean functions
n
= number of distinct truth tables with 2n rows = 22
E.g., with 6 Boolean attributes, there are 18,446,744,073,709,551,616 trees
Patrons? Type?
How many purely conjunctive hypotheses (e.g., Hungry ∧ ¬Rain)??
None Some Full French Italian Thai Burger

P atrons? is a better choice—gives information about the classification

Chapter 18, Sections 1–3 21 Chapter 18, Sections 1–3 24

Information Performance measurement
Information answers questions How do we know that h ≈ f ? (Hume’s Problem of Induction)
The more clueless I am about the answer initially, the more information is 1) Use theorems of computational/statistical learning theory
contained in the answer
2) Try h on a new test set of examples
Scale: 1 bit = answer to Boolean question with prior h0.5, 0.5i (use same distribution over example space as training set)
Information in an answer when prior is hP1, . . . , Pni is Learning curve = % correct on test set as a function of training set size
1
n
H(hP1, . . . , Pni) = Σ − Pi log2 Pi

% correct on test set

i=1 0.9
(also called entropy of the prior) 0.8
0.7
0.6
0.5
0.4
0 10 20 30 40 50 60 70 80 90 100
Training set size
Chapter 18, Sections 1–3 25 Chapter 18, Sections 1–3 28

Information contd. Performance measurement contd.

Suppose we have p positive and n negative examples at the root Learning curve depends on
⇒ H(hp/(p+n), n/(p+n)i) bits needed to classify a new example – realizable (can express target function) vs. non-realizable
E.g., for 12 restaurant examples, p = n = 6 so we need 1 bit non-realizability can be due to missing attributes
or restricted hypothesis class (e.g., thresholded linear function)
An attribute splits the examples E into subsets Ei, each of which (we hope) – redundant expressiveness (e.g., loads of irrelevant attributes)
needs less information to complete the classification
% correct
Let Ei have pi positive and ni negative examples
1 realizable
⇒ H(hpi/(pi +ni), ni/(pi +ni)i) bits needed to classify a new example
⇒ expected number of bits per example over all branches is
redundant
pi + ni
Σi H(hpi/(pi + ni), ni/(pi + ni)i) nonrealizable
p+n
For P atrons?, this is 0.459 bits, for T ype this is (still) 1 bit
⇒ choose the attribute that minimizes the remaining information needed
# of examples

Chapter 18, Sections 1–3 26 Chapter 18, Sections 1–3 29

Example contd. Summary

Decision tree learned from the 12 examples: Learning needed for unknown environments, lazy designers

Patrons?
Learning agent = performance element + learning element
None Some Full Learning method depends on type of performance element, available
F T Hungry?
feedback, type of component to be improved, and its representation
Yes No
For supervised learning, the aim is to find a simple hypothesis
Type? F that is approximately consistent with training examples
French Italian Thai Burger Decision tree learning using information gain
T F Fri/Sat? T
No Yes Learning performance = prediction accuracy measured on test set
F T

Substantially simpler than “true” tree—a more complex hypothesis isn’t jus-
tified by small amount of data

Chapter 18, Sections 1–3 27 Chapter 18, Sections 1–3 30

IB HL AA Unit 02 Functions
No ratings yet
IB HL AA Unit 02 Functions
7 pages
FIDP in General Mathematics-2020-2021
100% (7)
FIDP in General Mathematics-2020-2021
13 pages
FIDP Entrepreneurship Senior High School
100% (2)
FIDP Entrepreneurship Senior High School
9 pages
Subject Description: at The End of The Course, The Students Must Be Able To Apply Concepts and Solve Problems Involving Conic Sections, Systems of
No ratings yet
Subject Description: at The End of The Course, The Students Must Be Able To Apply Concepts and Solve Problems Involving Conic Sections, Systems of
5 pages
Demo Teaching Presentation Rubric
100% (2)
Demo Teaching Presentation Rubric
1 page
Learning AI
No ratings yet
Learning AI
27 pages
Flexible Instruction Delivery Plan (FIDP) Template - Problems Solving Involving Functions
No ratings yet
Flexible Instruction Delivery Plan (FIDP) Template - Problems Solving Involving Functions
9 pages
2
No ratings yet
2
26 pages
Addie Model
No ratings yet
Addie Model
1 page
Business Math FIDP 3rd Quarter
No ratings yet
Business Math FIDP 3rd Quarter
4 pages
06 Learning Systems
No ratings yet
06 Learning Systems
82 pages
Maths L6e
No ratings yet
Maths L6e
7 pages
Flexible Instruction Delivery Plan
No ratings yet
Flexible Instruction Delivery Plan
5 pages
And To Apply Logic To Real-Life Situations
No ratings yet
And To Apply Logic To Real-Life Situations
15 pages
FIDP-Earth Science
No ratings yet
FIDP-Earth Science
933 pages
General Math LP1
No ratings yet
General Math LP1
8 pages
Ae 19
No ratings yet
Ae 19
5 pages
Annual Plan
No ratings yet
Annual Plan
6 pages
Competency Matrix
No ratings yet
Competency Matrix
1 page
Reinforcement Learning
No ratings yet
Reinforcement Learning
3 pages
Final Format of Syllabi - Strategic Cost Management
No ratings yet
Final Format of Syllabi - Strategic Cost Management
6 pages
Subject Description: at The End of The Course, The Students Must Be Able To Apply Concepts and Solve Problems Involving Conic Sections, Systems of
No ratings yet
Subject Description: at The End of The Course, The Students Must Be Able To Apply Concepts and Solve Problems Involving Conic Sections, Systems of
4 pages
FIDP GenMath
No ratings yet
FIDP GenMath
13 pages
CIDAM GenMath 1
No ratings yet
CIDAM GenMath 1
2 pages
SAFe General Overview v0.1 14-04-2024-Entire Course Pack
No ratings yet
SAFe General Overview v0.1 14-04-2024-Entire Course Pack
25 pages
Report
No ratings yet
Report
8 pages
BASIC CLM ATS (Engine)
No ratings yet
BASIC CLM ATS (Engine)
31 pages
The Development of Listening and Speaking Skills and Strategies For Effective Communication in Various Situations
No ratings yet
The Development of Listening and Speaking Skills and Strategies For Effective Communication in Various Situations
3 pages
Cur Map Stem Quarter 2
No ratings yet
Cur Map Stem Quarter 2
3 pages
8 - Uc 8 CLM
No ratings yet
8 - Uc 8 CLM
3 pages
Gen-Math-FIDP
No ratings yet
Gen-Math-FIDP
11 pages
Motherson Sumi Systems Limited: Competency Analysis (Applicable To E3 & Above)
No ratings yet
Motherson Sumi Systems Limited: Competency Analysis (Applicable To E3 & Above)
1 page
Cipriano - FIDP - Gen Math
100% (1)
Cipriano - FIDP - Gen Math
14 pages
fidp-gen-general-mathematics
No ratings yet
fidp-gen-general-mathematics
9 pages
2 Ndweek
No ratings yet
2 Ndweek
3 pages
Coded Mind Maps
No ratings yet
Coded Mind Maps
14 pages
Basic CLM Acp
No ratings yet
Basic CLM Acp
30 pages
Day 3 - Session 1 (AN) - PS Spectrum PS Economics v2 (1)
No ratings yet
Day 3 - Session 1 (AN) - PS Spectrum PS Economics v2 (1)
8 pages
Assessment 1 ACTIVITY 3 (Midterm)
No ratings yet
Assessment 1 ACTIVITY 3 (Midterm)
2 pages
Assurance Engagement
No ratings yet
Assurance Engagement
1 page
Basic CLM Ats (Chassis)
No ratings yet
Basic CLM Ats (Chassis)
30 pages
Machine Learning: Introduction and Linear Regression
No ratings yet
Machine Learning: Introduction and Linear Regression
29 pages
Scope Definition: Evaluation Questionnaire Checklist Templates Guidelines Estimation Frameworks
No ratings yet
Scope Definition: Evaluation Questionnaire Checklist Templates Guidelines Estimation Frameworks
10 pages
Differential Calculus
No ratings yet
Differential Calculus
7 pages
Syllabus-2025-Final
No ratings yet
Syllabus-2025-Final
4 pages
Fidp PM
No ratings yet
Fidp PM
2 pages
Caregiving-Cmt-Clm-Session Plan
No ratings yet
Caregiving-Cmt-Clm-Session Plan
35 pages
Ch.07 Efficiency
No ratings yet
Ch.07 Efficiency
1 page
ICRA2024 IRL Reward Shaping Wu
No ratings yet
ICRA2024 IRL Reward Shaping Wu
8 pages
Gen Math (1st Quarter)
No ratings yet
Gen Math (1st Quarter)
2 pages
Education, Entrepreneurial Demands, Middle-Level Skills Development, and Employment Through Utilizing Appropriate Mathematical and Financial Tools
No ratings yet
Education, Entrepreneurial Demands, Middle-Level Skills Development, and Employment Through Utilizing Appropriate Mathematical and Financial Tools
7 pages
Grade 7 Curriculum Map
No ratings yet
Grade 7 Curriculum Map
3 pages
HSE-Risk Assessment Sheet
No ratings yet
HSE-Risk Assessment Sheet
3 pages
AP Precalculus_Course at a Glance Poster
No ratings yet
AP Precalculus_Course at a Glance Poster
1 page
fidp-gen-math-11-sy2021-2022-gcnesguerra
No ratings yet
fidp-gen-math-11-sy2021-2022-gcnesguerra
19 pages
Flexible Instruction Delivery Plan (FIDP) Template - Problems Solving Involving Functions
No ratings yet
Flexible Instruction Delivery Plan (FIDP) Template - Problems Solving Involving Functions
5 pages
2.2 Data Container - 2.2 Data Container - Ebgtc00000296 Courseware - Huawei Ilearningx PDF
No ratings yet
2.2 Data Container - 2.2 Data Container - Ebgtc00000296 Courseware - Huawei Ilearningx PDF
2 pages
Flexible Evaluation Mechanism (FEM) Sample 1 (Name of The Subject) GRADE: - QUARTER
No ratings yet
Flexible Evaluation Mechanism (FEM) Sample 1 (Name of The Subject) GRADE: - QUARTER
6 pages
06-BDE-5062 - Prepare The Fire Support Plan
No ratings yet
06-BDE-5062 - Prepare The Fire Support Plan
7 pages
Flexible Instruction Delivery Plan (FIDP) : What To Teach? Why Teach?
No ratings yet
Flexible Instruction Delivery Plan (FIDP) : What To Teach? Why Teach?
2 pages
Intermediate AI Prompting – Reinforcement Learning
From Everand
Intermediate AI Prompting – Reinforcement Learning
Eric Centore
No ratings yet
Learning From Observations: Chapter 18, Sections 1-3
No ratings yet
Learning From Observations: Chapter 18, Sections 1-3
30 pages
Mcculloch-Pitts "Unit": A G (In) G W A
No ratings yet
Mcculloch-Pitts "Unit": A G (In) G W A
4 pages
Neural Networks: Chapter 20, Section 5
No ratings yet
Neural Networks: Chapter 20, Section 5
21 pages
Robotics
No ratings yet
Robotics
71 pages
Robotics
No ratings yet
Robotics
3 pages
Inference in First-Order Logic
No ratings yet
Inference in First-Order Logic
46 pages
Universal Instantiation (UI) : That Does Not Appear Elsewhere in The Knowledge Base
No ratings yet
Universal Instantiation (UI) : That Does Not Appear Elsewhere in The Knowledge Base
8 pages
Robotics
No ratings yet
Robotics
7 pages
Constraint Satisfaction Problems
No ratings yet
Constraint Satisfaction Problems
40 pages
Wsbpel-V2 0
No ratings yet
Wsbpel-V2 0
264 pages
Best-First Search: Evaluation Function
No ratings yet
Best-First Search: Evaluation Function
6 pages
Problem-Solving Agents: Offline Online
No ratings yet
Problem-Solving Agents: Offline Online
13 pages
Manipulators: Degrees of Freedom
No ratings yet
Manipulators: Degrees of Freedom
5 pages
Computer Crime
No ratings yet
Computer Crime
151 pages
Adobe and Ebooks
No ratings yet
Adobe and Ebooks
4 pages
Coyne. Wicked Problems Revisited
No ratings yet
Coyne. Wicked Problems Revisited
13 pages
Instructional Planning
No ratings yet
Instructional Planning
30 pages
Course References
No ratings yet
Course References
5 pages
Level 6assessment Specification: The Struggles Encountered by Entrepreneurs
No ratings yet
Level 6assessment Specification: The Struggles Encountered by Entrepreneurs
16 pages
Chapter 8 Research Methodology
No ratings yet
Chapter 8 Research Methodology
16 pages
The Impact of Students Cellphone Use and Self Control On Academic Performance in Traditional Classroom
No ratings yet
The Impact of Students Cellphone Use and Self Control On Academic Performance in Traditional Classroom
8 pages
Educamos Con Calidad y Fe para La Autonomía y La Productividad
No ratings yet
Educamos Con Calidad y Fe para La Autonomía y La Productividad
4 pages
Lesson Plan Fukuzawa
No ratings yet
Lesson Plan Fukuzawa
15 pages
Detailed Lesson Plan in Math 3
75% (4)
Detailed Lesson Plan in Math 3
6 pages
Edu 41 - Field Study 2: Learning Episode 1
No ratings yet
Edu 41 - Field Study 2: Learning Episode 1
4 pages
Suri
No ratings yet
Suri
2 pages
Cpi Lesson Plan
No ratings yet
Cpi Lesson Plan
4 pages
MSEL532-Schein and Organizational Culture
No ratings yet
MSEL532-Schein and Organizational Culture
15 pages
CHPT 6 - Incentive and Compensation System Maf651
No ratings yet
CHPT 6 - Incentive and Compensation System Maf651
96 pages
Civics Lesson Plan
No ratings yet
Civics Lesson Plan
8 pages
2021 Creativity Book 1
No ratings yet
2021 Creativity Book 1
824 pages
Facct24 155
No ratings yet
Facct24 155
26 pages
Stylistics Learning Activity 2
No ratings yet
Stylistics Learning Activity 2
2 pages
2 Grade Oral Language Sample Lesson Plans: Nglish Tandards OF Earning
No ratings yet
2 Grade Oral Language Sample Lesson Plans: Nglish Tandards OF Earning
6 pages
English Sign System Lesson Plan
No ratings yet
English Sign System Lesson Plan
59 pages
2nd Quarter 8th Grade Social Studies Project Rubric - 2022-23
No ratings yet
2nd Quarter 8th Grade Social Studies Project Rubric - 2022-23
3 pages
The Nature of Performance
No ratings yet
The Nature of Performance
3 pages
S6 - Facing The Forces finAL
No ratings yet
S6 - Facing The Forces finAL
26 pages
Friday
No ratings yet
Friday
3 pages
TTL 1 L3 Module
No ratings yet
TTL 1 L3 Module
6 pages
Documentation Odt
No ratings yet
Documentation Odt
15 pages
Paulina Villarreal-Quiroga: Key Strengths
No ratings yet
Paulina Villarreal-Quiroga: Key Strengths
3 pages
Self-Reflection in The Course Evaluation
No ratings yet
Self-Reflection in The Course Evaluation
6 pages
Officer Like Qualities AKUN
No ratings yet
Officer Like Qualities AKUN
13 pages