Decision Making Under Uncertainty Guide

The document discusses decision making under uncertainty. It explains that a rational agent should choose the action that maximizes expected utility, which is the basis of decision theory. It introduces concepts like expected utility calculations, decision networks that extend Bayesian networks to model decisions and utilities, and the value of information. Sequential decision making problems with probabilistic transitions are also discussed.

Uploaded by

Madhuri Chaturvedi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

101 views63 pages

Decision Making Under Uncertainty Guide

Uploaded by

Madhuri Chaturvedi

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

Decision Making Under

Uncertainty

VARIOUS STEPS
Utility-Based Agent
sensors

?
environment
agent

actuators
Non-deterministic vs.
Probabilistic Uncertainty

? ?

a b c a b c
{a,b,c} {a(pa),b(pb),c(pc)}
 decision that is  decision that maximizes
best for worst case expected utility value
Non-deterministic model Probabilistic model
~ Adversarial search
Expected Utility
Random variable X with n values x1,…,xn
and distribution (p1,…,pn)
E.g.: X is the state reached after doing
an action A under uncertainty
Function U of X
E.g., U is the utility of a state
The expected utility of A is
EU[A] = i=1,…,n p(xi|A)U(xi)
One State/One Action Example
s0

EU(A1) = 100 x 0.2 + 50 x 0.7 + 70 x 0.1

= 20 + 35 + 7
A1
= 62

s1 s2 s3
0.2 0.7 0.1
100 50 70
One State/Two Actions Example
• EU(AI) = 62
s0 • EU(A2) = 74
• EU(S0) = max{EU(A1),EU(A2)}
= 74

A1 A2

s1 s2 s3 s4
0.2 0.7 0.2 0.1 0.8
100 50 70 80
Introducing Action Costs
• EU(A1) = 62 – 5 = 57
s0 • EU(A2) = 74 – 25 = 49
• EU(S0) = max{EU(A1),EU(A2)}
= 57

A1 A2
-5 -25

s1 s2 s3 s4
0.2 0.7 0.2 0.1 0.8
100 50 70 80
MEU Principle
rational agent should choose the action
that maximizes agent’s expected utility
this is the basis of the field of decision
theory
normative criterion for rational choice of
action
Not quite…
Must have complete model of:
 Actions
 Utilities
 States
Even if you have a complete model, will be
computationally intractable
In fact, a truly rational agent takes into account the
utility of reasoning as well---bounded rationality
Nevertheless, great progress has been made in this
area recently, and we are able to solve much more
complex decision theoretic problems than ever before
We’ll look at
Decision Theoretic Planning
 Simple decision making (ch. 16)
 Sequential decision making (ch. 17)
Decision Networks
Extend BNs to handle actions and
utilities
Also called Influence diagrams
Make use of BN inference
Can do Value of Information
calculations
Decision Networks cont.
Chance nodes: random variables, as in
BNs
Decision nodes: actions that decision
maker can take
Utility/value nodes: the utility of the
outcome state.
R&N example
Umbrella Network