SlideShare a Scribd company logo
1 of 19
CMSC 671
Fall 2003
Class #26 – Wednesday, November 26
Russell & Norvig 16.1 – 16.5
Some material borrowed from Jean-Claude Latombe and
Daphne Koller by way of Lise Getoor
Topics
• Decision making under uncertainty
– Expected utility
– Utility theory and rationality
– Utility functions
– Multiattribute utility functions
– Preference structures
– Decision networks
– Value of information
Non-deterministic vs. Probabilistic
Uncertainty
?
b
a c
{a,b,c}
 decision that is
best for worst case
?
b
a c
{a(pa),b(pb),c(pc)}
 decision that maximizes
expected utility value
Non-deterministic model Probabilistic model
~ Adversarial search
Expected Utility
• Random variable X with n values x1,…,xn and distribution
(p1,…,pn)
– X is the outcome of performing action A (i.e., the state reached after A
is taken)
• Function U of X
– U is a mapping from states to numerical utilities (values)
• The expected utility of performing action A is
EU[A] = Si=1,…,n p(xi|A)U(xi)
Utility of each outcome
Probability of each outcome
s0
s3
s2
s1
A1
0.2 0.7 0.1
100 50 70
U(S0) = 100 x 0.2 + 50 x 0.7 + 70 x 0.1
= 20 + 35 + 7
= 62
One State/One Action Example
s0
s3
s2
s1
A1
0.2 0.7 0.1
100 50 70
A2
s4
0.2 0.8
80
• U1(S0) = 62
• U2(S0) = 74
• U(S0) = max{U1(S0),U2(S0)}
= 74
One State/Two Actions Example
MEU Principle
• Decision theory: A rational agent should choose the action
that maximizes the agent’s expected utility
• Maximizing expected utility (MEU) is a normative criterion
for rational choices of actions
• ….too bad it’s intractable, even if we could represent utility
functions and probabilistic outcomes perfectly…
Not quite…
• Must have complete model of:
– Actions
– Utilities
– States
• Even if you have a complete model, will be computationally
intractable
• In fact, a truly rational agent takes into account the utility of reasoning
as well: This is called bounded rationality
• Nevertheless, great progress has been made in this area recently, and we
are able to solve much more complex decision theoretic problems than
ever before
Comparing outcomes
• Which is better:
A = Being rich and sunbathing where it’s warm
B = Being rich and sunbathing where it’s cool
C = Being poor and sunbathing where it’s warm
D = Being poor and sunbathing where it’s cool
• Multiattribute utility theory
– A clearly dominates B: A > B. A > C. C > D. A > D. What about B vs. C?
– Simplest case: Additive value function (just add the individual attribute
utilities)
• Lottery: General model for assessing the relative preference between
outcomes
L =[p1, C1 ; p2, C2 ; … ; pn, Cn]
is a lottery with possible outcomes C1…Cn that occur with probabilities
p1…pn
• Which is better:
[.5, A ; .5, B] You go to Ocean City and it might be warm
[.6, C, .4, D] You go to Bermuda and it will probably be warm
Axioms of utility theory
(properties of utility functions)
• Orderability: Given two states, one or the other should be
better, or the agent can not care.
• Transitivity: If A is better than B and B is better than C,
then A is better than C.
• Continuity: If A is better than B and B is better than C, then
you can “interpolate” A and C with some probability p to
get an outcome that’s equally desirable to B:
–  p: [p, A ; 1-p, C] == B
Axioms of utility theory, cont.
• Substitutability: If you are indifferent between A and B,
then you can substitute A for B in any lottery and the results
will be the same.
• Monotonicity: If A is better than B, then a lottery that
differs only in assigning higher probability to A is better:
– p ≥q ↔ [p, A; 1-p, B] ≥ [q, A; 1-q, B]
• Decomposability: Compound lotteries can be reduced
using the laws of probability (“no fun in gambling” – you
don’t care whether you gamble once or twice, as long as the
results are the same)
Some notes on utility
• Money <> utility
– Empirically, money typically converts logarithmically to utility
• Attitudes towards risk vary:
• Ordinal utility function: Gives relative rankings, but not
numerical utilities
Risk-averse Risk-seeking
“Money can’t buy happiness”
Ascetic
Risk-neutral
Decision networks
• Extend Bayes nets to handle actions and utilities
– a.k.a. influence diagrams
• Make use of Bayes net inference
• Useful application: Value of Information
Decision network representation
• Chance nodes: random variables, as in Bayes nets
• Decision nodes: actions that decision maker can
take
• Utility/value nodes: the utility of the outcome state.
R&N example
Evaluating decision networks
• Set the evidence variables for the current state.
• For each possible value of the decision node (assume just one):
– Set the decision node to that value.
– Calculate the posterior probabilities for the parent nodes of the utility node,
using BN inference.
– Calculate the resulting utility for the action.
• Return the action with the highest utility.
Exercise: Umbrella network
Weather
Forecast
Umbrella
Happiness
take/don’t take
f w p(f|w)
sunny rain 0.3
rainy rain 0.7
sunny no rain 0.8
rainy no rain 0.2
P(rain) = 0.4
U(lug, rain) = -25
U(lug, ~rain) = 0
U(~lug, rain) = -100
U(~lug, ~rain) = 100
Lug umbrella
P(lug|take) = 1.0
P(~lug|~take)=1.0
Value of Perfect Information (VPI)
• How much is it worth to observe (with certainty) a random variable X?
• Suppose the agent’s current knowledge is E. The value of the current best
action  is:
EU(α | E) = maxA ∑i U(Resulti(A)) p(Resulti(A) | E, Do(A))
• The value of the new best action after observing the value of X is:
EU(α’ | E,X) = maxA ∑i U(Resulti(A)) p(Resulti(A) | E, X, Do(A))
• …But we don’t know the value of X yet, so we have to sum over its possible
values
• The value of perfect information for X is therefore:
VPI(X) = ( ∑k p(xk | E) EU(αxk | xk, E)) – EU (α | E)
Probability of
each value of X
Expected utility
of the best action
given that value of X
Expected utility
of the best action
if we don’t know X
(i.e., currently)
VPI exercise: Umbrella network
Weather
Forecast
Umbrella
Happiness
take/don’t take
f w p(f|w)
sunny rain 0.3
rainy rain 0.7
sunny no rain 0.8
rainy no rain 0.2
P(rain) = 0.4
U(lug, rain) = -25
U(lug, ~rain) = 0
U(~lug, rain) = -100
U(~lug, ~rain) = 100
Lug umbrella
P(lug|take) = 1.0
P(~lug|~take)=1.0
What’s the value of knowing the weather forecast before leaving home?

More Related Content

Similar to c26_dtheory.ppt

Brains@Bay Meetup: The Increasing Role of Sensorimotor Experience in Artifici...
Brains@Bay Meetup: The Increasing Role of Sensorimotor Experience in Artifici...Brains@Bay Meetup: The Increasing Role of Sensorimotor Experience in Artifici...
Brains@Bay Meetup: The Increasing Role of Sensorimotor Experience in Artifici...
Numenta
 
streamingalgo88585858585858585pppppp.pptx
streamingalgo88585858585858585pppppp.pptxstreamingalgo88585858585858585pppppp.pptx
streamingalgo88585858585858585pppppp.pptx
GopiNathVelivela
 
Global Optimization with Descending Region Algorithm
Global Optimization with Descending Region AlgorithmGlobal Optimization with Descending Region Algorithm
Global Optimization with Descending Region Algorithm
Loc Nguyen
 

Similar to c26_dtheory.ppt (20)

Tudelft stramien 16_9_on_optimization
Tudelft stramien 16_9_on_optimizationTudelft stramien 16_9_on_optimization
Tudelft stramien 16_9_on_optimization
 
Adauctions
AdauctionsAdauctions
Adauctions
 
Brains@Bay Meetup: The Increasing Role of Sensorimotor Experience in Artifici...
Brains@Bay Meetup: The Increasing Role of Sensorimotor Experience in Artifici...Brains@Bay Meetup: The Increasing Role of Sensorimotor Experience in Artifici...
Brains@Bay Meetup: The Increasing Role of Sensorimotor Experience in Artifici...
 
Ml ppt at
Ml ppt atMl ppt at
Ml ppt at
 
Course 2 on risk management and credit risk.pdf
Course 2 on risk management and credit risk.pdfCourse 2 on risk management and credit risk.pdf
Course 2 on risk management and credit risk.pdf
 
streamingalgo88585858585858585pppppp.pptx
streamingalgo88585858585858585pppppp.pptxstreamingalgo88585858585858585pppppp.pptx
streamingalgo88585858585858585pppppp.pptx
 
Global Optimization with Descending Region Algorithm
Global Optimization with Descending Region AlgorithmGlobal Optimization with Descending Region Algorithm
Global Optimization with Descending Region Algorithm
 
Session 4 .pdf
Session 4 .pdfSession 4 .pdf
Session 4 .pdf
 
nber_slides.pdf
nber_slides.pdfnber_slides.pdf
nber_slides.pdf
 
Presentation socg
Presentation socgPresentation socg
Presentation socg
 
Uncertainty
UncertaintyUncertainty
Uncertainty
 
Lec12-Probability (1).ppt
Lec12-Probability (1).pptLec12-Probability (1).ppt
Lec12-Probability (1).ppt
 
Lec12-Probability.ppt
Lec12-Probability.pptLec12-Probability.ppt
Lec12-Probability.ppt
 
Lec12-Probability.ppt
Lec12-Probability.pptLec12-Probability.ppt
Lec12-Probability.ppt
 
Lec12-Probability.ppt
Lec12-Probability.pptLec12-Probability.ppt
Lec12-Probability.ppt
 
Ch13 slides
Ch13 slidesCh13 slides
Ch13 slides
 
Introduction to Supervised ML Concepts and Algorithms
Introduction to Supervised ML Concepts and AlgorithmsIntroduction to Supervised ML Concepts and Algorithms
Introduction to Supervised ML Concepts and Algorithms
 
Unit 4
Unit 4Unit 4
Unit 4
 
Unit 4
Unit 4Unit 4
Unit 4
 
Introduction to Big Data Science
Introduction to Big Data ScienceIntroduction to Big Data Science
Introduction to Big Data Science
 

More from Prabin Pandit (20)

Presentation1.pptx
Presentation1.pptxPresentation1.pptx
Presentation1.pptx
 
Periodic Report.pptx
Periodic Report.pptxPeriodic Report.pptx
Periodic Report.pptx
 
Discussion.pptx
Discussion.pptxDiscussion.pptx
Discussion.pptx
 
Presentation1.pptx
Presentation1.pptxPresentation1.pptx
Presentation1.pptx
 
Scientific name.pptx
Scientific name.pptxScientific name.pptx
Scientific name.pptx
 
Challenges.pptx
Challenges.pptxChallenges.pptx
Challenges.pptx
 
Unit-10.pptx
Unit-10.pptxUnit-10.pptx
Unit-10.pptx
 
Unit-8.pptx
Unit-8.pptxUnit-8.pptx
Unit-8.pptx
 
upload.pptx
upload.pptxupload.pptx
upload.pptx
 
Today presentation birat veneer.pptx
Today presentation birat veneer.pptxToday presentation birat veneer.pptx
Today presentation birat veneer.pptx
 
Lecture3-formation2022.pptx
Lecture3-formation2022.pptxLecture3-formation2022.pptx
Lecture3-formation2022.pptx
 
relation with 11,2022.pptx
 relation with 11,2022.pptx relation with 11,2022.pptx
relation with 11,2022.pptx
 
Lecture6-Plan.pptx
Lecture6-Plan.pptxLecture6-Plan.pptx
Lecture6-Plan.pptx
 
Itta ToR.pptx
Itta ToR.pptxItta ToR.pptx
Itta ToR.pptx
 
unit-6.pptx
unit-6.pptxunit-6.pptx
unit-6.pptx
 
5 to on.pptx
5 to on.pptx5 to on.pptx
5 to on.pptx
 
Unit-11.pptx
Unit-11.pptxUnit-11.pptx
Unit-11.pptx
 
Wood properties.pptx
Wood properties.pptxWood properties.pptx
Wood properties.pptx
 
Chapter 1.pptx
Chapter 1.pptxChapter 1.pptx
Chapter 1.pptx
 
Unit 5 Marketing of Forest Products.pptx
Unit 5 Marketing of Forest Products.pptxUnit 5 Marketing of Forest Products.pptx
Unit 5 Marketing of Forest Products.pptx
 

Recently uploaded

Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
ZurliaSoop
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
QucHHunhnh
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
KarakKing
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
heathfieldcps1
 

Recently uploaded (20)

Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
Explore beautiful and ugly buildings. Mathematics helps us create beautiful d...
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
Jual Obat Aborsi Hongkong ( Asli No.1 ) 085657271886 Obat Penggugur Kandungan...
 
Holdier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdfHoldier Curriculum Vitae (April 2024).pdf
Holdier Curriculum Vitae (April 2024).pdf
 
Unit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptxUnit-IV; Professional Sales Representative (PSR).pptx
Unit-IV; Professional Sales Representative (PSR).pptx
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
1029-Danh muc Sach Giao Khoa khoi 6.pdf
1029-Danh muc Sach Giao Khoa khoi  6.pdf1029-Danh muc Sach Giao Khoa khoi  6.pdf
1029-Danh muc Sach Giao Khoa khoi 6.pdf
 
Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024Mehran University Newsletter Vol-X, Issue-I, 2024
Mehran University Newsletter Vol-X, Issue-I, 2024
 
How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17How to Give a Domain for a Field in Odoo 17
How to Give a Domain for a Field in Odoo 17
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
The basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptxThe basics of sentences session 3pptx.pptx
The basics of sentences session 3pptx.pptx
 
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...Kodo Millet  PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
Kodo Millet PPT made by Ghanshyam bairwa college of Agriculture kumher bhara...
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17How to Create and Manage Wizard in Odoo 17
How to Create and Manage Wizard in Odoo 17
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Unit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptxUnit-V; Pricing (Pharma Marketing Management).pptx
Unit-V; Pricing (Pharma Marketing Management).pptx
 
Single or Multiple melodic lines structure
Single or Multiple melodic lines structureSingle or Multiple melodic lines structure
Single or Multiple melodic lines structure
 

c26_dtheory.ppt

  • 1. CMSC 671 Fall 2003 Class #26 – Wednesday, November 26 Russell & Norvig 16.1 – 16.5 Some material borrowed from Jean-Claude Latombe and Daphne Koller by way of Lise Getoor
  • 2. Topics • Decision making under uncertainty – Expected utility – Utility theory and rationality – Utility functions – Multiattribute utility functions – Preference structures – Decision networks – Value of information
  • 3. Non-deterministic vs. Probabilistic Uncertainty ? b a c {a,b,c}  decision that is best for worst case ? b a c {a(pa),b(pb),c(pc)}  decision that maximizes expected utility value Non-deterministic model Probabilistic model ~ Adversarial search
  • 4. Expected Utility • Random variable X with n values x1,…,xn and distribution (p1,…,pn) – X is the outcome of performing action A (i.e., the state reached after A is taken) • Function U of X – U is a mapping from states to numerical utilities (values) • The expected utility of performing action A is EU[A] = Si=1,…,n p(xi|A)U(xi) Utility of each outcome Probability of each outcome
  • 5. s0 s3 s2 s1 A1 0.2 0.7 0.1 100 50 70 U(S0) = 100 x 0.2 + 50 x 0.7 + 70 x 0.1 = 20 + 35 + 7 = 62 One State/One Action Example
  • 6. s0 s3 s2 s1 A1 0.2 0.7 0.1 100 50 70 A2 s4 0.2 0.8 80 • U1(S0) = 62 • U2(S0) = 74 • U(S0) = max{U1(S0),U2(S0)} = 74 One State/Two Actions Example
  • 7. MEU Principle • Decision theory: A rational agent should choose the action that maximizes the agent’s expected utility • Maximizing expected utility (MEU) is a normative criterion for rational choices of actions • ….too bad it’s intractable, even if we could represent utility functions and probabilistic outcomes perfectly…
  • 8. Not quite… • Must have complete model of: – Actions – Utilities – States • Even if you have a complete model, will be computationally intractable • In fact, a truly rational agent takes into account the utility of reasoning as well: This is called bounded rationality • Nevertheless, great progress has been made in this area recently, and we are able to solve much more complex decision theoretic problems than ever before
  • 9. Comparing outcomes • Which is better: A = Being rich and sunbathing where it’s warm B = Being rich and sunbathing where it’s cool C = Being poor and sunbathing where it’s warm D = Being poor and sunbathing where it’s cool • Multiattribute utility theory – A clearly dominates B: A > B. A > C. C > D. A > D. What about B vs. C? – Simplest case: Additive value function (just add the individual attribute utilities) • Lottery: General model for assessing the relative preference between outcomes L =[p1, C1 ; p2, C2 ; … ; pn, Cn] is a lottery with possible outcomes C1…Cn that occur with probabilities p1…pn • Which is better: [.5, A ; .5, B] You go to Ocean City and it might be warm [.6, C, .4, D] You go to Bermuda and it will probably be warm
  • 10. Axioms of utility theory (properties of utility functions) • Orderability: Given two states, one or the other should be better, or the agent can not care. • Transitivity: If A is better than B and B is better than C, then A is better than C. • Continuity: If A is better than B and B is better than C, then you can “interpolate” A and C with some probability p to get an outcome that’s equally desirable to B: –  p: [p, A ; 1-p, C] == B
  • 11. Axioms of utility theory, cont. • Substitutability: If you are indifferent between A and B, then you can substitute A for B in any lottery and the results will be the same. • Monotonicity: If A is better than B, then a lottery that differs only in assigning higher probability to A is better: – p ≥q ↔ [p, A; 1-p, B] ≥ [q, A; 1-q, B] • Decomposability: Compound lotteries can be reduced using the laws of probability (“no fun in gambling” – you don’t care whether you gamble once or twice, as long as the results are the same)
  • 12. Some notes on utility • Money <> utility – Empirically, money typically converts logarithmically to utility • Attitudes towards risk vary: • Ordinal utility function: Gives relative rankings, but not numerical utilities Risk-averse Risk-seeking “Money can’t buy happiness” Ascetic Risk-neutral
  • 13. Decision networks • Extend Bayes nets to handle actions and utilities – a.k.a. influence diagrams • Make use of Bayes net inference • Useful application: Value of Information
  • 14. Decision network representation • Chance nodes: random variables, as in Bayes nets • Decision nodes: actions that decision maker can take • Utility/value nodes: the utility of the outcome state.
  • 16. Evaluating decision networks • Set the evidence variables for the current state. • For each possible value of the decision node (assume just one): – Set the decision node to that value. – Calculate the posterior probabilities for the parent nodes of the utility node, using BN inference. – Calculate the resulting utility for the action. • Return the action with the highest utility.
  • 17. Exercise: Umbrella network Weather Forecast Umbrella Happiness take/don’t take f w p(f|w) sunny rain 0.3 rainy rain 0.7 sunny no rain 0.8 rainy no rain 0.2 P(rain) = 0.4 U(lug, rain) = -25 U(lug, ~rain) = 0 U(~lug, rain) = -100 U(~lug, ~rain) = 100 Lug umbrella P(lug|take) = 1.0 P(~lug|~take)=1.0
  • 18. Value of Perfect Information (VPI) • How much is it worth to observe (with certainty) a random variable X? • Suppose the agent’s current knowledge is E. The value of the current best action  is: EU(α | E) = maxA ∑i U(Resulti(A)) p(Resulti(A) | E, Do(A)) • The value of the new best action after observing the value of X is: EU(α’ | E,X) = maxA ∑i U(Resulti(A)) p(Resulti(A) | E, X, Do(A)) • …But we don’t know the value of X yet, so we have to sum over its possible values • The value of perfect information for X is therefore: VPI(X) = ( ∑k p(xk | E) EU(αxk | xk, E)) – EU (α | E) Probability of each value of X Expected utility of the best action given that value of X Expected utility of the best action if we don’t know X (i.e., currently)
  • 19. VPI exercise: Umbrella network Weather Forecast Umbrella Happiness take/don’t take f w p(f|w) sunny rain 0.3 rainy rain 0.7 sunny no rain 0.8 rainy no rain 0.2 P(rain) = 0.4 U(lug, rain) = -25 U(lug, ~rain) = 0 U(~lug, rain) = -100 U(~lug, ~rain) = 100 Lug umbrella P(lug|take) = 1.0 P(~lug|~take)=1.0 What’s the value of knowing the weather forecast before leaving home?