Lecture I-II: Motivation and Decision Theory - Markus M. MÄobius
Lecture I-II: Motivation and Decision Theory - Markus M. MÄobius
Markus M. Möbius
February 15, 2005
How should you play this game? A naive guess would be that other players
choose randomly a strategy. In that case the mean in the game would be
around 50 and you should choose 33. But you realize that other players make
the same calculation - so nobody should say a number higher than 33. That
means that you should not name a number greater than 22 - and so on. The
winning number was 13. That means that people did this iteration about 3
times. But in fact, the stated numbers were all over the place - ranging from
0 to 40. That means that different students had different estimates of what
their fellow students would do.
• In this game, the winner has to correctly guess how often his fellow
players iterate. Assuming infinite iterations would be consistent but
those who bid 0 typically lose badly. Guessing higher numbers can
mean two things: (a) the player does not understand strategic behavior
or (b) the player understands strategic behavior but has low confidence
1
in the ability of other players to understand that this is a strategic
game. Interestingly, most people knew at least one other person in the
class (hence there was at least some degree of what a game theorist
would call common knowledge of rationality).1
Experiment 2 I am going to auction off $20. Each of your can bid secretly
on the book and the highest bidder wins the auction. However, all of you have
to pay your bid regardless of whether you win or lose.
In this game there is no optimal single bid for all players. You can check
that for all cases where each player i bids some fixed bid bi at least one of
the players will regret her decision and try to reverse it - we say that there
is no pure strategy Nash equilibrium in this game. Consider for example the
case where all player bid 55 Dollars. Then some player should bid 55 and 5
cents. No equilibrium!
There is an equilibrium if we allow players to randomize. You can check
that with two players who pick random numbers between 0 and 60 with equal
probability no player would want to change her pick - all picks will give her
zero profit in expectation.
The more players there are, the more the bid distribution is skewed to-
wards 0 (check)! We will formally discuss mixed strategy Nash equilibria in
a few lectures time.
2
Interaction: What one individual player does directly affects at least
one other player in the group. Otherwise the game is simple a series of
independent decision problems.
Strategic: Individual players account for this interdependence.
Rational: While accounting for this interdependence each player chooses
her best action. This condition can be weakened and we can assume that
agents are boundedly rational. Behavioral economics analyzes decision prob-
lems in which agents behave boundedly rational. Evolutionary game theory
is game theory with boundedly rational agents.
1. Trade: Levels of imports, exports, prices depend not only on your own
tariffs but also on tariffs of other countries.
3. IO: Price depends not only on your output but also on the output of
your competitor (market structure ...).
3
A decision problem (A, ¹) consists of a finite set of outcomes A = {a1 , a2 , .., an }
and a preference relation ¹. The expression a ¹ b should be interpreted as
”b is at least as good as a”. We expect the preference relation to fulfill two
simple axioms:
Both axioms ensure that all choices can be ordered in a single chain without
gaps (axiom 1) and without cycles (axiom 2).
Although the preference relation is the basic primitive of any decision
problem (and generally observable) it is much easier to work with a consistent
utility function u : A → < because we only have to remember n real numbers
{u1 , u2 , .., un }.
Theorem 1 Assume the set of outcomes is finite. Then there exists a utility
function u which is consistent.
Proof: The proof is very simple. Simple collect all equivalent outcomes in
equivalence classes. There are finitely many of those equivalence classes
since there are only finitely many outcomes. Then we can order these
equivalence classes in a strictly increasing chain due to completeness
and transitivity.
Note that the utility function is not unique. In fact, any monotonic
transformation of a consistent utility function gives another utility function
which is also consistent.
We can now define what a rational decision maker is.
4
Remark 1 When there are infinitely many choices we want to make sure
that there is a continuous utility function. This requires one more axiom
which makes sure that preferences are continuous. For that purpose, one has
to define topology on the set of outcomes. We won’t deal with that since we
won’t gain much insight from it.
5
Definition 6 The compound lottery L̃ is expressed as L̃ = {(L1 , q1 ) , (L2 , 1 − q1 )}.
With probability q1 the simple lottery L1 is chosen and with probability 1 − q1
the simple lottery L2 is chosen.
Axiom 5 Archimedian. For any outcomes a < b < c there is some lottery
L = {(a, α) , (c, 1 − α)} such that the agent is indifferent between L and b.
6
Proof: First of all we find the best and worst outcome b and w (possible be-
cause there are only finitely many outcomes). Because of the Archime-
dian axiom we can find a number αa for each outcome a such that L =
{(b, α) , (w, 1 − α)}. We can define a utility function over each outcome
a such that u (a) = α. Using the monotonicity axiom it can be shown
that this number is unique. For each lottery we can now calculate its ex-
pected utility. It remains to be shown that this expected utility function
is consistent with the original preferences. So take two lotteries L1 and
L2 such that L1 ¹ L2 . We can write L1 = {(a1 , p1 ) , (a2 , p2 ) , .. (an , pn )}
and L2 = {(a1 , q1 ) , (a2 , q2 ) , .. (an , qn )}. Now replace each outcome
ai by the above Pn lotteries. The compound Pn lottery can be rewritten
as L1 = {(b,P i=1 pi u (ai )) , (w, 1
P−n i=1 pi u (ai ))}. Similarly, we get
L2 = {(b, Pni=1 qi u (ai )) , (w,
Pn 1 − i=1 qi u (ai ))}. From this we can de-
n
duce that i=1 pi u (ai ) ≤ i=1 qi u (ai ), e.g. EU (L1 ) ≤ EU (L2 ). Oth-
erwise, by the monotonicity axiom we could deduce that L1 Â L2 which
is a contradiction. QED
From now on all payoffs in our course will be assumed to represent vNM
utility values. The expected payoff will be the expected utility.
4.1 Puzzles
EUT forms the basis of modern micro economics. Despite its success there
are important behavioral inconsistencies related to it. Some of those we are
going to discuss briefly before we turn our attention to proper games.
7
• Lottery B2 is a 72 percent chance to win 4000.
8
Table 1: McNeil, Pauker and Tversky (1988)
Survival Mortality Both
Radiation Surgery Radiation Surgery Radiation Surgery
immediate 100 90 0 10
1 year 77 68 23 32
5 year 22 34 78 66
US 16 84 50 50 44 56
Israeli 20 80 45 56 34 66
Pair 1: 600 people are struck with a disease that could kill. Vaccine 1
will save 400 lives for sure while the second one will either save no one (1/3)
or will save everyone (with probability 2/3).
Pair 2: 600 people are struck with a disease that could kill. Vaccine 1
will kill 200 people for sure while the second one implies a 2/3 chance that
no one will die and a 1/3 chance that everyone will die.
Note that both situations are identical because save is equal to not kill.
However, people tend to be risk averse in saving lives and risk loving if it is
phrased in terms of losses (kills).
Preference reversals have real effects and do not just appear in cute ex-
amples. McNeil, Pauker and Tversky (1988) asked American doctors and
Israeli medical students about how they would choose between two cancer
treatments (surgery and radiation) - they presented one group with survival
statistics, a second group with mortality statistics and a third group with
both. Table 1 sums up their choices.