The Probit Model: Alexander Spermann University of Freiburg University of Freiburg Sose 2009
The Probit Model: Alexander Spermann University of Freiburg University of Freiburg Sose 2009
Alexander Spermann
University of Freiburg
SoSe 2009
Course outline
2
Notation and statistical foundations
1. y i = β 1 + β 2 x i 2 + K + β k x ik + ε i G ujarati
y t = β 0 + β 1 x t 1 + K + β k x tk + u t W ooldridge
2. M atrix
Y = Xβ +ε
Y = x 'β + ε
^ ^
Y = X β+u
y i = x i' β + ε i
x i' β x i' β
β0 β1
(1 x1 x2 ) β1 (1 x2 x3 ) β 2
β β
2 3
3
Notation and statistical foundations – Vectors
Column vector: a1
a
a = 2
nx1 M
a
n
Transposed (row vector): a ' = [ a1 a2 K an ]
1 xn
Inner product: b1
b
a 'b = [ a1 a2 K an ] 2 = ∑ ai bi
M
b
n
4
Notation and statistical foundations – density function
1 ( x−µ )
2
−
1
φ ( x) =
2 σ 2
e
σ 2π
Example: Standard normal distribution:
N(0,1), µ = 0, σ = 1
x2
1 −
φ ( x) = e 2
2π
µ =0
5
Notation and statistical foundations – distibutions
(1 + e )x 2 3
Exponential distribution:
1 e −θ , x ≥ 0
x
f ( x ) = 0θ, x ≤ 0 , θ > 0, µ = θ , σ 2 = θ 2
Poisson distribution:
e −θ θ x
f ( x) = , µ = θ ,σ 2 = θ
x!
6
Notation and statistical foundations – CDF
7
Notation and statistical foundations – logarithms
Rule I: y = xz
log y = log x + log z
log y = n log x
Rule III: y = a xb
log y = log a + b log x
8
Introduction to the Probit model – binary variables
1
y=
0
OLS
(linear)
1 x x x x
xxxx x x x
0
9
Introduction to the Probit model – latent variables
10
Introduction to the Probit model – latent variables
11
Introduction to the Probit model – CDF
Example:
1 CDF = Φ( z )
0,8
0,5
0,2
z = xi' β 1-0,2=0,8
−z 0 z
12
Introduction to the Probit model – CDF Probit vs. Logit
z = xi' β z = xi' β
13
Introduction to the Probit model – PDF Probit vs. Logit
14
Introduction to the Probit model – The ML principle
Joint density:
∏ F ( x β ) [1 − F ( x β ) ]
(1 − y i )
f ( y | x, β ) = '
i
yi '
i
i
= ∏i
Fi y i (1 − Fi ) 1− y i
ln L = ∑ yi ln Fi + (1 − yi ) ln(1 − Fi )
i
15
Introduction to the Probit model – The ML principle
∂ ln L yi f i (1 − yi )(− f i )
= ∑ + xi
∂β i Fi 1 − Fi
yi − Fi
= ∑ f i xi
i Fi (1 − Fi )
=0
16
Introduction to the Probit model – Example
f ( x1 , x2 , K, x10 | θ ) = ∏ f ( xi ,θ ) = 10
=
∏x!
i =1 207,36
i
i =1
Which value of θ makes occurance of the observed sample
most probable?
17
Introduction to the Probit model – Example
L(θ | x)
L(θ | x)
ln L(θ | x) d 2 ln L(θ ) 20
=− 2
dθ 2
θ
⇒ Maximum
2 θ
18
Application
Source: Spector, L. and M. Mazzeo, Probit Analysis and Economic Education. In:
Journal of Economic Education, 11, 1980, pp.37-44
19
Application – Variables
Grade
Dependent variable. Indicates whether a student improved
his grades after the new teaching method PSI had been
introduced (0 = no, 1 = yes).
PSI
Indicates if a student attended courses that used the new
method (0 = no, 1 = yes).
GPA
Average grade of the student
TUCE
Score of an intermediate test which shows previous
knowledge of a topic. 20
Application – Estimation
21
Application – Discussion
22
Coefficients and marginal effects
23
Coefficients and marginal effects
24
Coefficients and marginal effects – Computation
25
Coefficients and marginal effects – Computation
27
Coefficients and marginal effects – Interpretation
28
Coefficients and marginal effects – Interpretation
Variable Estimated marginal effect Interpretation
GPA 0.364 If the average grade of a
student goes up by an
infinitesimal amount,
the probability for the
variable grade taking
the value one rises by
36.4 %.
TUCE 0.011 Analog to GPA,with an
increase of 1.1%.
PSI 0.374 If the dummy variable
changes from zero to
one, the probability for
the variable grade
taking the value one
rises by 37.4 ppts.
29
Coefficients and marginal effects – Significance
Significance of a coefficient: test of the hypothesis whether
a parameter is significantly different from zero.
The decision problem is similar to the t-test, wheras the
probit test statistic follows a standard normal distribution.
The z-value is equal to the estimated parameter divided by
its standard error.
Stata computes a p-value which shows directly the
significance of a parameter:
31
Coefficients and marginal effects
32
Coefficients and marginal effects
Sample mean
33
Goodness of fit
34
Goodness of fit
35
Goodness of fit
36
Hypothesis tests
37
Hypothesis tests
38