Examples of Continuous Probability Distributions:: The Normal and Standard Normal
Examples of Continuous Probability Distributions:: The Normal and Standard Normal
probability distributions:
The normal and standard normal
Changing increases or
decreases the spread.
f ( x)
Note constants:
=3.14159
e=2.71828
1 x 2
(
)
2
1
2
1 x 2
(
)
e 2 dx 1
E(X)= = x
Var(X)=2
1
2
1 x 2
(
)
2
e
dx
x2
1 x 2
(
)
2
Standard Deviation(X)=
dx) 2
68-95-99.7 Rule
68% of
the data
68-95-99.7 Rule
in Math terms
1
2
1
2
1
2
1 x 2
(
)
e 2 dx
.68
1 x 2
(
)
e 2 dx
.95
1 x 2
(
)
e 2 dx
.997
112.3
127.8
143.3
25
20
P
e
r
c
e
n
t
15
10
0
80
90
100
110
120
POUNDS
130
140
150
160
96.8
127.8
158.8
25
20
P
e
r
c
e
n
t
15
10
0
80
90
100
110
120
POUNDS
130
140
150
160
81.3
127.8
174.3
25
20
P
e
r
c
e
n
t
15
10
0
80
90
100
110
120
POUNDS
130
140
150
160
Example
Example
BUT
What if you wanted to know the math SAT
score corresponding to the 90th percentile
(=90% of students are lower)?
P(XQ) = .90
1 x 500 2
)
50
dx
(
1
e 2
(50) 2
200
.90
1
p( Z )
e
(1) 2
1 Z 0 2
(
)
2 1
e
2
1
( Z )2
2
100
0
200
2.0
X
Z
( = 100, = 50)
( = 0, = 1)
Example
P( X 575)
(50)
200
1.5
1 x 500 2
(
)
e 2 50 dx
1
2
1
Z2
e 2 dz
Yikes!
But to look up Z= 1.5 in standard normal chart (or enter
into SAS) no problem! = .9332
Practice problem
a.
b.
Answer
a.
Answer
b. What is the chance of obtaining a birth
weight of 120 or lighter?
120 109
Z
.85
13
From the chart or SAS Z of .85 corresponds to a left tail area of:
P(Z.85) = .8023= 80.23%
Z=1.51
Z=1.51
Area is 93.45%
And if you wanted to go the other direction (i.e., from the area to the Z
score (called the so-called Probit function
data _null_;
The probit(p) function gives you the
theZValue=probit(.93);
Z-value that corresponds to a left-tail
area of p (here .93) from a standard
put theZValue;
normal curve. The probit function is also
run;
known as the inverse standard normal
1.4757910282
function.
SD = 6.8
Range = 0 to 24
(= 3.5 )
SD = 1.8
Range = 2 to 9
(~ 4 )
SD = 2.5
Range = 0 to 12
(~ 5 )
SD = :55
Range = 5:30 to 9:00
(~4 )
0.3
13.9
0.3 13.9
3.6
7.2
1.8
9.0
10
5.9
8.4
10.9
7:59
7:04+/- 0:55 =
6:09 7:59
8:54
7:04+/- 2*0:55
=
5:14 8:54
9:49
7:04+/- 2*0:55
=
4:19 9:49
)
n 1
where is the probit function, which gives the Z value
that correspond s to a particular left - tail area
Right-Skewed!
(concave up)
Neither right-skewed
or left-skewed, but
big gap at 6.
Right-Skewed!
(concave up)
Closest to a
straight line
Results:
Coffee: Strong evidence of non-normality
(p<.01)
Writing love: Moderate evidence of nonnormality (p=.01)
Exercise: Weak to no evidence of nonnormality (p>.10)
Wakeup time: No evidence of non-normality
(p>.25)
6 7
Normal approximation to
binomial
.27
6 7
2 (4.8) 2.8
Z
2
1.39
1.39
P(Z<2)=.022
A little off, but in the right ballpark we could also use the value
to the left of 1.5 (as we really wanted to know less than but not
including 2; called the continuity correction)
2.37
1.39
1.39
P(Z-2.37) =.0069
Practice problem
1. You are performing a cohort study. If the probability
of developing disease in the exposed group is .25 for
the study duration, then if you sample (randomly)
500 exposed people, Whats the probability that at
most 120 people develop the disease?
Answer
By hand (yikes!):
P(X120) = P(X=0) + P(X=1) + P(X=2) + P(X=3) + P(X=4)+.+ P(X=120)=
500
120
380
(.25) (.75)
120
500
2
498
(.25) (.75)
2
500
1
499
(.25) (.75)
1
OR Use SAS:
data _null_;
Cohort=cdf('binomial', 120, .25, 500);
put Cohort;
run;
0.323504227
120 125
.52
9.68
P(Z<-.52)= .3015
500
0
500
(.25) (.75)
0
Proportions
x np
x np(1 p)
2
Differs by
a factor of
n.
x np(1 p)
For proportion:
p p
np(1 p ) p (1 p )
p
2
n
n
p (1 p )
p
n
2
Differs
by a
factor
of n.