5.1 Inferential Statistics-Estimation
5.1 Inferential Statistics-Estimation
−
¢ The probability distribu7ons of sta7s7cs such as are called
X
sampling distribu7ons since they would only be observed if we
took many samples from the popula7on
µ
School of Public Health
Standard devia7on vs. standard error
– Interval es7ma7on
• using a sample sta7s7c to es7mate a popula7on parameter by making
allowance for sample varia7on (error)
Sample mean, X µ
Sample variance, S2 σ2
Sample propor7on, p P or π
Sample Odds Ra7o, OŔ OR
Sample Rela7ve Risk, RŔ RR
Sample correla7on coefficient, r ρ
2. Prac&cal interpreta&on:
• When sampling is from a normally distributed
popula7on with known standard devia7on, we
are 100% (1-α) [e.g., 95%] confident that the
single computed interval contains the unknown
popula7on parameter.
1) When the σ is known and the sample size is large or small, the C.I. has
the form:
2) When σ is unknown, and the sample size is small, the C.I. has the form:
Patients assigned randomly to receive a RBS level of study subjects before and
vaccine or placebo. Is the rate of the
disease the same in both groups, or did after breakfast.
the vaccine prevent disease?
SchoolMay
of 12,
Public
2016 Health 45
Remark…..
• You can construct a 100(1-α)% confidence interval for a paired experiment using
sd
d ± tα / 2
n
• Once you have designed the experiment by pairing, you MUST analyze it as a paired
experiment. If the experiment is not designed as a paired experiment in advance, do not use
this procedure.
• The interpretation of the CI of the mean difference of paired measurements depends on these
assumptions:
a. Your pair of subjects are randomly selected from the population of pairs or at least are
representative of the populations.
b. The overall population of pairs, the difference is distributed in a Gaussian manner.
c. The two measurements are before/after measurements on one subject or are measurements on
two subjects matched before the data were collected.
d. All subjects come from the same population, and each subject (if before/after) or each pair of
matched subjects has been selected independently of the others.
14.30 204.49
α
69.90 56.40 Hence,α = 1% = 0.01 ⇒ = 0.005
13.50 182.25 2
74.50 61.50 ⇒ tcritical = t α (n − 1) = t0.005 (10 − 1) = 3.250
13.00 169
2
67.80 57.20
10.60 112.36 ⇒∈= t α (n − 1)sd = (3.250)(0.663 ) = 2.15
2
70.30 58.10
12.20 148.84
67.00 56.20
⎪d − ∈= 11.32 - 2.15 = 9.17
⎧
⇒⎨
⎩d + ∈= 11.32 + 2.15 = 13.47
10.80 116.64 ⎪
68.50 58.40
10.10 102.01
Therefore, the 99% confidence interval
62.40 55.50
6.90 47.61 for µ1-µ2 is (9.17, 13.47)
Sum Σdi=113.2 Σdi2=12824.24
SchoolMay
of 12,
Public
2016 Health 48
Confidence Interval for a Population proportion (P)
(0.67,0.92)
This means that if the true proportion of surviving infants was
any less than 67%, there is less than 2.5% chance of observing
such a large proportion just by chance. It also means that if the
true proportion were any greater than 92%, the chance observing
such a small proportion just by chance is less than 2.5%.
Exercises: Do for the infants born at 22 weeks gestation
School of Public Health 51
CI for difference between two popula5on Propor5ons
ˆ (1 − P
P ˆ) ˆ (1 − P
P ˆ )
ˆ −P
(P ˆ )±Z 1 1
+ 2 2
1 2 α
1−
2 n1 n2