Estimation and Hypothesis Testing
Estimation and Hypothesis Testing
The degrees of freedom given by the formula is always rounded down to the
nearest integer.
ഥ𝟏 − 𝒙
Estimate of the Standard Deviation of 𝒙 ഥ𝟐
𝑠12 𝑠22
𝑠𝑥ҧ1 −𝑥ҧ2 = +
𝑛1 𝑛2
Interval Estimation of 𝝁𝟏 − 𝝁𝟐
The 1 − 𝛼 100% confidence interval for 𝜇1 − 𝜇2 is given by
𝑥1ҧ − 𝑥ҧ2 − 𝜇1 − 𝜇2
𝑡=
𝑠𝑥ҧ1 −𝑥ҧ2
Example
Assuming that the two populations are normally distributed with unequal and
unknown population standard deviations, construct a 98% confidence interval
for 𝜇1 − 𝜇2 for the following:
Solution
Exercise
A manufacturing company is interested in buying one of two machines. The company
tested the two machines for production purposes. The first machine was run for 15 days
and produced an average of 111 items per day with a standard deviation of 10 items.
The second machine was run for 18 days and produced an average of 118 items per day
with a standard deviation of 8 items. Assume that the production per day for each
machine is normally distributed and that the standard deviations of the daily
productions of the two populations are unequal.
(a) Make a 95% confidence interval for the difference between the two population
means.
(b) Using the 1% significance level, can you conclude that the mean number of items
produced per day by the first machine is lower than the second machine?
Inferences about the difference between
two population proportions for large and
independent samples
Inferences about the difference between two
population proportions for large and independent
samples
Mean, Standard Deviation, and Sampling Distribution of 𝑝Ƹ1 − 𝑝Ƹ 2
The sampling distribution of 𝑝Ƹ1 − 𝑝Ƹ 2 is (approximately) normal with:
The mean of ෝ𝑝1 − 𝑝Ƹ 2 , denoted by 𝜇𝑝ො1−𝑝ො2 , is given by
𝜇𝑝ො1−𝑝ො2 = 𝑝1 − 𝑝2
The standard deviation of 𝑝Ƹ1 − 𝑝Ƹ 2 , denoted by 𝜎𝑝ො1−𝑝ො2 , is given by
𝑝1 𝑞1 𝑝2 𝑞2 𝑝1 (1− 𝑝1 ) 𝑝2 (1− 𝑝2 )
𝜎𝑝ො1−𝑝ො2 = + = +
𝑛1 𝑛2 𝑛1 𝑛2
Interval Estimation of 𝑝1 − 𝑝2
The (1− 𝛼 )100% confidence interval for 𝑝1 − 𝑝2 is given by
known 𝜎1 and 𝜎2 unknown 𝜎1 and 𝜎2
confidence interval 𝑝Ƹ1 − 𝑝Ƹ 2 ± 𝑧𝛼/2 𝜎𝑝ො1 −𝑝ො2 𝑝Ƹ1 − 𝑝Ƹ 2 ± 𝑧𝛼/2 s𝑝ො1−𝑝ො2
The shape of the sampling distribution of 𝑥1ҧ − 𝑥ҧ2 is approximately normal and
both samples must be large.
Hypothesis Test about 𝑝1Ƹ − 𝑝Ƹ2
Test Statistic 𝑧 for 𝑝Ƹ1 − 𝑝Ƹ 2 is given by:
(𝑝Ƹ1 − 𝑝Ƹ 2 ) − ( 𝑝1 − 𝑝2 )
𝑧=
s𝑝ො1 −𝑝ො2
where
1 1
s𝑝ො1−𝑝ො2 = 𝑝𝑞 +
𝑛1 𝑛2
𝐻0 : 𝑝1 = 𝑝2 𝑧 > 𝑧𝛼
𝐻1 : 𝑝1 > 𝑝2
𝐻0 : 𝑝1 = 𝑝2 𝑧 < −𝑧𝛼
𝐻1 : 𝑝1 < 𝑝2
𝐻0 : 𝑝1 = 𝑝2 𝑧 < −𝑧𝛼/2 or
𝐻1 : 𝑝1 ≠ 𝑝2 𝑧 > 𝑧𝛼/2
Example
Construct a 97% confidence interval for 𝑝1 − 𝑝2 for the following:
1
The point estimate 𝑝1 − 𝑝2 ; 𝑝Ƹ1 − 𝑝Ƹ 2 = = 0.0014
700
𝑛1 = 600, 𝑝Ƹ1 = 0.02, 𝑛2 = 700, 𝑝Ƹ 2 = 0.0186
Solution(b)
(b) Construct a 94% confidence interval for the differences in the proportions of
all luggage between airline companies P and Q.
Solution:
𝑝Ƹ1 𝑞ො1 𝑝Ƹ 2 𝑞ො2 (0.02)(0.98) (0.0186)(0.9814)
s𝑝ො1−𝑝ො2 = + = +
𝑛1 𝑛2 600 700
= 0.0076
From the normal table with 𝛼/2 = 0.03 , z-value is 1.88
Thus, 94% confidence interval for 𝑝1 − 𝑝2 is
= 𝑝Ƹ1 − 𝑝Ƹ 2 ± 𝑧𝛼/2 s𝑝ො1−𝑝ො2
= 0.0014 ± 1.88 0.0076 = (−0.0129,0.0157)
Solution(c) 𝑛1 = 600, 𝑝Ƹ1 = 0.02, 𝑛2 = 700, 𝑝Ƹ 2 = 0.0186
(c) Testing at 3% significance level, can you conclude that the proportions of all luggage lost
between airline companies P and Q are different?
Solution:
(c) Testing at 3% significance level, can you conclude that the proportions of all luggage lost
between airline companies P and Q are different?
Solution:
(𝑝Ƹ1 − 𝑝Ƹ 2 ) − ( 𝑝1 − 𝑝2 ) 1 1 𝑝Ƹ1 𝑛1 + 𝑝Ƹ 2 𝑛2
Step 3: Test Statistic; 𝑧= s𝑝ො1 −𝑝ො2 = 𝑝𝑞ത + 𝑝ҧ =
s𝑝ො1−𝑝ො2 𝑛1 𝑛2 𝑛1 + 𝑛2