Hypothesis Testing
Hypothesis Testing
Hypothesis Testing
S tep 1 : S tate n u ll an d altern ate h yp oth eses
H 0 : 20.
•
The Alternate hypothesis
is that the statement is
not true, that is,
H1 : 20
• It is important to remember that no matter how
the problem is stated; the null hypothesis will
always contain the equal sign. The equality will
never appear in the alternative hypothesis.
• Step 2: Select a level of significance (or
Risk):
• LEVEL OF SIGNIFICANCE: The probability of
rejecting the null hypothesis when it is true.
• The level of significance is designated by , the
Greek letter alpha. A decision is made to use
the .05 level (often stated as 5% level), the .01
level, and the .10 level. Traditionally, the .05 level
is selected for consumer research projects;
the .01 level is for quality assurance and .10 for
political polling.
•
• There are two types of errors in rejecting or
accepting the null hypothesis:
• TYPE I ERROR: Rejecting the null hypothesis, H0,
when it is true.
The probability of committing another type of error,
called a type II error, is designated by the Greek
letter .
• TYPE II ERROR: Accepting the null hypothesis
when it is false.
• The following table summarizes the
decisions the researcher could make and
the possible consequences:
Researcher
Null Hypothesis
Accepts H0 Rejects H0
X
z
/ n
The z value is based on the sampling
distribution of X .
X
t
s/ n
With n – 1 degrees of freedom, where:
X is the mean of the sample
• is the hypothesized population mean
• s is the standard deviation of the
sample
• n is the number of observation in the
sample
EXAMPLE: Page#379
• An Insurance Company reports that the mean cost
to process a claim is $60. A comparison showed
this amount to be larger than most other insurance
companies, so they instituted cost-cutting
measures. The Insurance Company selected a
random sample of 26 recent claims. The mean cost
per claim was $56.42 and the standard deviation
was $10. At the .01 significance level, is it
reasonable to conclude that the mean cost to
process a claim is now less than $60?
• Can they conclude that the cost-cutting measures
were effective? Or should they conclude that the
difference between the sample mean ($57) and the
population mean ($60) is due to chance? Use
the .01 significance level. P-value?
Solution
Step1:We state the Null and Alternate
hypotheses:
H0: ≥ 60
H1: < 60
Step2: Select the level of significance.
= .01
Step3: Select the test statistic. Since σ is not
known, t test statistic is to be used.
Step4: Formulate the Decision Rule.
If tcomp. ≤ -2.485, H0 will be rejected, otherwise
H0 will be accepted.
Chart to be provided. Df=26-1=25
p .70 .60
Z 2.89
(1 ) .60(1 .60)
n 200
Decision:
Since Zcomp (2.89) falls in the rejection region, H0
is rejected and H1 is accepted, that is, π>.60.
Yes, at the .01 significance level, she can
conclude that more than 60 percent of the
accounts are in arrears for more than three
months.
The test is statistically significant.
p- value in hypothesis testing
• The p-value provides additional insight into the
decision. It gives additional information on the
strength of the rejection. That is, how confident are
we in rejecting the null hypothesis?
• The p-value compares with the
significance level.
• If the p-value is smaller than the
significance level, H0 is rejected.
• If it is larger than the significance level,
H0 is not rejected.
• Determining the p-value not only results in a
decision regarding H0 but it gives us additional
insight into the strength of the decision. A very
small p-value, such as .0001, indicates that
there is little likelihood the H0 is true. On the
other hand a p-value of .2033 means that H0 is
not rejected, and there is little likelihood that it is
false.
• How do we compute the p-value? [BOARD].
How to compute p-value?
X1 X 2
Z
12 22
n1 n2
Step4: Formulate the Decision Rule.
If Zcomp. >2.33, H0 will be rejected, otherwise
H0 will be accepted.
Chart to be provided.
X1 X 2 5.5 5.3
Z 3.12
12 22 (.40) 2 (.30) 2
n1 n2 50 100
Decision
P – value????
Two sample Tests about proportions
• Assumptions are the same as one sample test.
Example: Keya Cosmetics recently developed a
new fragrance that they plan to market under the
name “Heavenly” . A number of market studies
indicate that Heavenly has very good market
potential. The sales department is particularly
interested in whether there is a difference in
the proportions of younger and older women
who would purchase Heavenly if it were
marketed. There are two independent
populations, a population consisting of the
younger women and a population consisting of
the older women.Each sampled woman will be
asked to smell Heavenly and indicate whether
she likes the fragrance well enough to purchase
a bottle.
A random sample of 100 young women
revealed 20 liked the Heavenly fragrance
well enough to purchase it. Similarly, a
sample of 200 older women revealed 100
liked the fragrance well enough to make
purchase.
DEPENDENT SAMPLES
d
is the mean of the difference between the paired
or related observations
sd
is the standard deviation of the differences
between the paired or related
observations.
• n is the number of paired observations.
Example:
• A loan giving organization wishes to
compare the two companies they use to
appraise the value of residential homes.
The Organization selected a sample of 10
residential properties and scheduled both
firms for an appraisal. The results,
reported in $000, are:
Home 1 2 3 4 5 6 7 8 9 10
Co. A 235 210 231 242 205 230 231 210 225 249
Co. B 228 205 219 240 198 223 227 215 222 245
Home Co. A Co. B Difference, d d2
1 235 228 7 49
2 210 205 5 25
3 231 219 12 144
4 242 240 2 4
5 205 198 7 49
6 230 223 7 49
7 231 227 4 16
8 210 215 -5 25
9 225 222 3 9
10 249 245 4 16
Total 46 386
• At the .05 significance level, can we conclude there
is a difference in the mean appraised values of the
homes?
• SOLUTION: Done on the board.
• To find the p-value, we use Appendix F and the
section for a two-tailed test. Move along the row
with 9 degrees of freedom and find the values of t
that are closest to our calculated value. For a .01
significance level, the value of t is 3.250. The
computed value is larger than this value, but
smaller than the value of 4.781 corresponding to
the .001 significance level. Hence, the p-value is
less than .01.
Comparing Populations with Small
Samples
• Where:
• X 1 is the mean of the first sample.
• X 2 is the mean of the second sample.
• n1 is the number of observations in first
sample.
• n 2 is the number of observations in
second sample.
2
• s p is the pooled estimate of the
population variance.
• The number of degrees of freedom in the test is
the total number of items sampled minus the
total number of samples. Because there are two
samples, there are n1 + n2 - 2 degrees of
freedom.
Example: Page #417
• The production manager, at a manufacturer of
wheelchairs, wants to compare the number of
defective wheelchairs produced on the day
shift with the number on the afternoon shift. A
sample of the production from 6 day shifts
and 8 afternoon shifts revealed the following
number of defects.
Day 5 8 7 6 9 7
Afternoon 8 10 7 11 9 12 14 9
At the .05 significance level, is there a
difference in the mean number of defects
per shift?
Solution: