1.3 Type I Error Type II Error and Power PDF
1.3 Type I Error Type II Error and Power PDF
Objectives
Be able to define Type I error, Type II error, and power
Be able to identify Type I error and Type II error in a given scenario
Know what factors affect error rates and power
Be able to compute Type I error, Type II error, and/or power in a given
hypothesis testing scenario
When we conduct a hypothesis test, it leads us to one of two decisions based on the
evidence of our sample: we either reject H0 or fail to reject H0. However, a hypothesis
test is not foolproof—our decision might be incorrect!
1. A Type I error occurs when we reject H0 when H0 is actually true. The probability of
committing a Type I error is defined by the significance level, or α, and thus is
usually denoted as α. The probability expression of a Type I error is:
2. A Type II error occurs when we fail to reject H0 when H0 is actually false. The
probability of committing a Type II error is usually denoted as β. The probability
expression of a Type II error is:
Conversely, a hypothesis test can result in one of two correct decisions as well:
2. We can fail to reject H0 when H0 is actually true. The probability of failing to reject H0
when H0 is true is computed as 1 – α with a probability expression of:
Type I Error, Type II Error, and Power – 2
The relationships amongst these correct decisions and decision errors can sometimes
be best seen by a table:
Truth
H0 True H0 False
Reject H0
Decision from
hypothesis
test
Fail to
Reject H0
Example
Most store-bought pregnancy tests produce a binary outcome: they indicate either that a
woman is pregnant or indicate that they are not pregnant. Consider the following
hypotheses:
H0: a woman taking the test is pregnant
Ha: a woman taking the test is not pregnant
Example
In a Aesop’s fable “The Boy Who Cried Wolf,” a shepherd boy repeatedly runs to nearby
village to claim that there was a wolf attacking his flock when in fact there was no wolf at
all. Suppose the villagers were all statisticians. What type of error would they say the
boy was committing when doing this? (Assume H0: no wolf present)
Type I Error, Type II Error, and Power – 3
Significance level (α) – recall that α is the probability of rejecting H0 when H0 is true.
Increasing α…
Effect size – the effect size is the difference between the true parameter size and the
size specified in the null hypothesis.
Note: To calculate these probabilities, we need to know the true value of the parameter in the
population. Since this is often not something that we actually know, most questions involving
error and power calculations rely on us assuming a true value for the population parameter.
Note that if the question asks for the Type II error probability (β), you can follow the
steps above to find power, and then just take 1 – power to obtain β.
Example
A drug company that manufactures a sleeping aid drug claims that more than 70% of
the people that use their drug report an improvement in their sleep quality (compared to
before they were taking the drug). Suppose a competing company wishes to test this
claim by sampling 200 individuals who take the sleeping aid drug and asking them
whether or not they experience an improvement in their sleep quality while on the drug.
2. If the test were to be carried out using α = 0.025, state the decision rule in terms of
the sample proportion, p̂.
3. Suppose that in the sample of 200 individuals, 155 people claimed that they had an
improvement in their sleep quality when taking the drug. Based on this, what can be
said about the null hypothesis?
Type I Error, Type II Error, and Power – 6
4. Using the test outlined above, what would be the probability of concluding that
p = 0.70 if, in fact, p = 0.80? Find this probability by hand and using Minitab.
In Minitab…
Stat Power and Sample Size 1 Proportion…
Sample sizes: 300.
Comparison proportions: .
Power values: .
Hypothesized proportion: .
Click Options…
Alternative hypotheses: ≠.0
Significance level: ≠.0.
5. What is the power of the test outlined above if, in fact, p = 0.80?
Type I Error, Type II Error, and Power – 7
Example
Employees at multiple levels of a large company are stating that they are receiving, on
average, fewer annual paid vacation days than the national average. Suppose it is
known that the national average of annual paid vacation days in 2014 is known to be
7.5 days with a standard deviation of 1.6 days. To assess the validity of the employees’
claims, the company randomly samples 55 employees and finds that the average
number of annual vacation days for this sample is 7.22 days.
2. Find the p-value of this hypothesis test by hand and using Minitab.
In Minitab:
Graph Probability Distribution Plot View Probability
Distribution tab Shaded Area tab
Distribution: normal . Define Shaded Area by: X Value
Mean: 0. Select:
Std. Deviation:.1 X value:.
5. The employees are basing their claim on the national average of annual paid
vacation days in 2014. Suppose, however, that this average has now decreased to
7.41 days in 2018. Find the power of the test outlined above by hand and using
Minitab.
In Minitab…
Stat Power and Sample Size 1-Sample Z…
Sample sizes: 300.
Differences: .
Power values: .
Standard Deviation: .
Click Options…
Alternative hypotheses: ≠.0
Significance level: 9≠.0.
Type I Error, Type II Error, and Power – 9
Example
Consider the following hypotheses about a proportion, p, in a certain population:
H0: p = 0.50
Ha: p ≠ 0.50
Suppose the decision rule for a test of H0 is given as:
“Reject H0 if p̂ < 0.408 or if p̂ > 0.592.”
1. A sample of 80 individuals is taken to carry out the test outlined above. What is
the probability of making a Type I error? Find this probability with the help of Minitab.
In Minitab:
Graph Probability Distribution Plot View Probability
Distribution tab Shaded Area tab
Distribution: normal . Define Shaded Area by: X value
Mean: 0. Select:
Std. Deviation:.1 X value:.
Type I Error, Type II Error, and Power – 10
2. If p = 0.44, what is the probability of concluding that p = 0.5 using the test outlined
above (with a sample size of n = 80)? Find this probability by hand and using Minitab.
In Minitab…
Stat Power and Sample Size 1 Proportion…
Sample sizes: 300.
Comparison proportions: .
Power values: .
Hypothesized proportion: .
Click Options…
Alternative hypotheses: ≠.0
Significance level: ≠.0.
Type I Error, Type II Error, and Power – 11
Example
The American Heart Association (AMA) recommends that adults should aim to get an
average of at least 150 minutes of moderate exercise per week to maintain
cardiovascular health. An employer at a moderately-sized company wonders if his
employees are achieving this minimum. He takes a random sample of 40 employees
and finds their average weekly minutes of moderate exercise to be 133 minutes.
Suppose it is known that the standard deviation in the population is 57.9 minutes.
2. Suppose the test were carried out with a Type I error probability of 0.008. State the
decision rule in terms of the standard normal distribution and in terms of x̅.
3. Find the power of the test outlined above if μ = 145, μ = 140, and μ = 130. Use
Minitab to find the power values.
In Minitab…
Stat Power and Sample Size 1-Sample Z…
Sample sizes: 300.
Differences: .
Power values: .
Standard Deviation: .
Click Options…
Alternative hypotheses: ≠.0
Significance level: 9.
Click Graph…
□ Display power curve ≠.