0% found this document useful (0 votes)

182 views

Introduction To Hypothesis Testing, Power Analysis and Sample Size Calculations

This document discusses hypothesis testing, power analysis, and sample size calculations. It begins by explaining that hypothesis testing is commonly used to estimate differences between populations using sample means. It then discusses how power calculations are important when failing to reject a null hypothesis, to determine if the sample size was large enough to detect an effect if one existed. Finally, it explores the concepts of power analysis using the normal distribution and how power and sample size calculations can be more complex with other distributions, but the basic idea is the same.

Uploaded by

Fanny Sylvia C.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

182 views

Introduction To Hypothesis Testing, Power Analysis and Sample Size Calculations

Uploaded by

Fanny Sylvia C.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 8

Introduction to Hypothesis Testing, Power

Analysis and Sample Size Calculations

Hypothesis testing is one of the most commonly used statistical techniques
ever. Most often, scientists are interested in estimating the differences be-
tween two populations and use the sample means as the statistic of interest.
For this reason, the normal distribution is the one most often used to esti-
mate the probabilities of interest. Drawing conclusions about populations
from data is termed inference. Scientists wish to draw inferences about the
populations they are interested in from the data they have collected.
Power calculations also can be important in the case that we failed to
reject the null hypothesis of no effect or no significant difference. This process
can be important in the regulatory community, where failing to reject the
null hypothesis of no effect, is unconvincing unless accompanied by a power
analysis that shows that if there were an effect, the sample size was large
enough to detect it.
This lecture will explore the basic concepts behind power analysis using
the normal assumption. Power and sample size calculations can be more
complicated when using other distributions but the basic idea is the same.

1. Distribution of the Sample Mean

Most hypothesis testing is conducted using the sample mean as the statistic
of interest–to estimate the true population mean. Consider a sample of size n
of random variables X1 , X2 , · · · , Xn , with E{Xi } = µ and V ar{Xi } = σ 2 ∀i.
Let
Pn
Xi
x̄ = i=1 .
n
Then,
½ Pn ¾
i=1 Xi 1
E{x̄} = E = nµ = µ
n n

and
½ Pn ¾
i=1 Xi 1 2 1 2
V ar{x̄} = V ar = nσ = σ
n n2 n

1
2
If Xi ∼ N {µ, σ 2 }, then we know from earlier results that x̄ ∼ N {µ, σn }.
Additionally, even if the data do not come from a normal distribution
½√ ¾
n (x̄ − µ)
lim P ≤ x = Φ(x).
n→∞ σ
Hence, even if our data are not normal, for a large enough sample size,
we can calculate probabilities for x̄ by applying the Central Limit Theorem,
and our answers will be close enough.

2. Hypothesis Testing
Hypothesis testing is a formal statistical procedure that attempts to answer
the question “Is the observed sample consistent with a given hypothesis.”
This boils down to calculating the probability of the sample given the hy-
pothesis, P {X|H}. To set up the procedure, scientists propose what is called
a null hypothesis. The null hypothesis is usually of the form: these data
were generated by strictly random processes, with no underlying mechanism.
Always, the null hypothesis is the opposite of the hypothesis that we are
interested in. The scientist will then set up a hypothesis test to compare the
null hypothesis to the mechanistic, scientific hypothesis consistent with their
scientific theory.
Example 1 Examples of null hypotheses are:
1. no difference in the response of patients to a drug versus a placebo,

2. no difference in the contamination level of well-water near a papermill

and a well some distance away,

3. no difference between the leukemia rate in Woburn, Massachusetts and

the national average, and

4. the concentration of mercury in the groundwater is below the regulatory

limit.
A hypothesis test is usually represented as follows:
H 0 : µ = µ0
vs.
Ha : µ 6= µ0

2
The null and alternative hypotheses should be specified before the test is
conducted and before the data are observed. The investigators also need to
specify a value for P {X|H0 } at which they will reject H0 . The idea is that if
the data are quite unlikely under the null hypothesis, then we conclude that
they are inconsistent with the null, and hence accept the alternative. Notice
that the null and the alternative are mutually exclusive and exhaustive–that
is, one or the other must be true, but it’s impossible that both are.
The probability the we reject the null is denoted α and is called the size
of the test. It’s complement 1 − α is called the significance level of the test,
though sometimes you will see these terms used interchangeably.
Note that for some α = P {X|H0 } small enough, we reject H0 . Hence,
α = P {we reject H0 , when H0 is true}, also called the probability of a type I
error.
The probability of a Type II error is given by P {we fail to reject H0 when
H0 is false} = β.

Table 1: Possible Results from a Hypothesis Test

Truth
Test H0 Ha
H0 OK Type II Error
Ha Type I Error OK

3
The values under the normal curve that are equal or more extreme than
our test statistic constitute the rejection region.
Let’s begin with an example. Say that regulators desire a high certainty
that emissions are below 5 parts per billion for a particular contaminant, and
the regulatory limit is 8 parts per billion. They may conduct the following
test

H0 : µ < 5ppb

versus

Ha : µ ≥ 5ppb

At what value will we reject H0 ? Say, we would like to reject the null
hypothesis at the 95% confidence level. This means we wish to fix the prob-
ability of falsely rejecting H0 (type I error) at no greater than 5%. Here,
under H0 , µ can be fixed at µ = 5 without altering the size (confidence level)
of the test. Now we need to find the rejection region, i.e. the value of x̄ at
which we can reject H0 at 95% confidence.
We need to find a c such that

P r{x̄ > c|µ = 5} = 0.05 (2.1)

Now, since the standard deviation is taken to be 3 and the sample size is 5,
we can standardize x̄ under H0 so that it has a standard normal distribution
(a mean of 0 and a standard deviation of 1), and then we can make use of
the standard normal probability charts. We have

( ) ( )
x̄ − 5 c−5 c−5
P r{x̄ > c|H0 } = P r > = Pr z> (2.2)
√3 √3 √3
5 5 5

where z is a standard normal random variable. Then, from our probability

charts, we know that
c−5
= 1.64 (2.3)
√3
5

Solving for c, we find that we reject H0 when x̄ ≥ 7.2.

4
3. Power Calculations
Let’s continue with our example. In the event that the managers fail to reject
H0 , that is, they conclude that there is insufficient evidence that emissions
are above 5 ppb, they and their stakeholders, may want to ask the question:
“Was there sufficient information in our sample (i.e. is the lack of evidence
due to insufficient sample size) to have detected a difference of 3 ppb? Hence,
they need to calculate the power of the test when µ = 8 ppb.
The power of a test is defined as the probability that we correctly reject
the null hypothesis, given that a particular alternative is true. Power can
also be defined as

1-Pr{we do not reject H0 when H0 is false}=1-Pr{type II error}

In order to calculate power, we need to specify an alternative and we

require an estimate of the variability of the statistic used to conduct the
test, in most cases this statistic is the sample mean. The standard deviation
of the sample mean is given by the sample standard deviation divided by the
square root of the sample size.
σx
σx̄ = √ (3.1)
n
Continuing with the example, say we wish to calculate the power of the
above test, for a normal sample of size 5 and with known standard deviation
3. As with the size calculation, for the purposes of this power calculation,
under Ha , µ can be fixed at µ = 8. The test statistic is the sample mean. We
will reject the null hypothesis for some value of x̄. This value can be easily
calculated using elementary statistics since we have made the assumption
that the sample mean is normally distributed. This means that we assume
that if we were to repeat the experiment a large number of times and calcu-
lated the mean each time, that the resulting sample of means would show a
normal distribution.
We know that the rejection region was x̄ ≥ 7.2. We can calculate the
power of this test at some alternative, say µ = 8.

5
We need
( )
x̄ − 8 7.2 − 8
P r {x̄ > 7.2|µ = 8} = P r > = P r {z > −0.5963} = 0.7245
√3 √3
5 5
(3.2)
The power of this test at the specified alternative is then 0.7245. Alterna-
tively, we can say that the probability of type II error, or the probability that
we failed to reject the null when the true mean was 8 is 1 − 0.7245 = 0.2755.
We can conduct a full power analysis by plotting the power at a wide variety
of alternatives, or distances from µ0 , assuming that the standard deviation
remains constant across all concentrations.

Power Curve for n=5, sigma=3

1.0
0.8
0.6
Power

0.4
0.2

6 8 10 12 14

Alternative Mean Concenration in parts per billion

4. Sample Size Calculations

Even better than performing a power analysis after an experiment has been
conducted is to perform it before any data are collected. Careful experimen-
tal design can save untold hours and dollars from being wasted. As Quinn&
Keogh point out, too often a “post hoc” power calculation reveals nothing

6
more than our inability to design a decent experiment. If we have any reason-
able estimate of the variability and a scientifically justifiable and interesting
alternative, or even a range of alternatives, we can estimate before hand
whether or not the experiment is worth doing given the limitations on our
time and budget.
Say we would like to set the probability of a type I error at no greater
than 5% and of a type II error at no greater than 10%, what sample size
would we need for the test shown above? We saw that we rejected H0 at
µ ¶
σ
x̄ ≥ z1−α √ + µ0 .
n

Now consider the desired power. We need to repeat the same process as
we did above for the α level, but this time solving for c using the z value for
the corresponding power.
µ ¶
σ
x̄ ≥ zβ √ + µa .
n

Now recall that zβ = −z1−β . Setting the two expressions for x̄ equal to
one another we have
µ ¶ µ ¶
σ σ
z1−α √ + µ0 = −z1−β √ + µa
n n

Letting 1 − α be the confidence level we desire and 1 − β be the power

with zα and zβ being the corresponding z values and solving for n in the
above equation yields
· ¸2
2 z1−α + z1−β
n=σ (4.1)
µa − µ0
So, for the example above, from a standard normal probability chart we
have zα = 1.645 and zβ = 1.282. For this test, µa = 8 and µ0 = 5, yielding
· ¸2
1.645 + 1.282
n=9 = 8.56. (4.2)
3

So we need a sample of size 9 to achieve the desired confidence and power

for this experiment.

7
Of course, often we will have no preliminary data from which to estimate
a standard deviation. In this case, we must use a conservative “best guess”
for the variance. In practice, we may also not know exactly what is a sci-
entifically meaningful alternative. However, as practitioners of science we
should be working to move our community towards more careful planning of
experiments and more careful thinking about our questions before we begin
the experiment.

5. References
1. Pagano M and K. Gauvreau, 1993. Principles of Biostatistics, Duxbury
Press, Belmont, California.

2. Quinn, Gerry P. and Michael J. Keogh, 2002. Cambridge University

Press, Cambridge.

Everything Disc Manual: Description
0% (1)
Everything Disc Manual: Description
2 pages
Module 1 - Tests of Hypothesis For A Single Sample
100% (1)
Module 1 - Tests of Hypothesis For A Single Sample
27 pages
Spreadsheet Option Functions Available With Derivatives Markets
100% (4)
Spreadsheet Option Functions Available With Derivatives Markets
13 pages
TOPIC3 Tests of Hypotheses For A Single Sample
No ratings yet
TOPIC3 Tests of Hypotheses For A Single Sample
86 pages
Lecture8 PDF
No ratings yet
Lecture8 PDF
64 pages
Framing The Testable Hypothesis
No ratings yet
Framing The Testable Hypothesis
48 pages
03 ESCh4 Part1new
No ratings yet
03 ESCh4 Part1new
86 pages
IE2152 Statistics For Industrial Engineers Problem Solving Sessions
No ratings yet
IE2152 Statistics For Industrial Engineers Problem Solving Sessions
57 pages
04 Hypothesis Testing
No ratings yet
04 Hypothesis Testing
28 pages
Hypothesis Testing Complete Slides
No ratings yet
Hypothesis Testing Complete Slides
54 pages
Probability and Statistics Lecture-10
No ratings yet
Probability and Statistics Lecture-10
30 pages
Hypothesis Testing: Lecture Notes No. 7 M235
No ratings yet
Hypothesis Testing: Lecture Notes No. 7 M235
63 pages
Ch. 9 Montgomery RGM
No ratings yet
Ch. 9 Montgomery RGM
66 pages
Topic 6
No ratings yet
Topic 6
81 pages
6 Hypothesis Testing
No ratings yet
6 Hypothesis Testing
22 pages
ES12010 Lecture 8 2023-24
No ratings yet
ES12010 Lecture 8 2023-24
69 pages
Test of Hypothesis For A Single Sample PDF
No ratings yet
Test of Hypothesis For A Single Sample PDF
26 pages
Advanced Statistics: Session 1 & 2 - Some Revision of Statistical Methods
No ratings yet
Advanced Statistics: Session 1 & 2 - Some Revision of Statistical Methods
74 pages
Chapter 7: Hypothesis Testing: Procedures in Hypothesis Testing Statistical Power Factors That Affect Power
No ratings yet
Chapter 7: Hypothesis Testing: Procedures in Hypothesis Testing Statistical Power Factors That Affect Power
46 pages
Chapter9 - Tests of Hypotheses For A Single Sample
No ratings yet
Chapter9 - Tests of Hypotheses For A Single Sample
16 pages
Chapter 4 - Hypothesis Testing
No ratings yet
Chapter 4 - Hypothesis Testing
47 pages
Lecture 1-4_Review of Hypothesis Testing
No ratings yet
Lecture 1-4_Review of Hypothesis Testing
18 pages
SB K49 Lecture8
No ratings yet
SB K49 Lecture8
51 pages
22 Hypothesis 2
No ratings yet
22 Hypothesis 2
36 pages
Hypothesis Testing (WithSolutions)
No ratings yet
Hypothesis Testing (WithSolutions)
32 pages
Basics of Hypothesis Testing
No ratings yet
Basics of Hypothesis Testing
36 pages
Week 11-Fundamentals of Hypothesis Testing
No ratings yet
Week 11-Fundamentals of Hypothesis Testing
18 pages
Hypothesis Testing
100% (1)
Hypothesis Testing
58 pages
SBC_3305 (4)
No ratings yet
SBC_3305 (4)
11 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
41 pages
testing of hypothesis part -1
No ratings yet
testing of hypothesis part -1
47 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
15 pages
Stats Power
No ratings yet
Stats Power
53 pages
Chap 9
No ratings yet
Chap 9
64 pages
Basics of Hypothesis Testing
No ratings yet
Basics of Hypothesis Testing
36 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
5 pages
Testing Concepts.: 1 Hypotheses
No ratings yet
Testing Concepts.: 1 Hypotheses
6 pages
Basics of Hypothesis Testing
No ratings yet
Basics of Hypothesis Testing
36 pages
Probability Class 6 29th Sept Test of Hypothesis
No ratings yet
Probability Class 6 29th Sept Test of Hypothesis
39 pages
Gerstman PP09
No ratings yet
Gerstman PP09
36 pages
Hypothesis Test
No ratings yet
Hypothesis Test
23 pages
04 Hypothesis Testing IITB PDF
No ratings yet
04 Hypothesis Testing IITB PDF
33 pages
Introduction To Statistics With GraphPad Prism Slides
No ratings yet
Introduction To Statistics With GraphPad Prism Slides
101 pages
Ie106 Module 3
No ratings yet
Ie106 Module 3
257 pages
Estimation and Hypothesis Testing
No ratings yet
Estimation and Hypothesis Testing
46 pages
Inferential Statistics Lecture
No ratings yet
Inferential Statistics Lecture
83 pages
Statistical Analysis Dr. Shamsuddin
No ratings yet
Statistical Analysis Dr. Shamsuddin
62 pages
ch09 V1a
No ratings yet
ch09 V1a
47 pages
Scribe
100% (1)
Scribe
9 pages
Hypothesis-power-analysis
No ratings yet
Hypothesis-power-analysis
37 pages
Introduction To Hypothesis Testing: Print Round
No ratings yet
Introduction To Hypothesis Testing: Print Round
2 pages
Lec 1
No ratings yet
Lec 1
38 pages
Hypothesis Testing 1
No ratings yet
Hypothesis Testing 1
90 pages
Chapter 3
No ratings yet
Chapter 3
45 pages
Chapter8 Notes PDF
No ratings yet
Chapter8 Notes PDF
13 pages
Power: Type I & Type II Error
No ratings yet
Power: Type I & Type II Error
11 pages
Data Analytics Module 1 Lesson 6 Summary Notes
No ratings yet
Data Analytics Module 1 Lesson 6 Summary Notes
17 pages
Composite Hypotheses
No ratings yet
Composite Hypotheses
12 pages
07 Chapter 9 Hypothesis Testing
No ratings yet
07 Chapter 9 Hypothesis Testing
60 pages
QT II (Hy I) & (Hy II)
No ratings yet
QT II (Hy I) & (Hy II)
116 pages
Mathematical Foundations of Information Theory
From Everand
Mathematical Foundations of Information Theory
A. Ya. Khinchin
3.5/5 (9)
Foundations of Elementary Analysis
From Everand
Foundations of Elementary Analysis
Roshan Trivedi
No ratings yet
Sample Size Calculations Revisited: P r (reject H - H) = α
No ratings yet
Sample Size Calculations Revisited: P r (reject H - H) = α
1 page
Chapter 11
No ratings yet
Chapter 11
10 pages
A Closer Look at Assumptions
No ratings yet
A Closer Look at Assumptions
8 pages
Introduction To Monte Carlo Procedures: The Non-Parametric and Parametric Bootstrap 1. Review of The Non-Parametric Bootstrap
100% (1)
Introduction To Monte Carlo Procedures: The Non-Parametric and Parametric Bootstrap 1. Review of The Non-Parametric Bootstrap
10 pages
Chapter 13
No ratings yet
Chapter 13
11 pages
Chapter 9
No ratings yet
Chapter 9
15 pages
Chapter 5 P 2 Lecture
No ratings yet
Chapter 5 P 2 Lecture
3 pages
Intro Bootstrap 341
No ratings yet
Intro Bootstrap 341
18 pages
Close Out Netting
100% (4)
Close Out Netting
3 pages
Maintaining Standards: Differences Between The Standard Deviation and Standard Error, and When To Use Each
No ratings yet
Maintaining Standards: Differences Between The Standard Deviation and Standard Error, and When To Use Each
5 pages
Chapter 7
100% (2)
Chapter 7
8 pages
The Not So Short Introduction To LaTeX
100% (3)
The Not So Short Introduction To LaTeX
153 pages
Online Dating: Shawn and Cheryl Eharmony Success Story
No ratings yet
Online Dating: Shawn and Cheryl Eharmony Success Story
24 pages
Get File
No ratings yet
Get File
2 pages
LUNCH Sept
No ratings yet
LUNCH Sept
1 page
Do You Know How Food Portions Have Changed in 20 Years?
No ratings yet
Do You Know How Food Portions Have Changed in 20 Years?
34 pages
R Intro
No ratings yet
R Intro
100 pages
The Varying Economic Impact of Villagebanking
100% (4)
The Varying Economic Impact of Villagebanking
20 pages
Micro Insurance
75% (4)
Micro Insurance
262 pages
Dating 101: Your Guide To Happiness in Life
100% (2)
Dating 101: Your Guide To Happiness in Life
25 pages
P PROPOSAL in Past Tense
No ratings yet
P PROPOSAL in Past Tense
24 pages
Unit 6 Research Project in Health and Social Care
No ratings yet
Unit 6 Research Project in Health and Social Care
4 pages
Jurnal Aktivitas Belajar Siswa
No ratings yet
Jurnal Aktivitas Belajar Siswa
8 pages
11.test of Significance
No ratings yet
11.test of Significance
7 pages
Thesis Writing and Research Methodology
100% (3)
Thesis Writing and Research Methodology
94 pages
Self Study
No ratings yet
Self Study
10 pages
PHD Thesis Writing Grants
100% (2)
PHD Thesis Writing Grants
6 pages
Research
No ratings yet
Research
2 pages
Individual Assignment 1 - IT and Project Management
No ratings yet
Individual Assignment 1 - IT and Project Management
37 pages
Faktor-Faktor Yang Mempengaruhi Kapabilitas Internal Audit Kompetensi, Pendidikan, Dan Pengalaman Audit
No ratings yet
Faktor-Faktor Yang Mempengaruhi Kapabilitas Internal Audit Kompetensi, Pendidikan, Dan Pengalaman Audit
6 pages
Edf 5901 Assignment 6_2
No ratings yet
Edf 5901 Assignment 6_2
2 pages
Research Problems and Objectives: Research Methods Thesis/Project Study I CE Project 1
No ratings yet
Research Problems and Objectives: Research Methods Thesis/Project Study I CE Project 1
12 pages
Ai Literature Review
100% (2)
Ai Literature Review
7 pages
Autopsia Pisológica Pérez Janosch López Inglés
No ratings yet
Autopsia Pisológica Pérez Janosch López Inglés
16 pages
TSITSI CHINYOKOTO DISSERTATION
No ratings yet
TSITSI CHINYOKOTO DISSERTATION
26 pages
1) Introduction To Data Analysis For Managers
No ratings yet
1) Introduction To Data Analysis For Managers
9 pages
Habilidades Interpersonales y Comunicación Efectiva en Entornos Universitarios
No ratings yet
Habilidades Interpersonales y Comunicación Efectiva en Entornos Universitarios
21 pages
Artikel Ilmiah
No ratings yet
Artikel Ilmiah
12 pages
Mansi PPT STP
No ratings yet
Mansi PPT STP
9 pages
Pengaruh Pelatihan Dan Pengembangan Terhadap Produktivitas Kerja Karyawan PT. Semen Baturaja (Persero) Palembang
No ratings yet
Pengaruh Pelatihan Dan Pengembangan Terhadap Produktivitas Kerja Karyawan PT. Semen Baturaja (Persero) Palembang
12 pages
Liquidity Risk and Financial ....
No ratings yet
Liquidity Risk and Financial ....
87 pages
A Naturalistic Model of in Service Education and Training PDF
No ratings yet
A Naturalistic Model of in Service Education and Training PDF
13 pages
Logical Reasoning - Wikipedia
No ratings yet
Logical Reasoning - Wikipedia
2 pages
BookSlides 1 Machine Learning For Predictive Data Analytics
No ratings yet
BookSlides 1 Machine Learning For Predictive Data Analytics
56 pages
PROPOSAL FOR YIRGALEM TEXTILE FACTORY Final Last PPPPPPPPPPP
No ratings yet
PROPOSAL FOR YIRGALEM TEXTILE FACTORY Final Last PPPPPPPPPPP
51 pages
Literature Review
100% (1)
Literature Review
9 pages
Inductive Reasoning & Deductive Reasoning
No ratings yet
Inductive Reasoning & Deductive Reasoning
7 pages
College Athletes Views On Academics
No ratings yet
College Athletes Views On Academics
20 pages
Activity Proposal
No ratings yet
Activity Proposal
5 pages