Sample Size Calculation
Sample Size Calculation
Abstract
Every clinical trial should be planned. This plan should include the objective of trial, primary and secondary end-
point, method of collecting data, sample to be included, sample size with scientific justification, method of handling
data, statistical methods and assumptions. This plan is termed as clinical trial protocol. One of the key aspects of this
protocol is sample size estimation. The aim of this article is to discuss how important sample size estimation is for a
clinical trial, and also to understand the effects of sample size over- estimation or under-estimation on outcome of a
trial. Also an attempt is made to understand importance of minimum sample to detect a clinically important difference.
This article is also an attempt to provide inputs on different parameters that impact sample size and basic rules for
these parameters with the help of some simple examples.
Key Words: Sample size, statistically significant, clinically significant, Type I error, Type II error, power,
Type I error = (Reject H0 / H0 is true). of a CT. The description of primary study end point should
Probability of Type I error is called as level of significance cover whether it is discrete or continuous or time-to-event.
and is denoted as . Sample size is estimated differently for each of these end
points. Sample size is adjusted if primary end point involves
Type II Error: multiple comparision.
Consider we are testing paracetamol against placebo to 4) Expected Response Test vs. Control
evaluate if paracetamol is better in curing subject’s suffering The information about expected response is usually
from fever as compared to placebo. (It is expected that the obtained from previous trials done on the test drug. If this
effect of paracetamol is better than placebo). Let us try to information is not available, it could be obtained from previous
built a statistical hypothesis around this published literature. The important information required is:
Null Hypothesis (H0): paracetamol is equal to placebo a) response expected with test drug (mean expected score
Alternate Hypothesis (H1): paracetamol is better than / proportion of subjects achieving success.)
placebo b) response expected with control drug (mean expected
Let us try to evaluate the error that can happen. score / proportion of subjects achieving success.)
Error 2: If analysis concludes that paracetamol is equal to c) variability (standard deviation.)
placebo, we accept H0. Knowing that paracetamol is better
than placebo (H1) we are making an error here by accepting 5) Clinical Important / Meaningful Difference / Margin
H0. This is called as Type II error. Statistically it is defined as This is one of most critical and one of most challenging
Type II error = (Accept H0 / H1 is true). parameters. The challenge here is to define a difference
Probability of type II error is denoted as . between test and reference which can be considered clinically
In above case if analysis concludes that paracetamol is meaningful. To put it in plain English, this is the difference
better than placebo, we reject H0, which would be correct which will make your family doctor to give you new medication
decision. Probability of such a decision taking place is called over and above the existing gold standard which he is
as “Power”. prescribing for last 10 – 20 years. This threshold figure is at
Power = Probability (Reject H0 / H1 is true) which is actually 1- . times not easily available and should be decided based on
We can tabulate type I and type II error as clinical judgment.
Decision Taken \ Actual Fact H0 is True H1 is True
Reject H0 Type I error No error 6) Level of Significance
Accept H0 No error Type II error This is typically assumed as 5% or lesser. 1, 2.Type I error is
inversely proportional to sample size.
Following are the basic considerations for estimating
sample size for a CT 7) Power
As per guideline.1, 2 power should not be less that 80%.
1) Study Design Type II error is directly proportional to sample size.
In CT, different statistical study designs are available to
achieve objectives. Typical designs that may be employed 8) Drop-out rate
are parallel group design, crossover design etc. The sample size estimation formula will provide number
For sample size estimation study design should be of evaluable subjects required for achieving desired statistical
explicitly defined in the objective of the trial. A cross over significance for a given hypothesis. However in practice we
design will have different approach and formula for sample may need to enroll more subjects to account for potential
size estimation as compared to parallel group design. dropouts.
If n is the sample size required as per formula and if d is
2) Hypothesis Testing the dropout rate then adjusted sample size N1 is obtained as
This is another critical parameter needed for sample size N1 = n/ (1-d).
estimation, which describes aim of a CT. The aim can be
equality, non-inferiority, superiority or equivalence. 9) Unequal Treatment Allocation
Equality and equivalence trials are two-sided trials where For some of clinical trials it is ethically desirable to have
as non-inferiority and superiority trials are one-sided trials. more subjects in one arm compared to the other arm. For
Superiority or non-inferiority trials can be conducted only example, for placebo-controlled trials with very ill subjects it
if there is prior information available about the test drug on a is unethical to assign equal subjects to each arm. In such
specific end point. cases sample size is adjusted as below.
If n is sample size required as per formula and n1 is the
3) Primary Study End Point sample size for test and n2 is sample size for placebo and
The sample size calculation depends on primary end point n1 / n2 = k, then
68 PICR April-June 2010 Vol 1 Issue 2
Sample Size Estimation