Sample Size Calculation for Proportions

The document discusses methods for determining sample size and sampling techniques, particularly focusing on simple random sampling and stratified random sampling. It provides formulas for calculating sample sizes based on population proportions and confidence intervals, as well as the importance of stratification in improving survey reliability. Additionally, it outlines various approaches to estimating population parameters for effective sample size determination.

Uploaded by

Tigist G

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

46 views5 pages

Sample Size Calculation for Proportions

Uploaded by

Tigist G

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

is within 10% of the true of the average retail price in the city.

A SRS will be taken from available list

of all outlets. Another survey from the same population showed an average price of $7.00 for 20
items with a standard deviation of $1.4. Assuming 99.7% confidence interval, determine the sample
size.
Solution: N=2500, s=1.4 s2=(1.4)2
s 2 (1.4) 2
CV 2 ( y )    0.04, Z=3 for 99.7%,
y2 72
2
Z 2 CV y (3) 2 (0.04) no 36
no    36   0.0144 <5%
 2
(0.1) 2
N 2500
Therefore, no=n=36 which is a good approximation for the sample. But if you calculate for n, you will
no 36 36
get that n  n   35.5  36
no 36 1  0.0144
1 1
N 2500

Chapter 3 Sampling Proportion (for categorical data)

Definitions: In some cases the nature of the survey may require recording of the attributes, which can be
expressed qualitatively. The qualitative information can be quantified by counting the attribute
characteristics. These characteristics could be of various forms, such as living in urban or rural, being a male
or female, married or unmarried, literate or illiterate, adults between 18 and 45 years or adults over 45 years,
etc.
Therefore, the main interest for such attributes could be to estimate the total number of units and the
proportion of units in the population possessing some characteristics. Attributes can be changed in to
quantifiable information by allocating the score “1” or “0”, while measurable variables can also be changed
in to attributes by categorizing the population in to different groups. It is worth presenting the special simple
form that the variance of a proportion takes when the design is simple random sampling.
Notation
Consider a population in which each member is classified as either having or not having a specified attribute,
a two category population.
Population
N= Total number of units in the population
A= total number of units in specified category (in C)
P= A/N is population proportion, i.e the proportion (percentage) of the entire population that has a specified
value.
Q= 1-P is proportion of units not in C
Sample:
Page | 22
n= total number of members in the sample
a= the total number of members sampled that have the specified attribute,
p= a/n sample proportion, i.e. the proportion (percentage) of a sample from the population that has the
specified attribute.
q= 1-p proportion of sample members not in C.

Variance and Standard Errors of the Estimates

For any unit in the population or in the sample, we define an observation (variable) yi as follows to facilitate
counting.
1, if the is in C
yi  
0, if the Unit is not in C
N

N Y i
A NPQ
For population, Y   Yi  A , Y  i 1
  P and S 2  and for sample
i 1 N N N 1
n

n y i
a npq
y   yi  a , y  i 1

 p and s 2  (Verify).
i 1 n n n 1
Similar to a continuous case, a sample proportion, p, is also a random variable that depends on what
members of the population are included in that sample.
Theorem 5: The sample proportion, p=a/n is an unbiased estimate of the population proportion P=A/N, i.e..
Prove this theorem.
Theorem 6: The variance of the sample proportion or percentage (p) is given by
PQ  N  n 
Var ( p)  E ( p  P)    . Prove this theorem.
n  N 1 
Corollary: i) The estimated total number of units in class C, is an unbiased estimate of A.
ii) The variance of the estimated total number of units in class C, is
N 2 PQ  N  n 
Var ( Aˆ )   
n  N 1 
Estimation of the Standard error from the sample
Theorem 7: An unbiased estimate of the sample variance will be Var ( p)  pq  N  n   pq 1  f 
n 1 N  n 1
If N is large relative to n, the finite population correction (1-f) is negligible and the variance of p is
pq
Var( p)  (verify)
n 1
Corollary: The sample variance of estimated total number of members in specified category, is given by
N ( N  n)
Var ( Aˆ )  pq. In each case we can get the standard error by taking the square root of the variances.
n 1
Example: See Cochran 3rd edition page 52
For the proportion estimate the confidence limits can be obtained by: for large sample size and substitute
S.E(p) by s.e(p) to get the confidence interval . A slight improvement can be achieved by applying
continuity correction for normal approximation to binomial, i.e .
Relative Error
Page | 23
Statistical measures such as standard deviation and the standard error appear in the units of measurement of
variables. Such measurement units may cause difficulties in making some comparisons. Relative measures,
such as coefficients of variation, can be used to overcome the problems.
Sy sy
The element coefficient of variation is can be expressed as CV ( y )  and estimated by cv( y )  . For
Y y
S .E ( y ) s.e( y )
the mean the coefficient of variation is given by CV ( y )  and estimated by cv( y )  . For the
Y y
ˆ Nse( y ) se( y )
total , the coefficient of variation is given by CV (Yˆ )  S .E (Y ) .and estimated by cv(Yˆ )   ,
E (Yˆ ) Ny y
which is the same as the coefficient of variation of the mean.
PQ( N  n)
n( N  1) Q ( N  n)
For proportion (p), we can write the coefficient of variation as CV ( p)   ,
P nP( N  1)

Q
which is approximately equal to if finite population correction (1-f) is ignored. Its estimate is given as:
nP
pq( N  n)
s.e( p ) (n  1) N q
cv( p )    (1  f ) .
p p (n  1) p
S .E (ˆ)
Generally, the coefficient of variation of an estimator is given by CV (ˆ)  and its square is known
E (ˆ)
Var (ˆ)
as rel-variance, i.e, CV 2 (ˆ) 
 
.
E (ˆ)
2

Sample Size Determination

For Categorical Data
The sample size required for estimation population proportion (P) can be obtained in a similar way and have
similar form to those shown above for the mean. Assume that the proportion estimate p is normally
distributed with absolute margin of error or relative error , the sample size n can be calculated by
Z 2 PQ d 2
n ( Verify this).
1  1 N   Z 2 PQ Nd 2
no
If we put n0  Z 2 PQ d 2 , then we get n  . For large population size (N) we have the
1  1 N   n0 N
n0
sample size n  , and we can approximate n by n0 as we have done for the mean.
1  n0 N
Using the relative error () and the relation, we set n0  Z 2 Q P 2
In practice the population parameters ( must be estimated and the other factors usually set by the investigator
(researcher). The relation shows the following summary points.
 The smaller we make , the greater will be sample size n.
 If the degree of confidence () increases, certainly the sample size increases.
 Since population parameters are unknown, calculate n0 by using the sample estimates. That is
Page | 24
Z 2 s y 
2
Z 2 cv 2 ( y )
n0  Z pq d
2 2
or n0  
2 2
How do we get estimates of the population parameters in order to use these estimates in sample size
determination? In actual practice, there are four possible ways of estimating the parameters.
 By taking simple random sample of size n1, small preliminary sample, from which and the required n
will be obtained. This method gives the most reliable estimates, but slows up the completion of the
survey and because of this it is not often used.
 By using the results of pilot survey: To design efficiently a large sample in an unknown field, a pilot
study may be conducted prior to the survey to gain information for designing the survey which also
serves many other proposes.
 By using previous surveys results: we should search for data from previous/past surveys of similar
variables and make use of it after adjusting for time changes.
 By guesswork about the nature of population: these requires educated guesses or the services of
experts such as survey statisticians, supported by specialists in the subject matter concerned who may
construct a model of the population distribution, its shape, and its probable limits, and deduce from
it.

Reading Assignment: Read Cochran 3rd ed, chapter 4, section 4.7, page 78-81.
Example.

2. A teacher training institutes are interested in estimating the proportion (P) of teachers who consider
system to be more suitable as compared to the 3-term system of education. A SRS of n=120 teachers is
taken from a total N=1200 teachers, without replacement. Some of the teachers are in favor of two
semesters while others are not and it is found that 72 teachers are in favor of semester system.
i) Estimate the proportion P along with the standard error of your estimate.
ii) Calculate the 95% confidence interval for P.
iii) Do you think the sample size 120 is sufficient if the tolerable error could be 0.08? If not, how many
more units should be included in the sample?

Solution:
n=120, a=72, N=1200,
i) P=a/n=72/120=0.6
ii) 95% confidence limits

Therefore the proportion of teachers in the institutes favoring semester system is likely to be between
51% and 68%. Estimate of total number of teachers who are in favor of two-semester system is

Z 2 pq (1.96) 2  0.6  0.4 no 144

iii) no    144,   0.12 >5%
d2 (0.08) 2 N 1200
Therefore n can be estimated as

Page | 25
144
n
no
n
144   128.57  129
no 144 1.12
1 1
N 1200
Therefore 120 is not sufficient for achieving the given precision meaning 9 more teachers need to be
selected.

Chapter 4: Stratified Random Sampling

4.1 Definition:

Stratified Sampling is a technique, which involves the division or stratification of a population by

partition the sampling frame in to non-overlapping and relatively homogeneous groups called strata. The
selection of samples can be performed independently in each of those strata.

Stratified random Sampling is a sampling plan in which a population is divided in to L mutually

exclusive and exhaustive strata, and a simple random sample of nh elements is taken separately and
independently within each stratum. Let N1, N2, ------, NL represent the number of sampling units within
each stratum, and n1, n2, …….nL represent the number of randomly selected units within each stratum.
Then the total number of possible stratified random samples is equal to

 N1   N 2  N  N
      .......  L    
 n1   n2   nL   n 

Stratified random sampling, in particular involves dividing the population in to strata, and then selecting
simple random samples from each of strata. Stratification variables may be geographic (region, province,
rural/urban, zone) or non-geographic (income, age, sex, size of employees, etc). it should be kept in mind
that stratification is limited only to those items of information, which are available on the frame.

4.2 The purpose of stratified Sampling

Stratified sampling is used in certain types of surveys because it combines the conceptual simplicity of
simple random sampling with potentially significant gains in reliability. Basically there are four major
reasons for resorting to stratification:

Page | 26

Sampling Proportions and Estimation Techniques
No ratings yet
Sampling Proportions and Estimation Techniques
5 pages
Sampling Proportions and Variance Analysis
No ratings yet
Sampling Proportions and Variance Analysis
31 pages
Simple Random Sampling Explained
No ratings yet
Simple Random Sampling Explained
27 pages
Confidence Intervals and Hypothesis Testing
No ratings yet
Confidence Intervals and Hypothesis Testing
21 pages
Simple Random Sampling Explained
No ratings yet
Simple Random Sampling Explained
10 pages
Simple Random Sampling Explained
No ratings yet
Simple Random Sampling Explained
51 pages
Estimation Techniques in Statistics
No ratings yet
Estimation Techniques in Statistics
65 pages
Sample Size Estimation Techniques
No ratings yet
Sample Size Estimation Techniques
5 pages
95% Confidence Interval for Proportions
No ratings yet
95% Confidence Interval for Proportions
7 pages
Stat Chapter 4
No ratings yet
Stat Chapter 4
19 pages
CAPE Applied Mathematics Study Guide
No ratings yet
CAPE Applied Mathematics Study Guide
6 pages
Sampling Distributions & Confidence Intervals
No ratings yet
Sampling Distributions & Confidence Intervals
10 pages
Simple Random Sampling for Proportions
No ratings yet
Simple Random Sampling for Proportions
9 pages
Sampling Theory for Proportions
No ratings yet
Sampling Theory for Proportions
8 pages
Statistical Estimations Explained
No ratings yet
Statistical Estimations Explained
10 pages
Point vs. Interval Estimation in Statistics
No ratings yet
Point vs. Interval Estimation in Statistics
28 pages
Inference on Population Mean & Proportion
No ratings yet
Inference on Population Mean & Proportion
19 pages
Impact of Sample Size on Proportion Error
No ratings yet
Impact of Sample Size on Proportion Error
43 pages
Estimation Methods in Statistics
No ratings yet
Estimation Methods in Statistics
17 pages
Understanding Point Estimation
No ratings yet
Understanding Point Estimation
17 pages
Statistical Inference and Estimation Techniques
100% (1)
Statistical Inference and Estimation Techniques
33 pages
Sampling Distribution of Proportions
No ratings yet
Sampling Distribution of Proportions
17 pages
Statistical Formulas for Data Analysis
No ratings yet
Statistical Formulas for Data Analysis
10 pages
Confidence Intervals Explained
No ratings yet
Confidence Intervals Explained
13 pages
Statistical Estimation Methods Explained
No ratings yet
Statistical Estimation Methods Explained
20 pages
Statistical Estimation Techniques Explained
No ratings yet
Statistical Estimation Techniques Explained
130 pages
CRE Equations and Formulas Print Out
100% (2)
CRE Equations and Formulas Print Out
30 pages
Ratio Estimators in Sampling Techniques
No ratings yet
Ratio Estimators in Sampling Techniques
8 pages
Statistical Estimation
No ratings yet
Statistical Estimation
28 pages
Statistical Inference: Estimation & Testing
No ratings yet
Statistical Inference: Estimation & Testing
39 pages
Central Limit Theorem Explained
No ratings yet
Central Limit Theorem Explained
23 pages
Sampling Theory Overview and Techniques
100% (2)
Sampling Theory Overview and Techniques
36 pages
Estimating Population Proportions and Confidence Intervals
No ratings yet
Estimating Population Proportions and Confidence Intervals
17 pages
Simple Random Sampling Explained
No ratings yet
Simple Random Sampling Explained
23 pages
Estimation
No ratings yet
Estimation
41 pages
Key Formulas in Introductory Statistics
No ratings yet
Key Formulas in Introductory Statistics
2 pages
Estimation Methods in Statistics
No ratings yet
Estimation Methods in Statistics
33 pages
Confidence Intervals in Estimation
No ratings yet
Confidence Intervals in Estimation
85 pages
Sample Size Estimation Techniques
No ratings yet
Sample Size Estimation Techniques
8 pages
Large vs Small Sample Estimation
No ratings yet
Large vs Small Sample Estimation
44 pages
Sample Estimation and Confidence Intervals
No ratings yet
Sample Estimation and Confidence Intervals
22 pages
Statistical Estimation and Sampling Distributions
No ratings yet
Statistical Estimation and Sampling Distributions
48 pages
Statistical Estimation Techniques Explained
No ratings yet
Statistical Estimation Techniques Explained
66 pages
Cochran's Sample Size Formula Explained
No ratings yet
Cochran's Sample Size Formula Explained
19 pages
CQE Exam Equation Overview
No ratings yet
CQE Exam Equation Overview
15 pages
Inferential Statistics and Estimation Techniques
No ratings yet
Inferential Statistics and Estimation Techniques
84 pages
Estimation and Hypothesis Testing Overview
No ratings yet
Estimation and Hypothesis Testing Overview
92 pages
Confidence Interval Estimation Guide
100% (1)
Confidence Interval Estimation Guide
31 pages
Inferential Statistics
No ratings yet
Inferential Statistics
73 pages
Chapter 2 Organizing and Summarizing Data
No ratings yet
Chapter 2 Organizing and Summarizing Data
8 pages
Understanding Probabilistic Sampling
No ratings yet
Understanding Probabilistic Sampling
39 pages
Data Analysis and Hypothesis Testing Guide
No ratings yet
Data Analysis and Hypothesis Testing Guide
21 pages
Estimating Population Parameters and CI
No ratings yet
Estimating Population Parameters and CI
19 pages
Statistical Estimations in Business
No ratings yet
Statistical Estimations in Business
14 pages
SM Lec-2
No ratings yet
SM Lec-2
6 pages
Principles of Sampling in Research
No ratings yet
Principles of Sampling in Research
20 pages
Sample Size Determination in Statistics
No ratings yet
Sample Size Determination in Statistics
17 pages
Estimation and Hypothesis Testing Guide
No ratings yet
Estimation and Hypothesis Testing Guide
118 pages
95% Confidence Interval for Mean Estimation
No ratings yet
95% Confidence Interval for Mean Estimation
2 pages
Simple Random Sampling Techniques
No ratings yet
Simple Random Sampling Techniques
13 pages
Understanding Sample Survey Methods
No ratings yet
Understanding Sample Survey Methods
9 pages
Stratified Random Sampling Explained
No ratings yet
Stratified Random Sampling Explained
13 pages
Understanding Statistics: Concepts & Methods
No ratings yet
Understanding Statistics: Concepts & Methods
18 pages
Population Inference & Proportion Methods
No ratings yet
Population Inference & Proportion Methods
37 pages
Comparing Two Population Means
No ratings yet
Comparing Two Population Means
33 pages
Understanding Variance and Its Applications
No ratings yet
Understanding Variance and Its Applications
8 pages
Swinging Fire Door Closers
No ratings yet
Swinging Fire Door Closers
1 page
Section 50 of Indian Evidence Act Explained
No ratings yet
Section 50 of Indian Evidence Act Explained
78 pages
Navigating Toxic Achievement Culture
No ratings yet
Navigating Toxic Achievement Culture
7 pages
Managing Conflicting Values in Education
No ratings yet
Managing Conflicting Values in Education
4 pages
NP 065 ST Lawrence Pilot Edition 14 2003
No ratings yet
NP 065 ST Lawrence Pilot Edition 14 2003
438 pages
Tenor Sax 1: (S o Lo - Ad Lib. or As Wri Tten
No ratings yet
Tenor Sax 1: (S o Lo - Ad Lib. or As Wri Tten
1 page
Group Therapy Basics for Professionals
No ratings yet
Group Therapy Basics for Professionals
3 pages
Overview of the European Union
No ratings yet
Overview of the European Union
27 pages
Backrooms Level 3: Electrical Station
No ratings yet
Backrooms Level 3: Electrical Station
1 page
COMM 871 Training Methods
No ratings yet
COMM 871 Training Methods
129 pages
MCQs on Cash Flow Statement
100% (1)
MCQs on Cash Flow Statement
6 pages
Accounting Problems on Asset Depreciation
No ratings yet
Accounting Problems on Asset Depreciation
7 pages
Metacube Graduate Engineer Trainee Offer
No ratings yet
Metacube Graduate Engineer Trainee Offer
2 pages
Jung's Son of the Earth in The Red Book
No ratings yet
Jung's Son of the Earth in The Red Book
17 pages
Overview of India's Constitution Parts
No ratings yet
Overview of India's Constitution Parts
8 pages
Consumer Behavior in Marketing Insights
No ratings yet
Consumer Behavior in Marketing Insights
7 pages
TDS on Salary: Key Reasons and Sections
No ratings yet
TDS on Salary: Key Reasons and Sections
4 pages
2016 Laos Compensation & Resettlement Decree
No ratings yet
2016 Laos Compensation & Resettlement Decree
16 pages
Features of Mauryan Administration
No ratings yet
Features of Mauryan Administration
15 pages
H&M Quality Management Strategies
No ratings yet
H&M Quality Management Strategies
6 pages
Transportation Management MCQ PDF
No ratings yet
Transportation Management MCQ PDF
16 pages
Understanding the Synoptic Problem
No ratings yet
Understanding the Synoptic Problem
17 pages
Public Service Job Vacancies Announcement
No ratings yet
Public Service Job Vacancies Announcement
3 pages
Ethical Hacking Overview Guide
No ratings yet
Ethical Hacking Overview Guide
3 pages
Searchsearch: What Is Scribd? Books Audiobooks Magazines Podcasts Sheet Music Documents Snapshots
No ratings yet
Searchsearch: What Is Scribd? Books Audiobooks Magazines Podcasts Sheet Music Documents Snapshots
4 pages
Ethics and Evil at Abu Ghraib
No ratings yet
Ethics and Evil at Abu Ghraib
5 pages
Investment Management Assessment Guide
No ratings yet
Investment Management Assessment Guide
2 pages
SK Resolution for Financial Assistance
No ratings yet
SK Resolution for Financial Assistance
2 pages
Configure TD-W8960N LAN as WAN Port
No ratings yet
Configure TD-W8960N LAN as WAN Port
5 pages
Remboursement Always : 34% à 65% en France
No ratings yet
Remboursement Always : 34% à 65% en France
5 pages