Statistical-Sampling Presentation Slide
Statistical-Sampling Presentation Slide
A PRESENTATION BY:
GAURAV KUMAR PRAJAPAT
SR. AUDIT OFFICER
O/O AG (AUDIT-II), RAJASTHAN
JAIPUR
What is sampling?
Sample
Population
Why we need sampling in Audit?
Making an audit assertion with absolute certainty would be vastly
expensive.
There would always be some risk that audit fails to discover all
material errors, even when 100% of the transactions are audited.
Recognizing this, the auditor defines an audit risk that he is willing to
accept
Sampling in audit means testing less than 100% of the items in the
population for some characteristic and then drawing a conclusion
about that characteristic for the entire population.
What is ‘Statistical Sampling’?
Sampling error is the error that arises in a data collection process as a result of
taking a sample from a population rather than using the whole population
The sampling error for a given sample is unknown but when the sampling is
random, for some estimates (for example, sample mean, sample proportion)
theoretical methods may be used to measure the extent of the variation caused
by sampling error
Non-sampling error is the error that arises in a data collection process as a
result of factors other than taking a sample.
Non-sampling errors have the potential to cause bias in polls, surveys or
samples.
Sampling errors Vs Non-sampling errors
ITEMS SAMPLING ERROR NON-SAMPLING ERROR
Meaning Sampling error is a type of error, occurs An error occurs due to sources
due to the sample selected does not other than sampling, while
perfectly represents the population. conducting survey activities is
known as non sampling error.
Cause Deviation between sample mean and Deficiency and analysis of data
population mean
Type Random Random or Non-random
Occurs Only when sample is selected. Both in sample and census.
Sample size Possibility of error reduced with the It has nothing to do with the
increase in sample size. sample size.
Important factors in drawing a sample
Sampling Design
Sampling Frame
Sampling Method
Size of the Population
Sample Size
Required level of Confidence
Sample Design
Principle:
In this method, first unit is drawn by random numbers; thereafter, every k th (k =
N/n is sampling interval; ‘N’ is the population size and ‘n’ the sample size)
unit is drawn. It gives equal chance of selection to each unit;.
Procedure
1. Prepare a list of all elements in the study population (N)
2. Decide the sample size (n)
3. Determine the sampling interval ‘k’ as the integer nearest to N/n
4. Have the random start by choosing an integer ‘r’ between 1 and k.
5. Select every kth unit starting with the unit corresponding to the number ‘r’.
Systematic Sampling contd.
30
Works well only if the complete and up-to-date frame is available and if
the units are randomly arranged in the frame; for this reason the units are
arranged in some order say alphabetical or in increasing/decreasing order
of value before selecting a sample.
3. Stratified sampling
32
Principle
Classify population into homogeneous subgroups (strata)
Stratification may be done on the basis of income, age, rural-urban,
Revenue-Capital, Treasuries, major heads, etc.
Draw sample (not necessarily equal) from each strata
Combine results of all strata
Advantages
More precise if variable to be studied/audited is associated with strata e.g.
in MUS [Monetary Unit Sampling], variable is value of voucher which is
related to strata so it is likely to give better results than SRS
All subgroups represented, allowing separate conclusion about each of
them; say separate conclusion for each District/ treasury
Administrative convenience
3. Stratified sampling (contd..)
Disadvantages 33
(i) To select BPL households for a social audit; divide the population of BPL into
three categories (strata) say top 25%, Middle 50% and Bottom 25% and select
separate samples from 3 categories/strata.
(ii) Dividing contracts into value ranges and then selecting separate samples from
each value range; say 100% contracts from highest value range, 50% contracts
from next highest and so on.
(iii)Monetary Unit Sampling (MUS) is also a case of Stratified Sampling where the
population (vouchers) is divided into strata based on values.
Allocation of Sample size in Stratified sampling
34
Proportional Allocation: ni = (n/N)*Ni
where ni is size of sample from ith strata, Ni is population of ith strata; n is
sample size and N is the population size
Optimum allocation ni’s are chosen so as to
(a) Maximise the precision for fixed sample size n;
(b) Maximise the precision for fixed cost
(c) Minimise the total cost for fixed desired precision
Disproportionate Allocation
No. of items selected from a stratum is independent of its size.
A large sample would be required from a stratum if
1. Stratum size Ni is large.
2. Stratum variability Si (Variance or Std. Dev.) is large.
Exercise: Proportional Allocation 35
Number of Vouchers coming from 3 treasuries are 300, 200 and
500 respectively. Draw a Proportional Stratified sample of size 60
using the random number table.
Solution:
Here N1 = 300, N2 = 200 and N3 = 500; N = 1000
using ni = (n/N)*Ni; i = 1, 2, 3
n1 = (60/1000)* 300 = 18, n2 = (60/1000)* 200 = 12
and n3 = (60/1000)* 500 = 30.
Thus a sample of 18, 12 and 30 vouchers will be selected from
these three strata using random number table.
4. Cluster Sampling
36
The population is divided into non-overlapping groups known as
Clusters.
Clusters are commonly formed on the basis of geographical
/administrative/political boundaries, e.g. GPs, Blocks, Departments may
act as clusters.
Procedure
List all the clusters/groups of sampling units of the study population
Select Random Sample of clusters
Survey all or proportion of sampling units of selected clusters
For e.g. selecting some Districts from a state and auditing all the units
(say PHCs) under them leaving out the units of other Districts; finally
drawing conclusion about the entire state.
Cluster Sampling (contd..) 37
Advantages
3 C 5 155
4 D 20 175
5 E 10 185
The scatter diagram graphs pairs of numerical data, with one variable on each
axis, to look for a relationship between them. If the variables are correlated, the
points will fall along a line or curve. The better the correlation, the tighter the
points will hug the line.
https://www.ncbi.nlm.nih.gov/pmc/articles/PMC374386/#:~:text=The%20most%
20commonly%20used%20techniques,the%20form%20of%20an%20equation
.
IDEA Sampling Capabilities
IDEA offers five sampling methods together with the ability to calculate sample
sizes based on parameters entered and to evaluate the results of sampling tests.
Sampling methods include systematic, random, stratified random, monetary unit
and classical variables.
Attribute Planning and Evaluation can be used to calculate sample sizes,
confidence levels, and error limits.
Thank you for giving this opportunity to interact with
you and feel free to contact me in case of any doubt
regarding sampling
Mobile 9461588507