STAT 101 Chapter 2 PPT
STAT 101 Chapter 2 PPT
2 / 35
Outline
3 / 35
2. Data Collection
Properties of Numbers
Identity - property of a number that enables a person to distinguish one number
from the other and are used for classification purposes only.
Order - refers to the way the numbers are arranged in a sequence.
Additivity - is the property that allows us to add numbers of equal scales.
4 / 35
2. Data Collection
Levels of Measurement
1 Nominal Scale - possess only the property of identity (e.g. religion, gender)
2 Ordinal Scale - possess the properties of both identity and order (e.g. education
level, customer satisfaction rating).
3 Interval Scale - possess the properties of identity, order, and equality of scale but
do not have the property of absolute zero (e.g. temperature).
4 Ratio Scale - possess all the properties of identity, order, equality of scale, and
absolute zero (e.g. height, weight).
5 / 35
2. Data Collection
6 / 35
2. Data Collection
1 IQ scores
2 Education Level
3 Income
4 Ethnicity
5 Nationality
6 Allowance of a student (in pesos)
7 Marital status
8 Job Ranking
7 / 35
2. Data Collection
1 IQ scores - Interval
2 Education Level - Ordinal
3 Income - Ratio
4 Ethnicity - Nominal
5 Nationality - Nominal
6 Allowance of a student (in pesos) - Ratio
7 Marital status - Nominal
8 Job Ranking - Ordinal
8 / 35
2. Data Collection
2.2 Methods of Data Collection
10 / 35
2. Data Collection
2.3 Sampling Methods
11 / 35
2. Data Collection
Probability Sampling
Probability sampling is based on the principle that every element in the
population has a non-zero chance of being chosen.
Non-probability Sampling
Non-probability sampling methods do not use any randomization
mechanism in identifying the sampling units included in the sample.
Rather, it allows the researcher to choose the units in the sample
subjectively.
12 / 35
2. Data Collection
Probability Sampling
1 Simple Random Sampling (SRS)
• Simple Random Sampling With Replacement (SRSWR)
• Simple Random Sampling Without Replacement (SRSWOR)
2 Systematic Random Sampling
3 Stratified Random Sampling
4 Cluster Sampling
5 Multistage Sampling
Non-probability Sampling
1 Haphazard or Convenience Sampling
2 Judgment or Purposive Sampling
3 Quota Sampling
4 Snowball Sampling
13 / 35
2. Data Collection
14 / 35
2. Data Collection
Simple Random Sampling
Simple random sampling can be done with replacement (with repetition) or
without replacement (without repetition).
SRSWR
In SRSWR, the n elements in the sample need not be distinct, that is, an
element can be selected more than once to be a part of the sample.
SRSWOR
In SRSWOR, all of the n elements in the sample must be distinct from
each other.
Probability of Inclusion
The probability that an element will be included in the selected sample is
2
N −1 n
1− (in SRSWR) and (in SRSWOR).
N N
15 / 35
2. Data Collection
16 / 35
2. Data Collection
17 / 35
2. Data Collection
Randomization Mechanism
1 The Lottery Method - drawing n slips of paper (balls) from an opaque bowl (any
nontransparent container) containing N slips of paper.
2 Generating random numbers using a calculator - using the syntax RanInt#(1, N)
where N is the population size.
3 Generating random numbers using Microsoft Excel - using the syntax
= RANDARRAY (n, 1, 1, n, TRUE ) for SRSWR and
= INDEX (UNIQUE (RANDARRAY (n2 , 1, 1, n, TRUE )), SEQUENCE (n)) for
SRSWOR
4 The use of Table of Random Numbers - a computer-generated random numbers.
18 / 35
2. Data Collection
SRS Illustration:
19 / 35
2. Data Collection
Systematic Random Sampling
21 / 35
2. Data Collection
3 Generate the other 11 students:
r + 1k = 15 + 1(3) = 18
r + 2k = 15 + 2(3) = 21
r + 3k = 15 + 3(3) = 24
r + 4k = 15 + 4(3) = 27
r + 5k = 15 + 5(3) = 30
r + 6k = 15 + 6(3) = 33
r + 7k = 15 + 7(3) = 36
r + 8k = 15 + 8(3) = 39
r + 9k = 15 + 9(3) = 42
r + 10k = 15 + 10(3) = 45
r + 11k = 15 + 11(3) = 48
r + 12k = 15 + 12(3) = 52 = 2
Therefore, the students included in the sample survey are the 15th, 18th, 21st,
24th, 27th, 30th, 33rd, 36th, 39th, 42nd, 45th, 48th, and 2nd student in the list.
22 / 35
2. Data Collection
23 / 35
2. Data Collection
24 / 35
2. Data Collection
25 / 35
2. Data Collection
Cluster Sampling
Cluster Sampling
Cluster sampling is a probability sampling method wherein we divide the
population into nonoverlapping groups or clusters consisting of one ore
more elements, and then select a sample of clusters. The sample will
consist of all the elements in the selected clusters.
26 / 35
2. Data Collection
27 / 35
2. Data Collection
Caloocan Navotas
Las Piñas Parañaque
Makati Pasay
Malabon Pasig
Mandaluyong Quezon City
Manila San Juan
Marikina Taguig
Muntinlupa Valenzuela
28 / 35
2. Data Collection
Multistage Sampling
Multistage Sampling
Multistage Sampling is an extension of one-stage cluster sampling.
29 / 35
2. Data Collection
Non-probability Sampling
does not make use of any randomization mechanism
subjectively chooses the units as part of the sample
30 / 35
2. Data Collection
Non-probability Sampling
Quota Sampling
Quota sampling is like stratified random sampling but without
randomization.
31 / 35
2. Data Collection
Non-probability Sampling
Snow-ball Sampling
In snowball sampling, a few initial samples are taken by SRS. The
sample is then expanded through referrals. Sometimes, referrals are done
through social networks, so this method is also called network sampling.
32 / 35
2. Data Collection
Non-probability Sampling
33 / 35
Group Activity 2
Instruction: Search on the internet for any data from a reliable source and
identify the data collection method and sampling method they used.
Determine also the levels of measurement on each variable included or
defined in the data.
34 / 35
-END