Research-Methodolgy and IPR - Module-3
Research-Methodolgy and IPR - Module-3
Design of Sampling:
Introduction:
When we decide to study a population, most often we are unable to look at all the individuals
in the population. Real life issues like the lack of time, limited amount of money and
inconvenience to the individuals under study disallow us to include the entire population in our
study. Instead, we choose a sample from the population, which reflects its structure and nature.
We want our results to be reliable and dependable, and for those reasons our sample must
represent the entire population. Choosing a right sample is a critical matter.
A sampling design is a definite plan for obtaining a sample from a given population. It refers
to the technique or the procedure the researcher would adopt in selecting items for the sample.
Sample design may as well lay down the number of items to be included in the sample i.e., the
size of the sample. Sample design is determined before data are collected. There are many
sample designs from which a researcher can choose. Some designs are relatively more precise
and easier to apply than others.
The sampling design is a fundamental part of data collection for scientifically based decision
making. A well-developed sampling design plays a critical role in ensuring that data are
sufficient to draw the conclusions needed.
Sampling error:
Sampling error means the difference between the mean values of the sample and the mean
values of the entire population, so it only happens when you’re working with representative
samples. It’s the inevitable gap between your sample and the true population value.
the whole population will never be perfectly represented by a sample because the population
is larger and more complete. In this sense, sampling error occurs whenever you’re sampling.
It’s not a human error, and it can’t be completely avoided.
Non-sampling error:
Non-sampling errors can happen even when you are not sampling. i.e. they need to be avoided
whether you are working with a representative sample (such as with a national survey) or doing
total enumeration of your entire population (such as when you are carrying out employee
experience surveys with your workforce).
Non-sampling errors occur when there are problems with the sampling method, or the way the
survey is designed or carried out.
Reasons for Sampling Errors:
Faulty selection of sampling method.
Faulty demarcation of sampling units.
Variability of the population which has different characteristics.
Substituting one sample for other sample due to difficulties in collecting the sample.
Simple or unrestricted
Judgement sampling
random sampling
Stratified random
Quota sampling
sampling
Example: The company has 800 female employees and 200 male employees. You want
to ensure that the sample reflects the gender balance of the company, so you sort the
population into two strata based on gender. Then you use random sampling on each
group, selecting 80 women and 20 men, which gives you a representative sample of
100 people.
c) Systematic sampling: Researchers use the systematic sampling method to choose the
sample members of a population at regular intervals. It requires selecting a starting
point for the sample and sample size determination that can be repeated at regular
intervals. This type of sampling method has a predefined range; hence, this sampling
technique is the least time-consuming.
Example: All employees of the company are listed in alphabetical order. From the first
10 numbers, you randomly select a starting point: number 6. From number 6 onwards,
every 10th person on the list is selected (6, 16, 26, 36, and so on), and you end up with
a sample of 100 people.
d) Cluster sampling: Cluster sampling is a method where the researchers divide the entire
population into sections or clusters representing a population. Clusters are identified
and included in a sample based on demographic parameters like age, sex, location, etc.
This makes it very simple for a survey creator to derive effective inferences from the
feedback.
For example, suppose the Indian government wishes to evaluate the number of
immigrants living in India. In that case, they can divide it into clusters based on states
such as West Bengal, Bihar, Odissa, Punjab etc. This way of conducting a survey will
be more effective as the results will be organized into states and provide insightful
immigration data.
b. Interviews: Interviews involve direct interaction between the researcher and the respondent.
They can be conducted in person, over the phone, or through video conferencing. Interviews
can be structured (with predefined questions), semi-structured (allowing flexibility), or
unstructured (more conversational).
c. Observations: Researchers observe and record behaviors, actions, or events in their natural
setting. This method is useful for gathering data on human behavior, interactions, or
phenomena without direct intervention.
e. Focus Groups: Focus groups bring together a small group of individuals who discuss
specific topics in a moderated setting. This method helps in understanding opinions,
perceptions, and experiences shared by the participants.
Survey Experiment
Surveys are descriptive Experimental
Require larger samples Need smaller samples
Systematic and logical method of answering
Describe a phenomenon or state of affairs
the question
Experiment data is related to laboratory
Survey is field research
research
Hypothesis can be built from survey Hypothesis testing is done
Survey is on census or sample Experiment is on a sample
Purpose of survey is for correlational Purpose of experiment data is to establish
analysis causal relationship between variables
Secondary data collection involves using existing data collected by someone else for a purpose
different from the original intent. Researchers analyze and interpret this data to extract relevant
information. Secondary data can be obtained from the following sources.
b. Online Databases: Numerous online databases provide access to a wide range of secondary
data, such as research articles, statistical information, economic data, and social surveys.
e. Past Research Studies: Previous research studies and their findings can serve as valuable
secondary data sources. Researchers can review and analyze the data to gain insights or build
upon existing knowledge.
A case study is a detailed study (research) of a specific subject, such as a person, group, place,
event, organization, or phenomenon. Case studies are commonly used in social, educational,
clinical, and business research.
For example, a case study of labors may be taken up to study the impact of noise and vibration
on their physical and mental health.
Sources of data in case study method: Case studies tend to focus on qualitative data using
methods such as interviews, observations, and analysis of primary and secondary sources
(e.g., newspaper articles, photographs, official records). Sometimes a case study will also
collect quantitative data.
In our daily life we come across many situations where in we witness measurement of some
quantity (volume of petrol or edible oil, weight of rice, sugar, area of a space etc.,) or sometimes
we are part of measurement process. Measurement can be qualitative such as the quality of a
painting, quality of a song or the personality of people or it can be quantitative where we
quantify the properties of physical objects by its mass, volume, density etc. Thus, by
measurement we mean the process of assigning numbers to objects or observations, the level
of measurement being a function of the rules under which the numbers are assigned.
It is easy to measure in respect of objects or concepts that are quantifiable, because
measurement tools are standardized. The measurement obtained is highly accurate. However,
if the concept of abstract are qualitative form, the measurement becomes quite difficult because
the measurement tools are not standardized and the results are relatively less accurate.
In Statistics, the variables or numbers are defined and categorized using different scales of
measurements. Each level of measurement scale has specific properties that determine the
various use of statistical analysis. The most widely used classification of measurement scales
are:
1. Nominal Scale
2. Ordinal Scale
3. Interval Scale
4. Ratio Scale
1. Nominal Scale:
A nominal scale is the 1st level of measurement scale in which the numbers serve as “tags” or
“labels” to classify or identify the objects. A nominal scale usually deals with the non-numeric
variables or the numbers that do not have any value. Following are the characteristics of
nominal scale.
A nominal scale variable is classified into two or more categories. In this measurement
mechanism, the answer should fall into either of the classes.
It is qualitative. The numbers are used here to identify the objects.
The numbers do not define the object characteristics. The only permissible aspect of
numbers in the nominal scale is “counting.”
Example:
Gender, marital status, religion, race, hair color, country, etc are examples of Nominal Scale.
2. Ordinal Scale
The ordinal scale is the 2nd level of measurement that reports the ordering and ranking of data
without establishing the degree of variation between them. Ordinal represents the “order.”
Ordinal data is known as qualitative data or categorical data. It can be grouped, named and also
ranked. The characteristics of Ordinal Scale are:
The interval scale is quantitative as it can quantify the difference between the values.
It allows calculating the mean and median of the variables.
To understand the difference between the variables, you can subtract the values between
the variables.
The interval scale is the preferred scale in Statistics as it helps to assign any numerical
values to arbitrary assessment such as feelings, calendar types, etc.
Examples of Interval Scale:
Time: When measured throughout the day, time passes is an excellent example of interval data.
Because they are equidistant and quantifiable, the numerals on a clock are on an interval scale.
For example, the difference between 3 and 4 o’clock is the same as the difference between 4
and 5 o’clock.
Likert Scale: A Likert scale is a rating scale that quantitatively assesses opinions, attitudes, or
behaviors. It is made up of 4 or more questions that measure a single attitude or trait when
response scores are combined
Net Promoter Score(NPS): A net promoter score is a method of using a single survey question
to gauge customer satisfaction with a product. Businesses can send out this question— “On a
scale of 0 to 10, how likely are you to recommend [our product or company]?”—at various
stages of the customer’s journey.
Suppose the question is “How likely are you to recommend us?”
Customers can respond with a score of 0 to 10.
Based on their scores, you’ll place users in one of the three following categories:
Passives (scores of 7 or 8)
These users are satisfied with your product but not enthusiastic enough to promote it.
Detractors (scores of 0 to 6)
These are unhappy customers unlikely to buy from you again. They may even discourage others
from your product.
4. Ratio Scale:
The ratio scale is the 4th level of measurement scale, which is quantitative. It is a type of
variable measurement scale. It allows researchers to compare the differences or intervals. The
ratio scale has a unique feature i.e. it possesses the character of the origin or zero points.
A ratio scale is the most informative scale as it tends to tell about the order and number of the
object between the values of the scale. The most common examples of this scale are height,
money, age, weight etc. With respect to market research, the common examples that are
observed are sales, price, number of customers, market share etc.
Ratio scales encompass four distinct levels of measurement:
Example:
An example of a ratio scale is:
What is your weight in Kgs?
You can measure the size and magnitude You can measure the size and
Measurement of values using a consistent unit of magnitude of values with a
measurement. defined unit of measurement.