Unit 4 Hypothesis Formulation and Sampling: Structure
Unit 4 Hypothesis Formulation and Sampling: Structure
Structure
4.0 Introduction
4.1 Objectives
4.2 Meaning and Characteristics of Hypothesis
4.3 Formulation of Hypothesis
4.4 Possible Difficulties in Formulation of a Good Hypothesis
4.5 Types of Hypotheses
4.5.1 Null Hypothesis
4.5.2 Alternative Hypothesis
4.6 Errors in Testing a Hypothesis
4.7 Importance of Hypothesis Formulation
4.8 Sampling
4.8.1 Definition of Sampling
4.8.2 Sampling Terminology
4.8.3 Purpose of Sampling
4.9 Sampling Methods
4.9.1 Non Probability Sampling
4.9.2 Probability Sampling
4.10 Importance of Sampling
4.11 Let Us Sum Up
4.12 Unit and Questions
4.13 Glossary
4.14 Suggested Readings and References
4.0 INTRODUCTION
Scientific process or all empirical sciences are recognised by two inter-related
concepts, namely; (a) context of discovery (getting an idea) and (b) context of
justification (testing and results). Hypotheses are the mechanism and container
of knowledge moving from the unknown to known. These elements form
techniques and testing ground for scientific discovery. Hypotheses are tentative
explanation and potential answer to a problem. Hypothesis gives the direction
and helps the researcher interpret data. In this unit, you will be familiarised with
the term hypothesis and its characteristics. It is, then, followed by the hypothesis
formulation and types of hypothesis. Errors in hypothesis testing are also
highlighted.
Further, In order to test the hypothesis, researcher rarely collects data on entire
population owing to high cost and dynamic nature of the individual in population.
Therefore, they collect data from a subset of individual – a sample - and make
the inferences about entire population. This leads us to what we should know
about the population and sample. So, researcher plans sample design and uses
46
various method of sampling. This unit will acquaint you with the meaning of Hypothesis Formulation
and Sampling
sampling and basic terminology which is used in sampling design.
Now, it will move to purpose of sampling. And finally, various probability and
non-probability sampling methods along with advantages and disadvantages are
described.
4.1 OBJECTIVES
After reading this unit, you will be able to:
• Define and describe hypothesis and its characteristics;
• explain formulation of hypothesis;
• Enumerate the possible difficulties in formulating hypothesis;
• Explain types of hypotheses;
• Identify in hypotheses testing;
• Define sampling;
• Explain the purpose of sampling; and
• Analyse various probability and non-probability sampling methods.
By stating a specific hypothesis, the researcher narrows the focus of the data
collection effort and is able to design a data collection procedure which is aimed
at testing the plausibility of the hypothesis as a possible statement of the
relationship between the terms of the research problem.
It is, therefore, always useful to have a clear idea and vision about the hypothesis.
It is essential for the research question as the researcher intents to verify, as it
will direct and greatly help to interpretation of the results.
Hypothesis plays a key role in formulating and guiding any study. The hypotheses
are generally derived from earlier research findings, existing theories and personal
observations and experience. For instance, you are interested in knowing the
effect of reward on learning. You have analysed the past research and found that
two variables are positively related. You need to convert this idea in terms of a
testable statement. At this point you may develop the following hypothesis.
Those who are rewarded shall require lesser number of trails to learn the lesson
than those who are not rewarded.
A researcher should consider certain points while formulating a hypothesis:
i) Expected relationship or differences between the variables.
ii) Operational definition of variable.
iii) Hypotheses are formulated following the review of literature
The literature leads a researcher to expect a certain relationship.
Hypotheses are the statement that is assumed to be true for the purpose of testing
its validity.
49
Introduction to Research
Methods in Psychology 4.5 TYPES OF HYPOTHESES
As explained earlier, any assumption that you seek to validate through
investigation is called hypotheses. Hence theoretically, there should be one type
of hypotheses on the basis of the investigation that is, research hypothesis.
However, because of the conventions in scientific enquiries and wording used in
the constructions of the hypothesis, Hypotheses can be classified into several
types, like; universal hypotheses, existential hypotheses, conceptual hypotheses
etc. Broadly, there are two categories of the hypothesis:
i) Null hypothesis
ii) Alternative hypothesis
Researchers usually can not make direct observation of every individual in the
population under study. Instead, they collect data from a subset of individuals- a
sample – and use those observations to make inferences about the entire
population.
Sampling unit: Each individual or case that becomes the basis for selecting a
sample is called sampling unit or sampling elements.
Sampling frame: The list of people from which the sample is taken. It should be
comprehensive, complete and up-to-date. Examples of sampling frame: Electoral
Register; Postcode Address File; telephone book.
Self Assessment Questions (Fill in the blanks)
1) Any identifiable and well specified group of individual is known as
.............................................
2) List of all the units of the population is called ............................
3) Purposes of sampling is to derive the desired information about the
population at the minimum ..................... and maximum ....................
4) The way the researcher selects the sample is known as .....................
5) ........................... is the miniature picture of entire group.
Answers: (1) population, (2) sampling frame, (3) cost, reliability,
(4) sampling design, (5) sample.
For example, an investigator may take student of class X into research plan
because the class teacher of the class happens to be his / her friend. This illustrates
accidental or convenience sampling.
Quota sampling ensures that some differences are in the sample. In haphazard
sampling, all those interviewed might be of the same age, sex, or background.
But, once the quota sampler fixes the categories and number of cases in each
category, he or she uses haphazard or convenience sampling. Nothing prevents
the researcher from selecting people who act friendly or who want to interviewed.
Quota sampling methods are not appropriate when the interviewers choose who
they like (within above criteria) and may therefore select those who are easiest
to interview, so, sampling bias can take place. Because not using the random
method, it is impossible to estimate the accuracy. Despite these limitations, quota
sampling is a popular method among non-probability methods of sampling,
because it enables the researcher to introduce a few controls into his research
plan and this methods of sampling are more convenient and less costly then
many other methods of sampling.
For studying attitude toward any national issue, a sample of journalists, teacher
and legislators may he taken as an example of purposive sampling because they
can more reasonably be expected to represent the correct attitude than other
class of people residing in country.
Purposes sampling is some what less costly, more readily accessible, more
convenient and select only those individual that are relevant to research design.
v) Systematic sampling
Systematic sampling is another method of non-probability sampling plan, though
the label ‘systematic’ is somewhat misleading in the sense that all probability
sampling methods are also systematic sampling methods. Due to this, it often
sounds that systematic sampling should be included under one category of
probability sampling, but in reality this is not the case.
Despite these advantages, systematic sampling ignores all persons between every
ninth element chosen. Then it is not a probability sampling plan. In Systematic
sampling there is a chance to happen the sampling error if the list is arranged in
a particular order.
Activity
Make a list of some research studies where some of the non probability
methods could be used. Also justify the choice of particular sampling method
you have selected for the study.
A blindfolded person, then, may be asked to pick up one slip. Here, the probability
of each slip being selected is 1-40. Suppose that after selecting the slip and
noting the name written on the slip, he again returns it to the box. In this case, the
probability of the second slip being selected is again 1/40. But if he does not
return the first slip to the box, the probability of the second slip becomes 1/39.
When an element of the population is returned to the population after being
selected, it is called sampling with replacement and when it is not returned, it is
called sampling without replacement.
Thus random sampling may be defined as one in which all possible combinations
of samples of fixed size have an equal probability of being selected.
57
Introduction to Research Advantages of simple random sampling are:
Methods in Psychology
1) Each person has equal chance as any other of being selected in the sample.
2) Simple random sampling serves as a foundation against which other methods
are sometimes evaluated.
3) It is most suitable where population is relatively small and where sampling
frame is complete and up-to-date.
4) As the sample size increases, it becomes more representative of universe.
5) This method is least costly and easily assessable of accuracy.
Despite these advantages, some of the disadvantages are:
1) Complete and up-to-date catalogued universe is necessary.
2) Large sample size is required to establish the reliability.
3) When the geographical dispersion is so wider therefore study of sample
item has larger cost and greater time.
4) Unskilled and untrained investigator may cause wrong results.
Activity
In a class of 140 students, select a simple random sample of size 20 students
with replacement technique. Also mention the probability of each one of
140 students being included in the sample.
Having divided the population into two or more strata, which are considered to
be homogeneous internally, a simple random sample for the desired number is
taken from each population stratum. Thus, in stratified random sampling the
stratification of population is the first requirement.
There can be many reasons for stratification in a population.
Two of them are:
1) Stratification tends to increases the precision in estimating the attributes of
the whole population.
2) Stratification gives some convenience in sampling. When the population is
divided into several units, a person or group of person may be deputed to
supervise the sampling survey in each unit.
Advantages of stratified Random Sampling are:
1) Stratified sampling is more representative of the population because
formation of stratum and random selection of item from each stratum make
it hard to exclude in strata of the universe and increases the sample’s
representation to the population or universe.
58
2) It is more precise and avoids the bias to great extent. Hypothesis Formulation
and Sampling
3) It saves time and cost of data collection since the sample size can be less in
the method.
Despite these advantages, some of the disadvantages of stratified sampling are:
1) Improper stratification may cause wrong results.
2) Greater geographical concentration may result in heavy cost and more time.
3) Trained investigators are required for stratification.
iii) Cluster sampling
A type of random sample that uses multiple stages and is often used to cover
wide geographic areas in which aggregated units are randomly selected and then
sample are drawn from the sampled aggregated units or cluster
For example, if the investigator wanted to survey some aspect of 3rd grade
elementary school going children. First, a random sample of number of states
from the country would be selected. Next, within each selected state, a random
selection of certain number of districts would be made. Then within district a
random selection of certain number of elementary schools would be made. Finally
within each elementary school, a certain number of children would be randomly
selected. Because each level is randomly sampled, the final sample becomes
random. However, selection of samples is done to different stages. This is also
called multi stage sampling.
This sampling method is more flexible than the other methods. Sub-divisions at
the second stage unit needs be carried out only those unit selected in the first
stage. Despite these merits, this sampling method is less accurate than a sample,
containing the same number of the units in single stage samples.
Self Assessment Questions
1) Non probability sampling is one which there is way of assessing the
probability of the element or group of element of population, being
included in the sample. T/F
2) Simple random sampling is the core technique and attaches equal
probability to each unit of the population to be selected. T/F
3) Cluster sampling method sometimes known as multi stage sampling
method. T/F
4) Snowball technique is a probability sampling method. T/F
5) Stratified sampling is more representative for the population than other
methods. T/F
Answer: (1) F, (2) T, (3) T, (4) F, (5) T.
The three main advantage of sampling are that cost in lowest, data collection is
faster, and since the data set is smaller, it is possible to ensure homogeneity and
to improve the accuracy and quality of data (Ader, Mellenbergh & Hard (2008)
Researchers rarely survey the entire population for two reasons: The cost is too
high, and the population is dynamic in that the individual making up the
population may change over time. Sampling methods are of two types i.e. Non
probability and probability sampling methods. Probability sampling methods
are those in which some probability to each unit of the population to be included
in the sample and this is more representative. Three different probability sampling
method are discussed as simple random sampling, stratified random sampling
and cluster / multi stage sampling. The other non probability sampling methods
discussed are convenience sampling, Quota sampling, Purposive sampling,
Snowball sampling and systematic sampling. These methods are also used but
lack the representative character of samples.
4.13 GLOSSARY
Hypothesis : A tentative and testable statement of a potential
relationship between two or more variables.
Null hypothesis : The hypothesis that is of no scientific interest;
sometimes the hypothesis of no difference.
Alternative hypothesis : Statistical term for research hypothesis that
specifies values that researcher believes to hold
true.
Population : It is the aggregate from which a sample is drawn.
In statistics, it refers to any specified collection of
objects, people, organisation etc.
Population size : It is the total number of units present in the
population.
Sampling units : They are members of the population.
Sampling frame : It is the list of all the units of population.
Sampling design : It is a definite plan for obtaining a sample from a
given population.
Sample size : It is the total number of units in the sample.
Simple random sample : It is a sample in which each unit of the population
has an equal chance of being selected in the
sample.