Marketing Research-B360. Notes.
Marketing Research-B360. Notes.
SCHOOL OF BUSINESS
(LECTURE NOTES)
ACKNOWLEDGEMENT
These notes are based on and developed from materials in the following texts:
RESEARCH
Marketing decisions are made with some objectives in mind. Many marketing objectives
derive from corporate objectives. A typical marketing objective might be to increase
market share, which might be derived from a corporate objective of growth. Assuming
that our marketing manager‟s objective is to increase market share, he/she must then turn
to the task of identifying the various ways of attaining increasing market share. Possible
alternatives might be to offer his products at a lower price than his competitors, to
improve the quality of his product or to carry out increased advertising of his product. He
then will attach some benefits or costs to each of these alternatives. He is then faced with
the dilemma as to which options to go for. This dilemma is rooted in his ignorance or
uncertainty about the efficacy of any of the alternatives. His selection among the choices
will constitute a decision, and it will be based on some rationalization ad judgment as to
which of the options is optimal. Implicit in decision making is that the decision maker
makes an informed decision given the uncertainty associated with each alternative course
of action.
The McCarthy and Perreault model, illustrated in figure 1, holds that the customer is at
the centre of marketing activity. The first challenge a marketing manager faces, however,
is that
Figure 1: task of marketing management
Political/ Legal
Price
Competitive/
Customer
Cultural/ Social Place Product Technological
Promotion
Economic
a customer‟s needs and wants are not precisely known. Logically, if marketing effort is to
be effective, it must be targeted at known needs and wants. Much decision making in
marketing is consequently devoted to the task of identifying customer‟s needs and wants.
The role marketing research is to provide marketing managers with information that
enables them make informed decisions in identifying more accurately the needs and
wants of their customers or prospects. Much research is devoted to determining
customers‟ needs/ wants. Such research has to do with opportunity identification: is there
a need out there in the market which is not satisfied? Is there a better way of doing what
we are doing? What competitive advantage do other firms have? Will our product be
acceptable in the market place?
Another area of uncertainty in need identification relates to consumer preferences. While
needs are basic and easily predictable, wants are much more difficulty to ascertain
because they are conditioned by an individual‟s preference. Preference, in turn, depend
on a number of factors, such as, income, taste, background, social standing etc. how each
of these factors, or a combination of them, influence individual choice is not a matter that
can be easily ascertained. Managers need information that will help understand not only
what buyers need but how they go about exercising their choice. Thus a miller of maize
does not make the kind of decisions that a manager of the restaurant has to make, because
the demand for maize meal relates to satisfying a need which is largely homogenous and
one which is satisfied all too often in the same way. In contrast, a patronage of the
restaurant is relatively more complicated: the habit of eating at the restaurant is not as
widely spread as eating maize meal. In our tradition a Zambians, eating at a restaurant
may be alien and is probably an adopted habit arising out of social standing in an urban
setting or job demands. And even if a Zambian should find his or her way into a
restaurant, what they choose to eat there may not be as easy to predict as what they eat in
their homes. Thus, for a restaurant manager, areas of uncertainty stem from lack of
knowledge about demographic, ethnic, or economic profile of his customers. Customer‟s
individual tastes, price and service levels preferred by customers, location and
appearance of the restaurant, image, and reputation, etc. the point is that preferences are
determined by forces that are not easily discernible to marketers, and hence the need for
research aimed at assisting managers make proper decisions as to who needs what.
The McCarty and Perreault framework further holds that the marketing manager
manipulates elements of the marketing mix, namely Price, Product, Place and Promotion,
in order to put into effect a marketing program. The marketing manager faces a problem
in that he does not know or is uncertain about the impact of manipulating any one of or
all the element of the marketing mix. For example, while general economic theory holds
that price is inversely related to demand, the concept of price inelasticity holds that in
some instances demand may not be responsive to price changes as predicted by the
general economic theory. Indeed, customers react in different ways to price changes. For
the marketing manager, there is consequently some uncertainty surrounding the
effectiveness of manipulating price to induce a desired demand levels. Similarly, there is
no guarantee that programmed product, distribution or promotion strategies will lead to
desired outcomes. Since the manager is faced with uncertainty as regards the
consequences of his manipulation of any, or all elements of the marketing mix, marketing
research plays the role of reducing uncertainty by providing such information as will
enable a manager more accurately predict the effect of his marketing program.
A marketing manager must finally deal with a third area of uncertainty occasioned with
uncontrollable factors obtaining in the environment in which a firm operates, and whose
impact on the firm‟s marketing activities cannot be determined with certainty. These
factors comprise technological, political, economical, competitive, legal, and culture and
social developments. Because these factors are external to the firm, their impact, whether
positive or negative cannot be easily ascertained, and it is therefore the role of marketing
research to provide information which will hopefully make the marketing manager not
only aware of them but also make informed decision regarding their impact on marketing
operations. Consider, for example, how the opportunities and threats created by the
following external events:
In the pre-liberalization era, many companies operated as monopolies. Because
there was no threat of competition, such companies became somewhat
complacent in the area of product development and enhanced product quality.
With the advent of liberalization, there was need for some re-orientation to cope
with the threat posed by competition. Many local manufacturers and service
providers that had enjoyed protection from competition in pre-liberalization era
has to contend with a customer who had now a wider choice of goods or services
and prudently sought better quality product or price bargains. Companies that
could not meet customer expectation lost ground in market share and profits. The
collapse of many companies in the post liberalization era was due, in large
measure, to their lack of foresight in anticipating the threats to their survival
which loomed into the horizon as a country inched its way towards liberalization
and attendant economic development.
The enactment of the competition and fair trading act of 1995 further reinforced
the spirit of competition that was implicit in the policy of liberalization. Through
this piece of legislation, competition was legally encouraged, or behavior that
undermined competition was prohibited. Specifically, no firm could enjoy a
dominant position such that it could have power to arbitrary determine price and
control production or distribution; firms were prohibited from such collusion as
would otherwise enable share amongst themselves the domestic market, fix
prices, or reduce production; mergers or acquisition intended to kill, restrict or
distort competition were forbidden and; selective distribution which resulted in
restricting access to a commodity was forbidden. Further, in order to protect and
promote consumer welfare, misleading or deceptive conduct and disclaiming
liability for defective goods were prohibited.
The rise of consumerism and corporate social responsibility in Zambia demands
that, in addition to profit consideration, firms just their decisions from a moral and
ethical perspective. Increase in the price of energy and utilities are constantly
being subjected to public scrutiny in spite of the economic rationalizations from
the service providers
Churchill and Lacobucci (2002:8) propose that an alternative way of looking at the role
of marketing research is to examine how management might use it. They suggest that
when marketing research is used for planning, it serves to help managers identify
opportunities that can be taken advantaged of and threats which can be avoided; for
instance, in identifying unexploited markets or alerting management to unfavorable
trends in the market place. When it is used for problem solving, it focuses on the short or
long term strategies in the application of the four elements of the marketing mix. The
issues which marketing research can be used to address include: how should a product be
packaged, what price should be packaged for a product, where, and by whom, should the
product be sold: or how much should be spent on advertising and promotion? Finally,
control oriented marketing research assists managers to become aware of dangers and be
informed about their operations. Questions that might be asked include: what is our
market share by region or by customer type? Are customers satisfied with our service?
How is our company, product or employees perceived?
“the function which links the consumer, customer, and the public to the marketer
through information-information which is used to identify and define the
marketing opportunities and problems; generate, refine and evaluate marketing
actions; monitor marketing performance; and improve our understanding of
marketing as a process.” (Churchill et al; 2002:6)
Of note in this definition is the centrality of information and its application to decision
making
Marketing Manager
There are a number of organizations that provide research to a firm. These comprise:
Syndicated services
These refer to firms that routinely collect information that they provide to firms
subscribing to their services.
Universities
Universities and research institutions usually provide research and consultancy
services in a variety of disciplines.
INFORMATION SYSTEM
Guidance
Decision
Info
Request
Recurrent Info Marketing
Marketing Monitoring Info Information
Managers Requested Info System
Feedback
In short, the principal focus in MIS is to anticipate
What decision each decision maker will regularly make
What information is needed to make these decisions
How often this information should be received.
The advent and development of marketing decision support system arose because of
certain shortcomings of marketing information systems. One shortcoming of an
information system is that it is premised on a standardized or structured information need
package which a typical manager need to make decisions. Most problems encountered
are, however, not structured, requiring structured solutions. There are thus in practice
very few report format that are optimal for different users. Decisions are compromises
requiring flexibility, and decision making is unstructured, with different managers using
different factors and in different ways to arrive at the decision. The second weakness of
the marketing information system is that there is often resistance to changes which are
implicit and inevitable in analyzing, designing, developing and implementing a dynamic
marketing information system. Managers may be reluctant, or simply not know, what
factors they ought to take into account in making a decision about a particular issue.
Thirdly, given a dynamic nature of business and the marketing environment, coupled
with the time required to develop an effective marketing information system, a marketing
information system may not respond in time to issues of the day.
The essential features of the marketing decision support system are that the information
gathered fro business and the environment forms the basis for marketing actions, and that
decision makers or managers are allowed to interact directly with the data and models. In
this regard, its components are:
Computer
System Manager B
Responses
Model Manager C
System
Model System
This refers to routines that allow a decision-maker to manipulate the data into a
form that will be useful to decision-making.
The routines for manipulating the data are usually in the form of the statistical or
mathematical statements and may range fro basic operations to complex analysis
of the presumed relationship between two or more variables
Interface
This refers to the software that allows the manager to interact with the data and
the model.
It is the language system which permits managers with little or no programming
skills to access and explore the database and, using the model system, enable
them to produce information I formats that meets their information needs.
It is this singular feature which distinguishes an information system from a
decision support system.
Managers
The managers, who must make decisions by using information, represent the most
important aspect of the marketing decision support system. If they fail to use it
(properly), the system will be of no value
Among other things, managers must understand and accept the models involved,
and how to do the relevant analyses.
It is also preferable that they are involved in all aspects of the design of the
system.
2. Research Design
A research design is a framework, blueprint, or plan for the collection and
analysis of data. A research design is anchored on how much is known about the
problem being studied. There are three research designs: exploratory, descriptive
and experimental or causal research design. An exploratory research design is
used when there is little or no information about the problem being studied. Its
thrust then is to gain some insight into the problem. In contrast, descriptive
research designs are used when there is some knowledge about the problems and
the variables which relate to it. The thrust of the descriptive research design is to
provide an accurate and detailed description of the research problem. In a causal
or experimental research design, the thrust is to examine the functional
relationship of the variables in the problem model. Most research designs under
this category examine a causal relationship between variables; for example, the
relationship between sales and advertising.
4. Sample Design
Ideally, information should be collected by a census, that is, from the entire
population of interest. In reality, this is not possible. A researcher is then
compelled to collect information from only a part of the target population. The
process of collecting information from a part of, rather than the entire, universe is
known as sampling. To base decision on sample data requires validation of the
sample information. Such validation is done by addressing concerns about the
method used to select the chosen sample and adequacy of the size of the sample.
5. Data Collection
The actual collection of data addresses a number of questions. Not only must the
researcher administer matters to do with recruitment, training and deployment of
staff in the field, he/she must also deal with the greater problem of how to
mitigate errors which can potentially emanate from fieldwork. Such errors include
missing data occasioned by respondents who may be unavailable or unwilling to
answer questions; errors arising from the interviewer-interviewee interaction bias;
errors in recording responses; and outright cheating by interviewers.
Conclusion
The marketing of goods and services necessitates decision-making which enhances
customer satisfaction o a sustainable basis. The uncertainty and complexity of decision
making requires that marketing managers avails themselves of information that will
optimize their decision-making. This is the principal function and role of marketing
research. Marketing managers need information to make critical decisions in relation to
what need and wants to satisfy, how best to satisfy the identified needs and wants, what
new opportunities and threats are emerging in the external environment and what should
be the strategic response to identified environmental trends.
UNIT THREE
A decision problem can be said to exist when a situation obtains which represents a
problem or an opportunity such that a decision maker is required to take some action.
Churchill and Lacobucci (2202) have argued that research problems can arise from three
sources.
The first source is the internal and external environment in which a firm operates. The
external environment, involving technological, economic, political, legal and social
developments, may present opportunities which can be exploited, or problems which may
require some remedial action. New technologies in information and communication, for
example, have created immerse opportunities for better and more efficient ways of
conducting business. Political change from a one party system to multiparty democracy
heralded economic liberalization and privatization, with attendant opportunities and
problems. Economic liberalization, for instance, created the opportunity for expanded
markets but concomitantly presented problems of competition for local businesses which
hence forth ad to compete not only among themselves but also against foreign firms and
imports. The atmosphere of competition was further boosted by the enactment of the
Zambia Fair Trading and Competition Act in 1995. The principal aim of this act was to
encourage competition or to discourage behaviors which restricted, killed or otherwise
undermined competition. To this end, the Act, among other things, seeks to curb
monopolistic tendencies by prohibiting firms from enjoying a dominant position that
would give them power to determine prices or control production and distribution of
goods; from collusion that would enable them to share the domestic market; and from
effecting mergers and acquisition that would restrict, kill or distort competition
A third source of opportunity and problem is a myriad of ideas which come to the
attention of the decision-maker by chance and in a random manner. Such ideas include
complaints from customers who are dissatisfied, suggestions for improvements, reports
by sale people, and deliberation and exchange of ideas at business conferences.
a) An objective function
A problem exists when that which is desired cannot be realized. The objective of the
marketing manager is to maximize sales, and when it cannot be realized, the manager
experiences a problem. To illustrate the point, assume that the management of
FRESCO, a soft drink company, wishes to introduce a new brand of a soft drink as
part of its growth strategy. The objective function of management therefore is to seek
more sales by introducing a new brand. It can be reasonably assumed that the firm is
not entirely satisfied with its current level of sales and consequently wishes to take
some action about this state of affairs.
b) Alternative Courses of Action
A second feature of a decision situation is that there are at least two courses of action
a decision maker can follow, but a decision maker is not certain about which course
of action to follow. In our example, the options open to management are to introduce
the new brand or not to introduce the new brand. Research can assist management by
clarifying the uncertainty about which option to follow.
To answer the first question, management will be guided by their knowledge and
experience of the probable consequences of introducing or not introducing the new
brand. These consequences are known as “payoffs”. To go back to our example of
Fresco, let us assume that, on the basis of their knowledge and experience,
management feels that the introduction of the new brand will led to any of the three
possible outcomes; there could be a highly receptive to the introduction; there could
be an average reception; and there could be poor reaction. These possible reactions
from the market have attendant implications of realizable market share. They are
called states of nature because they are beyond the control of management.
Management must live with the risk of any of them occurring. Let us assume further
that management believes that if the market is highly receptive to the new brand, the
firm‟s market share will increase by more than 5 percent and profit will increase by
K5Million. If the new brand reception will be average, management believes that
market share will be between 1 and 5 percent and profits will rise by K3 million. If
the reception to the new brand will be poor, management believes that market share
will be less than 1 percent and the firm will incur a loss of K5 million. Because
market reduction to the new brand is beyond the control of the firm, management has
estimated the likelihood (probability) of any of these outcomes occurring. The
payoff table shows the respective consequences of each course of action given each
state of nature. The table shows that the likelihood of highly receptive, average and
poor consumer reactions are 0.30, 0.50 and 0.20 respectively.
States of Nature
Highly Receptive Average Poor
(+5%) (1-5%) (<1%)
Decision Option Prob=0.30 Prob=0.50 Prob=0.20
Introduce K500,000 K300,000 K-500,000
Do not Introduce 0 0 0
If management chooses not to carry out research, the optimal decision is to introduce the
new brand.
If management has perfect information, that is, if all uncertainty could be eliminated, the
correct decision could be made depending upon which state of nature existed. Thus, as
indicated in table 3.2, the option to introduce the new brand would be selected if
management was certain of a highly receptive or average reaction against the decision not
to introduce the new brand since the latter option yield nothing. On the other hand, if
management was certain of the poor reaction to the introduction of the new brand, the
selected option would be not to introduce the new brand because of the potential loss of
K5 million.
Table 3.2: Decision under uncertainty
States of nature
Pay off Decision
Highly receptive K500, 000 Introduce
Average K300, 000 Introduce
Poor -K500, 000 Do not introduce
The expected value of a decision under certainty can be computed by multiplying the
reward or penalty by the probability of being told for certain a particular consumer
reaction would obtain and summing:
EV (decision under certainty): (500, 000x0.3) + (300, 000x0.5) + (0x0.2) = 300, 000
The expected value of perfect information is the difference between the expected value of
decision under certainty and the expected value of optimal decision under uncertainty:
The significance of the value of perfect information is that it identifies an upper limit on
the amount that might be spent on research. In other words, the cost of anticipated
research should not exceed the expected value of perfect information.
Assume that there are two possible samples of information that indicate the likelihood of
how consumers are likely to react to the new brand. These are shown in tables 3.4 and
3.5. They indicate conditional probabilities of getting a particular test market result given
each state of nature. For instance, in the case of project A, the table is read as follows: if
the true state of nature is high reception for the new brand, the probability that the
research finding (test market result) will indicate a test market result of over 5% or highly
receptive reception is 0.60; if on the other hand the true state of nature are average
reception or poor reception, the respective probabilities of getting a test market result of
over 5% market share are 0.20 and 0.10. in considering whether or not to use sample
information, management will want to estimate how the new information will change the
new probabilities. For example, if the research project shows that the new brand will
attain a 4% market share, how should the prior probabilities of 0.30, 0.50 and 0.20 for
highly receptive, average and poor reaction respectively change? The premise here is
that the research finding will approximate the true state of consumer reaction. That
is, if the test market result shows a market share of over 5%, the true consumer
reaction to the new brand is most probably highly receptive.
PROJECT A
States of Nature
Test Result of Highly Receptive Average Reception Poor Reception
market share
Over 5 percent 0.60 0.20 0.10
1-5 percent 0.30 0.50 0.20
Below 1 percent 0.10 0.30 0.70
PROJECT B
States of Nature
Test Result of Highly Receptive Average Reception Poor Reception
market share
Over 5 percent 0.70 0.15 0.05
1-5 percent 0.25 0.60 0.15
Below 1 percent 0.05 0.25 0.80
Management and the researcher are required to estimate the conditional probabilities.
Similarly, the probability of research outcomes of 5% and 1-5% market share occurring
are 0.30 and 0.38 respectively.
Revising prior probabilities
Management estimated prior probabilities of 0.30, 0.50 and 0.20 for the existence of the
three consumer reaction-highly receptive, average and poor respectively. After research
results are obtained, management will use them to revise these prior probabilities. For
example, given tat management had assigned a probability of 0.30 for highly receptive
consumer reaction, how might this initial probability change if the research outcome
shows that the consumer reaction resulted in a market share increase of more than 5%, 1-
5% or a decrease in market share to below 1%? The revised probability indicates the
probability of the true state of nature (consumer reaction) in the light of research
outcome. Table 3.5, for example is read as follows: if the research outcome is the
consumer reaction of between 1 and 5% market share, the probability that the true state of
nature is average reaction is 0.66, that the true state of nature is poor is 0.10 and that the
true state of nature is highly receptive is 0.24. These revised probabilities are obtained by
dividing the probability of occurrence of a research outcome by the joint probability. This
calculation is illustrated in table 3.7
Table 3.7: probability of a given state of nature being the true state if a
given research outcome is obtained
POSSIBLE RESEARCH OUTCOMES
STATE OF NATURE OVER 5% 1-5% LESS THAN 1%
HIGHLY RECEPTIVE 0.18÷ 0.30=0.60 0.09 ÷ 0.38=0.24 0.03 ÷ 0.32=0.09
AVERAGE RECEPTION 0.10 ÷ 0.30=0.33 0.25 ÷ 0.38=0.66 0.15 ÷ 0.32=0.47
POOR RECEPTION 0.02 ÷ 0.30=0.07 0.04 ÷ 0.38=0.10 0.14 ÷0.32=0.44
Once a research problem is specified and understood, the next step in t research process is
the formulation of the plan of study for the collection and analysis of data. This plan of
study is known as the research design. It is intended to guide research efforts to ensure
that study is relevant and carried out as cheaply as possible.
A useful way to classify research design is to focus on the principal objectives of the
research. A research design can be exploratory, descriptive or causal.
Because the dimensions of the research study are not precisely known and because
knowledge about the problem is lacking, an exploratory study is characterized with
flexibility. The researcher must sniff around for information, relying on imagination,
ingenuity, judgement or mother luck. In this regard, there are three primary sources of
information. One source of information involves literature search comprising conceptual
literature, trade literature, published data and previous published studies. Information can
also be obtained from key informants, also known as experience surveys. These are
individuals deemed to have some knowledge about the problem at hand. Finally, one may
have recourse to an intensive analysis of a few selected cases.
In contrast to an exploratory study, there will be some knowledge about the problem at
hand in a descriptive study. The thrust of the study is to get more detailed description of
the problem variables. The format for the collection and the analysis of data tends to be
structured and more rigid than in an exploratory research design. As an illustration, we
might e interested in how the purchase of the products varies among people with the
certain level of education, and we might develop the following framework, called a
dummy table, to study the possible relationship between newspaper readership and the
level of education. Such a profile would reveal, for example, which newspaper is
preferred by people of a particular level of education. Of note in such a framework is that
there is some idea of the kind and structure of information required. What a descriptive
research design attempts to do is to provide more detailed information.
Table 4.1: Readership of Newspaper by Education
Level of Education Times Post Mail Total
Up to Grade VII 73 77 75 225
Grade VII to Grade XII 90 20 40 150
University Graduate 40 50 35 125
203 147 150 500
There are three source of information in descriptive studies. One source is the use of
panels. A panel is a select group of people from whom information is sought on a
continuous basis. A major advantage of panel data is that organizational costs are
minimized as information may be sought from the same group of people, albeit over a
number of issues. A second advantage is that one is able to achieve consistency and make
comparisons over time. However, panel interviews suffer from not being representative.
There may also be costs of trying to keep the panel together.
A third source of information is through an in-depth study of few selected cases. This is
known as case study approach. In contrast to a survey, a case study is narrower, intensive
and clearly not representative. However, its advantage over a survey approach is that it
offers in-depth knowledge about the subject by narrowing the scope to a few cases. In a
case study, one attempts to identify factors which are common, or features which are
common to a subgroup, or feature which are unique to a specific case. Sharp contrast can
be revealed better in a case study than in a survey study. A survey study, on the other
hand, aims at providing a general idea of some phenomenon.
A Causal Research Design
A major limitation of the descriptive studies is that they cannot satisfactorily establish
causality, in the sense that X can be said to be the probable cause of Y. causal or
experimental designs are intended to explore cause and effect relationships.
It is apt that the common sense notion of the concept of causality is distinguished from
scientific notion. The following three distinctions are made:
A major implication arising from the scientific notion of causality is not error-free and a
causal relationship should not be stated as if the relationship has been exhausted, is
certain or proven beyond doubt. Causal relationship cannot therefore be conclusively
demonstrated. Nevertheless, inferences of causality can be made on the basis of three
types of evidence:
Experimentation
Experiments are mechanisms for providing evidence of causality. They allow one to
check for all the three types evidence of causality.
The essence of an experiment is for the researcher to manipulate one variable, called the
independent or causal variable, in order to observe the effect of the manipulation on
another variable, called the dependent or effect variable.
Experiments can be of two types. A laboratory experiment is were the setting of the
experimental situation is contrived, or determined by the researcher. A field experiment is
where the situation is a natural one. Consider a situation in which a researcher wishes to
determine whether or not the demand for ice cream is influenced by flavour. In a
laboratory experiment, the researcher could gather, say, children and ask each one of
them to pick a flavour of ice cream they preferred. He could then determine the most
popular flavour from the selections made by the children and infer causality from the
observations made. The researcher contrives the setting for this experiment. In contrast,
in the field experiment, would not have control of, for instance the test units or the timing
of the experiment. Any individual could conceivably buy ice cream as they went on
doing their normal shopping. The purchase of ice cream would not be dictated by the
researcher but by natural events. Causality would thus be inferred under natural
conditions.
In our ice cream example, if the experiment were confined to children by refusing adults
to take part, the researcher would be confident in making a statement that inferred that a
particular flavour of ice cream is most popular among children who took part in the
experiment. He would not be correct to extend his conclusion to beyond the spheres of
the experiment. On the other hand, if every one who came along could buy ice cream of
their choice in a natural course of events, it could be reasonably inferred that the flavour
that was found to be popular in the experiment was equally popular beyond the confines
of the experiment. Nevertheless, in a field experiment, there is a possibility that other
factors, unknown to, or beyond the control of the researcher, may have contributed to the
observed effect. Such factors, if known to the researcher, can be controlled for in a
laboratory experiment but not in a field experiment.
Extraneous Factors
Extraneous factors are events that cannot be manipulated by the research, which
nevertheless have an effect on the variable to be observed. The following are some
common examples of extraneous factors.
History
History refers to events outside the spheres of the experiment but occurring at the same
time as the experimental variable. Sales at Christmas cannot be attributed to advertising
alone because there might be other factors which might spur demand, such as a Christmas
bonus or the festive nature of the occasion.
Maturation
Maturation refers to the changes that occur within the test units during an experiment due
to the passage of time. Where test units are people, maturation refers to the fact that
people get old, get tired and perhaps feel hungry. Responses to the experimental variable
may be affected, for example by hunger or fatigue.
Testing
Testing refers to the changes induced by the very process of experimentation. The testing
effect can be of two types. The first type, known as the main testing effect, is when the
earlier response to an experimental variable conditions subsequent responses. This can
happen, for instance, when earlier responses are used as reference points in responding to
an experimental variable. An example might be a panel of housewives requested by the
company to record monthly expenditure on an item. In their desire to be consistent,
housewives might be influenced in their subsequent answers by the answers they gave
earlier. A second type, known as the interactive testing effect, is where a prior response
affects a subsequent response to an experimental variable. For instance, if some one is
introduced to some product, they are likely to be sensitized and will become more aware
of subsequent advertisement of the product because of the initial alertness that was
created. The reaction to the advertisement is amplified by the prior exposure.
Instrument Variation
Refers to changes that are induced by variation in the measurement instrument or in the
administration of the instrument. Variation in responses may occur, for instance, when
two interviewers are used instead of one.
Statistical Regression
Refers to the changes brought about by the tendency of polar points to move towards te
average during the course of an experiment. In a panel interview, for instance,
interviewers may start off with an initial position on an issue and shift from the initial
position to another position in the course of the interview.
Selection Bias
Most experimental designs make use of an experimental group and a control group.
Selection bias is said to occur when there is no way of certifying that the test unit selected
and assigned to groups were equal prior to the selection and assignment to groups.
Inherent inequality among test units may lead to bias in a way test units responds to the
experimental variable. To minimize the incidence of selection bias, there should be
random selection and assignment to groups.
Experimental Mortality
Refers to effect caused by loss of test units during the course of an experiment. If the
experiment involves one group, mortality occurs when respondents that remain in the
study differ in their responses to the experimental variable from those who withdraw. The
question then is; would the answers have been the same if test unit had remained the
same?
If there are two groups or more, mortality refers to errors caused by differences in
responses due to the loss of respondents from the various groups in the experiment.
Reactive Error
Changes caused by the artificiality of the experimental situation ad/ or the behaviour of
the experimenter. Reactive error manifests itself by way of dampening, emphasizing or
altering the effects of the experimental variables on the test units. Reactive error arises
principally because of the tendency of human treatments to anticipate what the
experimental situation is all about and the attempt to produce the anticipated behaviour.
An example of reactive error is the exaggerated behaviour one might notice from models
at a fashion show.
X O
After-only designs do not adequately control for the effects of other extraneous
factors such as history, maturation, selection and mortality.
b) “Before-After” Design
In this design, a prior measurement is made before the experimental variable is
administered. After a prior measurement is taken, an experimental variable is
introduced and a post-measurement is taken. The impact of the experimental variable
is taken to be the difference between the prior measurement and post-measurement.
01 : Pre-measurement or prior observation
X : Experimental variable introduced
02 : Post-measurement or after observation
02 - 01 : Effect of experimental variable
Although this design is an improvement on the after-only design, it does not control
for the effect of extraneous factors of history, maturation, testing, mortality or
instrument variation.
04 – 02 = Extraneous factors
The only extraneous factor not controlled for in this design is the interactive testing
unit. This is because of the prior measurement taken in the experimental group. The
composition of the difference can now be refined to read as follows:
04 – 02 = Extraneous factors
01 02
X X
03 04 05 06
EG2 CG2
Before measurement
Experimental Variable X
After measurement 05 - 06
Effect of experimental variable 05 - 06
The extraneous variables are controlled for by randomizing and assuming therefore
that they equally affect the two groups and cancel each other. The interactive testing
effect is controlled for by not taking pre-measurements
f) Time Series Experiment
This type of design can be illustrated as follows:
01 02 03 X 04 05 06
As can be illustrated, the time series experiment requires that the researcher has
access to the same test units. What is envisaged is that a series of measurement are
taken prior to the introduction of an experimental variable. These initial
measurements are intended to indicate a pattern in the behaviour being measured.
Then, an experimental variable is introduced and therefore a series of measurements
are again taken. The difference in patterns between the two series of measurement is
ten said to have been due to the introduction of the experimental variable.
Sales
X
Jan Feb Mar Apr May Jun Jul
Obviously, if the pattern changes before the introduction of X, or does not change
despite the introduction of X, we can reasonably infer tat X has no relationship to the
pattern of sales.
A weakness of time-series experimentation is that it cannot control for extraneous
factors such as history.
UNIT FIVE
RESEARCH INFORMATION
We have thus far examined two topics, namely, problem formulation and research
designs. We now turn to discussion of (a) types and sources of research information
and (b) methods of collecting research information.
TYPES OF DATA
There are basically two types of data. The first type is secondary data and the second
type is known as primary data. Secondary data refers to information collected for
some other purpose other than the problem at hand. Primary data on the other hand
refers to information collected specifically for the study at hand. Given a research
problem, it is tempting to rush into survey where one has to generate primary data.
However, it is advisable to check first whether or not information already exists on
the problem to be studied.
Speed
Secondary data can be obtained far more quickly than primary data. If data
already exists, in published form and/or in a central place, it can be obtained at the
fraction of the time one would spend in generating fresh information.
Accessibility
Some secondary data sources provide information which could not be obtained by
a typical organization. For instance, information provided by government sources
is not easily obtainable if a non government source were to solicit the information.
Inaccuracy
Secondary data may not be accurate. Again, this arises principally because the data
was collected for some other purpose and may have been processed by several
intermediaries to suit their interests. The more intermediaries there are the greater the
likelihood of data manipulation. It is therefore to use a primary source rather than a
secondary source. Since secondary data are derived from some other source, it
follows then that there is less likelihood of error with a primary source than with a
secondary source. If data are collected from a source which originally gathered them,
one is afforded an appreciation and explanation of the context or the explanation of
how the data were collected. In contrast, secondary sources do not afford one the
opportunity of such an explanation. Accuracy of secondary data may also be
compromised by the purpose for which it was collected. One should be wary, for
instance, of data published for propaganda purposes, of information contained in an
advertisement or promotional campaign, data published anonymously or in
controversial circumstances, or information used in advancing the interests of an
industrial or commercial or any other group.
Finally, one may assess the accuracy of secondary data by examining the quality of
the data itself. A useful guideline is to look at the ability and reputation of the source
that generated the data and the methodology used in generating the data.
Accounting and financial reports can be supplemented by reports from other departments.
The marketing department might provide some useful information on advertising
expenditure and the effectiveness of advertising in generating sales; the human resource
department might provide pertinent information on employees; and production might
have records on production outputs and levels of productivity.
1. Computerized Databases
These are basically information banks consisting of data and/or textual
information which are made available in computer reader form for electronic
distribution. Of particular interest to marketing studies are bibliographic and
numerical databases which appear in form of articles or reports. Numeric
databases contain numeric data on topics such as sales, population, etc.
3. Government Agencies
Examples here include the Central Statistical Office, Government printers,
Government Gazette and occasional publication by ministries. These sources
carry information o population, income, housing, agriculture, industrial and
commercial sales, employment, etc.
4. Syndicated Services
A wide array of data on both consumer and industrial markets is collected and
sold by commercial organizations. The information sought comprises retail sales,
household purchases, media usage, attitudes, knowledge and behaviours.
5. Directories
Directories are a useful source of information on topics of study. One could
conceivably have a “Directory of Manufacturers in Zambia”, a “Directory of
Telephone Subscribers”, or the “Business Telephone Directory” – yellow pages.
Directories are commonly found in libraries.
7. External Experts
These are individuals outside the research firm whose job provides them with
expertise on the subject being studied. These include university researchers,
government officials associated with trade, commerce and industry, financial
analysts, editors and writers for trade and business publications and distributors.
8. Databases
There are many international databases to which one may get linked to access
information outside one‟s country.
PRIMARY DATA
For the reasons that we discussed earlier, it may not be possible to use secondary data
for the study at hand, in which case we must generate information for the study at
hand. This information can be of the following types:
B. Attitudes/ Opinions
Marketing managers not only want to know the level of consumer
awareness of a firm or its products but also the attitudes or opinions held
about them. Attitudes or opinions are dispositions towards something. It is
often assumed that a favourable attitude (disposition) will lead to an actual
purchase. It is principally for this reasons that marketers are interested in
collecting information about attitudes/ opinions.
C. Purchasing Intentions
Marketers are interested in people‟s anticipated or planned future
behaviour. A typical response model could be categorized as follows: will
definitely buy, will probably buy, not sure or undecided; and will
definitely not buy. A word of caution is in order here. Intentions do not
always translate themselves into actual purchase behaviour.
D. Activities or Behaviour
Many primary data-gathering studies are concerned with finding out what
consumers are doing. How much is spent in the retail supermarket? Who
does the shopping? When is the shopping done? What brand of soft drink
is bought?
E. Knowledge
We also might be interested in what respondents know or do not know
about an object or phenomenon. For instance, the effectiveness of an ad is
usually measured by how much can be recalled about the ad or its
message.
F. Motivation
This refers to a drive or an urge to do something. It is, in marketing
circles, the “why” of consumer behaviour.
QUESTIONNAIRE DESIGN
The primary concerns in the design of the questionnaire are completeness and accuracy.
A good questionnaire must collect all the intended data and must do so as accurately as
possible. While there are no rules about a good questionnaire construction, some
guidelines are worth bearing in mind when designing a questionnaire.
1. Brief
The brief summarizes the boundaries and constraints of the proposed study. All
the people concerned with the study must be brought together and be made to
appreciate the following:
Phrasing the question in terms of others and how they might feel or act. For
example, “do you think that most people drink in Zambia?”
The fixed alternative questions indicate the choice of response that is anticipated.
This format essentially restricts a respondent to a predetermined set of possible
responses. Respondents are asked to choose the alternative that most closely
corresponds to their position on the subject. For example:
It is affordable
It is well packaged
Advantages Disadvantages
Open ended Permit a wide range of Difficult to analyze in
Questions replies cases where large
Enable questions to be quantities of data are
answered in amassed
considerable length Require high skilled
May solicit information interviewers
unknown to the Seldom appropriate if the
researcher computer is to be used in
the analysis of data
Fixed –alternative Easier to administer Require special skills to
Question and respond to develop all possible
More amenable to use responses to a question
of a computer Allow little detail to be
Easier to analyze and given in the replies
interpret May introduce bias by
exposing to all possible
responses
May not capture
respondent‟s true position
Open ended response questions can be successfully used when a study involves a
small number of people. Closed response questions are most appropriate when a
large population has to be studied. They can also be used in screening respondents for
more detailed questioning.
4. Question Phrasing
Question phrasing is about conveying the researcher‟s meaning in a way that will
the respondents to capture the same meaning. Useful guidelines include:
Use simple words in a language that will be readily understood by the
target respondents.
Avoid using words with double meaning. For example, the word,
“plant” in the question, “who is in charge of the plant?” could be
interpreted in two ways: a plant could be understood to mean a factory
or vegetation!
Avoid using words which are ambiguous or imprecise, such as “often”,
“sometimes”, “regularly”, “never” or “occasionally”
Avoid using leading questions. These are questions which suggest
what the answer should be. A question stated “Many people prefer
Roan Antelope mealie-meal. Which one do you prefer?” will give the
impression that the minority position is out of line and may lead to a
respondent to side with the majority investigation
Avoid double barreled questions. These are questions which call for
two or more responses and tend therefore to create confusion for the
respondent. An example might be: “How do you assess Shoprite on
price and quality?” or “How many times in a week do you shop at
shoprite and what items do you buy?”
5. Question Sequence
The order in which questions are asked is important. Questions should be asked in
a logical flow and in a way that allows the respondents to retain interest through
out the interview. It is therefore useful to understand the broad categories of
questions that comprise a questionnaire. The first types are introductory questions.
These attempt to define the respondents as far as practical or relevant and may
include personal characteristics such as:
Respondent‟s name
Respondent‟s sex
Respondent‟s marital status
Respondent‟s age group
Respondent‟s profession
Respondent‟s address
Company name and address
Company‟s major products
Type of ownership
Number of employees, etc
Introductory questions additionally serve to “warm up” or establish rapport with
the respondent. They should therefore not antagonize the respondent. In this
regard, in the case of sensitive and difficult questions, they may be placed at the
end of the questionnaire to avoid situations where the respondents are turned off
and become uncooperative.
The second type of question deal with the subject of the study as indicated in the
brief. Generally, these should be placed after the introductory questions. Where
there is potential danger of respondents becoming uncooperative because of the
nature of any introductory question, it is prudent to ask questions that deal with
the subject of the study prior to asking classification questions. The following are
useful tips in maintaining logical flow of questions:
Use a funnel approach. Start with broad questions and progressively narrow
down the scope. For example:
What improvements are needed in service policy in XYZ Ltd?
How do you like the quality of service in XYZ Ltd?
Using the funnel approach, the latter question should precede the former
question because it is more general than the former question.
Place difficult or sensitive questions late in the questionnaire. If respondents
feel threatened or find it difficult to figure out the meaning of the question,
they are unlikely to proceed answering the rest of the questions.
Use the branch questions diligently. This approach is used to direct
respondents to different places in the questionnaires, based on the response to
the question at hand. For example:
Yes No
i. Do you wear a tie or a cravat?
(If “no”, skip question iii)
ii. Indicate how frequently you wear the tie
Space the questions from each other and avoid question crowding.
Number the questions, especially when branching questions are employed.
It is advisable to have questions on one side of the paper and answers on the other
side of the page; for example:
Yes No
o Does your company have a formal plan?
Tick as appropriate
o Indicate which of these factors are crucial
in the provision of service
Order processing
Price
Delivery
Quality of Product
Use good quality paper.
If possible, indicate the address to which the questionnaire has to be
returned. This should be positioned at the end of the questionnaire
The respondent should be thanked for his help in the study.
If there are rewards for this cooperation, this should presented promptly to
avoid any possible ill feeling
If it is necessary or practical, advise respondents when the results of the
study will be ready and offer to avail them to the respondent.
1) personal Interview
This involves face-to –face communication between the researcher (interviewer) and
the respondent (interviewee). Its relative advantage and shortcomings in comparisons
to other methods are:
Advantages:
It is accurate because it affords the researcher the opportunity to explain or
elaborate points of difficult to the interviewee.
It enables the researcher to collect as much information as possible, provided,
of course, if he is able to meet the expenses of interviewing the number of
people he has targeted.
It enables the researcher to secure the highest response rate as he is personally
present to secure the response.
It is quite flexible as it permits both the researcher and the respondent the
opportunity to adjust the pace and tempo of the interview.
It affords the researcher some control over the sample and the interview.
Disadvantages:
It can be quite expensive if research assistants have to be used; expense
related to training, travel, subsistence, lodging and renumeration can be
substantial.
It is rather slow.
Anonymity is rather compromised by the face-to-face interaction and is
therefore unsuitable for colleting sensitive information.
2) Telephone Interview
This involves asking and responding to questions over the telephone
Advantages:
It is quite fast.
It is in the intermediate range with regard to cost, accuracy and anonymity.
Disadvantage:
Coverage may be affected because not every one has a phone.
Response rate may be adversely affected by the “unsolicited” nature of the
phone call.
Amount of data that can be collected will probably be on the low side
3. Mail Survey
In this method, the interviewer neither come face-to-face with, nor speaks to, the
respondent; rather a questionnaire is posted to the respondent and would normally be
mailed back to the interviewer upon completion.
Advantages:
It is least expensive
Appropriate for ensuring anonymity.
Disadvantages:
Least control of sample and interview
Response rate tend to be slow
4. On-line Interview
This involves conducting interviews electronically.
Advantages:
Quite fast
It is also flexible
Disadvantages:
Quite expensive at least initially
Confidentiality is often compromised,
There is limited control of sample.
OBSERVATION
A second method of observing primary data is by observing the phenomenon of interest
according to prescribed rules given the objectives of the study.
Condition under which Observation method is appropriate
The behaviour being observed must be overt or external. It cannot be applied to
measure inert or internal behaviour, such as motives, attitudes, beliefs or
knowledge.
The behaviour must be repetitive, frequent or predictable. It is not appropriate to
apply when the phenomenon to be observed occur rarely or the occurrence is
unpredictable.
The behaviour must cover a relatively short time. Decision-making process do not
lend themselves to observation because their time span is longer rather than short
Sampling is a common practice in our daily lives. All of us, at one time or another, draw
conclusion about a large group on the basis of the sample: we “taste” Munkoyo before we
purchase a cupful of the drink; we try on a shirt or dress before actual purchase; or sip a
cup of coffee. In all these instances and many others; an attempt is made at studying a
subgroup and using the results so obtained to make conclusions about the large group.
Marketing information may also be obtained through sampling procedures. Thus, instead
of collecting information from a universe, information is always obtained from the
subgroup of the universe and inferences made about the population on the basis of sample
information.
1. WHAT IS SAMPLING?
Sampling is the process of collecting information from a subset of the population of
interest and making conclusion about the population on the basis of sample
information
b) Financial Constraint
Limited resources can force a firm to deal with a subgroup of a universe
instead of obtaining information from all elements in a universe. More
resources would be spent in dealing with a universe than a part of the
universe. For instance, if the population of interest is dispersed over a large
geographical area, requiring a researcher to travel over long distances,
sampling the universe is a cheaper way of obtaining information.
c) Universe Size
Even where there is no time or financial constraint, the sheer size of the
universe would make it impractical or impossible to reach every individual or
unit. Some people may be located in inaccessible places, while others would
be difficult o trace. A sample makes accessibility easier than a universe.
f) Accuracy
In some cases, sample results tend to be more accurate than those obtained
from a census. One is more likely to make errors with large populations than
with a sample. In the interest of greater accuracy, it may be prudent to deal
with a sample rather than a census.
3. SAMPLING PROCESS
The sampling process consists of the following steps:
Defining the population
Specifying the sampling frame
Specifying the sampling unit
Specifying the sampling method
Determining the sample size
The diagram below illustrates these steps. The first step involves dividing the
population by age (18-25 years, 26-49 years and 50 years and over) and sex (male and
female). Thus, if a researcher wanted a sample comprising 300 people
Age
Sex 18-25 years 26-49 years 50+years TOTAL
Male 36 63 48 147
Female 39 63 61 153
Thus, the four elements in the universe might be designated by the letters A, B, C and
D. the possible sample comprising two elements are designated as AB, BC, CD, AC,
BD and numbered 1 to 6 respectively. A draw of the chips would be made, and any
chip drawn would constitute the sample.
Note that this way of drawing a sample is premised on a complete and known list of
all population elements. Secondly, all items in the universe, or all possible samples,
have a known and equal chance of being selected. Lastly, the chances of selection
were not researcher determined. We can see from this simple example that the items
for the sample were randomly selected.
Another and, perhaps, more practical, way of selecting a simple random sample is
through the use of the table of random numbers. Again, the principal of complete and
known list of population elements must be obtained to enable one to assign numbers
to each population element. Thus, given the total number of items in the universe is
known, N, each item in the universe numbered serially from 1 to N. the net step is to
determine the size of the sample, n. then, beginning at any arbitrary point on the table
of random numbers, one begins to select those numbers between 1 and n in their order
of occurrence until a sample of size n is obtained.
The principal of randomness is satisfied because the digits in the table of random
numbers are not arranged, nor do they appear, in any systematic or discernible
manner. Assuming the starting point is chosen arbitrary, there is no way of
predetermining which numbers will be selected from the table. It is as if numbers
appear in the table by chance!
To illustrate the use of a table of random umbers, let us assume a population of 100
students from which we wish to select a sample of 30 students. We would number
each student from 1 to 100. a sample of 30 students would be selected by picking the
first thirty numbers in the table from 1 to 100. The entry point in the table would be
arbitrary, by entering the table at the randomly selected row and column. Having
established the starting point, we could next arbitrary choose in which direction to go,
either along some row or column- upwards, downwards, or horizontally. The first
thirty numbers to be randomly selected would constitute the random sample. Note
that were the number appears more than once, it is only picked the first time.
A stratifies sample bears some similarity to the quota sample only to the extent that the
population is divided into cells representing desired characteristics and that the sample is
premised to have those characteristics. The difference between the stratified and quota
sample is a material one, and this is in a way the sample elements are selected. A
stratified sample is the probability sample and selection of sample element is accordingly
not researcher determined as is the case with quota sampling.
Proportion versus disproportion stratified sample
A distinction can be drawn between a proportionate stratified sample and a
disproportionate stratified sample. A proportionate stratified sample contains elements in
the stratum in proportion to the relative number in the population, while a
disproportionate stratified sample contains elements in the stratum in disproportionate to
the number in the universe. For instance, in table X below, a population of 50, 000
persons has 10,000, 15,000 and 25,000 persons in the high income, middle income and
low income categories respectively.
The size of the proportionate sample for each category is relative to the size in the
universe, while a disproportionate sample for each category bears no relationship to the
size in the universe. Thus, 10,000 high income earners constitute 20% of the population.
Consequently, in a sample of 5,000, they will correspondingly constitute 20% of the
sample.
Disproportionate sampling on the higher side, such as for the middle income earners is
used when there is greater variability among items in the universe which comprise a
selected stratum. On the other hand, less than disproportionate sampling is used when
there is homogeneity among items.
Cluster Sampling
Cluster sampling involves dividing the population into mutually and exhaustive sub
group and then taking a sample of sub groups. If all the elements in the subgroup so
selected are studied, this is called one stage-cluster sampling; f a random sample of
elements is made from the selected sub groups, then this is called two-step cluster
sampling.
The diagram below illustrates the difference between one-stage and two-stage cluster
sampling. Suppose we have a universe comprising sixteen individuals from which we
wish to select four individuals. Under cluster sampling we would divide the population
into mutually and exclusive sub groups. In our example, these groups are denoted A, B. C
and D and each subgroup consist of four individuals. We would then proceed to select at
random a sample of sub group to study. Since we wish to study four individuals and each
sub group consists of four individuals, under one-stage cluster sampling, this would
involve selection of any one of the sub groups. This way, all the elements in the sub
group would be studied.
Under two-stage luster sampling, not only is a random sample of sub groups made but a
further selection of the four individuals must be made from the sub groups. Thus, we
might proceed by selecting randomly any two sub groups, and then randomly select two
individuals from each sub group.
Area Sampling
Area sampling is a type of cluster sampling. It is applied in instances where a detailed list
of universe items is not known, e.g. in a geographical area. Thus, blocks of households or
streets on which houses are located would be used as clusters. A random sample would
be selected by randomly selecting a street(s) and studying all households in the selected
street(s) of houses (one-stage area sampling) or, having selected a street(s) of houses then
selecting a sample of households from each pre-selected street(s) of houses (two-stage
area sampling). An illustration of area sampling is shown on page 32.
3rd street 9 10 11 12
4th street 13 14 15 16 13 14 15 16 14 16 14 16
One-stage Two-stage
1. A population is divided into four streets 1. a population is divided into four streets
Systematic sampling
The five steps in systematic sampling are to ensure that a list of all universe items is
available. Second, the universe items must be arranged in some kind of order. To select a
sample of size n, one begins to select sample items at any interval. For instance, if there
were 26 households and we wish to –
A B C D E F
G H I J K L
M N O P Q R
S T U V W X
Y Z
The following are some of the ways the sample size may be determined:
3.5.2 All-You-Can-Afford
This method is based on cost consideration, in particular the budget for the project. The
focus is on the cost of information to the exclusion of concern about its value in terms of
accuracy.
A = the desired level of accuracy, or the error from sampling that the
the research will allow.
This formula is based on the concept of a sampling distribution of a statistic. Assume the
statistic in question is the mean. The sampling distribution of the mean is the relative
frequency distribution of the means of all possible samples of size N. to illustrate the
sampling distribution of the mean, let us assume we have population of size N=4 and we
wish to draw all possible samples of size n=2 from this population. Assume that our
population consists of four individuals whose annual income, in million of kwacha, is as
indicated in figure 1
A 60
B 80
C 40
D 20
We readily see that the average income of these four individuals is K50million, that is,
M = 60+80+40+20
4
= 50
Assume we wish to draw a sample of size n=2 individuals and compute the average
income of all possible samples of size n=2 drawn from this population. The relative
frequency distribution of the means of all possible samples of size n=2 drawn from this
population would be as shown in figure 2
Figure 2: Income of A, B, C and D
AB 60+80 =70
2
AC 60+40 =50
2
AD 60+20 =40
2
BC 80+40 = 60
2
BD 80+20 = 50
2
CD 40+20 = 30
2
2
1
30 40 50 60 70
A relative frequency of sample means shows the number of time a sample mean
occurred. It I thus a measure of the probability of occurrence of the sample mean value.
The relative frequencies are shown in the histogram in the lower part of figure 2. A
normal curve is shown in the same figure. It can be seen that the distribution is normally
distributed. The normal curve in the lower part of figure 2 is the sampling distribution of
the sample means.
In general, a sampling distribution of the means of simple random samples that are large
(that is, more than 30) has:
i. A normal distribution
ii. A mean equal to the population mean, that is:
70+50+40+60+50+30
6
= 50, 000
iii. A standard deviation called the standard error of the mean, that is equal to the
population standard deviation(s) divided by the square root of the sample size
(√n):
σx = σ
√n
It is important to note that the standard error of the mean is simply the standard
deviation of the sampling distribution of the mean. The only reason that the standard
error of the mean is called that instead of the standard deviation is to indicate the it
implies to the distribution of sample means and not only to a single sample or a
population.
A basic characteristic of a sampling distribution is that the area under it between any
two points can be calculated as long as each point is defined by the number of
standard errors it I away from the mean.
The number of standard errors a point is away from the mean is referred to as the Z
value for that point. For example, the area under one side of area under one side of
the curve between the mean and point that have Z values of 1.0, 2.0, and 3.0,
respectively, are as follows:
Z value (number of standard error) Area under the curve from the mean
to the point defined by the Z value.
1.0 0.3413
2.0 0.4772
3.0 0.4986
The area between to points under a sampling distribution is a probability. This, mean that,
the probability of the sample with a mean falling within the range of one standard error is
0.6826 (0.3412x 2) or 68 percent.
A visual examination of the sampling distribution of the mean in figure 2 shows that the
mean of a sample of size 2 is likely to be quite close to the actual population mean. The
shape of the curve indicates that most of the sample means are clustered near the centre
of the distribution, which also happens to be the true mean. But the mean of the particular
sample of size 2 could be any of the sample means in the distribution, with some of them
being a substantial distance from the true mean and the true value of the mean is the
sampling error. If we examine the formula for the standard error of the mean, σ x = σ
√n
, we can see that the standard error of the Mean becomes smaller as the sample size
increases. In other words, increasing sample size will reduce the potential sampling error
because as the sample increases, the sampling distribution becomes clustered more
closely around the true population value.
Here is how an interval estimate of a mean is applied. A sample is taken and a sample
mean is calculated. We know that this sample mean falls some where within the sampling
distribution, but we do not know at what location. We do know, however, that there is a
probability of 0.3413 (34.13%) that it lies within 1 standard error above and a probability
of 0.3413 (34.13%) that it lies within 1 standard error below the actual population mean.
We may, therefore, make an interval estimate that allows us to be 68.26% (34.13 + 34.13)
confident that the population mean (m) lies within the interval formed by the sample
mean ( x ) plus 1 standard error (σ x ) and the sample means minus 1 standard error, that
is:
x- 1 σ x ≤ M ≤ x + 1 σ x
the 68.26 percent is the confidence coefficient of the estimate
Example:
Assume that we wish to determine the average expenditure per month spent on food.
Assume further that the expenditure for food is normally distributed. Now suppose a
random sample of 30 individuals is selected and is found that the average amount spent
by these 30 individuals is K180, 000 per month, with a standard deviation of K14, 000.
What then is the true monthly expenditure on food?
Solution
We can be 95.44 percent confident that the true average monthly expenditure lies
between K174, 992 and K185, 008
n = σ2 Z2
A2
In our discussion of the concept of a sampling distribution, an estimate of the variance of
the population from which the sample is to be drawn was given by the standard error of
the distribution of the statistic: in the case of the sampling distribution of a mean, that is,
σ x=σ
√n
The z value represents the number of standard errors from the mean as specified by the
confidence coefficient. The expression zσ is in reality the specification of the
√n
level of accuracy. If A is used to denote accuracy desired, then, A = zσ
√n
and solving for n, we have
n = σ2Z2
A2
Example:
The average advertising expenditure per year of the random sample of 50 firms is
K12million with a standard deviation of K1million.
How many firms must be surveyed to ensure that the estimate is not more than
K0.2million out at the 99% confidence level?
Solution
Sample size is given by n = σ2Z2
A2
= 166
Calculation of sample size in calculation problems involving proportions
Marketers are sometimes concerned about the proportion of people who behave in a
certain way. For instance, an advertiser of a product may be interested about the
percentage of people who read his advertisement. In such a case, marketing research will
be dealing with the proportion and, consequently, with the sampling distribution of the
proportion.
The sampling distribution of the proportion is the relative frequency distribution of the
proportion (p) of all possible samples of size n taken from a population of size N. The
same basic reasoning used to determine thee sampling distribution of the mean applies to
the sampling distribution of the proportion for a sample random, thus has:
i. A normal distribution
ii. A mean equal to the population proportion (P)
iii. A standard error (σp) equal to σp = P(1-P)
n
We can now turn to an illustration of an interval estimate of a proportion, and to the
determination of a sample size.
Example
In order to evaluate the success of the new product, a company market tested the product
among 200 housewives of whom 60 favoured the new product.
a) How correct does this estimate the proportion of housewives who favour the
product at the 95% confidence level?
b) How many housewives must be interviewed in order to establish this proportion
within 2% accuracy at the 95% confidence level?
Solution to (a)
Estimating the interval estimate of the proportion of housewives who favour the new
product is given by the interval:
p - z pq ≤ P ≤ p + z pq
n n
24 ≤ P ≤ 36
Solution to (b)
Estimating the sample size is given by: n = z2 pq
A2
It will be observed that the formula for determining the sample size is derived from the
expression z pq, which specifies the level of accuracy in estimating the true proportion.
n
Hence, accuracy can be specified by the expression, z pq , and solving for n gives us the
n
formula for determining the sample size.
TOPIC FOUR: FIELD PROCEDURE AND NON
SAMPLING ERROR
Refusals- these refers to those who refuses to respond to interview. The rate of
refusal depends on:
o Nature of respondent
o Auspices of the research
o Circumstances surrounding the contact.
o The nature of the subject, or
o The interviewer
3. Office errors:
Non sampling errors are not restricted to data collection. They may also be caused by
lapses in editing, coding, tabulating, and analysis of data.
Figure 6.1: The Kahn and Cannel Model of Bias in the Interview
Interviewer Interviewee
Background Background
Characteristics Characteristics
Psychological Psychological
Factors Factors
Behavioural Behavioural
Factors Factors
Source: Robert L. Kahn and Charles Cannell, The Dynamics of Interviewing (New York:
John Wiley & Sons, 1957) in Gilbert Churchill and Dawn Lacobucci, op.cit. pg 542
TOPIC EIGHT: ANALYSIS OF DATA
Introduction
The raw data that is contained in a questionnaire is in an unorganized form. It must be
drawn out and be presented in a form which facilitates understanding as per objectives of
research. The purpose of data analysis is, therefore, to draw meaning from the raw data.
This process starts with editing.
1. EDITING
Editing involves the inspection and, where necessary, the correction of each
questionnaire or observation form. The basic purpose of editing is to impose some
minimum standard on the information collected. This involves checking for the
following:
1.1Completeness
The check for completeness involves checking for any missing data. This can be
manifested by a blank to a specific question. This could be due to an oversight or lack
of knowledge about the issue the respondent was asked about. The immediate
corrective measure is to seek clarification from the field worker. Other ways of
handling missing data comprise:
1.1.4 Leave the question blank and record the answer as a non response- this
must be taken into account when interpreting data and making inferences
about the population.
1.2 Consistency
Inconsistency is said to obtain when any two or more answers contradict each other
when they are supposed to move in one direction. For instance, if a respondent
indicated that he prefers Colgate on one part of the questionnaire and later indicated
that he has never used it, the researcher would find it a problem to attach any
meaning to what the respondent meant. Such an inconsistency may indicate errors
in the collection or recording of a response, or ambiguity in the mind of the
respondent regarding the meaning of the questions.
1.3 Uniformity
Lack of uniformity is when responses are not recorded in the same units. For
instance, if the study is aimed at finding the size of an establishment in terms of
sales, and the respondents indicate the size of the firm in terms of profit, the
response base is not uniform.
1.4 Legibility
Written responses, including symbols or abbreviations used, must e legible or
decipherable. Misspellings must be corrected at this stage.
1.5 Comprehensibility
It is important that responses are understandable to all concerned. One must avoid
situations where only the field worker comprehends a response.
2 CODING
This is a procedure for assigning symbols to responses in order to facilitate
categorization of the responses. The first step in coding is to specify the category of
responses. For instance, sex can be categorized into male and female: education can
be categorized into Grade VII, Grade XII, Graduate; economic status can be
categorized into rich and poor and so on. The second step in coding is to assign a
symbol to each response category. A symbol is usually a numeral. In our example,
the coding could look like this:
An example of Coding
Response Response Symbol
Variable Category
Sex Male 01
Female 02
Education Grade VII 11
Grade XII 12
Graduate 13
Economic Status Poor 21
Rich 22
Coding is relatively easy for closed questions, but relatively difficult for open-
ended questions. In the closed question format, responses are by design categorized.
In the open-ended questions, respondents are free to express an answer in any way
they prefer and this makes it difficulty to anticipate what kind of answers will be
given.
3 TABULATION
Tabulation is a frequency distribution of a variable. Thus, tabulation involves
categorizing responses and counting the number of cases which fall into each
category. The tabulation may take the form of simple (one way) tabulation or a
cross tabulation.
0 2 20%
1 3 30%
2 5 50%
10 100%
It is advisable not to use percentages if the base is small. Base should preferably be
at least 50
(b) Averages-
These are measures of central tendency and can be in the form of an arithmetic
mean, a median or a mode. Caution is called for in the application of averages,
particularly in the following situations:
The non existent average- there are some unusual distributions which do not have
averages, such as a mean or median. For instance, per capita income distorts the
slant in income distribution between the rich and the poor. Per capita income as a
measure of average income tends to obscure significance differences in market
segments.
Nobody is average- the average represents a single point on a line, and a point has
no area. The probability of any person being average is therefore zero. The point
is significant in practical situations. What does average imply? In our example,
nobody consumes 1.3 loaves of bread: they either consume 2, 1 or nothing!
Spurious precision- because of the way averages are computed, one may come up
with too precise a figure which has little practical significance. For instance, an
average consumption of 1.3 loaves of bread per student is arithmetically correct
but it is spurious and has no practical significance.
Range- this refers to the difference between the highest and the smallest values,
such as the difference between the highest and the lowest paid.
∑ xi – u
Standard deviation- this is the same as the average deviation except that to
eliminate negative values, we use the square of the deviations of the individual
values from their arithmetic mean.
The importance of standard deviation is that it is one of the parameters of the
normal curve. In practice, this means that if we know that the population is
normally distributed and we know the standard deviation, we can determine what
percentage of the population falls between selected intervals (as measured in
standard deviations)-viz:
68% of the population will lie within one standard deviation of the mean.
95% of the population will lie within two standard deviation of the mean.
99% of the population will lie within three standard deviation of the mean.
Cross-tabulation
Cross-tabulation is used to examine the relationship between two or more variables. It
involves studying two or more variable simultaneously. In many marketing situations, we
are often forced with decisions that require an understanding of how variables relate to
each other. For instance, the relationship between demand and price has implications for
pricing decisions; the relationship between demand and advertising raises questions about
how much should go into advertising, to which target group advertising should be
directed;; how advertising should be conducted; and when advertising should be carried
out. Cross tabulation shows how two or more variables might be related.
Table 8.1 shows the raw data on the purchase of electronic equipment by 15 students.
The students are shown by age and background. The following are some of the ways this
data may be used.
Table 8.2 shows the relationship between age and purchase. Sixty three (63%) percent of
the students below the age of 20 bought the equipment, while thirty-seven (37%) percent
did not purchase the equipment. Among the older students, that is, the students twenty
years and older, twenty-nine (29%) percent bought the equipment, while seventy-one did
not purchase the equipment. The data suggest that the younger students are more likely to
buy than the older students.
Table 6.3 shows the interaction of age and background in the purchase of equipment.
Among the students below 20 years, seventy-five percent of those in the urban areas
bought the equipment, while their rural counterparts were evenly divided: fifty percent
purchases and fifty percent did not purchase. Among the older students, 100% of rural
students did not purchase, while sixty percent of urban students did not purchase. The
data thus, seem to suggest purchase is related to age and background, and that, more
specifically, the older students with rural backgrounds are not likely purchasers, while the
younger students of urban backgrounds are likely purchasers.
The initial relationship has therefore been retained with the introduction of a third
variable. In some situations, the initial relationship may turn out to be spurious with the
introduction of a third variable. For example, if e confined our analysis to students who
are below 20 years and have a rural background, we notice that the initial relationship
between age and purchase disappears: that is, there is no difference in purchase behaviour
among students with a rural background in this age group. In still other situations, a
relationship may be established with the introduction of a third variable where none
initially existed.
ESTIMATION AND HYPOTHESIS TESTING
There are clearly many ways of analyzing data. What specific meaning we wish to draw
out will depend on the research objectives. There are nevertheless three broad categories
of data analysis.
EXERCISES
1.1Estimating a Population Mean:
Assume we wish to estimate how much money was spent on buying Christmas gifts by
Kitwe residents. A random sample of 50 Kitwe residents is selected and it is found
that the average amount spent on buying Christmas presents was K3.5 million with a
standard deviation of K0.27 million.
We thus wish to estimate the average expenditure during the festive season by
Riverside/ Kitwe residents. Let us assume we desire to estimate the true average
expenditure at the 0.95 confidence level.
Solution:
x+z σ
√n
3.5 + 1.96 0.27
√ 5 0
3.5 + 0.08
3.42 ≤ µ ≤ 3 . 5 8
We can thus be 95% confident that the true average expenditure is between 3.42 and
3.58
Solution:
p + z pq
n
0.12 + 0.04
8% ≤ p ≤ 16%
We can be 95% confident that the true proportion of disinterested viewers lies
between 8% and 16%
In some situations, the parameter may be known but we may wish to know, whether
due to some development, this parameter has changed or has remained the same. In
examining whether or not the magnitude of the parameter has changed, we start off
with the hypothesizes value of the magnitude of the parameter, and the analyze
sample data to see whether or not they support the hypothesis of there being a change.
To justify their request for an upward adjustment in their monthly allowance, students
have taken a sample of 100 students and have found that this sample of 100 students
spent an average of K184, 000.
Our selected sample of mean x = K184, 000 is one of the values of means in this
sampling distribution, provided that the population parameter is K175, 000 that is:
x±z σ
√n
√ n
= 2.5
Should the allowance be revised on the basis of this information? Use ά= 0.05
Solution:
1. Observe a difference
GRZ criterion: K145, 000
0.04500
0.05
0 Z = 1.64
= 2000
667
= 3.0
1.64 3.0
Since computed Z value is greater than specified Z value, we reject null
hypothesis and conclude that there is justification for the review of the allowance.
OR:
If the monthly expenditure is K145, 000, we would expect the sample result to tie
with the interval shown in the diagram below:
µ±Z σ
√n
146, 093
Since the observed value of K147, 500 is outside the acceptance region, we reject
the null hypothesis.
Solution:
Step 1:
The observed difference which can have important implication for the investment
decision is whether the proportion of students who are interested in music is
significant to justify the opening of an outlet at CBU. The proportion of students
interested in music is 22.4% compared to the criterion proportion of less than
20%.
Step 2:
Specification of the null and alternative hypothesis
Null hypothesis: the proportion is less than 20%, that is, p < 20%
Alternative hypothesis: the proportion is 20% or more, that is, p ≥ 20%
Step 3:
Specify the Z value at the given level of significance. Since this is one-tailed test, Z
value is 1.64
Step 4:
Identify the test statistic and its sampling distribution. This is a sampling distribution
of a proportion and a test statistic is given by:
= 22.4 – 20
(20)(80)
625
= 1.5
Step 5:
Decision Rule: reject null hypothesis if the computed Z value is greater than specified
Z value, otherwise, accept null hypothesis since the computed Z value (1.5) is less
than the computed Z value (1.64). We fail to reject the null hypothesis. The observed
proportion of students interested in music is not significant enough to justify opening
a retail outlet at CBU
2.3 Example of the test of the difference between two sample mean
A marketer is seeking to promote his product by exhibiting it at a commercial show.
Before a show, he takes a random sample of 32 and finds that the average amount
spent on his product was K50 million with a standard deviation of K2 million. After
the show, he takes a random sample of 36 and spends that the average amount spent
on his product is K54 million with a standard deviation of K3 million.
Was there an increase in the amount spent on his product? Use 0.01 significant level.
Solution:
1. The observed difference between the two patterns of expenditure is as follows:
The question is whether the K54 million is significant enough to indicate a real
difference from the K50 million.
= 6.5
5. Decision
Since the computed Z value (6.5) is greater than the specified Z value (2.58), we
reject the null hypotheses. The expenditure before and after the show are not the
same.
= 0.73
Conclusion
We fail to reject the null hypothesis and conclude that there is no real difference between
the two populations. That is, the new stock feed is not effective.
Solution:
1. The observed proportions are:
24% for the rural region and 30% for the urban region. The question is whether
the two proportions are significantly different from each other.
= (30 – 24) – 0
(30)(70) (24)(76)
175 225
= 1.3
5. Decision:
Since the computed Z value (1.3) is less than the specified Z value (1.96), we fail
to reject the null hypothesis. The true proportions are the same for the urban and
rural sectors.
ά = 0.05
1. observe difference in frequency distribution between observed and expected
pattern
Observed Expected
Brand A 35 50
Brand B 55 50
Brand C 60 50
ά = 0.05
4. compute X2
Observed Expected
(Female) (Male) (O – E)2
E
Heavy 220 250 3.6
Moderate 270 300 3.0
Light 230 200 4.5
Non users 280 250 3.6
X2 = 14.7
5. Decision: since computed X2 > tabled X2, we reject null hypothesis, and conclude
that there is a difference in consumption patterns between males and females.
1. observe difference
Expected Frequencies
Preference
Sex Coke Fanta Sprite TOTAL
Male 106 93 215 414
Female 6 39 73 118
112 132 288 532
5. Decision: Reject null hypothesis and conclude that sex and preference are not
independent.
Example:
A manufacturer‟s past record shows that 42% of housewives on the Copperbelt use his
product. A survey of 100 housewives in each of the five Copperbelt towns shows that the
numbers using his product are 50, 41, 35, 47 and 39.
ά = 0.01
d. f = (5 -1) = 4
X2 = 13.28
Decision: we fail to reject null hypothesis and conclude that the product is consumed by
42%- there is no regional variations.
MEASUREMENT OF ATTITUDE
1. Concept of attitude
An attitude can be defined as a predisposition to respond to something. It has the
following components:
The Pork producers Council felt this attitude was unjustified as pork had some
desirable characters. They commissioned a study to uncover beliefs and attitudes
about pork. Among other things, the study revealed that:
Following this campaign, the sales of pork picked up, ostensibly because of
change of attitude towards pork accessioned by the pork!
Judges should not reflect their opinion, but rather whether the statement in
itself is Unfavourable, neutral or favourable.
3. The researcher then determines or computes the average rating (scale value) for
each statement. The procedure for determining the scale value for a statement is:
l = the value of the lower boundary of the interval in which the centile fall
C = centile in question
∑p = the um of the proportion below the interval in which the centile falls
= 8.5
4. Having determined the scale value of each statement- which imply that a
statement can have a value in the unfavourable, neutral or favourable range, a
respondent is asked to indicate which statement they agree with.
5. The attitude score is taken to be the average score of the scale value of the
statements agreed with.
(a) It is possible for people showing agreement with widely different scored
statements to end up with the same arithmetical mean score- hence assessed to be
the same.
A: 4, 6, 8 6
B: 6 6
For instance, when A agrees with the statements whose scale value are 4,6 and 8;
representing a neutral disposition of 6, and B agrees with only one statement
whose scale value is 6; both A and B are said to have the same attitude.
(b) It does not measure intensity of feeling; it merely categorizes into type of attitude.
Judges are not generally willing to spend time going through the many statements.
(c) The rating by judges could be influenced by their own attitude; that is, another
group of judges could score differently
Shoprite is far
Shoprite is congested
APPLES
Fresh ……………………… Stale
Affordable ………………... Not affordable
Accessible ………………... Inaccessible
RATING SCALES
Rating scales require a respondent to tick at an appropriate place on the scale to
indicate his or her attitude towards an object. The scale is often in the form of a
continuum on particular attributes of an object. The following are some of the
ways rating scales are applied:
To indicate overall attitude towards objects.
Tinned beef
Like Do not like
Price
Quality
Courtesy
Cleanliness
Convenience
Music
Fanta 3
Sprite 2
Coca-cola 1
Fanta 7
Coca-cola 3
Observation of behaviour
Under this technique, it is presumed that observed behaviour is indicative of a
particular attitude. For instance, there are certain type of behaviour which are
indicative of interest in a phenomenon: window shopping signals interest in goods
on display; attendance of commercial shows is indicative of interest; questions
asked about a product may be a prelude to actual buying; and time spent a product
or checking a price tag is a step away from a decision whether or not to purchase
an item.
Performance of a task
How well or badly a task is performed reflects one‟s attitude towards the
particular task. The saying „you have a bad attitude toward work‟ suggests that
one‟s performance is unsatisfactory.
Physiological reaction
How our bodies involuntarily and physiologically react to object may also
indicate our attitude towards an object. Thus, whether we have tears of joy or
grief entirely depends on whether or not we like what has prompted us to shed
tears. We salivate over food we like!
RESEARCH REPORT
The results of a research report may be reported in written or oral format, or both.
While many readers of a report will not be able to ascertain the quality of a
research design, questionnaires or intricacies of the experiment or field work, they
will, however, easily recognize the quality level of a report. The quality of the
report is all too often used as a major indicator of the research itself.
The guidelines are intended to assist you in writing a good research report. It is
important to note here that “good” is used from the perspective of the person who
is going to read the report- whether it is a decision maker, a scholar or any other
interested party in the issue that was studied. Thus, it should be obvious in writing
a research report that a researcher should keep in mind the audience.
LOGICAL FLOW
A good research report must reflect a clear and organized thinking. A person
reading a report must see how any section of the report relate to the whole. This
requires that sections of the report logically follow each other.
FORMAT OF THE REPORT
There is no best format for all accessions. Considerable variations in the
organization of the report should be dictated by the nature of the audience and the
topic of the report.
The following guidelines are never the less generally applied
(b) Declaration
A declaration is a statement to the effect that the work was completed by the
researcher and the works of others have been duly acknowledged
(d) Abstract
This is one-page summary of the study. It should include a brief statement of
the problem, the research objectives, how and where the research was
conducted, sample description, and a brief description of results and
recommendations.
(e) Introduction
This is the first chapter or section of the report. It should give a background to
the study and may include such questions as why you think the issue you are
investigating is important, problematic, controversial or necessary. The
background is then followed with the precise statement of the problem. It is
necessary to formulate a working statement of the problem e.g.
Why X and not Y
Why then and not now
Why this and not that
It is important that the researcher documents the existence of the problem. The
statement of the problem then follows in which the researcher will specify
what it is he/she wants to accomplish. These objectives must be specific,
workable and measurable. Conclude the introduction by pointing out the
potential significant of the study. For example, will it add to existing theory or
will it test a theory? Will it correct previous research, resolve inconsistencies
in the existing literature, or solve a practical problem.
(k) References
On the separate page under the heading references, list the sources refered to
and cited in the report. The sources should be listed alphabetically by author‟s
last name.
(l) Appendices
Place in the appendix items that will appeal to only a few readers or that may
be needed only for occasional reference. Such items include a copy of the
questionnaire used, detailed statistical tables or special reports.
-The end-