RRM 400 Research Methods and Project Management
RRM 400 Research Methods and Project Management
Chartered Institute of Risk and Security Management is a distance teaching and open learning
institution.
© Chartered Institute of Risk and Security Management: All rights reserved. No part of this publication
may be reproduced, stored in a retrieval system, or transmitted, in any form or by any means, electronic,
mechanical, photocopying, recording or otherwise, without the prior permission of the CIRSM.
To the student
This module contains information obtained from authentic and highly regarded
sources. Reasonable efforts have been made to publish reliable data and
information, but the compiler and institution cannot assume responsibility for
the validity of all materials or the consequences of their use. The compiler and
institution have attempted to acknowledge the authors of all material
reproduced in this module.
CHARTERED INSTITUTE OF RISK AND SECURITY MANAGEMENT
As you embark on your studies with the Chartered Institute of Risk and Security Management
(CIRSM) by open and distance learning, we need to advise our students so that they can make
the best use of our modules and other learning materials, your time and the lecturers who attend
to you virtually. The most important point that students need to note is that in e-learning, there
are no lectures like those found in conventional face-to-face learning. Instead, there are learning
packages that may comprise written modules, audio recordings, video recordings and other
referral materials for extra reading. All these including WhatsApp, Telegram, Twits, Blogs,
Skype, telephone and email can be used to deliver learning to students. As such, at CIRSM, we
expect the lecturers to lecture to students virtually on the stipulated six-hour tutorials designed
to give students robust introductory knowledge to their programmes. We believe that the
teaching and learning task is accomplished by the learning package that students receive at
registration.
What then is the purpose of the six-hour tutorial for each course on offer?
At CIRSM, like any other e-learning programmes, the students are at the core of learning. After
they receive the learning packages and other learning materials, it is obvious that they will come
across concepts/ideas that may not be that easy to understand or that are not so clearly explained.
They may also come across issues that they do not agree with, that actually conflict with the
practice that they are familiar with. Through interaction and discussion groups, friends can bring
ideas that are totally different and new and arguments may begin. Students may also find that
an idea is not clearly explained and they may remain with more questions than answers. They
need someone to help them in such matters. This is where the six-hour tutorial comes in.
For it to work, you need to know that:
This is one requirement in e-learning
The lecturer has to introduce the course adequately for students to progress on
their own.
The student should prepare questions, queries, clarifications, for the topics to the
discussed. For the lecturer to help you effectively, give him/her the concerns
beforehand so that in cases where information has to be gathered, there is sufficient
time to do so. If the questions can get to the lecturer at least two weeks before the
tutorial, that will create enough time for thorough preparation.
In the tutorial, the students are expected and required to take part all the time through
contributing in every way possible. They can give their views, even if they are wrong, (many
students may hold the same wrong views and the discussion will help correct the errors), they
still help them learn the correct thing as much as the correct ideas.
There is also need for both students and the lecturer to be open-minded, frank, inquisitive and
should leave no stone unturned as they analyze ideas and seek clarification on any issues. It has
been found that if tutorials are done correctly, students do better in assignments and
examinations because their ideas are streamlined. By introducing the six-hour tutorial, CIRSM
hopes to help students come in touch with the lecturers who mark their assignments, assess
them, and guide them in preparing for writing examinations and assignments and who run
students’ general academic affairs. This helps students to settle down in their course having
been advised on how to go about their learning.
Professional networking with students is, therefore, upheld by CIRSM.
The six-hour tutorials should be so structured that the tasks for each session
are very clear. Work for each session, as much as possible, follows the
structure given below.
Note that in all the three sessions, students identify the areas that their lecturer should
give help. They also take a very important part in finding answers to the problems
posed. As students, you are the most important part of the solutions to your learning
challenges.
Conclusion
In conclusion, we should be very clear that six hours is too little for lectures and this does not
limit, in view of the provision of fully self-contained learning materials in the package, to look
for supplementary sources to augment this module. We, therefore, urge students not only to
attend the six-hour tutorials for this course, but also to prepare oneself to contribute in the best
way possible so as to maximize beneficiation.
CIRSM
RRM 400
2. Description
This is the precise reporting and/or measurement of the characteristics of some population or
phenomenon under study addressing the ‗What, How, Where, How‘ questions
3. Explanation
It is the discovery and reporting of relationships among different aspects of the phenomenon
under study addressing the ‗Why‘ question. Research Proposal
The meaning of proposal … suggests looking forward, to what the researcher plans to do in the
future. It is a concise and coherent summary of your proposed research. It is a planned tool of
action which sets out the central issues or questions that you intend to address, logical mapping of
activities to be executed, how data will be collected and analysed, factual and instrumental
resources to be used and the time frame in order to achieve the desired result. It outlines the general
area of study within which your research falls, referring to the current state of knowledge and any
recent debates on the topic. By so doing, your research proposal becomes an evaluation instrument
for stakeholders.
• A research proposal gives you an opportunity to demonstrate that you have the aptitude for
graduate level research, for example, by demonstrating that you have the ability to
communicate complex ideas clearly, concisely and critically – your expertise in a particular
area.
• To convince others that one has a worthwhile research project and that one has the
competence and work plan to complete it.
• An academic proposal also helps us to match your research interest with an appropriate
supervisor.
• Should reflect topical issues as covered in Public Media, Donor interests and interests of
organizations/institutions.
• Not redundant, must fill a gap and must reflect that the study a ground breaking one.
Introduction
An introduction is the teaser for your background summary. It is meant to be short and attention
grabbing, and make the reader actually want to read further into the background summary. One
opens up (Introduces) one‘s area of interest for discussion or/and problematises one‘s area of
research inquiry. The aim of an introduction is to open up the subject for discussion, attention, etc.
The introduction also presupposes that the audience are fairly familiar with the subject of one‘s
study. However, among other issues, the introduction should highlight a description of the problem
that will be addressed. In this section the researcher might discuss the nature of the research and
the purpose of the research. Note that on the purpose of the research, you should highlight the
objectives of the study and what one intends to discover from the study. You should also briefly
state the rationale of the research problem. This is where you briefly explain why your research
topic is worthy of study and may make a significant contribution to the existing body of literature.
In fact, an introduction is a mental road map that answers the following questions:
1. What am I studying?
3. What do we already know about this topic or what have other experts discovered about the
topic?
Provide an overview of existing thinking about and/or research into your research problem.
Three
Identify a gap, problems or unresolved issues in the existing knowledge/research that your research
can fill or identify a research focus that will be useful:
• ―The previous research has mistakenly assumed that….‖ or ―Although most experts in the field
believe …., they have overlooked …‖
• ―Despite prior observations of voter behavior in local elections in Harare, it remains unclear
why some single mothers choose to avoid....‖ and/or
State your research question, your hypothesis and your knowledge claim, making sure to place
your research within the gap in existing knowledge.
• State the intent of your study, including the research question and your hypothesis,
Background
It assumes that lack of familiarity of your audience with the problem and the need to provide its
―backstage; could be a spatial (where it happened) or time (historical evolution) context. It relates
to the problem one wants to study to a larger problem.
The background study for your research includes a review of the area being researched, current
information surrounding the issue, previous studies on the issue, and relevant history on the issue.
You should explain the broad background against which you will conduct your research. You
should include a brief overview of the general area of study within which your proposed research
falls, summarising the current state of knowledge and recent debates on the topic. This will allow
you to demonstrate a familiarity with the relevant field as well as the ability to communicate clearly
and concisely.
Use the first paragraph or two of the Background to explain what is known generally in the area
of your study. Cite key references, but do not write an extensive review of literature; instead, direct
the reader to a recent review.
Then focus in on the problem that your study addresses.
Literature Review
Literature review (previous studies) can be defined as the process of surveying what has been done
on the topic or problem one wants to work on. The materials can be purely be theoretical or
empirical. Note that a literature review is not a mere list or summary of books and articles.
However, it is a combination and synthesis in the form of an integrated description of the field of
study which include inter alia: theories, models and studies. Literature review shows the originality
of your proposed study. Sources of relevant literature include: Electronic; references of references,
policy documents, academic books and journals.
- Literature provides a conceptual framework for the reader so that the research question and
methodology can be better understood.
-Literature review demonstrates to the expert reader, notably funding agencies and academics that
the researcher is aware of the breadth and diversity of literature that relates to the research question.
- It situates your research focus within the context of the wider academic community in your field;
reports your critical review of the relevant literature.
- Identifies a gap within that literature that your research will attempt to address.
- It showcases your talents of: understanding, interpretation, analysis, clarity of thought, synthesis,
and development of argument.
- The process of conducting and reporting your literature review can help you clarify your own
thoughts about your study. It can also establish a framework within which to present and analyse
the findings.
- Enables a reader to appreciate that you have up to-date awareness of the relevant work of others,
and that the research question you are asking is relevant.
- Establishes how your research might add to this understanding or challenge existing literature,
theories, methodology and beliefs.
Before you commence literature review, make sure you ask yourself the following questions:
What research question(s) of my study which should be answered by my literature review?
Has anyone else done anything similar to my study?
Is my research relevant to research/practice/theory in my field?
What is already known or understood about this topic?
How might my research add to this understanding, or challenge existing theories and beliefs?
Steps and Process in Literature Review
b) Provide indicative issues one wants to derive from the materials one would have collected.
These could be:
c). Prepare a notebook appropriately classified to reflect the type of information that one is
looking for from the materials collects. d). Compilation of the information collected. e). Make
an evaluation of the materials.
-Theoretical and methodological discourses and rebuttals surrounding the subject to date.
-A summary of study areas and subjects that have been handled through different initiatives.
-A synthesized write up of the review pinpointing the caps which the project/research would
like to focus on.
Research Framework
A research Framework is a structure that provides ―guidance for the researcher as study
questions are fine tuned, methods for measuring variables are selected and analyses are
Page 6 of 136 CIRSM
RRM 400
planned‖ (Liehr and Smith, 1999: 13). Once data are collected and analysed, the framework is
used as a mirror to check whether the findings agree with the framework or whether there are
some discrepancies; where discrepancies exist, a question is asked as to whether or not the
framework can be used to explain them.
Research Theory
A theory is ―a set of interrelated concepts, which structure a systematic view of phenomena for
the purpose of explaining or predicting. A theory is like a blueprint, a guide for modeling a
structure‖ (Imenda 2014). On the other hand, ―A scientific theory is a well-substantiated
explanation of some aspect of the natural world, based on a body of facts that have been repeatedly
confirmed through observation and experiment. Such factsupported theories are not ―guesses‖ but
reliable accounts of the real world. (American Academy for the Advancement of Science, 2010:
1).
Theoretical Framework
It refers to the theory that a researcher chooses to guide him/her in his/her research. ―A theoretical
framework is the application of a theory, or a set of concepts drawn from one and the same theory,
to offer an explanation of an event, or shed some light on a particular phenomenon or research
problem‖ (Imenda, 2014). Note that a theoretical framework consists of theories that seem to be
related and the theory provides a point of focus for attacking the unknown in a specific area. A
theoretical framework is more formal and used for studies based on existing theories. It refers to
the concepts found in Theories/Models. Note that the theories would have been tested and
confirmed to explain the phenomena being studied.
A theoretical framework is formulated to explain why the relationship between two or more
variables exists. However, it should be noted that a theoretical framework has been tested and
proven to test relationships e.g The Theory of Reasoned Action in predicting human behaviuor
(Figure 1). It is grounded in and based on scientific theory. It is derived from specific concepts and
propositions that have been tested and proven to explain the relationship/phenomena.
- To determine what theories and ideas exist in relation to the chosen subject.
- Helps to ‗frame‘ the research and show that one is knowledgeable about the key concepts,
theories, and models that relate to the topic.
- Provides the definitions and models which help in guiding the research.
- Provides scientific justification for the investigation: it shows that the research is not just
coming ―from the blues,‖ but that it is both grounded in and based on scientific theory.
Conceptual Framework
Robson, 2011). Miles and Huberman (1994) defined a conceptual framework as a visual or written
product, one that ―explains, either graphically or in narrative form, the main things to be studied—
the key factors, concepts, or variables—and the presumed relationships among them‖ (p. 18).
It is primarily a conception or model of what is out there that you plan to study, and of what is
going on with these things and why—a tentative theory of the phenomena that you are
investigating.
It incorporates pieces that are borrowed from elsewhere, but the structure, the overall coherence,
is something that you build, not something that exists ready-made. It is important for you to pay
attention to the existing theories and research that are relevant to what you plan to study, because
these are often key sources for understanding what is going on with these phenomena.
- It informs the rest of your design— to help you to assess and refine your goals and objectives,
develop realistic and relevant research questions, select appropriate methods, and identify
potential validity threats to your conclusion.
- It explains observations.
Research objectives
These are concrete statements which describe what the research is trying to achieve at the end of
the study. Research objectives should be closely related to the statement of the problem. In other
words, research objectives are a re-working of the statement of the problem. A well-worded
objective will be SMART, i.e Specific, Measurable, Attainable, Realistic, and Timebound given
the available resources. In addition, a research objective should be Relevant, Feasible, Logical,
It refers to broad developmental goal within which the project/research proposal falls. It provides
a short statement of the development goal being pursued by the research, e.g. ―improve security
issues in programmes‖
Governments or Development partners often express the broad development goal(s) in general
development strategies of broad policy agendas.
Specific objectives
These state precisely what you intend to do. Specific objectives include obtaining answers to
research questions or testing the research hypothesis, contributing factors in a coherent way in a
logical sequence. They must be SMART. In addition, they start with action verbs such as to: assess;
explore; identify; determine; establish; compare; verify; calculate; describe; and, analyse.
Specific objective should also be presented briefly and concisely. They cover the different aspects
of the problem. They should be clearly phrased in operational terms, specifying exactly what
researcher is going to do, where, & for what purpose. They should be realistic when one considers
their local conditions.
The formulation of research objectives helps the researcher to avoid the collection of data which
are not strictly necessary for understanding and solving problem that he or/she has
A problem statement is the description of an issue currently existing which needs to be addressed.
It provides the context for the research study and generates the questions which the research
aims to answer.
The statement of the problem is the focal point of any research.
It is also an analysis of the major factors that may influence the problem and discussion of why
certain factors need more investigation.
It provides a brief description of solutions to problem that have been tried in the past, how well
they have worked, and why further research is needed.
d) Conclude by stating that unless policies and programmes give priority to address the current
situation, the achievement of an ideal situation will always remain a pipe-dream.
Justification
A justification explains the importance of your study; why it is worthwhile undertaking; an agent
spend funds on the study/project; why should you spend so much time on the study/project.
Justification of study stems solely from the outcome of your literature review. From the literature
review, you can establish the following:
- That the methodologies of previous studies employed did not adequately explain the
phenomenon;
- May be that new methodology in other field of studies may contradict the existing
knowledge about the phenomenon and offer fresh insight which you may want to apply;
- May be that the way the problem and its associated concepts were approached and defined
were problematic; and,
- May be that contemporary problem have falsified at the previous claims about the issue
you are investigating.
So justification can show the theoretical, methodological scientific significance of the research and
practical relevance of your study. In addition, by the time you do thorough review, any of the
aforementioned points will come out and that will help provide a good justification for your
research.
You can use your statement of the problem to guide you when writing the justification of your
study. For instance, if you ask the question ―Is there a significant relationship between criminal
activities and education, age, sex, religion, place of residence, region of residence, income?‘, then
the contribution of your research would probably be coming up with technologically advanced
investigating techniques to curb criminal activities performed for instance by people highly
educated people (the white collar jobs). Note that your research in advanced criminal investigating
techniques would be important. This could be a groundbreaking approach that will change the way
criminal investigation is conducted.
You can also use the deductive reasoning approach to explain the significance of your study. This
means that you can start by highlighting the contribution of your study broadly and then taper off
gradually to a specific group or person. That is, the contribution of your study to a specific group
e.g. the society and organizations as a whole, then its contribution to
individuals
Methodology
It is a single paragraph where you would briefly state the target population, study area, research
design, the data collection methods and tools (justifying the selection of each method in brief) and
how you are going to analyse your data. Note that you link the justification to your research
objectives.
Target Population
The sampled population whose characteristic (knowledge, attitudes, beliefs and practices) you are
interested in. Justify the selection of the target population e.g. why you are only interested in
understanding the criminal behaviour of women aged 15 to 69. Study area
This would be the exact location from which you would carry out your study from. Also remember
you have to justify the selection of your study area. Research Design
A research design is the plan and procedure for research. It is the overall scheme or program of
research. It is the plan, structure and strategy of investigation, conceived to obtain answers to
research questions. You would briefly present the format of your study, highlighting and justifying
each of the data collection methods and tools you intend to use in your research. You would state
Page 12 of 136 CIRSM
RRM 400
whether or not the study was purely qualitative, quantitative or whether it triangulated the
qualitative and/or quantitative research methods. You would also briefly how you intend to have
the data recorded, managed and analysed.
In this section you would explicitly present in detail the respective methods and tools which you
intend to use in your study. Remember for each method, you should clearly state the method, how
many, tool, with who, why, sampling, selection of the respondents and how data were collected
and recorded.
How data is going to be recorded and analysed (Data manangement and analysis- briefly)
Ethical issues
Basically ethical issues are meant to enforce good morals and correct the bad ones. Ethical issues
refer to conforming to standards of conduct of a given profession or group.
1. Voluntary participation
- Have right not to discuss certain questions e.g. experiments on ….. - Refusal bearing no
consequences.
2. Risks
There should be no harm to participants, e.g Physical (mostly associated with medical research) and
Psychological (reminding teenagers of events they do not want to e.g rape).
3. Benefits
-State the indirect benefits and their levels of probability (don‘t overstate anything).
4. Anonymity
- Research must not make it possible to link certain responses with specific respondents.
5. Confidentiality
- The researcher identifies given person‘s responses but promises not to do so publicly.
6. Informed Consent
Budget
• Data analysis
• Report writing
• Equipment
• Dissemination
Activity Plan
• If time not specified by donor, allocate reasonable time for the study average one year.
Referencing
It is a method used to demonstrate to your readers that you have conducted a thorough and
appropriate literature search, and reading. Equally, referencing is an acknowledgement that you
have used the ideas and written material belonging to other authors in your own work.
There several referencing styles which include inter alia: The Harvard; Chicago; Vancouver
Convention; Oxford or footnoting system; American Psychological Association (APA); Modern
Language Association (MLA); American Chemical Society (ACS); and, American Institute of
Physics (AIP). A referencing style sets rules of telling you how to acknowledge the thoughts, ideas
and works of others in a particular way. In addition, it is not only a crucial part of successful
academic writing, but also a means to avoid plagiarism.
It should be noted that each referencing style has its own different origins and features. However,
Referencing style requirements cover the two elements of a referencing system: a. in-text citations
such as author-date citations or footnotes b. reference lists or bibliographies. What is of paramount
importance, however, is that you make sure you strictly adhere to the referencing style prescribed
by lecturer/tutor, assignment instructions guide, college handbook. This is largely because
referencing styles are established systems of referencing with consistent rules.
When referencing your work, you should know that consistency is very important. In addition, do
not mix referencing styles within one piece of writing.
• Failure to develop a coherent and persuasive argument for the proposed research.
• Too much rambling– going ―all over the map‖ without a clear sense of direction. (The best
proposals move forward with ease and grace like a seamless river.)
• Sloppy writing
Qualitative research is an approach in which a researcher often makes knowledge claims based
primarily on a constructivist perspective. This perspective uses multiple meanings of individual
experiences, meanings socially and historically constructed, with an intent of developing a theory,
a pattern, or basis of advocacy (Creswell, 2003). Qualitative researchers study things in their
natural settings, attempting to make sense of, or interpret phenomenon in terms of the meaning
people bring to them.
1. Phenomenology
This is a design which deals with the understanding of the essence of a phenomenon and
how individuals experience the world. The concern of the research is to understand social
and psychological phenomena from the perspectives of the people involved. The operative
word in phenomenology is ‗describe‘.
the phenomenological research design, the researcher set aside his/her own experiences in
order to understand those of the participants understudy. In-depth interviews, documentary
review and films/videos are used for the phenomenological research design. Note that in-
depth interview questions will focus on participants‘ experiences, feelings, and beliefs
about the theme.
(b) How do you feel to be a married woman but experiencing sexual rape?
(c) What do you think are the root causes of your problem(s) of sexual rape?
2. Grounded theory
3. Ethnography
It is a strategy of inquiry in which the researcher studies an intact cultural group in a natural
setting over a ‗long‘period of time by collecting primarily observational and interview
data. In ethnography, you immerse yourself in the target participants ‘environment to
understand the goals, cultures, challenges, motivations, and themes that emerge. You will
be able to represent a view of a world as it is structured by the participants under
observation. You will also be able to describe and interpret a group‘s system with regards
to its sociocultural, religious, political and economic organization and also examine a
Page 18 of 136 CIRSM
RRM 400
group‘s observable and learned patterns of behaviour, customs and way of life. Rather than
relying on interviews or surveys, you experience the environment first hand, and sometimes
as a
―participant observer.‖
• Differences among people with regards to, age, residential situation, ethnicity;
• Different behaviours, cultural beliefs, attitudes, norms;
• The structures (policies, rules);
• The processes (community organization, problem solving, decision making); and,
• The statuses (health, economic, educational, social and political).
4. Narrative Inquiry
Strategy of inquiry in which the researcher studies the lives of individuals and asks one or
more individuals to provide cohesive stories about their lives (Clandinin and Connelly,
2000:18). Subjects recount the events of their lives and narrate them into temporal order
and meaning. Such stories would be subsequently presented to the reader(s).
Narrative inquiry is also a social science which represents and understands an individual‘s
experiences. The inquiry serves the researcher who wishes to understand a phenomenon as
an experience rather than to formulate a logical or scientific explanation. (Kramp,
2004:104). As a qualitative research design, narrative inquiry allows the research to provide
a rich description, contextual (background or appropriate) understanding and explanation
of the place or event under observation.
You conduct in-depth interviews, read documents, and look for themes; in other words,
how does an individual story illustrate the larger life influences that created it. Often in-
depth interviews are conducted over weeks, months, or even years, but the final narrative
does not need to be in chronological order. It can be presented as a story (or narrative) with
themes, and can reconcile conflicting stories and highlight tensions and challenges which
can be opportunities for innovation.
5. Case Study
A case study is an exploration of a case over time (activity, process, one or more individuals
or organisations) through detailed in-depth data collection involving multiple sources of
information rich in context (Creswell, 1998:61). It is used when one is interested in
exploring, explaining and describing an organization, entity, company or event.
A case study is exploratory (investigative), explanatory or descriptive.
• The researcher wants to focus on the answers to the ‗Why‘ and ‗How‘ questions;
• If the boundaries between the context and phenomena are not clear.
Multiple methods can be used to gather data case study data. These include: in-depth
interviews, key informant interviews; observation; and, historical documentation.
Data for qualitative research can be collected through a wide range of data sources.
However, a research should always remember to match research objectives with the
appropriate data collection methods.
When ever you are writing any qualitative research method, always explicitly state the
following: method, how many, tool, with who, why, sampling, selection of the
respondents and how data were collected and recorded.
1. Observation
Observational techniques are methods by which a researcher can gather first-hand data
on programmes, processes, or behaviours being studied.
By directly observing operations and activities, the researcher can develop a holistic
perspective (i.e. an understanding of the context within which the issues under
investigation operate).
Whatever the purpose of the study, four questions face the observer:
d) What relationship should exist between the observer and the observed, and how can such a
relationship be established?
Types of Observations
It involves a researcher joining the group he or she is studying, without pronouncing his/her status
to the group. The researcher becomes ‗part of, or an actor‘ in the events he is studying.
The aim of covert participant observation is as far as possible to attain an entirely familiar presence
in the lives of the people being studied i.e., to become invisible and not affect feelings, attitudes
and behaviours of the subjects as much as possible. The researcher participates in and observes the
socio-cultural content of a household, organisation, community and gains important insights into
everyday life.
Involves the researcher being open with the group he is going to study. Before joining a group the
researcher is likely to inform the group's members (either personally or through the agency of a
sponsor) the following: his/her presence; the purpose of the research; its scope; how long the
research will last; and any other relevant information about the study. It is advisable for researchers
to keep a field diary where day-to-day observations, along with the researcher‘s ideas, thoughts
and feelings are recorded.
Merits of observations
Disadvantages of observations
• Structured schedules may not capture critical factors Unstructured observations are
difficult to carry out.
• Costly-labour intensive and involves analysis costs
It is a research technique that collects data through group interaction on a topic determined by the
researcher. FGDs provide normative information or a group generated perspective. In that group,
concepts will be defined broadly. Given that FGDs capitalise on group dynamics, they allow a
small group of respondents to be guided by a skilled moderator into increasing levels of focus and
depth on the key issues of the research topic.
FGDs are a qualitative technique involving between six and twelve people talking spontaneously
and freely on themes considered important to an investigation/research. It has a moderator and a
note taker. Each FGD should last between one and one and half hours.
1. Group interaction which generally stimulate richer responses and allow new and valuable
thoughts to emerge
2. The researcher can observe the discussion and gain first hand insights into the respondents‘
behaviours, attitudes, language (including body language) and feelings.
3. FGDs can be completed more quickly and generally less expensive than a series of in-depth
interviews.
Setting up FGDs
The number of groups for FGD depends on the purpose of the research and the nature of data
required. However, the following general guidelines should be followed:
a) Conduct at least two FGDs for each independent variable considered relevant to the topic.
e.g. if sex is a relevant variable, then you should have 4 groups, two for men and two for
women. If the researcher considers marital status to be important as well, the number of
groups should be eight.
b) Conduct FGDs until information obtained is no longer new or seems repetitive. If during
FGD you find that the information obtained is being repeated and no new information is
forthcoming, then most probably you have done enough FGDs.
c) If responses would vary according to geographical location , e.g. urban areas versus rural
respondents, then separate FGDs should be conducted in every relevant location
d) Economic conditions determine relevant variables and hence the characteristics of each
FGD must reflect the underlying differences.
e) Consider also race when conducting FGDs. Separate FGDs should be conducted with a
specific race.
An FGD guide is the tool used to collect data from FGDs. However, either a structured or a
semi structured FGD guide can be used to collect qualitative data. What you need to appreciate
are the merits and demerits of each type of guide. Remember also, the care with which you
plan and design your FGD guide will influence the quality of the information you obtain.
1. Do not begin with difficult/sensitive questions. Put these near the end so that if respondents
decide not to answer these, you do not lose their willingness to answer earlier questions.
2. Do not make respondents feel they ought to know the answers. Help them by saying,
‗Perhaps you have not had time to give this matter much thought?‘ May be they can find
the answer latter.
3. Decide carefully whether you should avoid emotional or sensitive words. Using words like
greedy, oppressed or immoral may seem to imply a judgment. Such words can cause bias
in answers. Respondents may be reluctant or nervous to give answers.
4. Avoid making assumptions. Why are men alone in this community undertaking criminal
activities?
5. Do not use confusing questions. Avoid questions like, ‗Why would you prefer and at the
same time not prefer your child to be vaccinated?‘ Keep it simple and positive. Ask, ‗Why
would you wish your child to be vaccinated?‘
Page 24 of 136 CIRSM
RRM 400
6. Use both direct and indirect questioning. Direct questions are those ―normal‖ questions
that we can ask to friends, family members, and people who we know well. On the other
hand, indirect questions are little more formal and polite. They are used when talking to a
person we do not know very well, or in professional situations.
Whenever you design your FGD guide make sure the following information is
available:
a) Identification information eg. province, city, the district, ward, enumeration area,
suburb (if data was collected in an urban area), village (rural area), date;
• The number of FGDs to be conducted and specifying the age and sex
composition for each FGD;
• General introduction of the researchers and the objectives of the study should
be done before the discussion begin;
• How questions and suggestions from the participants about the research process
should be recorded.
At the end of each day, the team should summarise the discussion. Cards with different
numbers and colours should be used for recording responses to each item discussed. Cluster
reports should be written combining all the FGD sessions for each respective cluster. At
district level, a cluster report combining all the sites should be written.
3. Interviews
This technique is characterised by extensive probing and open-ended questions. The emphasis is
on obtaining answers to carefully phrased questions. Interviewers are trained to deviate only
minimally from the question wording to ensure uniformity of interview administration. Interviews
are conducted on one-on-one basis between a respondent and a highly skilled interviewer.
Whenever conducting interviews, the following questions should face the interviewer:
• What relationship should exist between the interviewer and the respondent and how can
such a relationship be established.
Types of interviews
These interviews are conducted with the actual people who are experiencing or experienced
a given phenomenon. An in-depth interview guide is the instrument used to collect the data.
The indepth interview guide and the interview itself could be structured, semi structured
and unstructured, but must be uniformly administered to the same group of people.
Unstructured interviews follow the form of conversations more closely and the interviewer
does not follow a rigid form (take care of the trigger question and provide impetus to the
respondent to keep on track). When to use in-depth interviews
• Highly sensitive subject matter - A study among women who have had an abortion
regarding their feelings about sexuality and family planning.
These are interviews with knowledgeable community experts who deal/interact with the target
population on daily basis. They provide valuable insights to the target population‘s behaviour,
attitudes, beliefs, important statistics pertaining to the target population and also key
recommendations for the study.
3.2.1 Autobiography
A written story about someone‘s life (in the form of a book), where the writer is the subject. It is
the subject‘s own expression of his/her personal experience given within a chronological
framework. The term ‗life history‘ is also used in this connection. The autobiography concept
appears to be more precise in the sense that it emphasises the fact that it is the subject who is
expressing her/himself. Many autobiographies are written by people who were involved in
historical events that transcends their personal lives. So reading an autobiography may educate a
reader about the history and events above and beyond the subject of the book.
3.2.2. Biography A story of someone‘s life (written in the form of a book) by a different person.
Note that in biography, the informant (interviewee) is in control of the exploration of the topic.
However, the issues to be included in the biography are suggested by the interviewer. Biographers
often spend ‗years‘ studying their subjects, allowing them to provide insights into motivations,
contexts, as well as events. A biography maintains a more objective tone, thus the reader is more
likely to get the story from several angles (the interviewee‘s and the writer‘s), rather than only
from the angle of the subject of the book. The general rule in biography is that the respondent is
free to explore any field that appears to be related to the topic, even if it is a distant relationship. It
is therefore important for the interviewer to make it clear to the respondent, the purpose of the
study and what information the researcher is seeking. The interviewer should be a partner in the
dialogue, and should avoid putting the informant under any constraints as much as possible. The
role of the interviewer is to provide incentive and encouragement by being a listener.
• In ordinary social environment of the subject, with whom would the interviewer be
identified with.
While length of the interview varies in relation to the objectives of the study, an
autobiography or a biography of a single individual may be a study in itself.
It is important to choose the times when the subject availability permits the most open
dialogue. It should be understood that some interviews require a high level of
concentration from both the interviewer and the informant and that the fatigue factor
must be taken into account.
In the case of a biography, the basic material is furnished in 20, 30, or even 40 hours of
interviews. It is better to come back a second time rather than to rush the interview.
e). Social setting and social class of the protagonist (hero being interviewed)
This is not only important in terms of language, but also in relation to inter-personal
attitudes, which influence the form and content of the communication.
This method is used to capture the status of those people who must have been identified as
appropriate to the research problem. E.g. in the process of participatory methodology, the
community members may identify individuals considered to be very poor (or very rich)
and who may have engaged in activities/projects designed to improve their standard of
living.
The strategy here is to capture a wide spectrum of such individuals in a given community.
The approach can involve the interviewing of both successful and unsuccessful cases. The
idea is to capture the perceptions of the respondents as best as possible in their own words
and language.
5. Documentary Review
The method provides a record of events over a period of time, and a useful overview where
data are ready and available. It may also provide an indication of the state of development
uninfluenced by the evaluator‘s presence. Documentary review assists in-depth evaluation
and assists evaluator's understanding of the context. Public documents (policy documents,
minutes of meetings, newspapers) and private documents (journals, diary or letters) can be
used for documentary review.
6. Audio-visual material
This is an unobtrusive method of data collection which collects data using photographs,
video-tapes and art/objects. Provides an opportunity for the informant to share directly his
or her own ‗reality‘. The methods is creativity because it captures attention visually.
In order to collect these types of data for a study, a target population, community, or study
area must be identified first. It is not possible for researchers to collect data from everyone
in a sample area or community. Therefore, the researcher must gather data from a sample,
or subset, of the population in the study. In quantitative research, the goal would be to
conduct a random sampling that ensured the sample group would be representative of the
entire population, and therefore, the results could be generalized to the entire population.
The goal of qualitative research is to provide indepth understanding and therefore, targets
a specific group, type of individual, event or process. To accomplish this goal, qualitative
research focus on criterion-based sampling techniques to reach their target group. There
are three main types of qualitative sampling: purposeful sampling, quota sampling, and
snowballing sampling. The following descriptions describe the reasons for choosing a
particular method.
1. Convenient sampling
It relies on the availability the respondents (anyone in the target population whom the
researcher comes across). It is also known as accidental sampling (interviewing whoever
happens to stroll by BUT in the target population). The assumption is that by simply being
in a given locality, one can get a reasonably representative cross-section of the target
population. The method is fast, saves time & energy, easy to employ and is cost effective.
It is useful for pre-testing a questionnaire. However, there is no control over representation
of sample, hence results may not be generalized.
2. Purposive sampling
Also known as judgmental sampling. In this type of sampling, participants are selected or
sought after based on pre-selected criteria based on the research question. A researcher
purposively selects certain groups of people for their relevance to the issues being studied
and on the basis of his/her knowledge of the population. This sampling procedure is often
used when studying social phenomena that are rare to be studied using a representative
sample of the population. It ensures that researchers at least get some information from
respondents hard to locate e.g criminal activities among adolescents with dwarfism
syndrome or lymphoma patients. However, there is no assurance that those selected are in
any way representative of some population of more general interest. The sample size may
be predetermined or based on theoretical saturation, which is the point at which the newly
collected no longer provides additional insights.
3. Snow balling
Also known as reputational referral sampling. Each person (with desired attributes)
interviewed is asked to suggest additional people (with the same attributes) for
interviewing. It is often used in the absence of a complete or reliable sampling frame or
when access to subjects for interviewing is difficult. The sampling technique is often used
for studying special populations e.g. drug users, homeless people and members of secret
societies. It is used primarily for explorative research (breaking new grounds). The
interviewer may locate an individual with desired attributes and after interviews, may
request to be referred to other respondents of similar attributes. Technique is good for
gaining access to knowledgeable people who may provide indepth knowledge on a given
phenomenon. However, there are problems with representation & generalizability.
4. Quota sampling
People are selected into a sample on the basis of pre-specified background characteristics
e.g. age, sex, race, marital status, place of residence and religion. A quota is produced in
accordance with percentage of the population composed of a particular stratum e.g. 49%
male and 51% female, 75% rural and 25% urban, 4% white and 96% black. Accidental
sampling is done to identify respondents in each stratum. There is assurance that certain
strata of the population will be included. Hence, resembles probability sampling and strives
for representativeness.
Hint on sampling
Note that once a sampling method has been determined, the researcher must consider the
sample size. In qualitative studies, sampling typically continues until information
redundancy or saturation occurs. This is the point at which no new information is emerging
in the data. Therefore, in qualitative studies is it critical that data collection and analysis
are occurring simultaneously so that the researcher will know when the saturation point is
reached. It is important to understand that the saturation point may occur prematurely if the
researcher has a narrow sampling frame, a skewed analysis of the data, or poor
methodology. Because of this, the researcher must carefully create the research question,
select an appropriate target group, eliminate his or her own biases and analyze data
continuously and thoroughly throughout the process to bring validity to the data collected.
1. When should the observer make notes? The best time for recording observation is
immediately, as this will result in the minimum amount
of selective bias and distortion through memory. However, this is not always possible
as the observer may lose relevant aspects of the observation if s/he is writing and
observing simultaneously. The observer must write their impressions as soon as possible
after and observation.
(b) More commonly though, researchers record their observations through field
notes. b) Interview Data
• Note taking
The following are procedure which should be used to ensure the accuracy of the observation:
(i) Reflexivity. This involves the researcher monitoring, and reflecting on his/her
own influence and on the effect of the social context, on data production. (Foster, 1994:91).
b) Interview
The following are ways for controlling for these potential sources of error:
• To compare one respondent‘s account with other accounts i.e. crosschecking information;
• To exclude interviewer bias from recording of responses, some mechanical method should
be used.
Page 32 of 136 CIRSM
RRM 400
Fatigue, boredom, bias and expectation of a specific answer, among other things, can affect
this ability. If the interviewer keeps on taking notes during the interview, s/he might miss
something that is said, in addition to disrupting the flow of the discussion.
This can be affected by: a confusion of content between interviews; selective retention on
the part of the interviewer; the interviewer‘s attempt to retain too much information.
During the interview, it is the function of the interviewer‘s ability to identify the actual
level of richness of the content being provided. It is also important that the interviewer
steers the respondent away from irrelevant information, and induces richness when
superficial answers are being provided. The research must act upon the information being
received and altering the interview as it takes place. This is really an issue of probing,
focusing and staying on track with respect to the interview objective. A researcher should
display some familiarity with the phenomenon and setting under study and strong
conceptual (theoretical) interests. Use also a multi-disciplinary approach, as opposed to a
narrow grounding or focus in a single discipline. There is also need for good ‗investigative‘
skills, including doggedness (perseverance or determination), the ability to draw people
out, and the ability to ward off premature closure.
Having presented the qualitative data collection methods and the sampling strategies, let me give
you an example of how you would present the data collection methods whenever you are writing
the methodology section of your research/ dissertation. Let me assume that you have been asked
to carry out a research on the topic, ‗Gender based violence (GBV). A case study of Gweru,
Zimbabwe‘. The objectives the study are: to identify the forms of GBV in Gweru; to establish the
underlying factors to the forms of GBV so identified; and, to assess the availability, accessibility
and acceptability of reporting structures in Gweru.
Always remember that you should explicitly state the following: the method; how many; tool;
with who; why (you are justifying the the selection of the method by linking to your objectives);
selection of the respondents; sampling; and how data were collected and recorded – an approach
I have earlier on alluded to.
Data collection methods a) Focus Group Discussions (FGDs) (State the method first)
Fourteen FGDs were undertaken with males and females comprising both the younger and older
generation using a semi-structured FGD guide. (Note that in the first sentence, I have stated the
number of FGDs, participants and the tool). Seven FGDs were conducted with males and seven
others with women. Females were selected so as to test and validate information obtained from
male FGDs. Each FGD had between 6 and 12 participants. Since FGDs are largely exploratory, it
was necessary to get a group generated perception of the forms of violence common in Gweru.
(Note that I am now justifying the use of FGDs but however, linking to my objectives). Moreover,
FGDs provided data on what the community perceived as the underlying causes to GBV. (Another
justification that has been linked to my second objective). The community‘s perceptions and
evaluative analysis of the availability, accessibility, and acceptability of reporting structure for
GBV was also drawn upon in the FGDs (Another justification that has been linked to my third
objective). Selection of respondents for each FGD was based on the following age cohorts: 15-19;
20-29; 30-39; 40-49; 50-59; 60-69; and 70-79 years. Each age cohort had its own FGD. (The
sentences above are showing how I have selected my FGD participants). Convenient sampling
and willingness to participate were the basis of selection of the
FGDs‘ participants. (I am now articulating the sampling strategy). One of the researchers
facilitated the FGDs using the local Shona language. The other researcher recorded the responses
for the participants using a tape recorder and extensive note-taking. (I am highlighting how the
FGDs were conducted and how data were collected).
Twelve face-to face in-depth interviews were undertaken with selected women who suffered from
the identified forms of GBV such as physical violence, spousal rape, rape by a stranger and forced
marriage. These interviews were conducted using an in-depth interview guide. In-depth interviews
Page 34 of 136 CIRSM
RRM 400
helped solicit information about personal experiences of each form of GBV. They also provided
personal regarding the factors underlying each form of GBV. (Note that at times you can use in-
depth interviews to follow up on issues raised from other methods of collecting data such as a
survey, documentary review and key informant interviews). The GBV victims were first identified
through the survey. However, snow-balling was further utilised for the identification of other in-
depth interviewees. The author conducted the in-depth interviews, while she also recorded the data
through notes and tape recordings.
Six key informant interviews were conducted with the Victim Friendly Unit (VFU) police officers,
Musasa Project and Ministry of Health and Child Welfare (MoHCW) focal persons. A semi-
structured key informant interview guide was used to solicit data. Both males and females were
selected from each category. Two officials from each organization were purposively selected. Key
informant interviewees were community experts who provided statistics on the prevalence of the
various forms of GBV in Gweru. They provided data on how they investigate and deal with each
of the identified form of GBV. The diversity of key informants provided varying perspectives on
the socio-cultural, economic, religious, legislative, political and environmental factors underlying
GBV. They also provided information regarding the availability, accessibility and acceptability of
the reporting structures for GBV. Key informants provided confidential incidences, statistics and
recommendations pertaining to GBV in Gweru. Their particular community based knowledge and
understanding of GBV issues provided a logical basis for the development of more practical,
locally oriented and detailed plans, recommendations and solutions. Key informant interviews
were conducted by the author. The same author recorded data through note taking and tape
recording.
Qualitative data from FGDs, the key informant and in-depth interviews should be transcribed,
translated and typed. Data can be analysed using the critical realism approach. This is a scenario
where data is firstly demarcated into thematic segments which inform the objectives of the
research. For instance, let us assume that you analyzing qualitative data for the topic, ‘Factors
influencing criminal activities in Mabvuku, Harare‘ and the objectives of the study are: to identify
the commonly reported criminal activities in Mabvuku; to explore the factors underlying the
criminal activities so identified; and, assess the effectiveness of law enforcing agents in cubing
criminal activities. You should identify and code the themes for each objective. For instance for
the first objective, you must be able identify and code each of the criminal activities that have been
reported from the qualitative data. For the second objective, you should also identify and code
separately the factors underlying criminal activities. Note that you can identify the following
factors which should be coded separately: social; cultural; economic; environmental; political and
religious. Among the social factors, you might be also able to identify separate subthemes which
underlie the social factors. These subthemes could be peer pressure, double and single orphanhood,
divorces, and lack of parental guidance. So basically, you should code the main theme. Within
each segment of the main theme, you should also code the sub segments representing different
aspects of the main theme. Subsequently, assess the prevalence of codes in order to describe the
commanality of views while at the same time discussing similarities and differences in related
codes across distinct original sources/contexts. In addition, compare also the relationships between
codes, that is sub-themes.You can display the output of your analsyis uin the form of a structure
e.g a tree. Note that the stem of each tree would represent a particular theme (code), while the
branches represent the identified issues around the particular theme. You can also attach common
quaotations to the respective thematic general findings.
As you present your qualitative findings, make sure the findings are answering/responding to the
research objectives, research questions, research problem and also showing the research methods
that were use. In addition, remember you are presenting the findings you noted from the identified
themes and sub-themes. Firstly, start with a general statement which highlights a particular theme.
Then explain the theme or the subtheme also highlighting the method(s) which revealed the
finding. You should then substantiate your explanations with selected quotation(s) that are
poignant and/or most representative of the theme or research finding. Below is my example of how
you may present the findings for the topic on Gender based Violence in Gweru:
Qualitative data revealed that employment status of a woman is can influence physical abuse.
Women who were not formally employed were noted to be economically challenged. As a result,
they highly dependent on their husbands for survival. Such dependency was noted as a facilitating
factor to physical abuse during key and in-depth interviews. One woman aged 34 years in an in-
depth interview supported the view that unemployed women are always on the writhe of physical
abuse as she reported that:
Normally unemployed women are mostly subject to physical abuse because of their 100%
dependency on the husband. How I wish I was employed. My husband always beat me with
clinched fists each time I tell him that we no longer have grocery, and more so at the
monthend.
He always accuses me of being a parasite, waiting to suck his energy. Surprisingly, he does
not allow me to sell even pre-paid cell phone top up cards to earn extra cash so that I cease
to be a parasite. I cannot report my case to police because if he is to be locked in cells, that
will be a double tragedy to me. I do not have resources to look after myself and children.
Key informant interviews with Victim Friendly officers also highlighted the view that unemployed
and informally employed women suffer the greatest quandary of physical abuse. One of the officers
stipulated that:
More than 60% of women who have ever reported cases of GBV to this station are either
unemployed or informally employed. They usually say that poverty is disadvantaging them
and it has become the key determinant of GBV. However, it is surprising to note that before
the cases are taken to court, the majority of them withdraw the cases, a development that
likely perpetuates other cases of GBV.
Quantitative data is any data that is in numerical form such as statistics, and percentages. The
researcher analyses the data with the help of statistics and hopes the numbers will yield an unbiased
result that can be generalized to some larger population. Quantitative data collection methods
emphasize objective measurements and the statistical, mathematical, or numerical analysis of data
collected through polls, questionnaires, and surveys, or by manipulating pre-existing statistical data
using computational techniques. Quantitative research focuses on gathering numerical data and
generalizing it across groups of people or to explain a particular phenomenon. Lisa (2008) notes
that quantitative research also focuses on numeric and unchanging data and detailed, convergent
reasoning rather than divergent reasoning [i.e., the generation of a variety of ideas about a research
problem in a spontaneous, free-flowing manner]
• Data are in the form of numbers and statistics, often arranged in tables, charts, figures, or
other non-textual forms.
• The research can be used to generalize concepts more widely, predict future results, or
investigate causal relationships.
• Researcher uses tools, such as questionnaires or computer software, to collect numerical
data.
The overarching aim of a quantitative research study is to classify features, count them, and
construct statistical models in an attempt to explain what is observed.
Target Population
This is the entire population or group that a researcher is interested in studying and analyzing, e.g
all men between 18 and 79 years. The target population defines those units for which the findings
of the survey are meant to generalize.
Sampling
The selection of a small group of individuals from whom data is collected for generalizations to a
larger group of people.
Sampling frame
A list of all the elements from which a sample is drawn e.g. all youths (males and females) aged
15 to 24 in Seke Communal lands.
A Representative Sample
• This is a subset/ ‗cross section‘ of a population that closely matches the characteristics of
its population as a whole. A representative sample is one that has strong external validity
in relationship to the target population the sample is meant to represent. In other words, the
sample is a fairly accurate reflection of the population from which the sample is drawn.
• A representative sample is intended to make some inferences and generalisations about the
whole population.
Let me use Figure 3.1 to show the difference between a population and a sample. Note that the
population is the sampling frame from which the target population can be drawn.
Unrepresentative sample
This is a biased sample which does not reflect the distribution of characteristics of the target group. A
researcher intentionally or unintentionally seeks out respondents who support his theory or views.
Researchers commonly use biased samples because of laziness/sloppiness. They
take samples from what happens to be easily available rather than taking their time and effort to generate
an adequate sample and drawing a justification. Potential consequences of unrepresentative sampling
- Data collected from the unrepresentative sample cannot be generalised to the target
population, and is therefore biased.
- Data collected from the unrepresentative sample does not represent the population from
which it is drawn.
- Biased samples are less likely to contain numbers proportional to the whole population.
Sampling error
Unavoidable, but measures the differences between sample statistics and population
parameters usually resulting from probability sampling methods.
Non-sampling error
It emanates from other factors that reduce the accuracy of estimates based on sample results and
these include:
1. Non-response
• Problem inherent in all interviewing methodologies (door – door, mail, telephone, internet
etc)
Affects the generalisability of results. E.g Focusing only on the army in a study on HIV & AIDS
among the uniformed forces or focusing only on the youths when studying the prevalence of
criminal activities in an area.
from which the sample was drawn, the resulting data are called weighted data. Imagine you have
a target population that is evenly split by gender. If you then interview a sample of 400 people
within this population, 300 of whom are male and 100 female then you would know that you
sample overrepresents men.
Weighting the resulting data can help you to correct this imbalance. Let us assume that in your
study, the target proportions for both men and women are supposed to be 50%. The proportion of
men would therefore need to be
―down weighted‖ from 75% (300 out of 400 interviews) to 50% while the proportion of women
needs to be ―up weighted‖ from 25% to 50%.
In this case, weighting would be multiply the existing female interviews by 2, so that the female
response is amplified in the data. For example, on the gender question we have 100 people
answering female but after weighting this becomes 200 as the "female" data is counted twice. The
male interviews need to be correspondingly down weighted. In this instance we need to get 300
responses to effectively count as 200 so we multiply the male responses by two-thirds (or 0.67).
Before weighting we have 300 males coded on the gender question. Multiplying by two-thirds
gives us 200 males, equaling the number of female responses after weighting. The numbers used
to multiply the responses from each proportion of the sample are called weighting factors. A
summary of the weighting factors for this example is shown in Table 3.1.
Note that typically weighting is used to match the population profile on more than 1 variable to get
as representative a sample as possible. For example, to get a representative sample of a country‘s
population we might weight on a number of demographic variables such as gender, age, region
and social grade.
Uses of weighting
One of the basic principles of probability samples is that every respondent must have a known,
non-zero chance of being selected. In the typical media poll, the sample is designed to represent
all adults 18 years of age and older.
In practice, this also means ―among those residing in telephone households‖ because that is the
way the data are collected. So survey data is weighted for the probability of selecting a respondent.
b). Adjusting for sample design
After all of the data have been collected, some simple frequencies are run on selected demographic
variables so they can be compared to known characteristics of the population obtained from an
external source, such as the Zimbabwe Census. This adjustment is made because some
demographic groups tend to be overrepresented or underrepresented in the sample. For example,
young men are considerably harder to reach at home than older women. So unweighted data
frequently include a larger proportion of older women and a smaller proportion of younger men
than what the Zimbabwe Census reports. A pollster typically makes small adjustments -- called
post-stratification weights -- to bring the sample into line with known population characteristics
such as age, sex, region and education.
Design effect
This is an adjustment that should be used to determine survey sample size. According to Henry
(1990), Frongillo (1996) and United States Census Bureau (2001), design effect is basically the
ratio of the actual variance, under the sampling method actually used, to the variance computed
under the assumption of simple random sampling. For an example, ―The interpretation of a value
of (the design effect) of, say, 3.0, is that the sample variance is 3 times bigger than it would be if
the survey were based on the same sample size but selected randomly. An alternative interpretation
is that only one-third as many sample cases would be needed to measure the given statistic if a
simple random sample were used instead of the cluster sample with its (design effect) of 3.0‖. The
designer effect is used to determine how much larger the sample size or confidence interval needs
to be. In general, for a well-designed study, the design effect usually ranges from 1 to 3. It is not
uncommon, however, for the design effect to be much larger, up to 7 or 8, or even up to 30. Note
that the main components of the design effect are the intra class correlation, and the cluster sample
size.
1. Experimental designs:
The main focus of experimental designs is to determine causation. Manipulation of one or more
variables under carefully controlled conditions is typically common. They are better suited for
explanatory studies. Among the experimental designs, there are true experimental designs and
quasiexperimental designs.
2. Non-experimental designs:
There is no manipulation of variables by the researcher. Variables are observed/studied as they
exist, e.g surveys.
Types of surveys
1. Cross-sectional
Observations are made at one point in time (snapshot approach)
Figure 4.1.
Single unrepeated survey
Suited to descriptive and exploratory studies
3. Longitudinal survey
Conducting a study at different times (Figure 4.2).
How large may a sample be or how big is a ‗representative sample? There are several methods of
determining the sample size. However, you can use the formula n = Z²pq/e². Where: n = sample
size; z = standard normal deviation set at 1.96 corresponding to 95% confidence interval; p =
Page 48 of 136 CIRSM
RRM 400
proportion of the target population to the entire population; q = proportion of the entire population
excluding the target population (1-p); and e = maximum allowable error margin set at 0.05.
Rule of thumb in sample size determination
In the absence of statistical guidelines, a sample size of between 5% - 10% of the population is big
enough to be representative.
Probability sampling
Selection of samples typically involve some random selection mechanism. A sample will be
representative of the population from which it is selected if all members of the population have an
equal chance of being selected. Probability sampling is normally used in large scale surveys. It
provides an efficient method for selecting a sample that adequately reflects variations existing in a
population. Probability sampling is more representative when compared to other sampling
methods. Random selection of respondents reduces conscious and unconscious bias by the
researcher. However, note that probability sampling can be costly and demanding given that it may
be technical.
The researcher must know his/her total population and the sample size. With these two parameters
you firstly calculate the sampling interval (n) = Population size/ sample size. eg. if the population
size 100000, sample size 4000, then the (n) would be 25.
Randomness is in the selection of the first case in the first sampling interval. However, the random
number should be between 1 and 25 which you would call x or any letter. Suppose you select 5,
the number you will select in the next sampling interval is x + 25. This would be 5 + 25 = 30, then
30+n = 35; 35+n =40, etc. Systematic random sampling is simpler to use when compared to simple
random sampling. Randomization is done only once. Nonetheless, the technique may suffer from
sampling bias. c. Stratified Random Sampling
Involves dividing the population into 2 or more strata and then taking either a simple random
sample or a systematic sample from each stratum. Note that the strata must be internally
homogenous with heterogeneity between or among them. Strata are selected on the basis of
variables relevant to the study. They maybe age, sex, economic status, place of residence etc. You
need to sample each stratum according to its percentage in the total population i.e. use the
probability proportional to size to allocate the sample proportionately to each strata.
Disproportionate sampling is when strata are not sampled according to their percentage in the
population. It is done when proportionate sampling yields a very small number of elements in a
population and this does not allow detailed data analysis to be done. It should be noted that
stratification increases precision by reducing sampling error. The main challenge with stratification
can be on deciding which variables to govern stratification e. Cluster sampling
It is used when it is not practical or very difficult to obtain a complete list of all elements in a
population because of the massiveness of the sampling frames. E.g. of massive sampling frames
include all adolescents in Zimbabwe, all university students in Zimbabwe, the population of
Zimbabwe, number of church members in the country etc.
1). Randomly selecting existing clusters in a population through simple random, systematic or
stratified random sampling
OR, creating clusters and randomly selecting some from among them.
2. Randomly selecting individual elements from randomly selected clusters and interviewing them.
Multi-stage cluster sampling is one form of cluster sampling involving various stages at which
sampling is done.
For example, How would you sample adolescents for participation in a national baseline survey
on cyber security?
Page 50 of 136 CIRSM
RRM 400
NB. Obtaining a complete list of all adolescents in Zimbabwe is next to being impossible.
Stage 1: Obtain sampling clusters at national level. List all provinces in Zimbabwe & randomly
select those to collect data from.
Stage 2: Obtain sampling clusters at provincial level. List all districts in the selected provinces &
randomly select some for data collection
Stage 3: Obtain sampling clusters at district level. List all wards in selected districts and randomly
select some for data collection
Stage 4: Obtain sampling clusters at ward level. List all villages in selected wards and randomly
select some for data collection
Stage 5: Obtain sampling clusters at village level. List all households in the villages and randomly
select some for data collection
Stage 6. Randomly select adolescents from selected households and interview them.
It should be noted that cluster sampling is highly efficient and cost effective when compared to
other probability sampling methods. It enables researchers to conduct studies that would otherwise
be impossible. Nonetheless, the technique has more than one sampling error given that it has more
than 1 cluster sample.
NB. In discussing methodology, you discuss sampling down to the selection of individual
participants.
This is the tool or instrument used when collecting survey data which will be subsequently anaysed.
It is mainly characterized by close-ended questions e.g. (yes/no), (strongly, agree, disagree, and
strongly disagree).
Questions in a questionnaire must answer the broad and specific objectives of the study. In
designing the questionnaire, you should be aware of the indicators to be measured. The
questionnaire format should be spread out and orderly. You should choose appropriate response
format and this should be well-spelt out at the beginning of the questionnaire. You should indicate
how the respeonse should be shown (circling, ticking or putting an X where appropriate). The
common features of a questionnaire are shown below. Introduction;
Page 51 of 136 CIRSM
RRM 400
Every questionnaire to have an introduction which explains what the study is about.
Questionnaire to contain clear, specific instructions & introductory comments where appropriate.
- Mostly has close-ended questions, open-ended questions should be few and avoided.
- Response category for closed-ended questions should be exhaustive & mutually exclusive
(except for multiple responses).
- Questions may also be made up of statements measuring attitudes, beliefs and knowledge
(Matrix questions).
- Indicate (by ticking) in the appropriate space provided, whether you Strongly Agree (SA),
Agree (A), Disagree (D), Strongly Disagree (SD), Undecided (U).
- Multiple response question(s). Note that in a multiple response question, a respondent has
an option to give more than one answer. Eg.
b).Spousal rape 1 2
Other… (Specify)…
In designing a questionnaire, make sure you create short and concise questions. Avoid long,
winding and complicated questions as they would confuse the respondents. Also avoid double-
barreled questions. This is a single question or statement that may produce 2 different answers or
combining questions, e.g.
Sex education must be taught at primary schools and the schools must be allowed to provide
condoms to sexually active pupils.
When designing a questionnaire also remember to ask for one piece of information at a time. Avoid
biased and leading questions e.g.
Make sure you avoid asking questions with names of prestigious persons attached to them e.g. Do
you agree with the Prime Minister that -----.Take note of Contingency questions and questions that
do not apply to every respondent (skip patterns). E.g. Have you ever had sex? 1. Yes 2. No (skip
to Question 5)
- Do not start with awkward or embarrassing questions – respondents may just give up.
- Start with easy and non-threatening questions. This encourages respondents to carry on
with the questionnaire. - Ensure options are mutually exclusive
– e.g. ―How many years have you worked in academia: 0-5, 6-10, 11- 15, over 15.‖ Not,
―0-5, 5-10, 10-15…‖
Pre-testing a questionnaire
Involves administering a questionnaire to a very small sample (not in your area of study)
and noting all the problems that arise for the interviewer and subject. This approach
highlights any ambiguity in the wording of your questions. It would enable you to note the
necessary and desirable changes in wording, format and ineffective questions. Methods of
administering a questionnaire
1. Self- administered
Disadvantage
- Telephone interviews
2. Telephone interviews
Advantage
Disadvantage
3. Interviewer administered
Advantage
- Dress appropriately
- Establish rapport
- Be neutral
- An interactive front-end computer system that aids interviewers to ask questions over the
telephone.
- Answers are then keyed into the computer system immediately by the interviewer
- Lower costs
Quality control
The aim is to ensure that quality data are collected. Measures include:
Page 55 of 136 CIRSM
RRM 400
- Pre-testing questionnaire
- Field editing
- Data cleaning
Let me show you how would write the survey method using the method, how many, tool, with
who, why, sampling, selection of the respondents and how data were collected and recorded
approach I had use in qualitative methodology. My example will still use the topic on Gender based
violence in Gweru.
A face-to-face survey, using a questionnaire was carried out with males and female between 15
and 79 years of age. The survey was undertaken in order to quantify and determine the magnitude
of the commonly reported forms of GBV in Gweru, as well as the factors underlying the forms of
GBV so identified. The survey also quantified the extent of the availability, accessibility, and
acceptability of the reporting structures by the target population.
The sample size was calculated using the formula; n = (Z2pq)/e2. Where n = the desired sample
size; Z2 = the standard normal deviate usually set at 1.96 which corresponds with 95% confidence
interval; p = the proportion of males and females (the target population) between 15 and 79 years
to the total population in Senga Gweru; q = the proportion of total population in Senga excluding
15 to 79 year old males and females (1- p); and e2 = desired level of precision set at 0.05. In order
to calculate the sample size, a 95% confidence interval was used. In addition, 0.05 error margin
was assumed to be the degree of accuracy. This means that deviations of sample estimates from
the true population were only considered accurate if the significant level was pegged at 5%. The
test statistics for Z scores was also used.
Senga has a total population of 4710 of whom men and women between exact age 15 and 79 years
constitute 1297 (Note that this is my assumption. You have to use the correct figures and cite the
source). Therefore, the sample size calculation was (1.962*0.28*0.72)/0.052. The calculated
sample size was 303 male and female respondents. However, for robust analysis, the sample size
was set at 500 respondents.
Sampling procedures
You state all the sampling procedures you will have used. It can be a multistage sampling where
you would start off by selecting a ward and enumeration areas, household and then the actual
respondent. You will also need real or the actual figures of the distribution of the population in
your study area to assist you in explaining the sampling procedures.
Descriptive Statistics
Median
The median is the middle value of rank ordered data. Value that separates the higher half of a data
set from the lower half. It can be calculated by arranging all values from lowest to highest and
determining the value in the middle.
If there is an odd number of values in the data set, then the median is the middle value. Assuming
that you are given the data set 3, 2, 1, 6, 7, 4, 9, you must first arrange the data set from the lowest
to the highest. Thus, 1, 2, 3, 4, 6, 7, 9. From this arrangement, the median is 4.
If there is an even number of values in the data set, 3, 2, 1, 6, 7, 4, 9 and 8, then the median is the
mean of the two middle values. Firstly arrange the data set from the lowest to the highest, 1, 2, 3,
4, 6, 7, 8, 9. From this data set, the median is the average of two middle values, 4 and 6 = 5.
Mode
The most frequent measurement. Examples:
a) 132, 139, 131, 138, 132, 139, 133, 137, 139. Mode = 139.
c) 56, 23, 48, 78, 94, 35, 88, 69, 44, 53, 27. There is no mode
Standard deviation
This is a function of sampling error which is meant to measure how much sample statistics are
deviating from the true population parameters. The bigger the sample size, the smaller the standard
error and the more representative the sample is.
The formula for the sample standard deviation of a data set (s) is:
Where:
xi is each value is the data set; x-bar is the
mean; and, n is the number of values in the data set.
Steps in calculation the standard deviation.
squares,
5. Divide the sum of squares (found in Step 4) by the number of numbers in the data set minus
one; that is, (n – 1) to get the sample variance,
Note that the sample variance is the average of the squared differences from the mean.
Example.
You have the following data set: 1, 2, 4, 6, 7, and you are asked to calculate the standard deviation.
Solution.
Calculate:
When data are normally distributed, plotting them on a graph results in an image that is bell-shaped
and symmetrical, like the one shown here. In such a distribution of data, the mean, median, and
mode are all the same value, and coincide with the peak of the curve.
The normal distribution is also often called the bell curve because of its shape. Note that in a normal
distribution, skewness = 0. This means that the mean = median= mode.
a) Bimodal distribution
This is when the data has two peaks (Figure 5.2) Picture.
Data can be skewed either to the left or to the right, Figure 5.3). Note that when data is skewed to
the left, the majority of the data values fall to right of the mean, skewness < 0. However, when
data is skewed to the right, the majority of the data values fall to the left of the mean, skewness >
0.
Kurtosis
Mesokurtic
A mesokurtic distribution is one with tails shaped in roughly the same way as any normal
distribution. The kurtosis of a mesokurtic distribution is neither high nor low, rather it is considered
to be a baseline for the two other classifications. Binomial distributions for which the p value is
close to ½ are also considered to be mesokurtic. Mesokurtic distributions have excess kurtosis of
zero.
Leptokurtic
Leptokurtic distributions are sometimes identified by peaks that are thin and tall. The tails of these
distributions, to both the right and the left, are thick and heavy. Leptokurtic distributions are named
by the prefix "lepto" meaning "skinny." One famous example of a leptokurtic distributions is the
Student‘s t distribution. Leptokurtic distributions have positive excess kurtosis >3.
Platykurtic
These are distributions with slender tails. Many times they possess a peak lower than a mesokurtic
distribution. The name of these types of distributions come from the meaning of the prefix "platy"
meaning "broad." Examples of platykurtic distributions are all uniform distributions and discrete
probability distributions. Platykurtic distributions have negative excess kurtosis<3.
References
Inferential Statistics
Inferential statistics refers to the analysis of data that helps to describe, show or summarise data in
a meaningful way such that patterns may be acknowledged. However it does not allow us to make
conclusions beyond the available data. Inferential statistics is the main rationale for the extension
of concepts of descriptive statistics to inferential statistics. Inferential statistics techniques allow
researchers to use samples to make generalisations about the populations from which the samples
were drawn.
Two important techniques when drawing inferences from a set of data are:
• Parameters that can be calculated for that sample e.g. mean, variance, standard deviation
etc.
• The estimates are used to generalise the whole population taking into consideration some
uncertainties and errors.
• Sometimes you want to check if the postulations you made in your conceptual framework
are correct for a certain population. You also use these samples to check for the accuracy
of your postulations. E.g. an expert may claim that 20% of students at all tertiary institutions
in Zimbabwe are more likely to be thieves.
• This is called a hypothesis and samples of students may be analysed to validate this
hypothesis.
Modelling assumptions
• Homogeneity of variances
• Linearity
• Independence Normality
of data
• The data used in statistical inference is assumed to come from a normal distribution.
Remember, this is a distribution that is symmetric about the mean.
• This means there is a choice on choosing the members of the sample since none will depend
on another.
• This is important when carrying out Chi Square tests. Point Estimates
• If a parameter for a population is not known, you may estimate a value for the parameter
by using a sample.
Example
A railway enthusiast simulates train journeys and records the number, x, to the nearest minute,
trains are late according to the schedule being used. A random sample of 50 journeys gave the
following times.
4 3 10 5 2 14
21 9 22 36 14 34
8 15 41 23 13 7
5 34 26 17 8 43
19 5 23 13 12 10
Given that ∑χ=738 and ∑χ2=16 526, calculate to two decimal places, unbiased estimates of the
mean and the variance of the population from which this sample was drawn.
Solution
Interval estimates
• Information obtained from samples only give a rough idea about the population itself.
• Sometimes we need to know the range of values where we would expect the true value of
the parameter to lie.
• It shows the range of values where you are 95% confident that the true mean will lie.
• The interval should be central, hence it leaves 5% shared on either side of the interval as
shown in Figure 6.1.
Figure 6.1: The 95% confidence interval
• It shows the range of values where we are 99% confident that the true mean will lie.
• The interval should be central, hence it leaves 1% shared on either side of the interval.
Example
A plant produces steel sheets whose weights are known to be normally distributed with a standard
deviation of 2.4kg. A random sample of 36 sheets had a mean weight of 31.4kg. Find the 99%
confidence interval for the mean.
Solution 1.
Data: n
= 36
= 31.4
Solution 2
• If the sample size is small i.e. n<30, and the population standard deviation is unknown, you
use the T-distribution.
• This is a symmetric distribution which has a shape that depends on the degrees of freedom
v.
Table 6.1 shows the values that can be obtained from the T distribution table.
Table 6.1: Values that can be obtained from the T distribution table
Stage 2.
Calculate also the degrees of freedom:
v = n-1
=10 -1
=9
Stage 3.
Hypothesis Testing
This is a systematic way of validating an assumption or idea based on expertise or postulated value.
The modelling assumption of normality is used to carry out all hypothesis tests.
Steps in testing for a hypothesis.
1. Defining a variable
The variable is the property whose values are being analysed e.g. mass of a tomato, length of a pod
e.t.c. There are two types of hypothesis: a). Null hypothesis
This is a hypothesis which assumes that no change has occurred from the assumption made. It is
expressed as:
:
,
a).
• This is a one tailed test which suggests that the mean has increased.
We reject the null hypothesis if the value is in greater than the critical value.
b).
• We reject the null hypothesis if the value of the test statistic is less than the critical value.
c)
We reject the null hypothesis if the test statistic is in one of the critical regions.
• If the variance is known, or the sample size is large(>30), you define the distribution of the
mean as
• If the variance is unknown and the sample size is small(<30), we define the distribution of
the mean using a t distribution
Or
• If the test statistic is in the critical region, reject the null hypothesis and accept the
alternative hypothesis.
• Otherwise you accept the null hypothesis and conclude that the conditions expected have
not changed.
Summary of hypothesis (slide 55)
One-tailed test One-tailed test Two-tailed test
(lower tail) (upper tail)
H0: µ= µo H0: µ= µo H0: µ= µo H1: µ≠ µo
H1: µ< µo H1: µ> µo
z>1.645
Solution
Let X be the mark of a trainee from the particular county
𝐻_0: 𝜇= 70
𝐻_1: 𝜇<70
One tailed test in the lower tail at the 5% significance level.
If 𝐻_0 is true, 𝑋 ~ 𝑁 (70 ,6^2/25)
Test statistic 𝑍= (𝑥 𝜇)/√(𝜎^2/𝑛)
a) Define the variable under test e.g mass, weight, length e.t.c.
b) Formulate the hypothesis
f) Below is a diagram showing the chi square distribution and some values of the distribution.
Example
The table below shows the number of committed crimes for one day during a particular period of
time.
Day of Monday Tuesday Wednesday Thursday Friday
the week
Number 121 87 87 91 114
of crimes
(a) Find the frequencies expected from the hypothesis that the number of crimes does not
depend on the day of the week.
(b) Test at the 5% significance level whether the differences in the observed and expected
frequencies are significant.
Solution
a). Let X be number of absentees on any day.
- : The number of absentees does not depend on the day of the week.
b). Calculate the test statistic using the following table format:
O E (O - E)2
E
121 100 100 4.41 1.69
87 87 100 100 1.69 0.81
91 100 1.96
114
• Degrees of freedom
• The steps of
formulating the
hypothesis and defining a
distribution to calculate
the expected frequencies
will differ.
• the observations from the two groups are dependent on each other. ii). Define the
distribution for expected frequencies:
A contingency table which contains row totals and column totals is used to calculate the expected
frequencies on the basis that is two observations
Where: Row total is the total in the horizontal line containing the observation;
Column total is the total in the vertical line containing the observation; and Grand total is the total of
all the observations.
Example
Members of a forensic investigation team are investigating whether the weather has an effect on their
evidence.
Their results are as follows:
Outcome Weather
Conviction 12 4 16
Acquittal 5 8 13
Further investigation 7 14 21
Total 24 26 50
Formulate a suitable null and alternative hypothesis and use a chi square distribution to test the claim
at the 1% level of significance.
Solution there is independence between the weather and the results
obtained.
Outcome Weather
Total 24 26 50
= (3 1) (2 1) =2.
The table below shows the calculations from the test statistic.
O E (O - E)2
E
12 7.68 2.43
5 6.24 0.246…
7 4 10.08 0.941… 2.243…
8 8.32 0.227…
14 6.76 6.956…
10.92
Conclusion: Since the tests statistic is less that the critical value, we do not reject 𝑯0 and conclude
that there is independence between the performance of the team and the weather.
Analysis of variance
Analysis of variance (abbreviated as ANOVA) is used to test for when multiple sample cases are
involved.
The ANOVA technique enables us to perform a simultaneous test to draw inferences about whether
samples have been drawn from populations having the same mean.
The basic principle of ANOVA is to test for differences among the means of the populations by
examining the amount of variation within each of these samples, relative to the amount of variation
between the samples.
Then the said two estimates of population variance are compared with Ftest, wherein you work out.
F = Estimate of population variance based on between samples variance
Estimate of population variance based on within samples variance
𝐻_0:𝜇_1=𝜇_2=𝜇_3=…=𝜇_𝑛
_1: at least one of the means is significantly different from the others.
There are two types of ANOVA tests
One way ANOVA (comparing means from different classes under one condition)
Two way ANOVA (comparing means from different classes under many conditions)
The F distribution can be used to assess the test statistic calculated as shown above.
• Linear regression is the relationship between two or more variables, or the response of one
variable with respect to another.
• Linear correlation is the measure of the strength of the relationship between the variables.
• There are three main types of correlations as shown on the next page:
Types of correlations
• The independent variable is the one that you can control, or is the stimuli.
• The dependent variable is the variable which shows the response due to an input of the
independent variable.
Regression line y on x
• This is the equation of the line of best fit given in the form:
Where: a is the y intercept (the value of y where the line cuts the y axis); and, b is the gradient of the
line.
The values of a and b can be calculated from some analysed scatter lot using the method of least squares.
and,
• The linear regression correlation coefficient r is the value between -1 and 1 which shows the
linear degree of scatter.
Calculating r
The value of r can be obtained from the formula:
Interpreting r
The following data represents the lengths (χ) and breadths (y) of twelve cuckoos‘ eggs measured in
millimeters (mm).
Page 87 of 136 CIRSM
RRM 400
χ 22.3 23.6 24.2 22.6 22.3 22.3 22.1 23.3 22.2 22.2 21.8 23.2
y 16.5 17.1 17.3 17 16.8 16.4 17.2 16.8 16.7 16.2 16.6 16.4
Solution
To find the equation of the regression line y on x, we use the equations to find a and b.
• Obtain
Use the formula for r and get r = 0.86 (strong positive correlation)
References
Crawshaw, J. and Chambers, J.(2001).Concise Course for Advanced Level Statistics(4th edition).
United Kingdom: Nelson Thornes.
Kothari ,C.R. (2004). Research Methodology: Methods and Techiques (7th edition). New Delhi:New
Age International (P) Ltd., Publishers
In this chapter, I will explain how you would analyse data using the Statistical Package of Social Scientists
(SPSS).
SPSS was specifically designed to analyse data from the social sciences (as its title indicates) but it is
now widely used by a variety of fields and individuals. With SPSS you can perform highly complex data
manipulation and analysis with simple and straight forward instructions/commands. SPSS can take data
from almost any type of file and use them to generate tabulated reports, e.g. from Microsoft Excel. The
SPSS language consists of selfexplanatory commands e.g. the command to perform regression analysis
is ―Regression‖. Starting SPSS
- For you to be able to access SPSS, its software has to be first installed on to your computer.
- Open the SPSS window by double clicking its icon on the ‗desktop’ of your machine or click
Start, Program, then choose SPSS for windows.
- When SPSS first opens, a dialogue box appears asking, ‘what would you like to do?’ and a
number of options are provided (Figure 7.1);
- To create a new data file click on the box for type in data and then click ok.
Figure 7.1: SPSS dialogue box which asks you on ‘what would you like to do?’ in SPSS
To open an existing data file check on open an existing data source and then click OK. This will open
the data View spreadsheet (Figure 7.2).
SPSS Windows/Interfaces
There are three basic windows in SPSS:
a). Data Editor – this displays the contents of the data file. You can have only one data file open at a
time. The data editor has the following windows:
i). Variable view window – this is the dictionary window where variables are defined, characterised and
specified; and,
ii). Data view window – the window used for entering data in the data editor. b). Output Navigator
– this is where all SPSS results are displayed. Once a statistical procedure is run, an output view
window is created that holds results.
c). Syntax window – this is where the choices selected in the dialogue boxes can be saved. These can
then be edited and executed in future. Such operations will automatically appear upon clicking the paste
functions. Defining variables/Creating a dictionary
A good practice in data entry is defining the variables first before entering the data. Defining a
variable means converting questionnaire language to SPSS language.
- To define a variable:
Click on the Variable view tag at the bottom of the screen (Figure 6.3). Figure 6.3 Variable view tag
I will then take you step by step in defining each variable in each column. The first step is to enter
the variable name in the appropriate column (name) correctly following the rules:
Place the arrow in the cell of Row 1 of the NAME column and click.
Then type the variable name.
- The variable type dialogue box opens and then choose the appropriate
- There are eight variable types: numeric; comma; dot; scientific notation; date; dollar; custom
currency; and, string.
Width column
- Is the width of actual data entries that is determined by the number of characters contained in
the variable code.
- The default width of numeric variable entries is eight (8). String variables may be up to fifty
(50) characters.
- Employing the up/down-ward arrows (inside the cell) on the right hand side increases or reduces
the width of the variable entries. You may simply type a new number in the cell.
Decimals column
- The number of digits to the right of the decimal place to be displayed for data entries.
- Does not apply to string data.
- Decimal places are only used when dealing with continuous data e.g height, shoe sizes, or
income
- The values can be altered in the same way as for the width
Label column
- This is the label attached to the variable name. It is the broad definition of a variable.
- It is not confined to eight characters like variable name.
- The label remind you of the meanings of variables.
- This variable label is also displayed in the output from statistical analysis.
Values column
- These are labels attached to the category codes. These show the variable categories represented
by the numeric codes.
- Once an interger code is assigned to each response category (coding) the variable automatically
becomes ‘numeric’.
Clicking on the three period symbols under the respective sixth column of the window opens the
value labels dialogue box as shown in Figure 7.4. Figure 7.4: The dialogue box for the value
labels column
Missing column
- This is done by clicking the three period symbol and filling the resulting missing values dialogue
box accordingly.
Column
- The column can be adjusted in the same way as the width (third column)
Measure column
- Ordinal scale distinguishes variables basing on the element of rank (agree, disagree, neutral)
- Nominal scale distinguishes variables on the basis of a name e.g sex of the respondent, religion
and level of education.
- Scale refers to interval data e.g age groups, ranges of monthly income.
Data entry
You shall enter your data (the responses from the questionnaire) in the Data view window. Note
that if you point your cursor above any one of the columns of the data view you shall see each of
the coded question from the hard copy of your questionnaire. All you need to do is to enter the
respective code (response). In other words, you will be transferring the responses from the hard
copy of your questionnaire to the Data view window of SPSS.
Data cleaning
Data cleaning is the process of ensuring that data has been correctly and accurately entered in
SPSS. It is part of quality control. It is very unusual for real data to arrive and be analysed without
problems and/or errors. In large studies the process of data cleaning consumes considerably more
time and effort than the primary analysis of interest.
Main sources of error
The most interesting part in research is the analysis, interpretation and presentation of findings. It
is equally important to note that when interpreting your findings you should remain guided by your
objectives and conceptual framework. The purpose of analysing data is to obtain usable and useful
information. The analysis, irrespective of whether the data is qualitative or quantitative, may:
describe and summarise the data; identify relationships between variables; compare variables;
identify the difference between variables; and, forecast outcomes
Quantitative data analysis
Having cleaned your data in SPSS, the next stage is to analyse the data. Remember, the choice of
a statistical procedure depends on what you want to present. For instance, a frequency distribution
shows the number of the respondents who completed a particular question; percentages indicate
the proportion of the respondents who answered in a particular way; the mean is the average
number of a score or age; median shows the middle value in a range; and a cross tabulation shows
the relationship between the dependent and independent variable(s). Three types of quantitative
data analysis
Univariate analysis
This is an analysis of the distribution of cases, however, using one variable at a time, e.g. marital
status. The following are steps you should follow in SPSS if you want to analyse one variable e.g.
marital status.
Click "Analyze" from the toll bar-> "Descriptive Statistics" -> "Frequencies" (Figure 8.1)
Once you select frequency distributions, a dialogue box appears, Figure 8.2. You shall see that
from the left side of the dialogue box are the coded variables and the right box (which is usually
empty) is where you are supposed to put the selected variable. In the Figure 8.2, I have scrolled
through the variables until I have found ‗marital status‘ as my variable of interest. So I have clicked
the arrow to move that variable into the "Variable(s):" box. Should you wish to command SPSS to
calculate the measures of central tendency, dispersion, distribution and percentile values, select
the Statistics command which is at the top right side of the opened dialogue box. . You will also
realise that there are icons where you can also select the charts, format, style and Bootsrap of the
selected variable if need be.
Figure 8.2: A dialogue box that would appear upon clicking the ‘frequency distributions’
command
Upon selecting the variable (marital status) you can either click the Paste or
OK command. Note that if you are to click Paste, SPSS takes you to the Syntax window where the
selected option has been saved. If you are to select (highlight) the saved option(s) and click ‗Run‘,
SPSS takes you to the Output window. However, if you are to select the variable and just click
OK, SPSS immediately takes you to the Output Window (Figure 8.3). Remember the Output
window displays the distributions of the selected variable(s).
Given that I had selected marital status as my variable of interest, the Table 8.1 will be displayed
in the SPSS Output window. Note that there are two tables in this output. The first breaks down
the frequency of valid region data versus missing. The second table shows frequency distribution
with four labeled columns.
N Valid 500
Missin g
0
15 3.0 3.0
What is important however is for you to be able to interpret the meaning of the figures in the SPSS
Table 8.1 above. You will note that the first column of the table is showing the coded marital
categories (never married, married/cohabiting, divorced/separated and widowed) and ‘Total’
(representing the sample size). The second column, ‗Frequency,’ shows the frequency distribution
of the respondents in the various marital categories and the sample size of 500. The third column,
‗Percent’, depicts the proportion of the respondents in each marital category to the entire sample
size. Note that the proportion adds up to 100 %. The fourth column, ‘Valid Percent,’ is the percent
when missing data are excluded from the calculations. In other words, these are the percentages of
the respondents who selected each response after you disregarded missing responses. Use the valid
percent column when analyzing your data because it shows the frequency distributions after
disregarding any missing responses. The fourth column, "Cumulative Percent", adds the
percentages of each of the proportions in the third ‗Column‘ from the top of the table to the bottom,
culminating in 100%. This is more useful when the variable of analysis is ranked or ordinal, as it
makes it easy to get a sense of what percentage of cases fall below each rank.
You can analyse and interpret the meaning of the figures displayed from the SPSS output window
using your own table or graph. Note that a table or graph (and not both) should have the following:
title (which should reflect the meaning of the figures) and the sample size (n, for a representative
sample and N for a national study). Make sure you draw in Excel your graphs, e.g. pie chart,
column, line or bar graphs for quality presentation of results. For column, line and bar graphs,
make sure you label the X and Y axis, insert data labels, remove grid lines and format the Y axis
such that the percentage axis is up to 100. Note that percentage distributions in a pie-chart should
add up to 100.
Page 102 of 136 CIRSM
RRM 400
When analyzing and interpreting data, you should start with the text. Within the text, make
reference to the table or figure. In your text, the first statement should be a general statement which
points to the general observation(s) of the frequency distributions in the table or graph. The general
statement should then be substantiated with the presentation of the frequency distributions in the
table. The first time you present the first frequency distribution, you should make reference to your
table or graph in that particular sentence. In presenting the frequency distributions, words which
show the magnitude of the proportion, such as the majority, a significant proportion, a sizeable
proportion, a small proportion should be used.
The following is an example of univariate analysis using data in Table 8.2.
Table 8.2: SPSS Output window showing the distribution of the respondents’ level of
education Statistics (q100) Level of education
N Valid 500
0
Missing
The sample was highly educated. (Note that this is my general observation after noting that only
1% of the sample had never been to school). The majority of the respondents, 99%, reported that
they had attained some form of education. Over two-thirds of the respondents, 70%, reported that
they had attained secondary education (Table 8.3). (Note that if you had drawn a graph then you
say Figure 1). A sizeable proportion of the respondents, 16%, reported that had completed primary
school. Small proportions of the respondents, 7% and 5%, reported that they had completed
secondary and tertiary education, respectively. Only an insignificant proportion of the respondents,
1%, reported that they had never been to school.
n = 500
OR a column graph
n = 500
Statistics
(q301.6)
Voluntari
ly
accompa
nying (q301.7)
a sexual Agreeing to
partner assist with
(q301.2) (q301.4)
(q301.Physi Forced (q301.3) Willful visit (q301.5) Rape domestic
cal abuse marriage Spousal rape to in-laws by a stranger chores
500 500
NValid 500 500 500 500 500
0
0
Missin g 0 0 0 0 0
Frequency Table
(q301.1) Forms of gender based violence which you have ever heard of_
Physical abuse
Freque Perce Valid Cumulativ e
ncy nt Percent Percent
Tota
l 500 100.0 100.0
When analyzing the multiple response question, you draw one table or graph to show the frequency
distributions of all the variables. From each table of the SPSS output, you only select the frequency
distribution of your interest. For instance, in the above SPSS Output, one is interested in knowing
the respondents‘ knowledge about the forms of gender based violence. Thus, you should select the
percentage distributions for those who indicated a
Yes‘response in each variable. Note that the presentation technique is similar to the one I have
highlighted when I gave my example of the univariate analysis of the variable ‗Level of education.
What is however different is that the total percent of a multiple response question does not add up
to 100. Remember, you would have chosen the respondents who would have answered ‗Yes‘ to
each of the variables. I would use the above SPSS Output to show you how a data from a multiple
response question is presented and analysed.
Example
Awareness of the various forms of gender based violence was high although misconceptions were
common. (Note that this is my observation after considering the frequency distributions of
knowledge of the various forms of gender-based violence in Table 8.5.) Physical abuse was the
most commonly known type of gender-based violence as reported by 100% of the respondents
(Table 8.5). Force marriage, as a form of gender-based violence, was reported by 85% of the
respondents. Rape by a stranger was reported by 84% of the respondents. About 82% of the
respondents reported spousal rape. As noted earlier, the study noted misconceptions regarding the
forms of gender-based violence in Gweru. A sizeable proportion of the respondents, 19%, reported
that willful visit to in-laws is a form of genderbased violence. Similarly, 17% of the respondents
reported agreeing to assist with domestic chores. Further, a reasonable proportion of the
respondents, 15%, reported voluntarily accompanying a sexual partner as a form of gender-based
violence.
Table 8.5: Percentage distribution of the respondents’ knowledge about the various forms of
gender-based violence
Form of gender based violence Percentage
Physical abuse 99.6
Forced marriage 85.6
Spousal rape 82.2
Rape by a stranger 84.0
Voluntarily accompanying a sexual partner 15.0
n = 500
Bivariate analysis
It is used when you are interested in establishing the statistical and direction of relationship
between the dependent and independent variable(s). The following are the SPSS procedures for
bivariate analysis.
Click "Analyze" -> "Descriptive Statistics" -> "Frequencies"-> ―Crosstabs‖ (Figure 8.5).
Once you click the above command, a dialogue box (Figure 8.5) opens. To the left side of the
dialogue box are the defined variable, and to the right are options ‗Row‘, ‗Column‘, ‗Statistics‘,
‗Cells‘, ‗Format‘.
Figure 8.6: A dialogue box that would open after clicking the crosstab command
Use the upper and lower arrows to select the dependent and independent variables, respectively.
If you are to click ‗Cells‘, note that another dialogue box opens (Figure 8.7). This box has options
to select the ‘Counts’,
‘Percentages’, and ‗Residuals‘. Then click, ‗Observed‘ counts, and all the ‘Row, Colunm and Total’
percentages and then click ‘Continue’.
Figure 8.7: A dialogue box that would open after clicking the ‘Cells’ command
If you are to click ‗Statistics‘, SPSS takes to another dialogue box (Figure 8.8) which has various
options of statistical analysis. You select the statistical analyse which best suit your interest then
click ‘Continue’.
Figure 8.8: A dialogue box that would open after selecitng the ‘Statisctics’ command
Once you are satisfied with your selected option, click either Paste or OK. Remember, if you click
Paste, SPSS takes you to the Synax Window where the selected options are saved. Highlight the
saved selected options and click ‘Run’. This command will take you to the SPSS Output Window
(Table
8.6). You can also get to the SPSS Output Window by just clicking ‗OK‘ after selecting all your
options. What is important to note is that you are interested in finding out the extent of committing
any crime, however, by marital status. So you can see the number of males who committed any
crime by marital status, the ‗Yes‘ row (the row wise interpretation). You can note that 226 males
ever committed a crime out of the 401 males. However, among the males who ever committed a
crime, 71 had never married/single, 142 were married/cohabiting, 7 were divorced/separated,
while 6 were widowed.
The second row is showing the proportions (in percentages) of males by marital status who ever
committed a crime. For instance for the never married = 71/226 *100 =31.4%. Note that the
denominator is 226 (representing all males who ever committed a crime) and also that all the
proportions add up to 100%. So basically the figures are showing the distributions of all males who
ever committed a crime by marital status. It is equally important to also bear in mind that the
proportion of male respondents who ever committed a crime to the entire sample is 56.4%
(226/401)*100.
The third row is now showing the proportions of males who ever committed a crime within a
particular marital group (the column wise interpretation). For instance, all single/ never married
males were 101 [71 who said yes, + 30 who said No]. So within the never married males alone,
70.3% (71/101*100) reported that they had ever committed a crime. Similarly, within the
married/cohabiting males alone who constituted 268 respondents of the sample 53.0%
(142/268*100), reported that they had ever committed a crime. You can use the two examples to
interpret the proportions within the divorced and widowed marital groups. What you also need to
understand is that all column percentages add up to 100. This is largely because if we say 53.0 of
the never married males reported that they ever committed a crime, then 47%, as shown in the
‗No‘ column reported that they had never committed a crime. However, you are comparing the
commitment of a crime within marital groups, note that the sample size is now 401.
The last table of the cross tabulation shows the results of your statistical tests. It should be noted
that any association (correlation) whether positive or negative can be statistically significant if the
chi-square p-value is less than or equal to 0.05. This is largely because you would have used a 95%
confidence interval (error margin). So the SPSS cross tabulation below, the p- value is 0.006. This
shows that the association between committing a criminal activity is associated with marital status.
Unlike the presentation of the univariate analysis, you have to state the strength of the association
when analyzing and presenting findings from a cross tabulation.
Table 8.6: SPSS Tables of a cross tabulation Crosstabs Case Processing Summary
Cases
Valid Missing
N N N
(q402a) Have you ever
contracted an STD? *
100.0
(q101) Marital status 401 80.2% 99 19.8% 500
%
(q402a) Have you ever committed any crime? * (q101) Marital status Cross tabulat
62.8%
crime? % within 3.1% 2.7% %
(q402a) Have you 100.0
ever committed any 31.4%
crime?
53.0% 56.4%
% within (q101) 38.9% 42.9%
Marital status 70.3% 56.4%
35.4%
100.0 %
% within 6.3% 4.6%
(q402a) Have you
ever 17.1%
committed any
crime %
% of Total 31.4%
7.5% 2.7% 2.0% 43.6%
% within (q101)
Marital status
100.0% 100.0% 100.0
100.0%
% of Total
100.0%
100.0
66.8%
25.2% 4.5% 3.5%
Consistent with my 2 different interpretations of the figures from the SPSS output above (the
column and row interpretations which depend on how you would have selected your dependent
and independent variables), there are also 2 different ways of presenting and interpreting the
findings from a cross tabulation. Remember the techniques of starting with a general statement,
present the percentage distributions to substantiate your assertion, state the direction of the
relationship (inverse, positive and the group which is either more or less likely to respond in a
particular way when compared to other categories), making reference to the table or figure and
indicating the sample size (n).
Example
1. When drawing comparison within males who ever committed a crime.
Criminal activities are common in Gweru. The majority of the respondents, 56%, reported that
they had ever committed a crime (Figure 8.9). Figure 8.9: Percentage distribution of the
respondents who ever committed a crime
n = 226; p = 0.006
Further analysis demonstrated that committing a criminal activity is associated with marital status
(p = 0.006). Married respondents were more likely to report that they had ever committed a crime
when compared to all marital groups. For instance, while, 63% of the married respondents reported
that they had ever committed a crime, only 3% of the widowed respondents reported the same
(Table 8.7).
Table 8.7: Percentage distribution of the respondents who ever committed a crime by marital
status
Marital status Percentage
Never married 31.4
Married/cohabiting 62.8
Divorced/separated 3.1
Widowed 2.7
Total 100.0
n = 226
N.B. From the first presentation and analysis, it seems as if married respondents are more likely
to commit criminal activities when compared to the rest of the marital groups. However, note that
the first presentation could be biased given that the cross tabulation output shows that married
respondents dominated the sample. They constituted 66.8% of the entire sample. Thus, when
interpreting data, one should be mindful of the aforementioned observation. This is primarily
because if you are to design policies and programmes aimed at curbing criminal activities and use
the first approach to analyse your data, you might end up designing programme targeting married
men. Yet in actual fact you should also remember that married men constituted more than two
thirds of the sample size.
2. When drawing comparison of males who ever committed a crime within the different marital
groups.
Criminal activities are common in Mutare. The majority of the respondents, 56%, reported that
they had ever committed a crime (Table 8.8). Further analysis also demonstrated that committing
a crime is associated with marital status (p=0.006). Never married respondents were more likely
to report that they had ever committed a crime when compared to all marital groups. For instance,
while 70% of the never married respondents reported that they had ever committed a crime, only
39% of the divorced respondents reported the same.
Table 8.8: Percentage distribution of the respondents who ever committed a crime by marital
status
Marital status Percentage P value
Never married 70.3 0.006
Married/cohabiting 53.0
Divorced/separated 38.9
Widowed 42.9
Total 56.4
n = 401
N.B. The second presentation and analysis of the cross tabulation seem to be more realistic. This
is primarily because one is comparing the commitment of criminal activities using respondents
within their marital groups. In this case, note that never married males were more likely to commit
criminal activities when compared to other marital groups.
Triangulation
Triangulation involves checking the validity of findings and or/observational data themselves by cross-
checking (double or triple checking) with other sources of data (O‘Donoghue and Punch, 2003). The
technique is meant to: increase the credibility and validity of results; cross exam data from multiple
sources to search for regularities in research data (verification); give a more detailed and balanced picture
of the situation (Altrichter et al, 1996); avoid intrinsic biases and the problems that come from single
method, single observer, single-theory studies (Alexander, 2001); and, map out or explain more fully
the richness and complexity of human behaviour by studying it from one or more stand points (Cohen
and Manion, 1986). Triangulation is also used when building a case using different approaches. For
instance, detectives always use triangulation successful in establishing and corroborating findings with
little, yet elaborate instrumentation. They often use a modus operandi approach which consists largely
of triangulating independent indices. A detective would amass fingerprints, hair samples, alibis and
eyewitness accounts to identify a culprit. Note that such an approach produces a case which presumably
fits one suspect far better than others. Also note that all signs presumably point to one similar conclusion.
Further there are different kinds of measurements which provide repeated identification.
Goal of triangulation
Alexander (2001) asserts that the purpose of triangulation in specific contexts is to obtain confirmation
of findings through convergence of different perspectives seen to represent reality. However, ordinarily,
some researchers share a misconception that the goal of triangulation is to arrive at inconsistencies across
data sources or approaches. They argue that such inconsistencies may be likely given the relative
strengths and weaknesses of the different approaches. However, Patton (2002) argues that the
inconsistencies should not be seen as weakening the evidence, but should be viewed as an opportunity
to uncover the deeper meaning of the data.
of how the different investigators view the issue. If the findings from the different investigators
arrive at the same conclusion, then, the team‘s confidence in the findings would be heightened.
3. Theory Triangulation
This is a situation of pitting alternative theories against the same body of data. The main
idea will be to avoid the risk of researchers reaching theoretical conclusions, selecting only
those data which suit their pet views or developing small scale theory which has little
relevance beyond the immediate situation.
To triangulate theory the research should:
• Draw up a list of propositions which might explain or have relevance to the research
problem to be investigated;
• Identify the different ways each proposition might be interpreted;
• Carry out the research to ascertain the propositions which hold water;
• Discard those propositions which prove untenable;
• Carry out further research to identify the most likely interpretations still in the ring;
• Review the propositions which passed and failed the empirical test, along with their
parent theory; and,
• Arrive at the new theoretical understanding of the problem.
Depending on the outcome of the empirical work, this might include insights from a number of
apparently conflicting theories.
4. Environmental Triangulation
It involves the use of different locations, settings and other key factors related to the environment
in which the study took place. These factors could be time, day and season. The key to
environmental triangulation is to identify the environmental factors, if any, that might influence
the information that is received during the study. These environmental factors could be changed to
see if the findings are the same across settings. If the findings remain the same under varying
environmental conditions, then the validity has been established.
5. Methodological Triangulation
It involves the use of multiple qualitative and/or quantitative research methods during research or
to study the programme. Note that a study can be purely qualitative. However, within the
qualitative study, you can still triangulate your qualitative research methods to check for the
validity of the findings and also similarities and differences among the emerging issues from the
data. You can also triangulate quantitative and qualitative research methods. In this scenario, you
would use a survey to quantify the magnitude of your variables. Nonetheless, you would also use
the qualitative research methods such as the key informant interviews, FGDs, documentary review
and in-depth interviews to explain, describe and explore the factors underlying the quantified
variables. If the conclusions from each of the methods are the same, then validity is established.
When writing the methodology of Methodological Triangulation, the following sub-headings must be
explicitly stated:
Methodology for triangulated data collection methods a). Study area (Justify the
selection of the area)
d). Data collection methods (present each method write up to the selection of the respondent.
Note my examples in the preceding chapters). E.g
1. Survey
2. Key informant interviews
3. Focus group discussions
4. In-depth interviews
N.B. If you have used qualitative methods only, then present those up to the selection of the respondent.
The study revealed that physically abused women are most commonly susceptible to experiencing
reproductive health problems (R.H). The respondents universally reported that they experienced at
least one R.H problem as a result of physical abuse. However, excessive bleeding was reported by
the majority, 43% of the respondents (Table 8.1). A significant proportion of the respondents, 20%,
reported that they had suffered from a miscarriage. Fourteen percent reported that they had suffered
a premature birth. A still birth and broken wounds and bones were reported by 13% and 10% of
Miscarriages 20.0
Total 100.0
n = 126
Qualitative data supported that physical abuse has adverse impact on women‘s reproductive health.
Miscarriages were commonly reported by the respondents during in-depth interviews and FGDs.
Five years ago I experienced a miscarriage that almost took my life when my husband
attacked me with booted feet and clenched fists on my stomach whilst I was pregnant. I
abused women. Note this remark during a FGD of women aged 30-39 years, which received
Pregnant women who are always suffer from severe physical abuse from their husband in
this community normally end up having miscarriages. These miscarriages are mainly a
Deaths were also typical reported as consequences of physical abuse among women. During a
FGD of women between 40 and 49 years of age, one respondents, reported that some physically
abused women would die upon arrival to the hospital. Note her remark, which received some agony
I have a neighbour who was pronounced dead on arrival to hospital due to excessive
bleeding. The bleeding was a result of physical abuse whilst she was pregnant.
a) Literature Review
As you write your literature review, remember you are reviewing empirical and theoretical
literature up to the most recent and relevant. In addition, you should be responding to your
objectives. Literature review is not a mere presentation of data from various sources. However,
it is a synthesis of literature while at the same time explicitly indicating the gap(s) to be filled by
your current study.
b) Conceptual framework
- This should be presented at the end of the literature review chapter.
- Remember, you will be explaining the proposed relationship(s) between your dependent and
independent variables.
Chapter 3 – Methodology
a) Introduction
When presenting your methodology, make sure the following sunheadings are clearly
articulated:
b) Study area
c) Target population
d) Study Design
- Indicate whether or not you have triangulated qualitative and/or quantitative research methods.
Brief state the method highlighting why you have used each of the method.
a) Introduction
b) Background characteristics
• Percentage distribution of the demographic and socio-economic characteristics of the
respondents
c) Respective sub-sections responding to the study objectives
• Remember when presenting your findings, you are responding to your objectives, while at the
same time being guided by your conceptual framework.
• If presenting both quantitative and qualitative data, they must be presented in a complementary
fashion not as separate entities.
• State the implications of your key findings. In other words, you will be answering the question
‗So what?‖
c) Conclusions
• It is like the last chord of a song.
• Restate the objectives of your dissertation.
• Briefly summarise the main points of your findings.
• Remember, recommendations will be derived from the presented summary of findings.
d) Recommendations
• These are suggestions of the best course of action given the findings from your research.
• State what you propose to be done and the relevant implementer(s).
e) Further research
• Indicate the issues you have noted from your study which however you feel should be researched
further.
• You will be highlighting important knowledge gaps which you will have identified from your
study that call for further research.
REFERENCING
You acknowledge ALL the sources which you will have cited in dissertation. Remember, there are
different referencing styles. However, make sure you stick to one referencing style recommended
by your institution. As for CIRSM, use the Harvard Style of Referencing.
References
Blaxter L., Hughes C. & Tight M. (2001) How to research. Buckingham: Open University.
3. Merriam S. (1988) Case study research in education: a qualitative approach. San Francisco,
CA:Jossey-Bass.
6. Taylor D. & Procter M. (2008) The literature review: a few tips of conducting it. Health
Services Writing Centre:University of Toronto. http://www.utoronto.ca/writing/litrev.html
7. Wellington J., Bathmaker A., Hunt C., McCulloch G., & Sikes P. (2005) Succeeding with your
doctorate. London:Sage
Chinn P.L and Kramer M.K. (1999) Theory and Nursing: A Systematic Approach. 5th Edition. St Louis,
USA: Mosby.
Imenda S. (2014) Is There a Conceptual Difference between Theoretical and Conceptual Frameworks?
Journal of Social Science, 38(2): 185-195 Liehr P. and Smith M.J. (1999) Middle range theory:
Spinning research and practice to create knowledge for the new millennium. Advances in Nursing
Science, 21(4): 81-91.
Shi L. and D.A. Singh. (2011) The Nation's Health, Eighth Edition. Jones & Bartlett Learning: Sudbury,
MA
Sinclair M. (2007) Editorial: A guide to understanding theoretical and conceptual frameworks. Evidence
Based Midwifery 5(2): 39
Ajzen and Fishbein (1980) Understanding Attitudes and Predicting Social Behaviour, p8
Miles, MB. & Huberman, AM. (1994). Qualitative Data Analysis. (2nd edition). Thousand Oaks, CA:
Sage Publications.
Berg, B. L., & Lune, H. (2004). Qualitative research methods for the social sciences (Vol. 5). Boston:
Pearson.
Coyne, I. T. (1997). Sampling in qualitative research. Purposeful and theoretical sampling; merging or
clear boundaries?. Journal of advanced nursing, 26(3), 623-630.
Malterud, K. (2001). Qualitative research: standards, challenges, and guidelines. The lancet, 358(9280),
483-488.
Marshall, M. N. (1996). Sampling for qualitative research. Family practice, 13(6), 522-526.
Patton, M. Q. (2005). Qualitative research. John Wiley & Sons, Ltd.
Sandelowski, Margarete. "Sample size in qualitative research." Research in nursing & health2 (1995):
179-183.
Strauss, A., & Corbin, J. M. (1990). Basics of qualitative research: Grounded theory procedures and
techniques. Sage Publications, Inc.
Babbie, Earl R. The Practice of Social Research. 12th ed. Belmont, CA: Wadsworth Cengage, 2010;
Brians, Craig Leonard et al. Empirical Political Analysis: Quantitative and Qualitative Research
Methods. 8th ed. Boston, MA: Longman, 2011.
Given, Lisa M. (2008). The Sage encyclopedia of qualitative research methods. Los Angeles, Calif.:
Sage Publications
US Census Bureau. Technical Paper 63: Current Population Survey - Design and Methodology.
<http://www.bls.census.gov/cps/tp/tp63.htm> accessed 12 January, 2001. Page 14-8.
Frongillo, EA. Analysis of Complex Surveys. October 22, 1996. Cornell University, Office of Statistical
Consulting. College of Human Ecology and
Agricultural and Life Sciences, Division of Nutritional Sciences
<http://www.human.cornell.edu/Admin/StatCons/StatNews/stnews12.htm>
Henry, GT. Practical Sampling. Sage Publications, 1990. Applied social research methods series,
volume 21.
Turner, AG. Sampling Topics for Disability Surveys. United Nations Statistics
Division, Technical Notes, December 1996. Available at
<http://www.undp.org/popin/demotss/tcndec96/tony.htm>
11. Turner
12. 13 also. Alecxih, L, Corea J and Marker D. Deriving state-level estimates from three national
surveys: A statistical assessment and state tabulations. Department of Health & Human
Services/ASPE. May 4, 1998. <http://aspe.hhs.gov/health/reports/st_est/> Accessed 11 January
11, 2001. Section II B, Precision of estimates.
16. Flores-Cervantes, I., Brick, J.M., and DiGaetano, R. Report No. 4: 1997
NSAF Variance Estimation, March 1999. One of the National Survey of America's Families
Methodology Series, at http://newfederalism.urban.org/nsaf/methodology1997.html
17 & 18. Carolina Population Center. Russia Longitudinal Monitoring Survey.
<http://www.cpc.unc.edu/projects/rlms/home.html>.
Coon, D. & Mitterer, J. O. (2010). Introduction to psychology: Gateways to mind and behavior with
concept maps. Belmont, CA: Wadsworth; 2010.
Jakob, Alexander 2001 (February), On the Triangulation of Quantitative and Qualitative Data in
Typological Social Research: Reflections on a Typology of Conceptualizing 'Uncertainty' in the
Context of Employment Biographies, Forum: Qualitative Social Research, Volume 2, Number 1.