Kathrynn A. Adams, Eva K. McGuire - Student Study Guide With IBM SPSS Workbook For Research Methods, Statistics, and Applications-SAGE Publications (2023)
Kathrynn A. Adams, Eva K. McGuire - Student Study Guide With IBM SPSS Workbook For Research Methods, Statistics, and Applications-SAGE Publications (2023)
Third Edition
Kathrynn A. Adams
Guilford College
Eva K. McGuire
Guilford College
FOR INFORMATION: Copyright © 2023 by SAGE Publications, Inc.
Preface xii
Prefacexii
IBM® SPSS® Workbook Part 2: Analyses to Examine One Variable at a Time 193
Descriptive Statistics 193
Nominal Variables 193
Interval or Ratio Variables 194
Writing up Results 199
Descriptive Statistics Practice Exercises 202
Confidence Intervals 204
Data Entry 204
Calculating the Confidence Interval for the Mean 204
Writing up Results 205
Confidence Intervals Practice Exercise 206
One-Sample t Test 207
Writing up Results 208
One-Sample t Test Practice Exercise 208
Chi-Square Goodness of Fit 210
Writing up Results 210
Chi-Square Goodness of Fit Practice Exercise 211
The Student Study Guide and IBM® SPSS® Workbook is a companion for the textbook Research Methods,
Statistics, and Applications. Students can use the study guide and workbook as a self-guided tool to
reinforce and apply concepts from the textbook, or it can be used as an in-class or in-lab workbook, or
both. Professors also may wish to assign exercises from the study guide and workbook as homework.
In the textbook, we included in-chapter practice and applications. The study guide and workbook
provides students with additional opportunities to review, practice, and apply essential knowledge and
skills related to research methods and statistics.
Whereas many study guides repeat information from the textbook, we designed the study guide and
workbook to supplement, rather than supplant, the textbook, and therefore we keep chapter summaries
very brief. Additionally, the study guide and workbook goes beyond the textbook by including step-
by-step directions for using the data analysis program, IBM® SPSS®, interpreting output, and writing
up results.
Many study guides focus on recognition and recall of material, and we acknowledge that gaining basic
knowledge is an essential initial step in the learning process. We provide students with recognition
activities via multiple-choice quizzes at the beginning of each study guide chapter, with the answers
appearing immediately afterward so that students can easily check their understanding of basic concepts.
Additionally, we include review exercises to encourage deeper processing by having students recall,
rather than merely recognize, key terms.
A unique feature of this study guide and workbook is that it encourages students to build on their
foundational knowledge. The majority of the exercises require students to think critically and actively
engage with the material. These application exercises require more effort and involvement than
recognition or recall tasks. Such exercises encourage students to make meaningful connections and
are more likely to have enduring effects on their understanding and retention of research processes.
Additionally, most chapters of the study guide include a “Your Research” exercise so that students can
apply key terms and concepts to their own research projects.
Chapters in the Student Study Guide and IBM® SPSS® Workbook contain the following sections:
Learning Objectives
Chapter Summary
Chapter Quiz
Chapter Exercises
Your Research
Learning Objectives are listed at the beginning of each chapter in both the textbook and the study
guide and workbook. The learning objectives are designed to help students reorient to the textbook
chapter. Moreover, a reminder of the learning objectives encourages students to consider what they have
learned and what areas require additional review.
The Chapter Summary provides students with a very brief overview of the material covered in the
textbook chapter. Like the learning objectives, the summary is designed to reorient the students to the
textbook chapter.
The Chapter Quiz includes 10 multiple-choice questions. It is designed to be a review of key concepts.
Answers are provided directly following the quiz to ensure that students adequately understand key
concepts before going on to the chapter exercises, which require deeper processing and application.
xii
Preface xiii
The Chapter Exercises match the organization of the textbook chapters and are designed to help
students achieve the learning objectives. The exercises designated as “Review” require students to
recall key terms and concepts. The exercises designated as “Application” go beyond simple definitions
and require students to apply key terms and concepts. These application exercises lend themselves
particularly well to in-class or in-lab activities and discussion, especially when students are required to
complete the fill-in-the blank sections prior to class or lab.
Students: Instructors have access to the answer keys.
Instructors: You can download the complete answer keys via the Instructor’s Resource page on the
textbook website and then share with your students as you see fit. There are also answer keys for only
the odd-numbered answers. Some professors find it useful to provide all or half the answers to students
ahead of time so that they may check their own work. Other professors assign study guide exercises as
graded or ungraded homework assignments or use the exercises in class and do not wish the students to
have advance access to the answers. We have found assigning exercises from the study guide as ungraded
homework (checked simply as complete or incomplete) helps to ensure students come to class prepared
and with good questions about the material. We then share the answers during or after class.
In the Your Research section, students apply key concepts and skills from the chapter to a research
topic of their choice. We have found that encouraging students to think about and develop their own
area of research promotes deeper understanding and integration of the material. The exercises in this
section may be used as homework assignments, and they are especially useful in classes that require a
semester- or year-long research project. Having students consider how different concepts apply to a
topic of their choice can also be useful in the absence of such a requirement. The questions about the
research project are worded to apply to a great range of topics that might be selected by students or
professors. In addition, the format allows students to compare their responses when working as a group
on a research project.
The IBM® SPSS® Workbook provides step-by-step directions for data analysis and interpretation. There
are practice exercises to help students gain competence using the program, interpreting output, and
writing up results. As with the chapter exercises, instructors have online access to the answer keys. The
workbook is designed for students brand new to SPSS. We also hope that it will be a useful resource that
students can keep and refer to should they use the program in future coursework, graduate programs,
or careers.
New to this edition: For the third edition, the IBM® SPSS® Workbook is a separate section divided into
five parts. We believe this will make finding the directions for specific analyses easier, and students
can easily refer back to the workbook throughout their academic careers and beyond. The workbook
organization matches the guidelines on selecting statistical analyses and includes similar flowcharts
found in the final chapter of the textbook, “Putting it all Together”. We also provide directions to an
alternative data analysis program, jamovi, on the Instructor Resource site.
PUBLISHER’S ACKNOWLEDGMENTS
xiv
ABOUT THE AUTHORS
xv
1
THINKING LIKE A RESEARCHER
LEARNING OBJECTIVES
• Explain the connection between thinking critically and thinking like a researcher.
• Describe the scientific approach, including the challenges and benefits of taking a scientific approach.
• Outline and synthesize the steps of the research process.
• Identify and apply ethical principles and standards.
• Formulate a big picture of the nature of science and how science progresses.
CHAPTER SUMMARY
Students in the social and behavioral sciences are often curious about personal and social phenomena, and learning about
research methods and statistics will give you the tools necessary to engage your curiosity on a deeper level. Critical thinking is
key to research, and researchers put critical thinking to action throughout the entire process of carrying out a research study.
Additionally, researchers think by taking a scientific approach. You are encouraged to consider why the scientific approach
is important and to consider the scientific method as a process rather than a list of steps that must be followed in order. A
complete overview of the research process from start to finish is included in this chapter. The goal is to introduce you to
key concepts in the context of the overall scientific process, and later chapters detail the steps and concepts in more depth.
The next part of the chapter focuses on ethics. The chapter provides a brief history of ethical violations that led to the
development of current ethics codes. Key ethical principles and standards are discussed, and you are encouraged to
consider how ethical standards help to uphold broader ethical principles. The chapter ends with a discussion of proof and
progress in science, debunking the notion that any single study leads to proof and, instead, encouraging you to consider
how multiple studies move the field forward.
CHAPTER 1 QUIZ
Test your knowledge of the key terms from the chapter.
1. Which of the following might be a variable in a study?
a. male
b. participants
c. gender
d. method
1
2 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
9. When the participants in Milgram’s obedience studies asked to stop, the experimenter ordered
them to continue. This violates the ethical standard of
a. confidentiality
b. the right to withdraw
c. informed consent
d. noncoercive incentives
10. Which of the following must be a part of all research studies?
a. informed consent
b. debriefing
c. confidentiality
d. anonymity
CHAPTER 1 EXERCISES
Achieve the learning objectives of the chapter by reviewing and applying key concepts.
Application: Think critically about the information you have received about this research methods
course and consider how you might gather new evidence to evaluate such information.
1. Identify one thing you heard about this class: _____________________________________
4. What additional evidence would help you evaluate the information, and how might you go
about gathering this evidence?
2. How might you have instead taken a scientific approach? What would be the benefits and
challenges to taking a scientific approach in this situation?
Step 4:
Step 5:
Step 6:
Step 7:
2. Replace the crossed out words to summarize how the steps are synthesized within the
scientific method:
3. For those questions you identified in question 1 that could not be examined with an
experiment, identify the most appropriate type of research to help answer the question.
6 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
We found that Nonjudge, the ability to refrain from judging one’s own cognitions, emotions,
and bodily sensations, predicted lower levels of depression, anxiety, and stress. Furthermore,
Act-aware, the ability to maintain awareness of daily activities, predicted lower levels of
depression.
Imagine that three students summarized this excerpt. Consider whether each student plagiarized Cash
and Whittingham (2010) and explain your answer.
Student 1
Cash and Whittingham (2010) found that Nonjudge, the ability to refrain from judging one’s own
cognitions, emotions, and bodily sensations, predicted lower levels of depression, anxiety, and stress.
Furthermore, Act-aware, the ability to maintain awareness of daily activities, predicted lower levels of
depression.
plagiarism or not plagiarism
because .
Student 2
Cash and Whittingham (2010) found that participants who were most aware during their daily activities
reported few symptoms of depression and those who were most accepting of their own thoughts,
feelings, and sensations reported fewer symptoms of both depression and anxiety.
plagiarism or not plagiarism
because .
Student 3
“Nonjudge, the ability to refrain from judging one’s own cognitions, emotions, and bodily
sensations,” and “Act-aware, the ability to maintain awareness of daily activities,” are linked to
improved mood.
_____ plagiarism or ____ not plagiarism
because _________________________________________________________.
2. A researcher wants to observe interactions in a social media group. The group members are
anonymous, and the group is open, in that anyone can log on and view conversations. Imagine
you are a member of the Institutional Review Board. List three questions that you think are
most important to ask and identify why each is important, based on ethical principles and
standards.
3. What might be some of the next steps we take to examine the relationship between
mindfulness training and stress?
8 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
4. If someone asked you how science helps us understand something, how might you answer?
Your Research
Take a Scientific Approach to Identify a Research Topic 3. Now evaluate your list and cross out or modify any that
1. Identify a topic that interests you: • are biased, in that you think you already know the
answer.
2. Come up with a list of 7–10 questions on this topic. Try to
build these questions from textbooks, research articles, • are not testable (you cannot disprove them).
or previous coursework, but you can also include a few
• necessitate equipment or a population to which
questions based on observations or experiences.
you do not have easy access.
a.
• do not fit the criteria, if any, laid out by your
b. professor (e.g., your professor may require you
to do an experiment, and not all questions can be
c.
experimental).
d.
4. What questions remain? These might serve as a
e. good starting point. Keep in mind that your research
question will change as you read more research in this
f.
area.
g.
h.
i.
j.
2
BUILD A SOLID FOUNDATION FOR
YOUR STUDY BASED ON PAST
RESEARCH
LEARNING OBJECTIVES
CHAPTER SUMMARY
Chapter 2 focuses on finding and evaluating past research on a topic. The chapter is designed to help you discern
different sources, including the difference between primary and secondary sources, the difference between scholarly
and popular sources, and the various types of scholarly work that you might find in an academic journal (i.e., primary
research articles, literature reviews, and commentaries) as well as other scholarly sources (conference papers and
posters, unpublished manuscripts, books, theses and dissertations, etc.). You also learn the various ways to search for
relevant research.
How to read and evaluate primary research articles is another major focus of this chapter. The key sections of a primary
research article are outlined, and their purpose discussed. An article on the topic of academic achievement is used
to illustrate the different parts of a research article, and finding and referring to this article will greatly enhance your
understanding of key concepts. Specific guidelines for citing and referencing sources are provided. The chapter ends with
a discussion of building on past research and some specific strategies for doing so.
9
10 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
CHAPTER 2 QUIZ
Test your knowledge of the key terms from the chapter.
1. A may have been written by an expert or by a layperson and is
designed to educate or entertain.
a. scholarly source
b. popular source
c. meta-analysis
d. primary research article
2. Which of the following is considered a scholarly source?
a. articles published in academic journals
b. articles published in newspapers
c. textbooks
d. educational pamphlets
3. Which of the following is considered a primary research source?
a. any article in an academic journal
b. textbooks
c. reports of original research
d. literature reviews
4. A review of past research would be found in
a. a meta-analysis.
b. a literature review.
c. a primary research article.
d. All of the above
5. A review of past research without a report of new, original research is always a .
a. secondary research source
b. primary research source
c. popular source
d. scholarly source
6. When researchers collect and report on new data, those data are
a. primary data.
b. secondary data.
c. either a or b.
d. none of these.
7. A primary research source analyzes .
a. primary data
b. secondary data
c. either a or b
d. none of these
Chapter 2 ■ Build a Solid Foundation for Your Study Based on Past Research 11
CHAPTER 2 EXERCISES
Achieve the learning objectives of the chapter by reviewing and applying key concepts.
Application
1. The comedian Aziz Ansari wrote a book called Modern Romance. In addition to sharing
personal anecdotes, he reports results of interviews and focus groups that he and his research
partner conducted.
a. Is this a popular or scholarly source?
b. Are the reports of interviews and focus groups a primary or secondary research source?
2. Paul Eastwick, PhD, and colleagues conducted a meta-analysis examining the role of physical
attractiveness in dating preferences and actual dating behaviors. Their findings were published
in 2014 in the academic journal Psychological Bulletin.
a. Is this a popular or scholarly source?
3. René Dailey, PhD, and colleagues conducted a study about on-again/off-again dating
relationships. Their findings were published in 2020 in the academic journal Personal Relationships.
a. Is this a popular or scholarly source?
4. “Better than you think: The impact of TV on your relationship” is an article on the website luvze.
com. The author of the article, Gary Lewandowski, PhD, summarizes the results of two research
studies (neither of which he conducted) and explains what the results mean to the average person.
a. Is this a popular or scholarly source?
5. Use Figure 2.1 in the textbook to help you answer the following questions.
a. Which of the sources from questions 1 to 4 above might you use to generate ideas for a
research topic but not include in your written report?
b. Which of the sources from questions 1 to 4 above might you use to identify patterns and
gaps in past research and to help you identify original research sources?
c. Which of the sources from questions 1 to 4 above represents the type of source that
should make up the majority of your sources for a research project?
6. Imagine you were using the online search engine Google (likely against your professor’s
recommendations) and you came across a report of original research.
a. How would you determine whether or not the source was a scholarly one?
b. Suppose that you determine the source is, in fact, scholarly. How else should you evaluate
it to determine whether it is appropriate to use as a source for a research project?
Chapter 2 ■ Build a Solid Foundation for Your Study Based on Past Research 13
3. Find the article titled “The biometric antecedents to happiness” from the search results. Is this
a primary or secondary research article? How do you know?
Parts of a Primary
Research Article Purpose
Abstract
Method
Application
Look at the example published article in Appendix B in the textbook.
1. What is the title of the article?
2. Who are the authors?
3. Is this a primary or secondary research source?
4. Can you tell from reading the abstract whether or not the authors conducted an experiment?
Why or why not?
5. Summarize, in your own words, the key point of the first paragraph of the Introduction.
6. Summarize, in your own words, the key point of the second paragraph of the Introduction.
10. What is the first limitation noted by the authors? How might a future research study address
that limitation?
b. On a computer, type up an APA-style References section with these sources. Save your
document and check with your professor on how to turn it in, if applicable.
16 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
Application B
1. Find and correct the five APA-format citation errors in the paragraph:
Many of us like to believe that “beauty comes from within” and that that inner beauty is more
important than physical beauty. However, research suggests that physical attractiveness is the single
most important factor that impacts our desire to date someone (Kurzban and Weeden, 2005; Li et al.,
2013). An early cross cultural survey conducted by Buss & Schmitt found that across the world physical
attractiveness of one’s partner was more important for men than it was for women (1993). A more
recent meta-analysis found that gender differences only occur in ratings of ideal romantic partners and
ratings of hypothetical targets. Both men and women generally find physical attractiveness equally
important in actual face-to-face interactions (Eastwick, Luchies, Finkel & Hunt, 2014). Moreover,
although we might all desire an attractive partner, we tend to wind up with someone who matches our
level of physical attractiveness (Miller, R. S., 2012). Thus, our stated preferences do not fully predict our
actual dating behaviors (Eastwick et al., 2014).
2. Find and correct the five APA-format errors in the references.
References
Miller, R. S. (2012). Intimate relationships (7th ed.). M
cGraw-Hill.
Buss, D. M., and Schmitt, D. P. (1993). Sexual strategies theory: An evolutionary perspective on
human mating. Psychological Review, 100(2), 204–232. https://doi.org/10.1037/0033- 295X.100.2.204
Eastwick, P. W., Luchies, L. B., Finkel, E. J., & Hunt, L. L. (2014). The predictive validity of
ideal partner preferences: A review and meta-analysis. Psychological Bulletin, 140(3), 623–665. https://doi
.org/10.1037/a0032432
Kurzban, R., & Weeden, J. (2005). HurryDate: Mate Preferences in Action. Evolution and Human
Behavior, 26(3), 227-244. https://doi.org/10.1016/j.evolhumbehav. 2004.08.012
Li, N. P., Yong, J. C., Tov, W., Sng, O., Fletcher, G. J. O., Valentine, K. A., Jiang, Y.F., Balliet, D.
(2013). Mate preferences do predict attraction and choices in the early stages of mate selection. Journal
of personality and social psychology, 105(5), 757–776. https://doi.org/10.1037/a0033777
Exercise 2.5: The Big Picture: Use the Past to Inform the Present
Learning Objective: Formulate a big picture of how to build on past research.
Review: List the four ways you might build on past research.
1.
2.
3.
4.
Your Research
Find Research on Your Topic c. Did the results support the hypothesis you
identified? Explain.
1. What database is most commonly used in your discipline?
LEARNING OBJECTIVES
• Explain how to operationally define constructs using qualitative and quantitative measures.
• Compare the four scales of measurement in terms of their four attributes.
• Identify and describe the advantages and disadvantages of self-report measures.
• Identify and describe the advantages and disadvantages of behavioral and physiological measures.
• Identify and describe the advantages and disadvantages of archival research.
• Formulate a big picture of how to choose measures for your study.
CHAPTER SUMMARY
Defining your variables is a critical early step of research. This step should be accomplished after you have reviewed the
relevant literature for ideas about possible variables you might examine, measure, or manipulate in your study. You must
clearly identify the constructs of interest and how these constructs will be operationally defined (or measured). It is also
important to identify the scale of measurement (nominal, ordinal, interval, ratio) of each variable. You should consider
the type of measure that best fits the study (descriptive, correlational, or experimental) you plan to conduct. Options
range from self-report (questionnaires, interviews) to behavioral and physiological measures (observations, unobtrusive
measures, performance-based, physiological) to archival (secondary data, records and documents). Understanding
the advantages and disadvantages of the different types of measures will assist you in your decision-making. How you
define and measure your variables will affect how you conduct your research, the analyses you can perform, and the
conclusions you can draw about your results.
CHAPTER 3 QUIZ
Test your knowledge of the key terms from the chapter.
1. A precise explanation of a variable in terms of how it is measured or manipulated is called a(n)
a. qualitative measure.
b. operational definition.
c. construct.
d. quantitative measure.
18
Chapter 3 ■ Measuring Your Variables 19
2. A researcher is interested in children’s attention during reading lessons. She has second grade
teachers write a summary of a different target child’s behavior during the lesson each day for a
week. This measure is an example of a(n)
a. ratio measure.
b. operational measure.
c. qualitative measure.
d. quantitative measure.
3. The four attributes that determine the preciseness of a scale of measurement are
a. rating, ranking, ratio, equal intervals.
b. equal intervals, categories, ranking, uniqueness.
c. true zero, identity, order, equal intervals.
d. sequencing, rating, true zero, identity.
4. Runners rate their energy level on a 10-point scale as they finish the race. This measure
represents the scale of measurement.
a. interval
b. ordinal
c. ratio
d. nominal
5. Self-reports may be inaccurate due to the
a. descriptive nature of surveys.
b. external validity of surveys.
c. social desirability bias.
d. internal validity of surveys.
6. Questionnaire items that use a Likert-type response are one example of a(n)
a. close-ended response format.
b. open-ended response format.
c. scale score.
d. items that will need re-coding.
7. One disadvantage of conducting interview research is
a. participants are less likely to take the research seriously.
b. the response rate is likely to decrease.
c. maintaining anonymity is harder than with questionnaires.
d. the accuracy of responses is likely to increase.
8. In comparison to survey research, observational research
a. examines what people actually do, not what they say they do.
b. is less time consuming than survey research.
c. is less prone to observer bias.
d. requires less training of experimenters.
20 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
CHAPTER 3 EXERCISES
Achieve the learning objectives of the chapter by reviewing and applying key concepts.
5. Suppose your classmate tells you, “There is only one way to operationally define each
construct we study.” How would you respond to their statement?
6. If you decided to ask people to rate how much they love their partner on a scale from 1 to 10, you
would be operationally defining love using a measure because it is numerical.
Chapter 3 ■ Measuring Your Variables 21
7. If you interviewed college students and asked them to describe the love they feel for their
partner, you would be operationally defining love using a measure
because it does not use numbers.
:
Each number on the scale has a unique
meaning.
:
Numbers on a scale are ordered.
:
The distance between numbers on a scale
is equal.
:
The scale has a fixed zero.
Application
1. Draw a line between each item on the left with the term on the right that best matches it.
2. Suppose you are studying attitudes toward the Black Lives Matter movement using a
questionnaire. Write four questions for your questionnaire using each of the four scales of
measurement once. List the scale of measurement before each question.
22 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
3. In the following table, summarize the advantages and disadvantages of the following types of
self-report.
Mailed questionnaire
Online questionnaire
Structured interview
Unstructured interview
4. Two reasons you should consider using already existing questionnaires are:
Application
1. Look over the items in a questionnaire assessing anxiety (higher scores = higher anxiety):
i. I almost never feel anxious.
Strongly disagree Disagree Neutral Agree Strongly Agree
ii. I worry about everything.
Strongly disagree Disagree Agree Strongly Agree
iii. My schoolwork and my procrastination make me anxious.
Strongly disagree Disagree Neutral Agree Strongly Agree
a. Which of these is a reverse-scored item?
b. Which of these is a double-barreled item?
c. How could you rewrite the item you identified in b?
e. After you fix the item you identified in b, explain the next steps you would take to
compute a total anxiety score.
3. Would it be better to interview or survey students about their anxiety? Explain the rationale
for your choice.
Review
1. Match the example on the left with the type of measure on the right.
2. In the table below, summarize the reason(s) you might choose to use each of the following
types of measures.
Observation
Unobtrusive
Performance-based
Physiological
24 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
Application
Imagine that you are interested in understanding how students at your college or university view the
women’s basketball team (or another sports team, if your college or university does not have a women’s
basketball team).
1. What types of observations might you make that could help you better understand how
students view the team, and where would you make them? Be clear if your observation is
overt or covert, naturalistic or contrived, and if you include a participant observer and/or a
confederate.
Disadvantages: _____________________________________________________________
3. List 2 different behaviors you might record during your observation study using the timing of
a behavior.
5. Add an unobtrusive measure of student views of the team to your observational study.
Chapter 3 ■ Measuring Your Variables 25
Application
Assume you are still studying student views of the women’s basketball team (from the Application
section of Exercise 3.4).
1. How could archival data help you understand student views of the team as a supplement to
the observational data?
3. What would be the advantages and disadvantages of conducting archival research on this topic?
Advantages: _______________________________________________________________
Disadvantages: _____________________________________________________________
Exercise 3.6: The Big Picture: How to Choose Measures for Your Study
Learning Objectives: Formulate a big picture of how to choose measures for your study.
Application
Retrieve the article:
Justin, T., Kaszycki, A., Geden, M., & Bunde, J. (2020). Some stress is good stress: The challenge-
hindrance framework, academic self-efficacy, and academic outcomes. Journal of Educational
Psychology, 112(8), 1632–1643. http://dx.doi.org/10.1037/eduedu0000478
Read the introduction and the method sections of the study and use the information provided
to answer the following questions.
1. List the variables in the measures section and the scale of measurement for each one.
26 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
2. What type of measure (questionnaire, interview, observational, archival, etc.) are each of the
variables?
4. Based on the literature reviewed in the introduction of the article, how did the researchers
decide on their measures?
Your Research
Find a primary research article on your topic. Choose 3. What type of measure was the variable (questionnaire,
one variable from the article and answer the following interview, observational, archival, etc.)?
questions.
LEARNING OBJECTIVES
CHAPTER SUMMARY
Reliability indicates consistency and validity indicates accuracy.
Chapter 4 introduces these concepts and explains how these terms relate to measurement. The chapter then describes
several ways to calculate and evaluate the reliability of a scale, including internal consistency (coefficient alpha and split-
half ), test-retest, alternative forms, and interrater. Likewise, different ways to assess validity are described, and these can be
conceptualized broadly as face validity versus construct validity. Researchers are more concerned with construct validity.
There are several ways to assess construct validity, including content validity, convergent validity, divergent validity, and
criterion validity (either concurrent or predictive). Examples to illustrate each of the different types of measure reliability
and validity are provided.
Next, reliability and validity at the study level are discussed. Reliability of a study indicates that results have been replicated
in other studies, and validity of a study refers to both internal validity (the amount of control) and external validity (the
generalizability of findings). Researchers should consider the balance between reliability and validity in studies they
conduct as well as in studies they scrutinize before developing their research.
27
28 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
CHAPTER 4 QUIZ
Test your knowledge of the key terms from the chapter.
1. Validity and reliability are dealt with from two perspectives or in regard to
a. personal issues and professional examples.
b. the study level and the measurement level.
c. the manipulation of variables and the data collection.
d. participant confounds and experimenter measures.
2. A scale that measures what it is supposed to measure is
a. reliable.
b. operational.
c. true.
d. valid.
3. The internal consistency of a scale is questioned when
a. α = .80.
b. α < .70.
c. α > .70.
d. α > .90.
4. How is interrater reliability different from the other forms of measurement reliability?
a. It requires ratings by at least two of the participants.
b. It is computed only on forced-choice response formats.
c. It requires that at least two observers or raters provide independent scores.
d. It is measures the global sense of consistency rather than that of individual participants.
5. Researchers are often suspicious of face validity because it is often based on:
a. a single study.
b. multiple studies with different findings but similar procedures.
c. a single person’s view.
d. multiple experts’ views.
6. Divergent validity is supported when a researcher finds a correlation
between the scores of two tests measuring constructs.
a. positive; different
b. positive; similar
c. negative; different
d. no; similar
7. Criterion validity assesses the correlation between
a. a scale and its alternate form.
b. a scale and a behavioral measure.
c. two scales measuring the same concept.
d. the even and odd items in a scale.
Chapter 4 ■ The Cornerstones of Good Research 29
8. Many studies in psychology use students in lower-level psychology classes as participants. This
fact means the of these studies may be limited to students in these classes.
a. external validity
b. internal validity
c. reliability
d. control
9. A researcher repeats a study in exactly the same way. This is called a and tests
study .
a. literal replication; validity
b. conceptual replication; validity
c. literal replication; reliability
d. conceptual replication; reliability
10. The degree to which we can say that changes in one variable were caused by changes in
another variable is called
a. replication
b. correlation
c. external validity
d. internal validity
Quiz 4 Answer Key
1. b. 2. d. 3. b. 4. c. 5. c. 6. c. 7. b. 8. a. 9. c. 10. d.
CHAPTER 4 EXERCISES
Achieve the learning objectives of the chapter by reviewing and applying key concepts.
3. Imagine that your friend tells you that spinach tastes bad but admits she has never tried it.
You might question the of her conclusion about spinach.
4. Suppose a researcher uses time spent on the website for a research methods class as a measure
of motivation in the class.
a. Is time on the website a reliable measure? Why or why not?
I.
A. Cronbach’s alpha
II.
Application
1. A political scientist wants to test the measurement reliability of this single item: “On a
scale from 1 to 7, with 7 = very likely, how likely are you to vote in a political election?”
Identify the type of measurement reliability they should assess and explain how they
should assess it.
2. A mental health researcher wants to examine the reliability of a five-item measure of state (or
current) anxiety.
a. What type of measurement reliability would you recommend they assess, and how would
they do it?
b. What type of reliability should the researcher not assess? Explain why not.
Chapter 4 ■ The Cornerstones of Good Research 31
b. The academic confidence scale has 10 items which are scored on a 5-point Likert-type
scale ranging from 1 = Strongly disagree to 5 = Strongly agree. The professor finds that
Cronbach’s alpha = .80 in the administration during the first week. What would you say
about the internal consistency of the scale?
c. What if the professor had found Cronbach’s alpha = .40? What would you then say about
the scale?
d. How else could the professor measure the internal consistency of the confidence scale?
4. Two observers record whether people avert their gaze when a uniformed police officer enters
the room. What type of reliability should be used to assess the consistency between the two
observers’ scores?
5. Why is it important that the GRE (or LSAT or MEDCAT) have high alternate forms
reliability?
I.
A.
B.
C.
D. Criterion validity
i.
ii.
32 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
Application
1. A political scientist asked, “On a scale from 1 to 7, with 7 = very likely, how likely are you to vote in
a political election?” and followed up with participants to determine whether or not they voted in
the next election. They compared participants’ self-report to their voting behavior 6 months later.
a. What type of validity would be assessed? .
b. Identify a different type of measurement validity the researcher might use to evaluate the
self-report question. Explain how they would evaluate this type of validity.
2. A mental health researcher wants to examine the validity of a five-item measure of state (or
current) anxiety. How could the researcher assess the congruent validity of the measure?
b. Suppose the professor finds that students’ scores on the academic confidence scale in the
third week are positively correlated with the number of hours they reported studying that
week. What type of validity has the professor assessed?
Application
List all of the terms below that apply to each of the research examples.
study reliability study validity
literal replication internal validity
conceptual replication external validity
confound
1. A researcher finds that using content-related humor in her class presentations for one section
of a course results in more engagement by students (as measured by attendance and number
of student questions) than in her other section in which she did not include humor. The
researcher isn’t sure that her results were due to the humor she included, and she repeats the
same procedure in her two sessions of the same course the next semester.
2. A colleague in another discipline then decides to try telling jokes in one section of his course
and not in the other. He finds the same result of increased student engagement.
3. Another colleague who tells jokes in one section of his course found no difference
in engagement by the students in the joke versus no joke course sections. In their
end-of-semester feedback, however, students in both sections reported that the
professor sang and danced in all his classes, and the students in both sections rated
him equally funny.
4. A professor tells jokes in an Introduction to Criminal Justice class and the students report that
the professor is funny. The same professor tells the same jokes in an upper-level criminal
justice class and the students report that the professor is not funny.
Torres-Harding, S. R., Andrade, A. L., Jr., & Romero Diaz, C. E. (2012). The Racial Microaggressions
Scale (RMAS): A new scale to measure experiences of racial microaggressions in people of color.
Cultural Diversity and Ethnic Minority Psychology, 18(2), 153–164. http://dx.doi.org/10.1037/
a0027658
Excerpts:
The Racial Microaggression Scale. This scale was developed to assess the themes and
categories of racial microaggressions presented and discussed in the literature. Specific themes
within the larger categories of microinsults and microinvalidations were used as the underlying
basis for the questionnaire items (Sue, Capodilupo, et al., 2007). Members of the research team
developed each item to closely match the themes and categories described in the literature. Then,
members of the research team reviewed items for readability and comprehension. In addition,
for each set of items below, specific statements, thoughts, descriptions, and phrasing expressed or
quoted by participants in the qualitative investigations (cited below) were used to inform item
development. Feedback on the scale was sought from other university students who were familiar
34 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
with the racial microaggressions construct. Similarly, during the administration of the study
measures, feedback was sought regarding reactions to the items from both university students
and community members. (p. 155)
…For each item, respondents were asked to indicate how often they have encountered a
particular racial microaggression on a 4-point Likert-type scale (0 = never, 1 = a little/rarely, 2 =
sometimes/a moderate amount, 3 = often/frequently). (p. 156)
Next, internal consistency of the revised six-factor model was examined using Cronbach’s alphas
for the entire sample (n = 377). The Cronbach’s alphas were found to be very good: Environmental
Invalidations (α = .81); Foreigner/Not Belonging (α = .78); Sexualization (α = .83); Low-Achieving/
Undesirable Culture (α = .87); Criminality (α = .85), and Invisibility (α = .89). (p. 158)
4. Name the type(s) of measurement validity addressed and describe what Torres-Harding
found.
Pezdek, K., Abed, E., & Cormia, A. (2021). Elevated stress impairs the accuracy of eyewitness memory
but not the confidence–accuracy relationship. Journal of Experimental Psychology: Applied, 27(1),
158–169. http://dx.doi.org/10.1037/xap0000316
Excerpts:
The final sample (N = 106) included 56 participants in the high-stress condition and
50 in the low-stress condition. 2 The mean age did not differ between the high-stress
(mean [M] = 38.1 years, SD = 11.85, range 22– 67) and the low-stress condition (M =
37.5 years, SD = 12.18, range 19 – 64). Both groups were primarily male (male = 57%)
and were primarily Caucasian (76%)...
Materials and procedure. The face stimuli were 48 White male faces selected from a
database of faces used by Meissner, Brigham, and Butz (2005)...
Forty-eight (24 high-stress and 24 low-stress)3 images were sourced from the IAPS (Lang
et al., 2008). IAPS images are normed on arousal and valence. Arousal refers to physical and
Chapter 4 ■ The Cornerstones of Good Research 35
mental activation and varies from 1 (lowest activation) to 9 (highest activation). Valence refers
to the intrinsic positivity or negativity of an image and varies from 1 (most negative) to 9 (most
positive). Mean arousal was significantly greater for the high-stress (M arousal = 6.47, SD = 0.33)
than the low-stress images (M arousal = 2.96, SD = 0.65), t(66) = 28.10, p < .001, d = 6.81. The
mean valence was also significantly more negative for the high-stress (M valence = 2.34, SD =
0.51) than the low-stress images (M valence = 5.00, SD = 0.08), t(66) = 29.80, p < .001, d = 7.23.
The low-stress IAPS images were specifically selected to be neutral in Experiment 1.
Participants were instructed to try to remember both the image and the face because
both would be important later in the study. An old/new recognition-memory test
followed, in which participants viewed 48 test faces (24 old and 24 new). The test faces
were presented in a different random order for each participant, and the 48 faces were
counterbalanced so that across participants, each face served approximately equally often
as a target (old) and foil (new) face. Participants were instructed to respond “old” or
“new” to each test face and to indicate their confidence in each response on a scale of 1
(completely guessing) to 5 (absolutely sure I’m correct). (p. 161)
Summary of Results
Although the study procedure and analysis were much more complicated than this brief description
suggests, the major findings were participants in the high-stress group were significantly less likely than
those in the low-stress group to correctly identify a face. However, correct identification did not differ
between the high- and low-stress groups when participants were highly confident of their choice.
1. What type of study (descriptive, correlation, experiment, quasi-experiment) is described? How
do you know?
Your Research
Find a primary research article on your topic. It may be 4. What evidence, if any, is provided about the measure’s
the article you reviewed for YOUR RESEARCH Chapter 3. validity?
Choose a variable from the article and answer the following
questions.
LEARNING OBJECTIVES
CHAPTER SUMMARY
A descriptive study examines the who, what, when, where, and how, but it does not examine relationships among
them. In this chapter, we first discuss when it is appropriate to use a descriptive study and how to evaluate the validity
of descriptive research. The second half of the chapter focuses on sampling. The chapter explains how to identify a
population and follows with specific procedures for obtaining different types of probability or nonprobability samples
from the population. Issues regarding sample size for probability and nonprobability samples are also discussed. The
chapter ends with an introduction to other types of studies that will be detailed in later chapters.
CHAPTER 5 QUIZ
Test your knowledge of the key terms from the chapter.
1. is how common a behavior, attitude, characteristic, or condition is; a is a
pattern over time.
a. Trend; prevalence
b. Prevalence; trend
c. Population; subpopulation
d. Subpopulation; population
37
38 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
2. Which of the following types of validity is more important in a descriptive study than in
other types of studies?
a. measurement validity
b. internal validity
c. external validity
d. criterion validity
3. When does a pilot study happen?
a. Before the main study is conducted.
b. After the main study is conducted.
c. While the main study is being conducted.
d. The timing of a pilot study varies.
4. If you are interested in examining how students at your school view a specific sports team, the
for your study is all the students at your school.
a. sampling bias
b. sample
c. population
d. subpopulation
5. In question 4, athletes at your school would be a
a. sampling bias.
b. sample.
c. population.
d. subpopulation.
6. The students who participated in your study on views of the sports team are your
a. sampling bias.
b. sample.
c. population.
d. subpopulation.
7. If athletes are overrepresented in your sample of students, you have a
a. sampling bias.
b. sample.
c. population.
d. subpopulation.
8. A type of probability sampling in which every single member of the population has an equal
chance of being selected for the sample is known as
a. convenience sampling.
b. cluster sampling.
c. random assignment.
d. simple random sampling.
Chapter 5 ■ Basics of Research Design 39
9. If you rely on participants’ recruiting other participants for you, you are likely using
a. snowball sampling.
b. cluster sampling.
c. random assignment.
d. quota sampling.
10. When a researcher seeks out the full range of extremes in the population for representation in
her sample, this is known as
a. extreme sampling.
b. cluster sampling.
c. maximum variation sampling.
d. wide-range sampling.
Quiz 5 Answer Key
1. b. 2. c. 3. a. 4. c. 5. d. 6. b. 7. a. 8. d. 9. a. 10. c.
CHAPTER 5 EXERCISES
Achieve the learning objectives of the chapter by reviewing and applying key concepts.
3. Suppose that a researcher examined weekly exercise routines and mental health issues
including symptoms of depression and anxiety.
a. In the research report, the researcher reported frequency and type of exercise, as well
as frequency and severity of mental health issues within the sample. This represents a
design.
b. In the same research report, the researcher examined the correlation between frequency of
exercise and severity of mental health issues. This represents a design.
c. If the researcher also examined whether or not increased exercise resulted in a decrease in
severity of mental health issues, this would represent a design.
d. Is it reasonable to report results from multiple designs in a single research report? Explain.
_________________________________________________________________________
_________________________________________________________________________
Review
1. A classmate tells you “random assignment” is essential to probability sampling. Help explain
why the term “random assignment” is inappropriate in this context and identify the correct
term to use instead:
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
2. Explain the difference between random sampling with replacement and random sampling
without replacement.
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
Application
1. Imagine you wanted to interview the actors who were in the first season of the television series
Game of Thrones. You know that it would not be possible to interview all the actors, so you
decide to use probability sampling in order to obtain a representative sample.
You define the population as all the actors who played named characters during the first
season (so you would exclude extras who did not play a character with a name).
You obtain a list of all the named characters from online sources.
a. What would be the next steps if you used simple random sampling?
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
b. Suppose you wanted your sample to accurately represent the proportion of female actors
and male actors in the first season. What type of probability sampling would you use and
how would you obtain your sample?
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
c. There were 129 named characters on the first season of Game of Thrones.
i. Use either the formula in Chapter 5 of the textbook or the online calculator found at
www.surveysystem.com/sscalc.htm to calculate the minimum sample size you would
need to have a confidence interval of 5 (meaning that the margin of error will be
about 5%) and a confidence level of 95% (meaning that there is a 95% likelihood that
you will achieve your desired margin of error).
ii. If you increased your confidence interval to 12 and kept your confidence level at
95%, what is the minimum sample size you would need?
42 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
2. Imagine you want to conduct a study about the political attitudes of adults who live in your
neighborhood. You realize you cannot interview everyone in your neighborhood and instead
decide to randomly select streets within the neighborhood.
a. What type of sampling is this?
b. What would be your next steps?
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
Your Research
Designing a Descriptive Study on Your Topic 3. Choose a specific sampling procedure. Define the
population and then outline how you will obtain a sample
1. Identify one question that you could examine with a
from your population.
descriptive study and that you believe is important to
study (based on what you know about existing research in
this area).
LEARNING OBJECTIVES
CHAPTER SUMMARY
Describing a sample requires critical thinking about both ethical and practical issues in choosing the characteristics of
the sample to describe. Quantitative (numerical) analyses are a popular way to summarize sample characteristics, and
the majority of the chapter focuses on descriptive statistics. Specific descriptive statistics are organized by their overall
purpose: to describe how often a score appears in the sample; to describe the central tendency; and to describe the
variability. Definitions, formulas, and example calculations are included for each type of descriptive statistic. Note
that statistical notations that are most common in published research are used throughout the book (e.g., M and SD).
In this chapter, an emphasis is placed on not only understanding different types of descriptive statistics and their purpose,
but also choosing the appropriate descriptive statistics based on the scale of measurement and, for interval or ratio
variables, the shape of the distribution of scores. The chapter ends with a reminder of the importance of thinking critically
about the quality of your data, the characteristics of your sample, and the meaning of your results.
CHAPTER 6 QUIZ
Test your knowledge of the key terms from the chapter.
1. If you wanted to know what proportion of your sample consisted of left-handed people, you would be
wondering about the
a. frequency.
b. cumulative percentage.
c. mean.
d. percentage.
44
Chapter 6 ■ Describing Your Sample 45
2. The point in a distribution at which 50% of the scores are below and 50% of the scores are
above is known as the
a. standard deviation.
b. mode.
c. median.
d. mean.
4. When you report the mean, you should also report the that tells you the
degree to which scores differ from the mean.
a. standard deviation
b. median
c. range
d. possible minimum and maximum
5. If you reported that participants in your study had ages between 18 and 64, you would be
reporting the
a. standard deviation.
b. observed minimum and maximum.
c. possible minimum and maximum.
d. mean.
9. Which of the following is the best statistic to evaluate an individual score within a distribution?
a. mean score
b. standard deviation
c. observed maximum score
d. z score
10. What is the difference between a percentage and a percentile?
a. A percentage is the proportion of a score in a sample whereas a percentile is the proportion
scores that fall under a set value.
b. A percentage is the proportion of a score in a sample whereas a percentile is the percentage
of a score in the sample between a set interval.
c. A percentage can link directly to a letter grade whereas a percentile cannot.
d. There is no difference.
CHAPTER 6 EXERCISES
Achieve the learning objectives of the chapter by reviewing and applying key concepts.
2. In addition to asking about satisfaction, what questions would you include to describe your sample?
1 Store 11 Online
2 Catalog 12 Online
3 Store 13 Store
4 Store 14 Store
5 Store 15 Online
6 Online 16 Store
7 Store 17 Catalog
8 Online 18 Online
9 Online 19 Store
10 Store 20 Store
2. A highly selective university honors society only recruits incoming students who are in the top
10 of their class in high school. Following are the rankings of their 15 members, in order from
lowest to highest ranking:
7, 7, 5, 5, 4, 3, 3, 3, 2, 1, 1, 1, 1, 1, 1
3. A professor asks students how many hours they predict they will need to study for an
upcoming exam, with the following results: 1, 2, 2, 2, 3, 3, 3, 3, 3, 4, 4, 5
a. Complete the frequency table:
Hours (predicted) f
b. Calculate by hand and report the mean of these data. Show your work.
c. Calculate by hand and report the standard deviation for these data. Show your
work.
48 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
4. After the exam, the professor asks the students how many hours they actually spent studying,
with the following results: 1, 1, 1, 2, 2, 2, 2, 3, 3, 3, 4, 8
a. Complete the frequency table:
Hours (actual) f
b. Calculate the cumulative percentage of those who studied only one or two hours. Show
your work.
Nominal data
Ordinal data
Uniform distribution
Application
1. Refer to the data and your answers from Exercise 6.2, question 1 (shopping preferences).
a. What type of graph is appropriate to depict this data? Explain why.
__________________________________________________________________
__________________________________________________________________
Chapter 6 ■ Describing Your Sample 49
b. Draw the graph to summarize the results. Be sure to label your axes.
c. Write a sentence or two describing the results as you would in a Results section of an
APA-style report. ________________________________________________________
______________________________________________________________________
______________________________________________________________________
2. Refer to your answers from Exercise 6.2, question 2 (class rankings). Recall that the data are:
7, 7, 5, 5, 4, 3, 3, 3, 2, 1, 1, 1, 1, 1, 1
a. What scale of measurement is represented?
b. Write a sentence or two describing the results as you would in a Results section of an
APA-style report. ________________________________________________________
3. Refer to your answers from Exercise 6.2, question 3 (predicted study hours). Recall that the
data are: 1, 2, 2, 2, 3, 3, 3, 3, 3, 4, 4, 5
a. What two types of graphs are appropriate to depict this data? Explain why.
4. Refer to the data and your answers from Exercise 6.2, question 4 (actual study hours). Recall
that the data are: 1, 1, 1, 2, 2, 2, 2, 3, 3, 3, 4, 8
a. Draw a histogram to summarize the results. Be sure to label your axes.
c. What statistic should you calculate to confirm your assessment of the shape of the
distribution?
d. Assuming you confirm your assessment of the shape of the distribution, what measure of
central tendency and variability should you report?
g. Write a sentence or two describing the results as you would in a Results section of an
APA-style report.
______________________________________________________________________
______________________________________________________________________
1. Professor K converts all his students’ test scores to z scores. Julie has a z score of 0.5 and Alice
has a z score of –2.7. How would you help Julie and Alice interpret their scores?
2. What is the difference between earning 95% on an exam and being in the 95th percentile on
an exam?
3. If you scored in the 50th percentile on an exam, how would you interpret your performance?
What is your z score?
Chapter 6 ■ Describing Your Sample 51
Exercise 6.5: The Big Picture: Know Your Data and Your Sample
Learning Objective: Formulate a big picture of how to describe your sample.
Review
1. List five things that might negatively impact your ability to accurately interpret your
descriptive statistics.
3. How does the sampling technique impact how you interpret descriptive statistics?
Your Research
Find a primary research article on your topic. Look through 3. Did the author(s) report the central tendency and
the Method and Results sections. variability to describe any of the variables? If so, explain
what variables and what measures of central tendency
1. Did the author(s) use frequencies to describe a
and variability the author(s) reported.
characteristic of the sample? If so, explain.
2. Did the author(s) use percentages to describe a 4. Did the authors use z scores or percentiles? If so,
characteristic of the sample? If so, explain. explain.
7
BEYOND DESCRIPTIVES
Making Inferences Based
on Your Sample
LEARNING OBJECTIVES
• Explain how inferential statistics are used to determine whether the finding of a study is unusual.
• Illustrate the hypothesis testing process.
• Describe when to reject or retain a null hypothesis and the types of errors associated with each of these
decisions.
• Distinguish between statistical significance, effect size, confidence intervals, and practical significance.
• Formulate a big picture of how to interpret a study’s results.
CHAPTER SUMMARY
The chapter is focused on inferential statistics that allow us to make inferences about a population from findings with a
sample. It introduces many terms and the process that you will use when analyzing a study’s quantitative data. Sampling
distributions are a distribution of some statistic and provide the basis for deciding whether our sample is representative (or
not) of the population—or when a difference makes a difference. The decision-making process is called hypothesis testing
(or null hypothesis significance testing) and involves multiple steps: stating null and alternative hypotheses, defining the
sampling distribution, setting the criterion level (.05 or .01), computing a statistic, and deciding whether to reject or
retain the null hypothesis. The alternative hypothesis determines the region of rejection in the sampling distribution and
can be one-tailed (directional) or two-tailed (nondirectional). If the results of our analysis fall in the region of rejection,
we reject the null hypothesis and support the alternative hypothesis, but if the statistic falls in the region of acceptance,
we retain the null hypothesis.
We design studies to try to reject a false null hypothesis (called power), but because hypothesis testing is based on
probability, the decision to reject or retain the hypothesis is associated with a probability of error. Type I error occurs
when we reject a true null hypothesis, and Type II error occurs when we retain a false null hypothesis. Ways to decrease
the probability of either type of error, as well as ways to increase power, are described. Finally, effect sizes (magnitude of
an effect), confidence intervals (margin of error), and practical significance (everyday implications) are described as tools
to enhance our understanding of the results of a study.
52
Chapter 7 ■ Beyond Descriptives 53
CHAPTER 7 QUIZ
Test your knowledge of key terms from the chapter.
1. The purpose of inferential statistics is to allow researchers to determine whether .
a. the computed statistic is reliable
b. a study has internal validity
c. an outcome can be generalized to samples of different sizes
d. the outcome of a study is typical or unusual for a particular sampling distribution
2. The process of determining the probability of obtaining a particular result is called .
a. hypothesis testing
b. gambling
c. theoretical possibilities
d. statistical analysis
3. A sampling distribution is created by .
a. collecting samples of various sizes from a population
b. drawing many samples of the same size from a population and computing the same
statistic for each sample
c. repeating a study multiple times with the same size samples from different
populations
d. drawing different size samples from a population and computing their mean
4. In an experiment the alternative hypothesis predicts .
a. a difference between groups
b. no difference between groups
c. a difference between the sampling distribution and a population
d. the degree of similarity between groups
5. When the results for a study fall in the region of rejection it implies that .
a. our sample does not belong to the sampling distribution as defined by the null hypothesis
and we should reject the null hypothesis
b. our sample does not belong to the sampling distribution as defined by the alternative
hypothesis and we should retain the null hypothesis
c. our sample belongs to the sampling distribution and the researcher should retain the null
hypothesis
d. there was probably a confound in the study and we should reject the alternative hypothesis
6. The critical value of a statistic defines the .
a. top 2.5% of the distribution for a one-tailed hypothesis
b. bottom 2.5% of the distribution for a one-tailed hypothesis
c. extreme 5% of the distribution for a one-tailed hypothesis and extreme 2.5% of the
distribution for a two-tailed hypothesis
d. extreme 2.5% of the distribution for a one-tailed hypothesis and extreme 5% of the
distribution for a two-tailed hypothesis
54 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
CHAPTER 7 EXERCISES
Achieve the learning objectives of the chapter by reviewing and applying the key concepts.
Descriptive Inferential
Sample
Parameter
mean (M)
sigma (s)
Chapter 7 ■ Beyond Descriptives 55
Application
1. Suppose you are interested in comparing the number of crimes committed by those in
urban areas versus those in rural areas. Name the type of hypothesis for each statement
below, and, if the statement is an alternative hypothesis, identify whether it is directional or
nondirectional.
a. “There will be no difference in the number of crimes committed by those in urban areas
and those in rural areas.”
b. “The number of crimes committed in urban areas will differ from the number committed
in rural areas.”
c. “The number of crimes committed in urban areas will be greater than the number
committed in rural areas.”
2. Define a sampling distribution for the following hypothesis: There is no difference in the
percentage of income spent on housing by elderly households and the percentage of income
spent on housing by all households. Hint: First name the percentage spent by all households
(estimate a reasonable percentage).
56 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
3. In the figures below, the numbers on the x-axis are standard deviation units away from the
mean. For example, −2 = 2 SD’s below the M, while 1 = 1 SD above the M.
A B
–3 –2 –1 0 1 2 3 –3 –2 –1 0 1 2 3
Figure A: Draw the region of rejection and region of acceptance for a two-tailed test where p < .05.
Figure B: Draw the region of rejection and region of acceptance for a one-tailed test where p < .05.
4. List the seven steps that you would follow to test the following hypothesis: “Males are more
likely than females to physically bully their classmates.” Make your steps relevant to this
specific study (e.g., state the null and alternative hypotheses for this study).
a.
b.
c.
d.
e.
f.
g.
5. If you find that, in your sample, girls verbally bully their classmates more often than boys (p = .04),
a. Are the results statistically significant?
b. Would you reject or retain the null hypothesis?
c. What is the probability of a Type I error? Type II error?
d. Name two ways you could reduce your chance of a Type I error in the study.
i.
ii.
6. If you find, in your sample, that males physically bully their classmates more often than girls
(p = .07),
a. Are the results statistically significant?
b. Would you reject or retain the null hypothesis?
c. Is there a probability of a Type I error? Type II error?
7. Two studies were conducted on bullying at an elementary school. Study 1 selected a
convenience sample of 10 boys and 10 girls from grades K through 5. Study 2 selected a
convenience sample of 25 boys and 25 girls from the fourth grade.
Which study is likely to have more power? Why? (Hint: There are two
reasons.)
a.
b.
1. Draw a line from the term/phrase in first column that best matches the one in the second
column.
2. Fill in the table, but remember that these numbers are guidelines and not designed to be strict
cutoffs.
Small/Weak
Medium/Moderate
Large/Strong
58 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
Application
1. Interpret each of the following effect sizes as weak, moderate, or strong:
a. 2% of the variability accounted for
b. 50% of the variability accounted for
c. d = 0.48
d. d = 1.20
e. d = 0.12
f. 10% of the variability accounted for
g. d = 0.25
2. A teacher reports that exam grades ranged from 64% to 98%, with a mean of 79.85
(SD = 10.18), 95% CI [75.09, 84.61]. The confidence interval tells you that you are
confident that the sample mean of represents a population mean that falls
between and .
3. Another teacher reports exam scores based on academic year.
First-year students: M = 75.25, SD = 5.16, 95% CI [71.70, 78.88]
Second-year students: M = 80.33, SD = 7.33, 95% CI [74.11, 86.55]
a. Do the confidence intervals for the population means overlap?
b. A second-year student brags that their year outperformed first-year students. Based on the
confidence intervals, how might you counter this student’s claim?
c. What would you say about the practical significance of these results?
Review
What are the benefits of reporting each of the following in describing the results of a study?
1. Statistical significance
2. Effect size
3. Confidence interval
Chapter 7 ■ Beyond Descriptives 59
4. Practical significance
Application
1. A study examined whether convenience affected recycling behavior. The researchers
operationally defined recycling behavior based on the weight (in pounds) of material
recycled by each household. They found that households who were supplied a large
recycling bin and weekly pickup (M = 25.00, SD = 5.00, 95% CI [20.25, 29.75]) recycled
significantly more material than the households who were sent flyers encouraging them to
recycle (M = 10.00, SD = 4.00, 95% CI [6.50, 13.50], p = .003). Convenience accounted for
22% of the variance in recycling.
a. Explain what the p value indicates and how you would interpret it.
b. Interpret the effect size.
c. Interpret the confidence intervals.
d. Interpret the practical significance of the study.
2. The researchers replicated the study in a different city and found that those supplied with a
recycling bin (M = 25.00, SD = 7.00, 95% CI [16.20, 33.80]) recycled more than those who
were just given an informational flyer (M = 10.00, SD = 5.50, 95% CI [2.50, 17.50]). This
time, they found that p = .10, Cohen’s d = 0.25.
a. Explain what the p value indicates and how you would interpret it.
b. Interpret the effect size.
c. Interpret the confidence intervals.
d. Interpret the practical significance of the study.
60 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
Your Research
Find a primary research article on your topic. If possible, 1. State the research question that interests you and
find an article that has a simple design. If the article explain why the article you selected is related to this
describes multiple studies, pick one of the studies as the question.
3. Now examine the Method section. d. Did the authors report an effect size?
What do you learn about the sample and procedure of If yes, list it here and interpret its meaning.
the study that may help the researchers to avoid Type II
errors (e.g., sample size, error in the research design,
strength of effect)? You may want to review the section
e. Did the authors include a confidence interval for
“Reducing the Chance of a Type II Error” in Chapter 7
their findings? . If yes, list the confidence
before answering this question.
interval(s) and interpret the meaning.
Sample:
LEARNING OBJECTIVES
• Describe the process of choosing the appropriate inferential statistic to compare a sample and
known expected value.
• Compare a sample to a known population value when you have interval or ratio data using a one-sample
t test.
• Compute the two types of effect size when comparing a sample to a known population.
• Compute the confidence interval when comparing a sample to a known population value.
• Formulate a big picture of how to examine one variable at a time.
CHAPTER SUMMARY
The chapter describes the statistics used to compare a sample score to a known or expected score. The appropriate
statistical test depends upon the scale of measurement. The chi-square goodness of fit is the appropriate test for nominal
data, and the one-sample t test is the appropriate test for interval or ratio data. This chapter focuses on the one-sample
t test. Using an example comparing a sample of first-year students’ scores on a citation format knowledge quiz to the
national average for the quiz, the steps of the hypothesis testing process are described. For the statistical analysis, you
learned how to compute a one-sample t test, which is used when you have interval or ratio data. Then two options for the
effect size (eta squared and Cohen’s d) were covered. Finally, you learned how to compute a 95% confidence interval for
the difference between the sample mean and the known mean in a study. This section concluded with instruction about
how to write up Results and Discussion sections for a one-sample study when you have computed these statistics.
CHAPTER 8 QUIZ
Test your knowledge of key terms from the chapter.
1. If we want to find whether the first-year class at our college has higher scores on the SAT writing test than the
national average for the test, we would compute .
a. the effect size
b. practical significance
c. a one-sample t test
d. an alternative hypothesis
62
Chapter 8 ■ Comparing Your Sample to a Known or Expected Score 63
2. You want to find out whether a significantly greater percentage of students at your university
binge-watch television series than the national percentage for binge-watching. Can you
compute a one-sample t test on these data?
a. Yes, you have sample data and a national value for comparison.
b. No, the data are a nominal scale and a chi-square goodness of fit test is
needed.
c. No, students cannot be legitimately compared to a national sample of all television
viewers.
d. You do not have enough information to make a decision.
8. The two types of effect size that are computed for a one-sample t test are
and .
a. Cohen’s d; eta squared (η2)
b. estimated sample size (SEM); percentiles
c. percentage of variability accounted for; Cohen’s df
d. eta squared (η2); confidence intervals
9. Typically we compute the confidence interval.
a. 10%
b. 50%
c. 75%
d. 95%
10. A researcher compared the average salary for a sample of first-generation college graduates
with the average for all college graduates one year postgraduation and found t(99) = –1.59,
p = .07, d = .62. Is there a possibility of a Type II error?
a. Yes, the probability of a Type II error is 7%.
b. Yes, but we do not know the exact probability.
c. No, you cannot have a Type II error when the null hypothesis is rejected.
d. No, you cannot have a Type II error when the effect size is moderate or large.
CHAPTER 8 EXERCISES
Achieve the learning objectives of the chapter by reviewing and applying the key concepts.
Review
1. The assumptions of a one-sample t test include the following:
a.
b.
c.
2. In order to calculate a one-sample t test, we must have four statistics. Name them and note
their symbol.
STATISTIC SYMBOL
3. The t value is the value that defines the region of rejection for the
sampling distribution if the null hypothesis is true.
4. To be statistically significant, the calculated or computed t value must be
(greater than/less than) the critical t value.
Application
1. Do fathers who have completed a parenting class spend more time each day talking to their
toddlers than the average number of minutes reported in a national survey of fathers?
a. The researcher should compute a (type of statistical test).
b. Should your alternative hypothesis for this study be directional or nondirectional?
Explain why.
2. A professor finds that the age of students in her class is skewed, and she wants to compare
the median age in her class to that of the university. Why should the professor not use a one-
sample t test to analyze this data?
66 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
3. A political scientist asks voters leaving the polling center for whom they voted in the city
council election; she plans to compare her data to the actual voting outcome. Why should the
researcher not use a one-sample t test to analyze this data?
4. Suppose you found for the study of fathers in Question 1 the result of t(14) = 3.12.
a. The number of participants in this study is . How do you know?
b. The critical t value for this finding for a two-tailed test at the .05 level is .
(Hint: Use Table C.4 in Appendix C of the textbook.)
c. Would your result be significant for p < .05 for a two-tailed test?
d. How do you know?
b. Would this result be significant for p < .01 for a one-tailed test?
How do you know?
c. Why is there a difference in your conclusions for the one- and two-tailed tests?
6. After surveying drivers at multiple sites throughout the country, the National Transportation
Board reports that drivers report using their cell phones during 60% of their trips in their car.
You think that better-educated people will use their cell phones less frequently while driving.
You ask 25 of your classmates during what percentage of trips in their car (of any duration) do
they use their cell phones. Your classmates report a mean of 52%, with a standard deviation of
10%. Do your classmates use their cell phones significantly less than the general population
when driving?
a. State your null hypothesis.
H0: ___________________________________________________________________
c. df =
Chapter 8 ■ Comparing Your Sample to a Known or Expected Score 67
Application
1. A researcher compares the number of calories consumed daily by a sample of obese
5-year-olds to the recommended calories for this age group. As part of her analysis, she
finds η2 = .15.
a. The name of this test is .
b. She should interpret this finding to mean
which suggests a (weak/moderate/strong) effect.
2. Another researcher compares the amount of time a sample of high school juniors spent on
their phones during a school day to the average time found in a national survey of high school
students. He finds d = .35.
a. The name of this test is .
b. He should interpret this result to mean which
suggests a (weak/moderate/strong) effect.
3. a. Compute the effect size using Cohen’s d for Application question 6 from Exercise 8.2
(about cell phone use while driving).
Application
1. The researcher who assessed the time high school juniors spent on their phones (measured
in minutes) during a school day finds (M – µ) = 4.50, 95% CI [3.30, 5.70]. How would you
interpret this finding?
2. Does a researcher prefer a small or large confidence interval? Why?
3. a. Compute the confidence interval for Application question 6 from Exercise 8.2 (about cell
phone use while driving).
1.
3.
4.
1.
3.
5.
Application
1. As part of a complex study of substance users in Australia, the participants’ scores
(M = 53.7, SD = 19.6) on a subjective well being scale were compared with the national
norm (µ = 75.97) for the scale. The analysis found a significant difference, t(131) = 13.1,
Chapter 8 ■ Comparing Your Sample to a Known or Expected Score 69
p < .001 (Tait, 2018). Using the statistics provided in the report, d = 0.57 and 95% CI
[ –25.65, –18.89].
a. Is this an appropriate use of a one-sample t test? Why or why not?
______________________________________________________________________
______________________________________________________________________
b. Interpret the findings of Tait’s (2018) study.
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
2. Retrieve the following results from the study for the drivers using cell phones
a. t test and df (formatted in APA style)
(Exercise 8.2, Application question 6c., d.)
b. Cohen’s d (Exercise 8.3, Application
question 3a.)
c. confidence interval (Exercise 8.4,
Application question 3a.)
d. Write a Results section for your findings. Use APA format and include the descriptive
statistics, type of statistical test and results of the test, effect size, and confidence interval.
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
e. Write a Discussion section for your findings. Include the findings, interpretation/
explanation/implication of the findings, limitations of the study, and possible next studies.
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
3. The Pew Research Center sampled a large national group on basic science facts using a
12-item quiz. The average correct score on the quiz was 7.9. You decide to see whether
a sample of 16 students in a study group at a local bookstore is more knowledgeable than
the general adult public. The output from a one-sample t test comparing the students to
the national average is below.
One-Sample Statistics
One-Sample Test
a. Write a Results section following APA format. Be sure to check with your instructor for
any specific guidelines that apply for your class.
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
b. Write a Discussion section following APA format.
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
9
EXAMINING RELATIONSHIPS
AMONG YOUR VARIABLES
Correlational Design
LEARNING OBJECTIVES
CHAPTER SUMMARY
The chapter begins with a discussion of correlation research, which investigates the relationship between variables.
Correlation designs are employed for many reasons, such as when the manipulation of variables is impossible or unethical,
in a pilot study to check whether a relationship exists before testing for causation in an experiment, to test validity or
reliability of measures, or to increase external validity. The major drawback of correlation designs is that they do not meet
the requirements for causation. The ethical interpretation of correlation research and analysis is reviewed throughout the
chapter in terms of focusing on relationships and not implying causation, even when a relationship is strong.
The chapter describes types of correlation statistics that are used to analyze the relationship between two variables. These
statistics may be the primary or secondary analysis in a study, and, thus, correlation statistics are used in experimental as
well as correlational designs. The type of correlation statistic that is appropriate is dependent on the scale of measurement
of the two variables being analyzed. The most commonly used correlation is Pearson’s r, which examines the relationship
between two interval or ratio variables. Other correlation statistics include the point-biserial correlation (rpb) between a
dichotomous and interval/ratio variable (described in this chapter), the chi-square test for independence (χ2) between two
nominal variables, and the Spearman rho (rs) between two ordinal variables (both described in Chapter 14).
When there is a statistically significant Pearson’s correlation, a researcher may choose to compute a regression equation
or the equation for the line of best fit for the data (Y ′ = bX + a). This equation can then be used to predict one of the
variables in the relationship from the other. In addition, multiple regression, an analysis that predicts a variable based on
its relationship to two or more other variables, is briefly described.
71
72 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
CHAPTER 9 QUIZ
Test your knowledge of key terms from the chapter.
1. A correlational design .
a. describes the relationship between two variables
b. is a type of study that tests the hypothesis that variables are related
c. determines causation between variables
d. examines causal relationships between three or more variables
2. Correlational designs can have greater external validity than experiments because
correlational designs .
a. better represent the everyday world where we do not manipulate or
control events
b. study true relationships while experiments only manipulate relationships
c. are more attractive to participants
d. are easier to conduct
3. We can increase the power of a correlational study by
a. using measures with floor or ceiling effects.
b. using measures that are valid and reliable.
c. restricting the range of scores for each measure.
d. only using a very small random sample.
4. A correlation coefficient provides these two pieces of information:
a. strength and direction of a relationship.
b. direction and cause of a relationship.
c. strength and prediction of each variable.
d. direction and effect size of a relationship.
5. The value of Pearson’s r ranges between .
a. 0.0 and +1.0
b. –1.0 and 0.0
c. –1.0 and +1.0
d. 0.0 and 10.0
6. The point-biserial correlation coefficient (r pb) is used to analyze the
a. effect of a nominal variable on an interval/ratio variable.
b. effect of an interval/ratio variable on a nominal variable.
c. correlation between variables measured on interval and ratio scales.
d. correlation between a dichotomous and an interval/ratio variable.
7. You might compute a linear regression equation when you .
a. compute a significant correlation between two variables.
b. find a significant correlation for two interval/ratio variables.
c. find a significant rpb.
d. have one dichotomous and one interval/ratio measure and a significant rpb.
Chapter 9 ■ Examining Relationships Among Your Variables 73
CHAPTER 9 EXERCISES
Achieve the learning objectives of the chapter by reviewing and applying the key concepts.
Review
1. Fill in the chart below.
2. What is the major disadvantage of correlation designs?
_________________________________________________________________________
3. A valid correlation depends on obtaining a variation in the scores for each measure.
a. When the obtained scores all fall at the upper end of the possible scores, the measure may
have a effect.
b. When the obtained scores all fall at the lower end of the possible scores, the measure may
reflect a effect.
Application
1. Why are the following studies appropriate for a correlation design?
when the scores of one variable increase as the scores of Pearson’s correlation coefficient
another variable decrease
when the scores of two variables either increase together or perfect correlation
decrease together
r=0 no correlation
r = –.76 positive correlation
r = 1.0 negative correlation
graph that depicts the participant scores on two measures strong correlation
statistic assessing the relationship between 2 interval or scattergram
ratio variables
Application
1. The more alcohol consumed, the poorer a person’s scores on a driving skills simulation. This is
an example of a (positive/negative) correlation.
2. As the number of likes for a news story on Facebook increases, the time spent viewing the
story also increases. This is an example of a (positive/negative)
correlation.
3. Suppose you find that the correlation between hours of study and scores on a statistics exam
is .34, while the correlation between math anxiety and the statistics exam is –.63. Which
relationship is stronger? Explain your answer.
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
4. The correlation between hours of study and scores on a statistics exam (r = .34) would
be interpreted as a (weak, moderate, strong) correlation, while the
correlation between math anxiety and statistics exam (r = –.63) would be interpreted as a
(weak/moderate/strong) correlation.
5. Examine the graph below.
a. How would you interpret the graph?
______________________________________________________________________
b. What statistic(s) would you compute to check that your interpretation of the contents of
the graph was correct?
150000.00
125000.00
100000.00
Salary
75000.00
50000.00
25000.00
60.00
Total Reminiscing
40.00
20.00
.00
5.00 10.00 15.00 20.00 25.00 30.00 35.00
Total of satisfaction 1, satisfaction 2,
satisfaction 3, satisfaction 4, and satisfaction 5
Application
1. A researcher finds for a group of 30 residents in an assisted living facility that general health status
(assessed on a 20-point scale) and minutes spent exercising per week are correlated, r = .32.
a. State a null hypothesis for the study.
______________________________________________________________________
b. State a directional alternative hypothesis for the study.
______________________________________________________________________
c. What is the df for the study? (Hint: for a Pearson’s r, df = N – 2.)
d. Is the relationship statistically significant for a two-tailed test at p < .05? (Hint: see Table
C.5 in Appendix C of the textbook.) Explain your answer.
______________________________________________________________________
______________________________________________________________________
2. A developmental psychologist observes seven children with their parents, rating levels of
family support and child behavior problems. The data appear below. (Note that this is a very
small sample size for demonstration purposes. If you were conducting an actual study, you
would want a larger sample size.)
1 8 4
2 5 4
3 2 7
4 9 3
5 7 4
6 5 7
7 3 5
Chapter 9 ■ Examining Relationships Among Your Variables 77
d. Using the formula in Chapter 9 of the textbook, calculate Pearson’s r. (Hint: Add
columns to the table for the needed parts of the formula.)
e. Calculate the df
f. Using Table C.5 in Appendix C of the textbook, what is the critical r for this study for a
two-tailed test at p < .05?
g. Is the relationship statistically significant? In other words, is the r you calculated at or
stronger than the critical r?
CORRELATIONS
Openness to New
Gender Stereotypes Experiences
N 21 21
N 21 21
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
Chapter 9 ■ Examining Relationships Among Your Variables 79
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
Application
1. The relationship between living status (live alone or lives with others) and life satisfaction
(rating scale) can be analyzed using the . The statistical notation for this
correlation statistic is .
2. A researcher is interested in the relationship between owning a home (yes/no) and educational
attainment (years of education completed). After surveying 100 individuals at a local farmer’s
market, he finds rpb = .23.
______________________________________________________________________
______________________________________________________________________
c. Is the relationship significant at the p < .01 for a two-tailed test? ________ Explain your
answer.
______________________________________________________________________
______________________________________________________________________
CORRELATIONS
Pet Owner Happiness
N 30 30
N 30 30
a. Even though the output above says this is a Pearson’s r, what correlation statistic was
computed?
b. Is the relationship statistically significant? How do you know?
CORRELATIONS
Frustration Annual Health
Tolerance Self-Control Income Status
Frustration Pearson Correlation 1 .704* .673* –.436
Tolerance
Sig. (2-tailed) .023 .033 .208
N 10 10 10 10
Self-Control Pearson Correlation .704* 1 .842** –.748*
Sig. (2-tailed) .023 .002 .013
N 10 10 10 10
Annual Income Pearson Correlation .673* .842** 1 –.529
Sig. (2-tailed) .033 .002 .116
N 10 10 10 10
Health Status Pearson Correlation –.436 –.748* –.529 1
Sig. (2-tailed) .208 .013 .116
N 10 10 10 20
Chapter 9 ■ Examining Relationships Among Your Variables 81
Review
1. You could use the regression equation to predict one variable based on another if
.
2. The variable is the predicted variable in a regression.
3. The is the value used to predict another variable.
4. Sometimes we are interested in using a relationship between three or more variables to predict
a variable and, in this case, we compute a , which is symbolized as
.
Review
1. The value that results when you enter a particular X value in a regression equation is called
and is symbolized by .
2. Each X value and its predicted Y value falls on the .
3. The regression equation is always in the format of Y' = bX + a, with b = and
a= .
4. The direction of the slope (positive or negative) is determined by the .
5. The average difference between predicted Y values (Y’) and the actual Y values in the data is
called the and is symbolized by .
6. A small standard error of estimate suggests that .
7. is the proportion of variability accounted for by knowing the
relationship between two variables and is symbolized by .
8. The larger r 2, the (more/less) accurately we can predict Y from X.
82 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
Application
A researcher asked 35 participants to rate the importance of environmental issues and how much time
they spend outside. The output is shown below:
Model Summary
Model R R Square Adjusted R Square Std. Error of the Estimate
1 .381 a
.145 .119 5.47401
a. Predictors: (Constant), I think the environment is the most important political issue today.
ANOVAa
Model Sum of Squares df Mean Square F Sig.
1 Regression 167.847 1 167.847 5.601 .024b
Residual 988.839 33 29.965
Total 1156.686 34
a. Dependent Variable: On average, how many hours do you spend outside each week?
b. Predictors: (Constant), I think the environment is the most important political issue today.
Coefficientsa
Unstandardized Standardized
Coefficients Coefficients 95.0% Confidence Interval for B
Std.
Model B Error Beta t Sig. Lower Bound Upper Bound
1 (Constant) -3.008 4.518 -.666 .510 -12.199 6.183
I think the 2.818 1.190 .381 2.367 .024 .396 5.240
environment is the
most important
political issue today.
a. Dependent Variable: On average, how many hours do you spend outside each week?
20.00
Observed
Linear
15.00
10.00
5.00
.00
3.00 3.50 4.00 4.50 5.00
I think the environment is the most important
political issue today.
Chapter 9 ■ Examining Relationships Among Your Variables 83
2. The effect size is . (name the exact value). You interpret this
as meaning . The effect size is .
(strong/moderate/weak).
6. a. W hat is the average deviation between actual hours spent outside and the hours predicted
by the regression equation?
b. If a person rates as 4 (mostly agree) that the environment is the most important political
issue, how many hours do you predict he or she spends outside, according to the
regression equation?
7. Describe these findings as you might in a Results section, using APA format.
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
8. Interpret the findings as you would in a Discussion section.
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
84 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
Correlational Statistics
Point-biserial correlation
coefficient
2. Name the most appropriate inferential statistic and its statistical symbol for the following
questions:
a. Is a measure of hardiness (scores can range from 20 to 100) reliable? High school students
complete a scale assessing the trait of hardiness and then complete the same scale again
one week later.
______________________________________________________________________
b. Are visits to a mental health clinic (yes/no) related to family stress as assessed by a scale
ranging from 0 to 20?
______________________________________________________________________
c. A school counselor and teacher rank students in the class in terms of frequency of
misbehavior. Are the counselor and teacher rankings correlated?
______________________________________________________________________
d. Is SES (high/middle/low) related to political views (conservative/moderate/liberal)?
e. Is suicide ideation (scores range from 0 to 84) related to work-related traumatic stress
(scores range from 0 to 10) among police officers?
______________________________________________________________________
f. If compulsivity and attention to detail are strongly correlated, what is the score for
attention to detail if compulsivity = 25?
______________________________________________________________________
g. Is participation in after-school activities in high school (yes/no) related to self-reported
acts of delinquency (yes/no)?
______________________________________________________________________
h. What is the line of best fit for the relationship between years on a job and salary?
______________________________________________________________________
Chapter 9 ■ Examining Relationships Among Your Variables 85
Your Research
Find and Evaluate a Nonexperimental Study on Your Chapters 8 and 9). List them and explain how the authors
Topic used the statistic.
1. Use your library database to find a primary research
article that describes a nonexperimental study on your
topic (or related to your topic).
2. After you have found an appropriate nonexperimental
study, read the entire article carefully.
3. Look through the entire Results section to identify any
inferential statistics with which you are familiar (from
10
EXAMINING CAUSALITY
LEARNING OBJECTIVES
CHAPTER SUMMARY
How do you determine that one variable caused a change in another variable? Chapter 10 begins by outlining the requirements
for causality, including sequence, correlation, and the ability to rule out alternative explanations. Internal validity is the extent
to which a research design ruled out alternative explanations, and the key threats to internal validity are discussed in detail in
this chapter. These threats are inherent in the nonexperimental designs, such as a pre-/posttest or quasi-experimental designs.
An experiment is the only type of design that can demonstrate causality, although simply conducting a study is not
enough to show causality. This chapter discusses basic issues in designing an experiment including practical and ethical
issues in recruiting participants, random assignment, maintaining experimental control, manipulating the independent
variable, and measuring the dependent variable. The chapter focuses on increasing internal validity of experiments while
acknowledging the importance of balancing internal and external validity. The chapter ends with a discussion of the
strengths and limitations of experimental designs.
CHAPTER 10 QUIZ
Test your knowledge of the key terms from the chapter.
1. is the degree to which one can say that one variable caused a change in another.
a. Power
b. Experiment
c. Internal validity
d. Confound
86
Chapter 10 ■ Examining Causality 87
CHAPTER 10 EXERCISES
Achieve the learning objectives of the chapter by reviewing and applying key concepts.
1. What are some terms that researchers use to suggest that one variable caused a change in
another variable?
2. Students sometimes confuse the words “effect” and “affect.” Remember that when discussing
causality, effect is a noun and affect is a verb. Practice by filling in the blanks below with the
word “effect” or “affect”
a. Researchers have examined the of smell on memory.
b. The color of a car might the likelihood of getting into an accident.
c. We hypothesize that spending time in nature will mood.
d. Do you think that taking practice quizzes will have an on your exam score?
3. List the three requirements for causality:
4. What condition(s) for causality are lacking in a correlational design?
Chapter 10 ■ Examining Causality 89
Application
1. The local animal shelter intends to call people who recently adopted a pet to see if having
a pet has reduced their loneliness. Will they be able to show a causal relationship? Why or
why not?
2. Your friend crammed for an exam but did very well, and then concludes that cramming for
exams is more effective than spacing studying over time. Identify at least one alternative
explanation for causality.
a pretest
time
Application
1. A political candidate’s approval rating was very low, a mere 20%, prior to unveiling his
economic plan. After the plan was shared, the candidate’s approval rating rose to 28%.
a. What type of design is this? Be specific. ___________________________________
90 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
b. Identify the threats to internal validity that help explain why this design
cannot demonstrate that the economic plan is the reason for the increased
approval ratings:
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
2. A marketing executive wants to know whether a new website might improve sales. The
executive plans to compare the new website to the original one.
a. Identify two ways that the marketing director can minimize the testing threat to internal
validity.
______________________________________________________________________
______________________________________________________________________
b. Why would it be a problem if the marketing executive gave one group up to 30 minutes
to explore the new website, but gave the other group only 10 minutes to explore the old
website? Identify the specific threat to internal validity.
______________________________________________________________________
______________________________________________________________________
c. Why would it be a problem if the marketing executive had one group look at the new
website in pairs and the other group look at the old website individually? Identify the
specific threat to internal validity.
______________________________________________________________________
______________________________________________________________________
d. Why would it be a problem if the marketing executive had all the people who were under
30 explore the new website and had all those who were 30 and older explore the old
website? Identify the specific threat to internal validity.
______________________________________________________________________
______________________________________________________________________
e. Why would it be a problem if the marketing executive asked those in one group to report
the items they would like to purchase and looked at the online shopping carts of those in
the other group? Identify the specific threat to internal validity.
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
Chapter 10 ■ Examining Causality 91
Recruit participants
Prescreen for:
Keep everything else constant (the same) across the two conditions
Your Research
Find and Evaluate an Experiment on Your Topic 5. Was there a manipulation check? If so, describe.
Use your library database to find a primary research article that
describes an experiment on your topic (or related to your topic). 6. Identify at least one way, other than random assignment,
Limit your search to experiments published in the last 10 years. that the researchers controlled threats to internal
See Chapter 2 for search strategies. validity.
1. How do you know the study is an experiment?
7. Identify the DV(s). How did the researchers operationally
define the DV(s)?
2. How did the researchers recruit participants (or obtain
animal subjects)?
8. What, if any, threats are there to the internal validity of
the study? (Hint: review the Discussion section.)
3. What type of random assignment did they use?
4. Identify the IV(s). How did the researchers manipulate the 9. How might you build on this study? Include ways that you
IV(s)? might address any threats you identified above.
11
INDEPENDENT-GROUPS DESIGNS
LEARNING OBJECTIVES
CHAPTER SUMMARY
In this chapter, you learned about independent-groups designs in which the participants in different groups are unrelated.
These designs can be correlational, quasi-experimental, or experimental studies. Although the statistical tests used
to analyze the three types of independent-groups designs are the same, the interpretation of the results differs. Both
correlational and quasi-experimental results are described as a correlational relationship between the predictor (IV) and
outcome (DV), while experiments are described as a causal relationship between the IV and DV. The chapter reviewed
the controls and considerations you learned about in previous chapters regarding designing powerful independent-groups
studies.
The simplest independent-groups design involves two groups, and you learned how to analyze the data from such studies
using an independent-samples t test. This statistical test assumes that you have two unrelated groups, an IV (predictor)
and DV (outcome), interval/ratio data, normally distributed DV, and similar variability of scores in the two groups
(homogeneity of variance). The independent-samples t test compares the means of the two groups in relation to an
estimate of the standard deviation of the sampling distribution (standard error of the difference between the means). You
also compute an effect size (Cohen’s d or rpb2) and confidence intervals, either by hand or by using a statistical package
(e.g., SPSS, Excel).
You then learned about multiple-groups designs when you have one IV (predictor) with three or more independent
groups. The advantages of multiple-groups designs (in comparison to two-group designs) were described. The findings
of these designs are analyzed using a one-way analysis of variance (ANOVA), which compares the variability between
groups (treatment variance) to the variability within groups (error variance). The assumptions are similar to those for the
95
96 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
independent-samples t test. If you find a significant F (the end statistic for the ANOVA), you compute
a post hoc test, which examines differences between all possible pairs of the groups in the study. Eta
squared (η2) is the effect size appropriate for the ANOVA, and it is interpreted the same way as rpb2. In
reporting your results for a t test or one-way ANOVA, you should make clear the type of relationship
you have examined (correlational or causal) and discuss the practical and statistical interpretation of
your findings.
CHAPTER 11 QUIZ
Test your knowledge of the key terms from the chapter.
1. Correlational designs include a(n) instead of an IV and a(n)
instead of a DV.
a. predictor; estimated value
b. predictor; outcome
c. outcome; predictor
d. estimated effect; estimated value
2. The interpretation of the results for quasi-experimental designs differs from that of an
experiment because
a. quasi-experiments examine relationships while experiments examine causation.
b. quasi-experiments examine causation between two variables while experiments examine
causation between three or more variables.
c. experiments involve the manipulation of a variable while quasi-experiments do not.
d. experiments manipulate an IV while quasi-experiments manipulate a DV.
3. In a simple experiment, the IV is on a(n) scale of measurement and the DV is
on a(n) scale.
a. nominal; ordinal or ratio
b. ordinal; ratio or interval
c. interval or ratio; nominal
d. nominal; interval or ratio
4. In a simple experiment examining the impact of violent games on aggressive thoughts, we
might have participants play a violent game for 45 min rather than 5 min. This decision
reflects our attempt to maximize power by
a. controlling conditions in our experiment.
b. ensuring that our IV is valid and reliable.
c. using a strong manipulation of our IV.
d. using similar participants in our study.
5. To determine where our mean difference falls on the sampling distribution that predicts no
difference we need to know the standard deviation of the sampling distribution. This term is
called .
a. standard error of the means.
b. standard error of the estimate.
c. standard error of the difference between means.
d. standard deviation of the samples.
Chapter 11 ■ Independent-Groups Designs 97
CHAPTER 11 EXERCISES
Achieve the learning objectives of the chapter by reviewing and applying key concepts.
Application
1. A study finds that children who read at home every day are more likely to enjoy school than
children who never read at home. This is a/an (type of design) independent-
groups design; reading at home is the variable and enjoying school is the
variable.
2. A researcher interested in environmental issues completes a study on recycling. She arranges
to have one neighborhood’s recycling picked up each week and a second neighborhood’s
recycling picked up each month. The amount of recycling (in pounds of material) for each
resident in the two neighborhoods is measured over 6 months.
a. What type of study is this?
b. What type of variable does timing of recycling represent?
c. What type of variable does amount of recycled material represent?
d. Can we make assumptions about causality regarding this study? Explain.
______________________________________________________________________
______________________________________________________________________
3. How would you make the recycling study an experiment?
__________________________________________________________________________
__________________________________________________________________________
a. _________________________________________________________________________
b. _________________________________________________________________________
c. _________________________________________________________________________
5. Name two steps you could take to ensure the reliability and validity of the DV in
this study.
__________________________________________________________________________
__________________________________________________________________________
6. Name two potential confounds that you would want to eliminate from this study.
__________________________________________________________________________
__________________________________________________________________________
7. What ethical concerns might arise from the study and how might you address them?
__________________________________________________________________________
__________________________________________________________________________
__________________________________________________________________________
__________________________________________________________________________
__________________________________________________________________________
2. Circle which of the following assumptions should be met in order to appropriately compute
an independent-samples t test.
3. An independent-samples t test is used to determine whether the means of two groups differ
more than we would expect by .
5. Match the symbols on the left (which are associated with analyzing data from two
independent-samples) with the appropriate term or phrase on the right.
µ1 ≠ µ2 a. an effect size that reflects the standard deviation units between the
means of two groups
SD2 pooled c. Range that we are 95% confident that our mean difference varies
t = 0.0 e. a
n effect size that notes the percentage of variability in the DV accounted
for by the IV
rpb2 i. t value that represents the extreme 2.5 % of a distribution for a two-tailed
test
6. In the formula for an independent-samples t test, the difference between the means is divided
by which is symbolized by .
7. The standard error of the difference between means is (larger or smaller) than the
standard deviation of each group in a study because .
_________________________________________________________________________
8. To find the tcrit for a study, we need the following: , , and
.
9. Suppose for one of our groups SD = 1.50 and for the other group SD =15.00. What
assumption of a t test might we be worried about?
_________________________________________________________________________
Application A
A study investigates the effect of different methods of taking notes in class (entering notes on a laptop or
writing notes in longhand on paper) on student retention of material. Students (N = 27) are randomly
assigned to the two conditions. They hear the same 30-minute lecture and 30 minutes later take a short-
answer test on the material (maximum score = 25).
1. What type of independent-groups design is this study?
Explain what characteristics define the design you selected.
_________________________________________________________________________
_________________________________________________________________________
2. State a directional alternative hypothesis for this study.
_________________________________________________________________________
_________________________________________________________________________
3. For this study: IV = ; levels of the IV are
and .
Chapter 11 ■ Independent-Groups Designs 101
DV = ____________________
df = ____________ for a two-tailed test at p < .05, tcrit = ___________________
4. Suppose Mlaptop = 18; Mlonghand = 22; t(25) = 5.32. Are these results statistically significant for a
two-tailed test at p < .05? Explain.
_________________________________________________________________________
_________________________________________________________________________
5. The researcher finds rpb2 = .20. Interpret the meaning of this finding.
_________________________________________________________________________
_________________________________________________________________________
6. Suppose 95% CI [2.65, 5.38] for the study. Write a sentence describing the meaning of this result.
_________________________________________________________________________
_________________________________________________________________________
7. In another study exploring the same variables, 95% CI [1.65, 7.32]. Which of these confidence
intervals would a researcher be more satisfied with? . Explain your choice.
_________________________________________________________________________
_________________________________________________________________________
8. A second study examines the effect of the two types of note taking on a multiple-choice test
(rather than a short-answer test). The results show t(38) = 1.98. Are these results statistically
significant for a two-tailed test at p < .05? Explain.
_________________________________________________________________________
_________________________________________________________________________
9. The researcher for this study finds that d = .15. Interpret the meaning of this finding.
_________________________________________________________________________
_________________________________________________________________________
Application B
A developmental psychologist observed seven children with their parents. She recorded whether or not
the parents used any harsh language (yes or no) and rated the level of child behavior problems exhibited.
The data appear below.
1 No 4
2 No 4
3 Yes 7
4 No 3
5 No 4
6 Yes 7
7 Yes 5
102 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
1. Using the formula in Chapter 11 of the textbook or the Computational Formula in Appendix
D.6, calculate t
2. Calculate the df
3. Using Table C.4 in Appendix C, are the results statistically significant for a two-tailed test at
p < .05? For p < .01?
4. Compute an effect size and interpret its meaning.
5. Compute the confidence interval for the mean difference and interpret its meaning.
Application C
A student is interested in whether faculty who use humor in the classroom are liked more than faculty
who do not use humor. She arranges for a graduate student to be the “guest teacher” in two sections of
the same class. The content of the class is the same but the graduate student uses humor in one section
and omits humor in the other class. At the end of each class, the students complete a survey about the
guest teacher, including how much they liked the class.
Group Statistics
Teach Group N Mean Std. Deviation Std. Error Mean
Likability Humor 12 8.1250 2.97050 .85751
No humor 12 6.1667 1.19342 .34451
Standardizer a
Point Estimate Lower Upper
5. How does this change your interpretation of and conclusions about the output?
7. Write a few sentences summarizing the findings as you would in a Results section.
104 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
9. Name a possible flaw in the study and at least one suggestion for future studies.
Application D
Are male presidents of community colleges paid a higher salary than female presidents at these
institutions? A researcher documents the salaries of 22 presidents who were hired within the last 2 years.
Use the output in the following tables to answer questions 4-7. Values/dollars are expressed in thousands.
Group Statistics
Levene’s Test
for Equality of
Variances t test for Equality of Means
7. Based on these findings, what conclusions can you make about gender and community
college presidents’ salaries?
6. F = variance/ variance.
7. If our null hypothesis is true (our treatment has no effect), we would expect F =
.
8. Complete the 5 assumptions for computing a one-way ANOVA:
IV has ______________________groups.
Groups are ________________________.
DV is ____________________________scale.
DV is ____________________________.
Variability in groups is ________________; also called __________________________.
9. The values computed for an ANOVA are presented in a table.
10. The first step in computing an ANOVA is to subtract the mean from each of its scores and
then square the deviation scores and sum them. This value is called the
.
11. We also subtract the total mean from each group mean, square the deviation scores and sum
them. This value is called the .
12. We have two dfs in a one-way ANOVA, one for groups and one for
groups.
13. Using the formulas in Chapter 11 of the textbook, complete the following Summary Table for
an experiment comparing four independent groups:
14. a. Calculate the effect size (η2) using the formula in Chapter 11 of the textbook:
______________________________________________________________________
b. What does this effect size tell you?
______________________________________________________________________
15. Based on the summary table:
a. How many total participants are in the study?
b. If the n is equal in each condition, how many participants are in each condition?
c. Is the result statistically significant at p < .05? How can you tell?
______________________________________________________________________
d. What test should you compute next and why should you do this?
______________________________________________________________________
e. How many paired comparisons are there for this study?
Application A
A social psychologist is interested in the impact of men’s clothing on views of their attractiveness.
College students are randomly assigned to a condition in the study. The output for the study is below.
Chapter 11 ■ Independent-Groups Designs 107
Descriptives
Attractiveness
95% Confidence Interval for
Mean
Std. Upper
N Mean Deviation Std. Error Lower Bound Bound Minimum Maximum
Jeans & Polo 8 7.1250 .76111 .26909 6.4887 7.7613 5.50 8.00
Jeans & T-shirt 8 6.0500 .84007 .29701 5.3477 6.7523 4.50 6.80
Suit 8 4.5000 1.06904 .37796 3.6063 5.3937 3.00 6.00
Pants & vest 8 7.8750 .79057 .27951 7.2141 8.5359 7.00 9.00
Total 32 6.3875 1.53281 .27097 5.8349 6.9401 3.00 9.00
ANOVA
Attractiveness
Sum of Squares df Mean Square F Sig.
Between Groups 51.465 3 17.155 22.477 .000
Within Groups 21.370 28 .763
Total 72.835 31
5. Was the homogeneity of variance assumption violated in this study? Explain how you know.
_________________________________________________________________________
6. What is the effect size for the study and its strength?
7. List the comparisons that met the criteria for statistical significance at the p < .05. Indicate
the exact p value for each of these comparisons, except report p < .001 when SPSS reports a p
of .000.
_________________________________________________________________________
_________________________________________________________________________
8. Report your findings as if you were writing a Results section. Be sure to identify the IV and
DV, descriptive statistics, and all of the required statistical analyses and their findings. Use
APA format.
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
9. Interpret the results as you would in a Discussion section. Include one limitation and one way
to address the limitation.
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
Application B
Iam Phuny is investigating the effect of facial expression on ratings of females’ job success. Participants
are randomly assigned to view a picture showing only the head and shoulders of a woman (similar to
one that would appear in an online announcement on a business website or newsletter). They then rate
the female on several characteristics using a 10-point scale (1= not at all to 10 = extremely). Iam is only
interested in ratings of job success. The results of the analysis are below.
Descriptives
Report
Job Success
Facial Express Mean N Std. Deviation
Neutral 7.4667 15 1.95911
Frown 5.7333 15 2.31352
Smile 5.6000 15 2.26148
Total 6.2667 45 2.30020
Chapter 11 ■ Independent-Groups Designs 109
ANOVA
Job Success
Total 232.800 44
Multiple Comparisons
Dependent Variable: Job Success
LSD
95% Confidence
Interval
Mean
(I) Facial (J) Facial Difference Std. Lower Upper
Express Express (I–J) Error Sig. Bound Bound
Smile 1.86667 *
.79735 .024 .2575 3.4758
*
The mean difference is significant at the 0.05 level.
7. Describe two potential threats to internal validity and what the researcher can do to avoid/
minimize each threat.
i. ______________________________________________________________________
______________________________________________________________________
ii. ______________________________________________________________________
______________________________________________________________________
8. What is the total number of participants in the study?
9. What is the probability of a Type I error? Type II error?
10. Graph the results (the group means).
11. Describe the results in APA format.
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
12. Comment on the external validity of the study.
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
13. What would you recommend to employed females?
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
14. Discuss the statistical significance, effect size, and practical significance for this study.
_________________________________________________________________________
_________________________________________________________________________
_________________________________________________________________________
b. Jack is conducting a study on math anxiety. He has participants take the same 10-item
math test, but the groups differ in how long they have to complete the test (10 minutes,
20 minutes, 30 minutes). He assigns the first 25 participants to the 30-minute group, the
next 25 participants to the 20-minute group, and the last 25 participants to the 10-minute
group. He finds that the 30-minute group members score significantly better on the exam
and report lower anxiety, and he concludes that math teachers should allow more time for
all tests because it can cause a decrease in math anxiety.
______________________________________________________________________
______________________________________________________________________
c. Sharon has just completed a study on the effect of different fonts on reading
comprehension. Her analysis shows F(2, 47) = 1.04, p = .45. She reports that she had a
45% chance of a Type I error and no chance of a Type II error.
______________________________________________________________________
______________________________________________________________________
2. For the following studies, name the type of design and what characteristics define the design,
the appropriate statistical analyses, and the appropriate interpretation. We have completed
some of the table for you.
Inferential Statistic
Study Design Characteristics and Effect Size Interpretation
Does choice impact enjoyment? Fifth graders in two two groups, IV independent- samples correlation
one class were assigned a book by their teacher independent manipulated, DV is t test, with either rpb2 or
while students in a second class chose from 10 groups, quasi- interval, no random Cohen’s d as the effect
books. After two weeks, students rated how much experiment assignment size and confidence
they enjoyed reading their book on 10-point scale. interval for the mean
difference
Your Research
1. State a hypothesis for a simple experiment on your topic 4. What inferential statistic would you use to test your
(one IV, with two levels, and one DV). This should be based hypothesis? Explain.
on past research on your topic.
5. Which of these two designs is more appropriate, given the
2. What inferential statistic would you use to test your state of the research on this topic? Explain.
hypothesis? Explain.
3. Turn your simple experiment into a multiple-groups
design by adding at least one more level to your DV. This
should be based on past research on your topic.
12
DEPENDENT-GROUPS DESIGNS
LEARNING OBJECTIVES
CHAPTER SUMMARY
In this chapter, you learned about two types of dependent designs: matching and repeated measures. Their major advantage
is the increased power they provide by decreasing variability caused by the unique characteristics of participants. By
decreasing error variability, you should be able to better see the effect of your IV (in experiments) or predictor variable (in
correlational studies). The two types of dependent designs reduce participant variability in different ways. In a matched
design, you identify a variable that is highly correlated to the dependent (or outcome) variable and match participants on
this variable. You then ensure that one of the matched partners is in each of the study conditions. In a repeated-measures
design, each participant experiences each condition in the study. Dependent designs may involve two groups or multiple
groups, but the rationale and implementation are the same.
As with all designs, the dependent designs have potential disadvantages. Both dependent designs require extra steps
that can complicate the implementation of the study. In a matched design, you must be able to identify an appropriate
matching variable that doesn’t sensitize the participants to your hypothesis or the purpose of the study. Repeated-measures
designs are vulnerable to sensitization, fatigue, practice, and carryover effects and require counterbalancing of the order of
treatments. In addition, you can’t use a repeated-measures design when your IV creates a permanent change.
In addition to learning about the types of dependent designs, you learned about the statistics used to analyze the data
generated from them. The dependent-samples (or paired-samples or repeated-measures) t test examines the differences
between matched pairs, or the same participant under different conditions, and compares this mean difference to the
error within conditions. The dependent-samples (or within subjects or repeated measures) one-way ANOVA is used
113
114 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
to analyze the data from a dependent multiple-groups study. This ANOVA separates error variability
into the variability associated with a particular participant (in a repeated-measures study) or with the
matched participants (in a matched design) and the variability that is error due to other sources (subject
by condition interaction).
As a researcher, you should consider the advantages and disadvantages of different designs before deciding
what type of design (independent or dependent and correlation, quasi-experiment, or experiment) will
best test your hypothesis.
CHAPTER 12 QUIZ
Test your knowledge of the key terms from the chapter.
1. Where does the power of the dependent-groups design come from?
a. the large sample size
b. the stronger manipulation
c. the more sensitive dependent variable
d. the decrease in random error that is created by participant characteristics
2. The matching variable you choose should be ______
a. highly correlated with your dependent variable.
b. the same as your dependent variable.
c. the same as your independent variable.
d. something that increases the variance between groups.
3. Which of the following is true about the relationship between design type and external validity?
a. A matched design with the same variety of participant characteristics as an independent
design will have less external validity than the independent design.
b. A matched design with the same variety of participant characteristics as an independent
design will have the same external validity as the independent design.
c. A matched design with the same variety of participant characteristics as an independent
design will have more external validity than the independent design.
d. There is no relationship between design type and external validity.
4. Counterbalancing eliminates ______
a. attrition.
b. order effects.
c. mortality.
d. power.
5. If the impact of the treatment or measurement lasts longer than the time between different
conditions, this is known as ______
a. boredom.
b. practice.
c. fatigue.
d. carryover.
Chapter 12 ■ Dependent-Groups Designs 115
6. The average difference between the scores of matched pairs or the scores for the same
participants across two conditions is known as ______
a. the variance.
b. the mean difference.
c. the standard deviation.
d. the standard error of the mean difference.
7. ______ is when participants are randomly assigned to different sequences of conditions
so that each condition is represented in each order an equal number of times but not all
sequences are represented.
a. Complete counterbalancing
b. Partial counterbalancing
c. Latin Square counterbalancing
d. Randomized partial counterbalancing
8. The assumption that the variances of the differences between all the combinations of pairs of
groups are equal is known as ______
a. sphericity.
b. deviance of variance.
c. skewness.
d. counterbalancing.
9. The error associated with participants in the same condition or interaction sums of squares is
symbolized as ______
a. SSs.
b. SSAxB .
c. SSi.
d. Iss.
10. ______ is used to assess the effect size of a dependent-groups ANOVA.
a. Pearson’s r
b. rpb2
c. Partial eta
d. Partial eta squared
CHAPTER 12 EXERCISES
Achieve the learning objectives of the chapter by reviewing and applying key concepts.
Review
1. Put an I by the characteristics of independent-groups designs and D by characteristics of
dependent-groups designs. (Hint: some characteristics fit both types of designs.)
_____________ Can be correlational, quasi-experiments, or experiments
_____________ Can examine causation or correlation
_____________ Are the more powerful design
_____________ May have the same participants in more than one condition
_____________ Achieve equal groups through random assignment
_____________ Decrease random error created by participant characteristics
_____________ More vulnerable to confounds due to participant variables
_____________ More sensitive to changes in the DV created by changes in the IV
_____________ Can be matched or repeated measures designs
2. Distinguish matched and repeated measures designs.
3. a. What is different about the matching process for correlational and quasi-experiments and
the matching process for experiments?
b. How is the matching process the same for the different types of dependent designs?
4. Another name for a repeated-measures design is design.
5. Explain the difference between a repeated-measures design and the use of multiple trials in a study.
5. Name three potential problems of repeated-measures designs and how these problems might
occur.
a. _______________________________________________________________________
b. _______________________________________________________________________
c. _______________________________________________________________________
b. What matching variable might you suggest for this study and why?
2. Another researcher decides to replicate the language instruction study but matches
participants on the variable you suggested and on their time since arrival in the United States.
Explain why this might be a good or bad idea.
assigned to the two groups, and enter the matched pairs’ participant numbers in the
last column.
2 18 12 13
3 12 13 14
4 14 14 17
5 21 15 20
6 15 16 23
7 11 17 15
8 16 18 25
9 6 19 7
10 5 20 6
1. A gerontologist studying the effect of mental stimulation on the alertness of those 90 years or
older has 8 participants available. They are randomly assigned to the order of two conditions:
playing a word game for 10 minutes and watching a comedy segment for 10 minutes.
Alertness is assessed after each session. In order not to overtax the energy of the participants,
the researcher has the participants complete both of the 10-minute sessions back-to-back.
What potential confound(s) can you identify in this study and how would you correct it/
them?
2. A high school basketball coach wants to try a different approach to teaching her players how
to make free throws. She has a session one day when she gives her regular instruction, has
players practice free throws for 15 minutes, and then has each team member attempt 10 free
throws. The next day, she follows the same pattern but tries a new method of instruction
before the free throw practice and attempts. She finds that the players make an average of 1.5
more free throws on the second day. Name a potential confound in this exercise and explain
how to correct it.
3. A teacher wants to examine which of two methods of teaching long division is more effective.
Can the teacher use a repeated-measures design? Why or why not?
4. Ima Ipad is a senior psychology major. For her senior thesis, she wants to study the effect
of sleep deprivation on driving skills. After months of negotiating with the engineering
department, she is granted access to the needed driving simulation equipment. She must
Chapter 12 ■ Dependent-Groups Designs 119
now decide whether to use an independent or dependent design. What would you advise
her and why?
Application A
Suppose we find in a study comparing student retention in an online (M = 7.08, SD = 1.08)
versus in-person (M = 7.83, SD = 1.53) lecture with 24 students paired on their GPA that tobt =
2.46. Retention was measured with a 10-point multiple choice test.
1. df = ________________
2. tcrit for a two-tailed test, p < .05 = ________________
3. Do the two lecture presentation modes result in a statistically significant difference in student
retention? ________________ How do you know?
4. If SED = .30, compute the 95% confidence interval using the following formula:
(SED) (–tcrit) + (MD) ≤ µD ≤ (SED) (+ tcrit) + (MD)
120 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
5. If SDD = 1.06, compute Cohen’s d as the effect size for the study and comment on its strength.
(See the effect size formula for two dependent samples in Chapter 12.)
6. Comment on the practical significance of the study’s findings.
Application B
Suppose you are the gerontologist above who is examining the effect of two types of stimulation (30
minutes of word games or comedy on TV) on the alertness of those who are over 90 years of age. Each
participant is randomly assigned to the order of the two stimulations, which are provided two days
apart. Alertness is measured on a 25-point scale immediately following each stimulation session.
95% Confidence
Interval of the
Std.
Difference
Error
Mean Std. Deviation Mean Lower Upper t Df Sig. (2-tailed)
Pair 1 word game – comedy 1.50000 1.69031 .59761 .08687 2.91313 2.510 7 .040
Chapter 12 ■ Dependent-Groups Designs 121
4. df = ______________________
5. How many participants were in the study? ______________________ How do you know?
6. Does mental alertness differ significantly for elderly people after playing word games vs.
watching comedy? Explain your response.
7. Compute the effect size for the results using the formula for dependent samples.
8. Describe the findings in APA format, including all information that is expected in a Results
section.
9. Interpret the results as you would in a Discussion section. Include one limitation and one way
to address the limitation.
Application C
Are people with harder-to-pronounce names considered more threatening than those with easier-to-
pronounce names? A researcher matches participants (college students) on a scale measuring openness
to new experiences. One of each matched pair of students is assigned to rate people with easy-to-
pronounce names while the other rates people with hard-to-pronounce names with the same number
of letters. All students view the same pictures of a diverse group of 10 adults who are rated on several
characteristics. The findings for ratings of “threatening” are shown below. Threatening was rated on a
7-point scale, with higher scores reflecting more perceived threat.
Pair 1 easy pronounce −.93333 1.94447 .50206 −2.01014 .14348 −1.859 14 .084
- hard
pronounce
Review
Application
1. Suppose a weight loss program was examining the effect of weighing every day (1), every other
day (2), every third day (3), or once a week (4) on participants’ confidence that they could lose
weight. Participants were matched on their age and BMI.
a. The possible number of sequences for this study is _________________________.
b. If the experimenter randomly assigned participants to all possible sequences, this would be
an example of _________________________ counterbalancing.
c. The experimenter decided to randomly assign each group of participants to the following
sequences: 1234, 2341, 3412, 4123.
This is an example of _________________________ counterbalancing.
d. Evaluate the matching variables. Can you think of a more effective matching variable?
124 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
2. A researcher explores the impact of time outside on third graders’ completion of work in the
classroom. Each day of one week each child spends 10, 15, 20, 25, or 30 minutes outside
(every child spends each of the time periods outside over the course of one week). The
completion of work immediately after the child returns to the classroom is measured.
a. What type of design is this? _______________________________
b. What potential confound do you see in the study?
______________________________________________________________________
c. What additional step would the researcher need to take to avoid this confound? Explain
in detail what the researcher would need to do.
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
3. A researcher is investigating the impact of verbal reinforcement on 4-year-olds’ scores on a
vocabulary test. The test is individually administered and requires the examiner to read a
question and then have the child choose a picture from several choices. He uses a sample
of children who receive 10 questions followed by verbal reinforcement (“good”) for correct
answers only (C), 10 questions followed by verbal reinforcement for both correct and
incorrect answers (B), and 10 questions followed by no reinforcement (N).
a. How many different orders of the reinforcement conditions are possible?
______________________________________________________________________
b. Show how you would use complete counterbalancing to decide the order of the conditions
(C, B, N) that each child would receive. Remember that the number of orders should
match the number you named in the previous question.
______________________________________________________________________
c. How would the process of assigning children to conditions differ if you used Latin square
counterbalancing? Name the specific orders you might use.
______________________________________________________________________
______________________________________________________________________
Equal variances for the differences between Measurement variable that is normally
pairs of groups distributed
Dependent groups
Chapter 12 ■ Dependent-Groups Designs 125
3. The term used to describe equal variances for the differences between groups in a one-way
dependent-groups ANOVA is _____________________.
4. Match the symbols on the left (associated with analyzing data from a within-subjects
ANOVA) with the appropriate term or phrase on the right.
SSS µ1 ≠ µ 2 ≠ µ 3
ηpartial
2
(k – 1)(kps – 1)
Application A
1. Complete the following summary table.
Source SS DF MS F
Condition (A) 126.20 2 ?? ??
Subject ?? ??
Interaction (A x S) 22.47 ?? ??
Total 224.47 29
2. Are the results of this study statistically significant? _________________ How do you know?
3. Compute the effect size using information from the Summary Table above. _______________
4. Interpret the meaning of the effect size.
5. What next step (in terms of analysis) should you take?
6. How many paired comparisons should you make in the post hoc tests?
7. Suppose the following descriptive statistics are found for the groups in the study.
Descriptive Statistics
Mean Std. Deviation N
Compute LSD and complete the matrix below. Note with an asterisk which paired-comparisons meet
the criteria for statistical significance at p < .05.
126 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
1 2 3
Means (15.400) (15.500) (19.800)
Group 1 — ?? ??
Group 2 — — ??
Group 3 — — —
*
p < .05
Application B
1. State a null and an alternative hypothesis for a study that examined the effect of weighing
every day (1), every other day (2), every third day (3), or once a week (4) on participants’
confidence that they could lose weight.
H0: ___________________________________________________________
Ha: ___________________________________________________________
2. Complete the Summary Table below for the weigh-in study.
Source SS DF MS F
Weighing (A) 24.60 ?? ?? ??
Subject 72.72 36
Interaction (A x S) ?? ?? ??
Total 151.32 39
3. What is Fcrit for p < .01? ___________ Are these results significant at p < .01? ___________
4. Are these results significant at p < .05? ___________ How do you know?
5. Compute the effect size and interpret its meaning.
6. How many paired comparisons should you make in the post hoc tests? ___________
7. The descriptive statistics for the groups are shown below. Compute LSD and draw a matrix to
show which groups differ significantly from one another.
Daily 2.6
Weekly 7.2
Application C
A researcher replicated the weigh-in study described in Application B in a geographic region that is
known for its high rate of obesity and examined the effect of weighing every day (1), every other day
(2), every third day (3), or once a week (4) on participants’ confidence that they could lose weight. The
output for this study is found below.
Chapter 12 ■ Dependent-Groups Designs 127
Descriptive Statistics
Mean Std. Deviation N
Daily 2.8750 1.24642 8
every other day 3.0000 1.30931 8
every third day 4.0000 1.06904 8
Weekly 6.5000 1.60357 8
Tests the null hypothesis that the error covariance matrix of the orthonormalized transformed dependent variables is
proportional to an identity matrix.
a. Design: Intercept Within Subjects Design: factor1
b. May be used to adjust the degrees of freedom for the averaged tests of significance. Corrected tests are displayed in
the Tests of Within-Subjects Effects table.
Pairwise Comparisons
Measure: weigh-in
(I) factor1 (J) factor1 Mean Difference (I-J) Std. Error Sig.b Lower Bound Upper Bound
1 2 −.125 .441 1.000 −1.727 1.477
3 −1.125 .479 .308 −2.868 .618
4 −3.625* .420 .000 −5.152 −2.098
2 1 .125 .441 1.000 −1.477 1.727
3 −1.000 .423 .299 −2.536 .536
4 −3.500* .567 .003 −5.561 −1.439
3 1 1.125 .479 .308 −.618 2.868
2 1.000 .423 .299 −.536 2.536
4 −2.500* .681 .048 −4.977 −.023
4 1 3.625* .420 .000 2.098 5.152
2 3.500* .567 .003 1.439 5.561
3 2.500* .681 .048 .023 4.977
Based on estimated marginal means
. The mean difference is significant at the .05 level.
*
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
b. A social service director offers clients the option to attend a 6-week job training course
or a 6-week life skills course. Of the 50 clients contacted, 34 agree to participate in the
courses. After matching the participating clients on depression, the director assigns one of
each matched pair to the course taught on the night most convenient for them. Six clients
drop out of the courses over the 6 weeks (4 from job training and 2 from life skills). At
the end of the 6-week period, the remaining matched clients report significantly less
depression after taking the life skills course than the job skills course, t(10) = 3.65, p < .05,
d = .20. The director concludes on the basis of these findings that the agency will now
Chapter 12 ■ Dependent-Groups Designs 129
require all clients to attend the life skills course because attendance in the course causes a
reduction in depression.
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
c. A psychologist is hired to try to increase the productivity of employees at a call center.
He decides to test the effect of different types of appreciation on increasing productivity.
He matches the employees at the call center on extraversion and randomly assigns one
of each group to one of three types of appreciation (a weekly free meal provided at work,
an extra hour during one lunch each week, or a free movie pass each week). The study
runs for one month, after which productivity of each employee in the study is measured.
The researcher finds no difference in the productivity of the groups F(2, 38) = 1.58,
p = .16, η 2partial = .20 . The researcher concludes that different types of appreciation do not
increase productivity.
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
d. A trainer who offers diversity workshops is interested in the effect of different techniques
on reducing prejudice toward Black individuals in the workplace. Twenty managers of
small businesses volunteer to take her workshop which consists of 4 two-hour zoom
sessions, offered one a week for four weeks. Each session employs a different focus to
reduce prejudice (1. general information and discussion about prejudice, 2. information
and role-plays about attracting and hiring Black applicants, 3. facts about prejudice and
microaggressions toward Black employees and discussion about how to build positive
relationships, 4. role-playing positive interactions with Black employees). Participants
are randomly assigned to the order of the 4 sessions and complete a different form of the
prejudice scale at the end of each session. The analysis finds no difference in the prejudice
scores at the end of the workshop. Upon further examination, however, the trainer
discovers that participation, questions, and discussion were greatly reduced in the last 2
sessions regardless of their focus. ____________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
130 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
2. For the following studies, name the type of design and the characteristics that define the
design, the appropriate statistical analyses, and the appropriate interpretation. We have
completed some of this for you.
Inferential Statistic
Study Design Characteristics and Effect Size Interpretation
Does alcohol positively impact video game Dependent two Matched pairs, 1 Dependent- samples Causation
performance? Young adults are matched on matched groups, IV on a nominal t test, Cohen’s d
the amount of time they report playing video experiment scale with random or squared point-
games and randomly assigned to drink either assignment of pairs biserial correlation
12 oz. of water or 12 oz. of alcohol. Game to manipulated as effect size,
scores are recorded after the participants condition, DV on ratio Confidence interval
play a video game for 30 minutes. scale
Are sports fans more likely to attend to ads Multiple conditions Correlation
promoting beer, cars, food, or technology? (4) are manipulated,
Patrons at a sports bar are observed as Participants are
prerecorded ads are shown during a pro exposed to all
football game. The percentage of the relevant conditions but not
ads they watch is recorded. randomly assigned
to their order, IV is
nominal, DV is ratio
Your Research
1. In the “Your Research” exercise in Chapter 11 of this b. Briefly describe the procedures you would follow
study guide, you selected either a simple experiment for the design you chose.
or a multiple-group design. Restate the hypothesis for
the design you chose.
c. What inferential statistic would you use to test
your hypothesis? Explain.
2. Is it possible to use a dependent-groups design, either
matching or repeated measures, to test this hypothesis?
If no, explain why not.
a. Which would be a better design, matching or
repeated measures? Explain.
13
FACTORIAL DESIGNS
LEARNING OBJECTIVES
CHAPTER SUMMARY
Examining the impact of one variable on another might provide an incomplete picture, and often it is important to
understand the interaction of multiple variables. Chapter 13 begins by introducing basic concepts in factorial designs,
including the different types of factorials, factorial notation, and the difference between main effects and interaction
effects. Next is a discussion of the rationale for factorial designs, with an emphasis on the benefits of examining complex
relationships and controlling extraneous or confounding variables.
The second part of the chapter focuses on 2 × 2 designs. The chapter includes information on how to graph these designs
and provides examples of different graphs with and without interaction effects. Strategies for articulating interaction
hypotheses are provided, with three typical interaction patterns outlined along with example graphs demonstrating each
pattern. The chapter provides details on how to analyze independent-groups designs using formulas and provides an
example output generated by SPSS. The chapter also provides an overview of independent designs with more than two
factors or with a nominal outcome or dependent variable, dependent-groups factorial designs, and mixed designs. The
chapter ends by suggesting that the factorial design allows one to avoid overly simple explanations and reminds students
of the importance of ethical and critical thinking when analyzing results.
CHAPTER 13 QUIZ
Test your knowledge of the key terms from the chapter.
1. A(n) is a general name for a predictor or independent variable.
a. IV b. DV
c. factor d. factorial
132
Chapter 13 ■ Factorial Designs 133
9. A researcher hypothesizes that children will develop a stronger sense of self-esteem when
their families have regular dinners together and that this is especially true for children with
learning disabilities. The best analysis to test this hypothesis is
a. an independent-samples t test.
b. a one-way between-subjects ANOVA.
c. a mixed ANOVA.
d. a two-way between-subjects ANOVA.
CHAPTER 13 EXERCISES
Achieve the learning objectives of the chapter by reviewing and applying key concepts.
Application
Use the following variables to create specific types of factorial designs:
b. Drinking tea before going to sleep will improve the quantity and
quality of sleep compared to drinking water.
c. Distractions will impede performance on the math test but
will improve performance on a creativity test, and the opposite effect will occur in a
distraction-free environment.
d. Compared to a placebo, taking medicine to go to sleep will
increase the quantity of sleep but decrease the quality of sleep.
e. Both television viewing and social media use prior to bedtime will
decrease the quality of sleep.
2. For the interaction you identified above, identify:
a. Factor 1: ; levels:
b. Factor 2: ; levels:
c. Outcome/DV:
3. Which of the following graphs best fit the interaction:
10 9 9
9 8 8
8 7 7
7
Performance
Performance
Performance
6 6
6
5 5
5
4 4 4
3 3 3
2 2 2
1 1 1
0 0 0
Math test Creativity test Math test Creativity test Math test Creativity test
A. B. C.
no distractions distractions
Pet
none cat
50–64
50—64 4 7
Age
65—80 4 7 65–80
5
Happiness
0
No pet Cat
Chapter 13 ■ Factorial Designs 137
5. Does the steepness and direction of the lines in the graph indicate
a. A main effect for pet?
b. A main effect for age?
c. An interaction?
6. Graph the results of a study on income:
Degree
Associates Bachelors
English 40,000 55,000
Major
70
60
50
40
30
Associates Bachelors
7. Does the steepness and direction of the lines in the graph indicate
a. A main effect for degree? _______________
b. A main effect for major? _______________
c. An interaction? _______________
8. Graph the results of a study on quiz scores:
A or B C or less
Attended 9.5 8
Did not attend 7.5 6
10
A or B
9 C or less
8
Quiz Score
4
Attended study session Did not attend
9. Does the steepness and direction of the lines in the graph indicate
a. A main effect for study session? ______________
b. A main effect for current grade? ______________
c. An interaction? ______________
138 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
10. For each interaction you identified above, describe the pattern:
_________________________________________________________________________
_________________________________________________________________________
5. If the researcher found a statistically significant interaction and wanted to determine if the
difference in cell means were statistically significant, the researcher could conduct follow-up
post hoc analyses by conducting . (Identify the specific
type of analysis.)
Application A
Apply the formulas using hand calculations.
A researcher conducts a 2 × 2 factorial with 80 participants and equal numbers in all four cells.
1. Fill in the missing information in the ANOVA summary table:
2. Look at Appendix C.6 to determine whether the F value for the main effects and interaction
meet the criteria for statistical significance at p < .05.
Interaction effect
3. Compute the effect size (η 2partial ) for each main effect and the interaction using the formula
ηpartial
2
= SSB / ( SSB + SSw ) .
η 2partial
Main effect for factor 1
Interaction effect
Application B
Interpret output from a data analysis program.
A researcher hypothesizes that ADHD symptoms will exacerbate the distracting impact of a cell phone
alert during a lecture. Abbreviated SPSS output appears below:
Use these results to fill in the blanks below. Round your answers to two decimal places, except when
important information for the p value would be lost with rounding (e.g., p = .004 or p = .045).
1. What type of analysis is appropriate when you have three factors and an interval or ratio
outcome variable and…
a. all factors are independent
b. all factors are dependent
c. two factors are independent and the other is dependent
2. If you have five independent factors and an interval or ratio outcome:
a. what type of ANOVA should you conduct?
b. how many main effects would you test?
c. how many interactions would you test? Recall that the formula for the number of
interactions is:
2k – 1 – k
where k = number of factors
Chapter 13 ■ Factorial Designs 141
Application
1. A researcher designs a study to predict number of calories consumed per week based on
marital status (married vs. cohabitating vs. single) and home ownership (rent vs. own).
a. What type of design is this? Include the factorial notation in your answer.
Application
Indicate which of these tests you should use to analyze the following designs:
a. chi-square test of independence
b. two-way between-subjects ANOVA
c. two-way within-subjects ANOVA
d. mixed ANOVA
Your Research
Develop an interaction hypothesis for a 2 × 2 as gender, age, or personality, will moderate the
experimental or hybrid factorial design. effect.
1. State a hypothesis for a simple experiment on your 3. Based on the option you chose and the research
topic (with one IV, with two levels, and one DV). This article(s), develop an interaction hypothesis that
should be based on past research on your topic. suggests one of the following:
a. A certain condition (option 1) or participant
characteristic (option 2) will strengthen or weaken
2. Go back to the research on your topic to decide between the effect you stated in your simple experiment
these two options: hypothesis.
1. Option 1 (experimental factorial): Does research b. The simple experiment hypothesis you stated only
suggest that the IV will impact the DV under all occurs under a certain condition or for certain
conditions? If not, find at least one research participants.
article that suggests that a second IV will c. The direction (e.g., positive or negative) of the
moderate the effect. effect of your IV on your DV depends on a certain
condition or participant characteristic.
2. Option 2 (hybrid factorial): Does research suggest
that the IV will impact the DV for all participants?
If not, find at least one research article that
suggests that a participant characteristic, such 4. Graph your hypothesized interaction.
14
NONPARAMETRIC STATISTICS
LEARNING OBJECTIVES
CHAPTER SUMMARY
Nonparametric statistics are used with nominal or ordinal data, or when the data violate the assumptions required for
parametric statistics. Chapter 14 outlines the key differences between nonparametric and parametric statistics and then
provides information about when to use the various nonparametric tests.
Because chi-square analyses (nominal data) and Spearman’s rho (ordinal data) are relatively common tests in the social
sciences, the chapter details the computations for these tests and also provides examples of how to interpret these analyses
using a data analysis program. For the other nonparametric tests, the computations are included in Appendix D rather
than in the chapter. The chapter focuses on selecting the appropriate nonparametric test based on the type of data
(nominal vs. ordinal) and design (independent vs. dependent groups).
CHAPTER 14 QUIZ
Test your knowledge of the key terms from the chapter.
1. Nonparametric statistics are used when we have ______________
a. ordinal or nominal data.
b. normally distributed variables.
c. homogeneity of variance.
d. the need for a more powerful test.
14 3
144 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
CHAPTER 14 EXERCISES
Achieve the learning objectives of the chapter by reviewing and applying key concepts.
Parametric Nonparametric
Interval or ratio data
Distribution free
Complicated computations
Powerful
146 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
Each member in the dataset belongs to only Expected frequency of at least 20 in each
one category category
1. A researcher is studying whether people in different age groups (18–30, 31–45, 46–60, 60+)
were equally likely to vote in the last election. He computes chi-square goodness of fit and
finds χ2(3, N = 400) = 5.35.
a. The null hypothesis for this study is
H0:
b. The alternative hypothesis for this study is
Ha:
c. In this study df = ______________.
d. What is the minimum number of voters that should be included in the sample?
e. How many voters were in the sample?
Chapter 14 ■ Nonparametric Statistics 147
10 20 30
BIRTH QUARTILE
Total 50
TEST STATISTICS
Birth Quartile
Chi-Square 10.960 a
Df 3
a
. 0 cells (0.0%) have expected frequencies less than 5. The minimum expected cell frequency is 12.5.
a. Did the researcher conduct a chi-square test for equal or unequal frequencies?
____________________________ How do you know?
b. Is the result of the analysis significant? ______________ Explain how you know.
c. What is the probability of a Type I error? ______________ Type II error?
d. Did the researcher find support for the Relative Age Effect? ______ Explain your answer.
e. Write a few sentences that you think the researcher should include in her Discussion section.
f. Name at least one limitation that the researcher could mention and a future study that
could address this limitation.
g. What recommendation might the researcher make to coaches of high school soccer teams
(or principals of high schools)?
150 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
1. The statistic used to analyze the relationship between two nominal variables is
, with the statistical notation .
2. When we examine the relationship between two nominal variables, the table that summarizes
the data is called a .
3. The only difference in the assumptions for a chi-square goodness of fit test and a chi-square
test for independence is .
4. Obtaining the expected frequencies for a chi-square test for independence is more
complicated (than for a chi-square goodness of fit test) because ______________
.
5. The chi-square test for independence is analogous to the parametric test.
6. Answer the following questions using the information in the study: Is developmental status
(adolescence vs. emerging adult) related to unhealthy dieting among females? Tenth-grade
and college senior females who self-report that they are dieting describe their diet practice as
moderate (e.g., exercise, balanced diet, reduced fats), unhealthful (e.g., skipping meals, eating
only one food a day), or dangerous (e.g., fasting, vomiting, using laxatives).
a. These data can be summarized in a × contingency table.
b. The minimum sample size for this study is .
c. The appropriate statistic to analyze these data is .
d. The appropriate effect size statistic for these data is because .
e. The df = .
7. Answer questions a–d using the information in the following question: Is race (African
American/Caucasian) related to having been stopped for a traffic violation in the last month
(yes/no)?
a. This is a × contingency table.
b. The appropriate effect size statistics for this analysis is ______________.
c. The df = ______________
d. Suppose the researcher found χ2 = 5.62. Look at Appendix C.7 in the textbook. Is this
relationship significant at p < .05? ______________. Explain.
8. Answer the following questions according to the output:
Employment
Yes Count 4 8 13 25
Total Count 10 12 28 50
CHI-SQUARE TESTS
N of Valid Cases 50
a
. 0 cells (0.0%) have expected count less than 5. The minimum expected count is 5.00.
SYMMETRIC MEASURES
N of Valid Cases 50
Stolenbase
Stole at least
No bases stolen one base Total
Total Count 26 9 35
CHI-SQUARE TESTS
Asymptotic
Significance Exact Sig. Exact Sig.
Value df (two-sided) (two-sided) (one-sided)
N of Valid Cases 35
a
2 cells (50.0%) have expected count less than 5. The minimum expected count is 4.37.
b
Computed only for a 2 × 2 table.
SYMMETRIC MEASURES
Approximate
Value Significance
Nominal by Phi –.344 .042
Nominal Cramer’s V .344 .042
Number of Valid Cases 35
d. Are the results your classmate found consistent with Sulloway and Zweigenhaft’s (2010)
finding? Explain.
1. What nonparametric test for nominal data corresponds to the dependent-groups ANOVA?
2. The appropriate statistic to compute when you have two related-samples with nominal data is
.
3. Which nonparametric test for dependent groups requires dichotomous data?
4. Young adults matched on their game playing habits play one of two versions of a popular
video game. Half of the students play a 15-minute updated version that uses avatars and
half of the students see an updated version without avatars. The students note whether they
“would play again,” “would not play again,” or “not sure” if they would play the game again.
What test should be computed to determine whether either version of the game is more likely
to be played?
154 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
CORRELATIONS
Language
Prematurity Skills
N 15 15
N 15 15
*
Correlation is significant at the 0.05 level (2-tailed).
A 1 3
B 2 2
C 3 4
D 4 1
E 5 5
F 6 7
G 7 6
H 8 8
b. Compute the appropriate analysis by hand, with a data analysis program, or both.
c. Are the results significant at p < .05? Explain.
d. How should Ben interpret his findings?
1. One of the objections to the transformation of interval or ratio data to an ordinal scale is
.
2. Jesse collects data on the average minutes of daily practice on the guitar. He decides to
convert the data to ranks. If his original data are 80, 50, 50, 45, 40, 35, 30, 25, 20, 15, what
would his ranked data be if they represented the same order as the original data?
3. Deedee is testing the effect of a new advising method on the study time of college students.
Twelve students are matched on their GPAs and one of each pair is advised by the new
method and one by the typical method. At the end of a month, all the students are ranked on
their average daily study time. How would Deedee determine whether there is a difference in
the study times of the two groups? (Hint: What test should she compute?)
4. A musician wonders if songs about breakups tend to rank higher than songs about falling in
love on the Billboard top 10 chart for this year.
a. What inferential statistic should the musician conduct? Explain.
b. If the musician wanted to compare rankings of breakup songs versus falling in love for
the Billboard top 100 over the past 50 years, what inferential statistic is now appropriate?
Explain.
5. You should choose between using a Kruskal-Wallis H test and a Friedman χ2 to analyze your
data based on .
Your Research
Select a primary research article on your topic.
15
FOCUSING ON THE INDIVIDUAL
Case Studies and Single N Designs
LEARNING OBJECTIVES
CHAPTER SUMMARY
Previous chapters have focused on sample-based studies. The primary goal of sample-based studies is to extrapolate results
to the population from which the sample was drawn, but it is not clear whether the results from such studies will be
relevant or helpful to a specific individual. Chapter 15 outlines two alternatives to sample-based designs: the case study
and the single N design. The case study is typically qualitative and the chapter provides an overview of some of the most
common qualitative analyses used with case studies. The single N design is quantitative, and graphs are essential to their
interpretation. The chapter includes explanation of effective use of each of these designs, along with their strengths and
limitations. The chapter ends with a discussion of when to choose a sample, case study, or single N design.
CHAPTER 15 QUIZ
Test your knowledge of the key terms from the chapter.
1. Which of the following statements about sample-based studies is FALSE?
a. Their primary goal is to extrapolate the results from the sample to the population.
b. They can be used to infer if a relationship found in a sample represents a relationship that exists in the population.
c. They can be used to understand a specific person, event, or organization.
d. They can include a large number of participants.
157
158 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
CHAPTER 15 EXERCISES
Achieve the learning objectives of the chapter by reviewing and applying key concepts.
1. List at least two ways that one might one analyze the content of a case based on words and
phrases.
2. Outline the steps in thematic analysis.
3. Outline the steps in a priori content analysis.
4. Explain the similarities between analytic generalization and analytic induction.
5. Explain the differences between analytic generalization and analytic induction.
Chapter 15 ■ Focusing on the Individual 161
Application A
1. Client A keeps track of how many cigarettes she smokes over a 10-day period and finds the
following results: 18, 20, 19, 20, 21, 18, 19, 20, 21, 20. Is this a stable baseline? Explain.
2. Client B keeps track of how many cigarettes he smokes and finds the following results: 2, 5,
10, 7, 1, 0, 5, 8, 12, 9. Is this a stable baseline? Explain.
3. Client C keeps track of how many cigarettes she smokes and finds the following results: 0, 3,
5, 7, 9, 11, 13, 15, 17, 19. Is this a stable baseline? Explain.
4. Client D keeps track of how many cigarettes he smokes and finds the following results: 22,
20, 20, 16, 17, 15, 16, 16, 14, 12. Is this a stable baseline? Explain.
162 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
5. If you wanted to implement and evaluate an intervention to help these clients reduce the amount
they smoked, which two clients would be most ready to begin the manipulation phase? Explain.
Application B
1. A yoga student consistently meditated 10 minutes each day for 5 days. In order to increase the
meditation time, the student tries to add nature sounds to the routine and is able to meditate
for 14 minutes that day, 13 minutes the next day, and 15 minutes the following 3 days.
a. Graph these results.
b. Based on visual inspection, is there sufficient evidence to say that the intervention was
effective? Explain.
c. The yoga student is considering adding a return-to-baseline phase (ABA design).
i. If the intervention is effective, what would the student expect to see? Graph an
example ABA design (make up data for the second baseline):
ii. If the intervention is not effective, what would the student expect to see? Graph an
example ABA design (make up data for the second baseline):
2. An individual is trying to drink less cola and wonders whether a simple note to himself
saying, “Drink water, not cola!” will help. He conducts a multiple-baseline across settings to
test the effectiveness of the note and finds the following:
Day 1 2 3 4 5 6 7 8 9 10 11 12
6
5
4
3
2
1
0
1 5 10 15 20
Days
4. If an AB design suggests that an individual spent more time meditating during the
intervention stage than the baseline, would it be ethically appropriate to add a return-to-
baseline phase? Explain.
5. If an AB design suggests that an individual consistently drove while intoxicated during
the baseline but stopped during the intervention, would it be ethically appropriate to add a
return-to-baseline phase? Explain.
Exercise 15.5: The Big Picture: Choosing Between a Sample, Case Study,
or Single N Design
Learning Objective: Formulate a big picture of how to choose between a sample, case study, and single N
design.
164 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
Application
Identify which would be the best design for the following questions. Explain.
1. To determine whether a drug use treatment program is likely to be effective for a population.
2. To determine whether a drug use treatment program is effective for an individual.
3. To determine the process by which your high school principal decided which programs were
beneficial for students and evaluated the cost/benefit ratio of implementing the program.
4. To understand how the director of the film Moonlight felt about winning the 2017 Academy
Award for Best Picture after La La Land was first mistakenly given the award.
5. To understand how viewers felt when the film Moonlight was awarded the 2017 Academy
Award for Best Picture after La La Land was first mistakenly given the award and whether
reactions varied based on gender, age, or race.
6. To determine whether turning off your Internet access at home increases or decreases the
amount of time you study.
Your Research
Is a Case Study or Single N Design a Good Choice for
Your Topic?
1. Does your topic cover a unique phenomenon? Or do
you have a reason to gain in-depth information about 3. If you chose the case study method, explain whether or
one individual or a handful of individuals? not an embedded case study or using grounded theory
• If your answer is no to either of these questions, would be part of your methodology.
stop here. You should stick with one of the sample- OR, if you chose the single N design, identify the
based studies discussed in previous chapters. specific design (AB, reversal, multiple baseline,
• If your answer is yes to either of these questions, multiple manipulation) you would use and why.
answer questions 2 and 3.
2. Which is a better choice for your topic, a case study or
single N design? Explain.
16
HOW TO DECIDE?
Choosing a Research Design and
Selecting the Correct Analysis
LEARNING OBJECTIVES
CHAPTER SUMMARY
Chapter 16 is a review chapter, which means you can take stock of all that you have learned and focus on some key takeaway
points about research methods and analyses. The first part of the chapter includes information about the decision-making
process involved in choosing a research design. As always, you are reminded to base your study on past research, to consider
internal and external validity, and to determine whether your goal is to describe, predict, or explain. Additionally, you should
consider whether or not you will examine groups, and, if so, you will need to decide the number of groups, whether to
conduct an independent- or dependent-groups design, and whether or not to conduct a factorial design.
The second part of the chapter focuses on selecting the appropriate statistical analyses. The focus is on selecting analyses
based on the type of data collected. The tables and figures in the chapter can help guide your decision-making process
for any current or future research project. Remember that a single research study might contain different designs and
different analyses, and the chapter highlights two examples from the research to illustrate this point.
We hope you take a moment to savor all that you have learned about research methods. We also hope that you will take
these lessons with you as you move on to new challenges and opportunities.
CHAPTER 16 QUIZ
Test your knowledge of the key terms from the chapter.
1. generally have stronger external validity than .
a. Correlational designs; descriptive designs c. Experiments; quasi-experiments
b. Correlational designs; experiments d. Experiments; correlational designs
165
166 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
3. A researcher wants to examine how beliefs in climate change have changed over time. The
researcher should design a(n) study.
a. descriptive
b. correlational
c. quasi-experimental
d. experimental
4. A researcher wants to know if those who live in urban areas are more likely to believe in
climate change. The researcher should design a(n) study.
a. descriptive
b. correlational
c. quasi-experimental
d. experimental
CHAPTER 16 EXERCISES
Achieve the learning objectives of the chapter by reviewing and applying key concepts.
Application
1. Using Figure 16.1 in the textbook, decide whether a descriptive, correlational, or experimental
design would be the best way to help answer each of the following questions:
a. How much fast food do Americans eat?
b. Do married individuals eat more or less fast food than single individuals?
c. Has television viewing increased, decreased, or stayed the same over the last 10 years?
d. Can watching comedies on television decrease depression?
e. Does binge-watching a television series in one night versus watching the series over several
weeks have an effect on anxiety?
f. Is there a relationship between age and binge-watching television?
2. For the correlational designs above, identify the one(s) that
a. Will definitely examine groups:
b. May or may not examine groups:
3. For the question, “Does binge-watching a television series in one night versus watching the
series over several weeks have an effect on anxiety?” what are some reasons for expanding
from the current two-group design to a multiple-groups design?
4. Look at Figure 16.2 in the textbook to answer the following:
a. For the question, “Can watching comedies on television decrease depression?” explain
why you might choose
i. an independent-groups design
ii. a dependent, matched-groups design
iii. a dependent, repeated-measures design
b. For the question, “Do married individuals eat more fast food than single individuals?”
two out of the three designs in Figure 16.2 are possible choices.
i. Which two are they and why?
Chapter 16 ■ How to Decide? 169
ii. Explain how you would decide between these two choices.
Application
1. Use the figures in textbook, as indicated below, to help identify the appropriate statistical
analyses for each of the following questions.
Does the question include just one variable? → see Figure 16.4
Does the question examine a relationship (with no groups) → see Figure 16.5
Does the question compare groups? → see Figure 16.6
a. How much fast food do Americans eat?
b. Do married individuals eat more or less fast food than single individuals?
i. No matching:
ii. Individuals matched on socioeconomic status:
c. Does the amount of fast food eaten by married versus single individuals depend on
whether the individual is male or female?
i. No matching:
ii. Individuals matched on socioeconomic status:
d. Does binge-watching a television series in one night versus watching the series over several
weeks have an effect on anxiety?
i. Independent groups design:
ii. Dependent groups design:
e. Are there differences in mood after watching a comedy, drama, or horror movie?
i. Independent groups design:
ii. Dependent groups design:
2. Identify three to five inferential statistics that you covered in your class. Come up with
research questions that you could test with each statistic. We have provided an example
for you to get you started, but this table is designed to help you review inferential statistics
important in your class. We expect that your professor may have skipped or glossed over some
of the statistics in this textbook.
Your Research
1. After reading past research on your topic, what are the 3. If you are evaluating groups, use Figure 16.2 in the
primary questions that deserve further investigation? textbook to help you decide whether your groups should
be independent or matched or whether you should use
repeated measures. Note your decision here.
Choose one of these that interests you the most, you 4. Does it make sense to do a factorial design? Why or why
believe you could objectively study, and you could not?
carry out given your time and resource limitations. Put
an asterisk (*) next to the question you chose.
2. Use Figure 16.1 of the textbook to help determine what 5. What inferential statistic would you use to help answer
type of research design you might use to help answer the the question? Explain.
research question you chose. Note your decision here.
IBM® SPSS®*
WORKBOOK PART 1
FUNDAMENTALS OF DATA ANALYSIS
You should omit any questionable data. For example, if it is clear that a participant did not answer questions
correctly or if they seemed to be answering haphazardly, you will want to omit that data. You can choose to
omit part of a participant’s data (one question or one scale) by leaving it blank (“.”) in SPSS. Alternatively,
you may decide that all the data from a participant must be discarded. In either case, be sure that you note
any data that you chose not to enter and why. You will include this in the Method section of your research
report.
It is unethical to eliminate any piece of data because it is not what you expected or does not fit your hypothesis.
You should discuss with your professor when deciding to eliminate data that seems to suggest the participant
misunderstood what was being asked or did not take the study seriously.
It is OK if a participant left some items blank. Deciding not to answer a question is the right of the participant
and a few missing variables is to be expected in most research studies.
Although you can store non-numerical responses in the data file as “string” variables, you cannot use them in
SPSS statistical analyses. You will need to code into numbers any open-ended questions or other non-numerical
responses you plan to analyze with SPSS®. For example, instead of entering “male” as a participant’s gender, you
should code male as a number (0, 1, 2, the number doesn’t matter as long as you are consistent in your coding
and record your codes so that you remember them).
It is a good idea to put an ID number on each of the measures or observer sheets and then enter that number
into SPSS. This will provide a link between your raw data and your SPSS data file. An ID is particularly
helpful when you find a data entry error in your SPSS file.
*IBM® SPSS Statistics / SPSS is a registered trademark of International Business Machines Corporation
173
174 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
DATA ENTRY
Every variable in your dataset must have a unique name. If you are entering data for a class research
project, be sure you use the variable names your professor tells you to use. If this is your own data
set, you can name the variables anything you want as long as there are no spaces and the name begins
with a letter (not a number). Versions prior to SPSS 27 do not allow for symbols in the variable name.
For example, variable names for the questionnaire below might be as follows:
4. In the “Label” column, you can provide more information about what each variable measures.
Adding a label serves as a reminder for you, but you can skip this step if you have a small data
set and the variable names provide enough of a reminder.
The label allows you to add more detail about each variable.
176 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
5. The “Values” column serves as a reminder for how your nominal, ordinal, and interval
variables are coded. You do not have to enter values, but it is a good idea so that you can easily
access data codes without having to return to your raw data or code sheets.
Click on a cell, then click on ... and the Value Labels box will appear.
The example above is for the first question on our Research Methods questionnaire
(named “learningexp”). In the image, we had already entered value labels for the values 1 through
3 and are in the process of finishing our last value label.
Value → 1 Value Label → strongly disagree; Add
Value → 2 Value Label → disagree; Add
Value → 3 Value Label → agree; Add
Value → 4 Value Label → strongly agree; Add
When you have finished adding all the value labels, you click OK.
Hints:
You do not need value labels for ratio data.
Your value labels will depend on how your variables are coded.
{{ For nominal variables (such as gender, in our example), you will decide how to code the
categories. For our example, we decided to code male as 1, female as 2, and nonbinary
gender as 3, but we could have reversed that or used different numbers.
{{ Some interval scales might not have value labels for every score, and you can just enter
the lowest- and/or highest-value labels. For example, if we asked participants to rate how
happy they are on a scale from 1 to 10, with 10 being extremely happy, we might just
enter the value label for the value 10 (extremely happy).
If you have multiple variables with the same value labels, you can copy and paste.
IBM® SPSS® Workbook Part 1 ■ Fundamentals of Data Analysis 177
6. The “Measure” column identifies the scale of measurement for each of your variables. Because
interval and ratio data are treated the same for analysis purposes, these are both identified as a
“scale” in SPSS.
Enter Scores
Be sure to SAVE your data often! Data files are .sav files.
1. Select
all the 2. Send them to
variables the “variables” box
(either by clicking the
one at a arrow.
time OR
use shift +
left click).
An output window will appear, as shown below. Inspect your data to determine if any values in the
output do not make sense (e.g., they are out of the possible range of values based on how your data are
coded).
IBM® SPSS® Workbook Part 1 ■ Fundamentals of Data Analysis 179
180 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
Make sure that all items are coded in the same direction and recode variables if
necessary.
Check that the scale has good internal consistency (a type of measurement reliability). Here
we will show you how to calculate Cronbach’s Alpha.
Compute a scale score.
(NOTE: If the measure has separate subscales, follow these directions for each subscale.)
Recoding Variables
Make sure that all the items are coded in the same direction. Recode any that are coded in the opposite
direction.
For example, the four items below are part of the “Attitudes Toward Research Methods” measurement
scale.
It is best to use “Recode into Different Variables” so that you can keep track of changes you have made
to your data set.
When you recode into a different variable, you create a brand-new variable in your data set.
An easy strategy for naming your recoded variables is to add an “R” to the end of the original
name (in our example, we will recode “afraid” into “afraidR”).
After recoding, the new recoded variable will show up at the end of your data set. The
original variable will remain in your data set.
IBM® SPSS® Workbook Part 1 ■ Fundamentals of Data Analysis 181
How to Recode:
On the Menu Bar, click Transform → Recode into different variables
The “Recode” box will open:
In our example dataset, the original variable was coded on a 4-point scale. To recode, we need to reverse
the codes so that a higher score indicates more positive attitudes:
1. 1 → 4
2. 2 → 3
3. 3 → 2
4. 4 → 1
Hints:
Your old and new values will depend on how your original variable was coded.
You need to enter new values for ALL your old values. Even if the old and new values are the
same number, you need to enter it (i.e., 3→3).
You can recode multiple variables at the same time as long as the old and new values are the
same for all the variables.
Your recoded variable will show up at the end of your dataset. If you like, you can click and
drag it to a different position in your dataset.
You can check that you did the recoding correctly by looking at the recoded variable in data
view. Select a few participants and compare the original value to the recoded value.
Recoding can be used for other transformations as well. For example, you could convert a
4-point scale to two categories (low vs. high).
Reliability
N %
Cronbach’s alpha is the measure of
Cases Valid 10 100.0 internal consistency and tells us how
reliable the measurement
Excludeda 0 .0 scale is.
Total 10 100.0
The higher the alpha, the better
a. Listwise deletion based on all variables in the procedure. reliability. Ideally, the alpha should be
.70 or above, although an alpha as
low as .60 can be acceptable.
Reliability Statistics
.762 4
Item-Total Statistics
Scale Statistics
Note: For scales developed and validated by other researchers, it is usually best to keep all the
items if possible. For such scales, only omit items in cases where the alpha is below .70 and
omitting items will raise the alpha.
In the example output, we have a pretty decent alpha with all four items (.76). However, if
we omit the “love RM” question, the alpha will increase to .88, and this is a big increase.
Plus, this question doesn’t seem to fit as well with the other items because it assesses a general
feeling of love. Consequently, we chose to omit it from further analyses.
To omit an item, you don’t actually delete it from your dataset. You simply leave it out when
you compute a scale score in the next step.
3. Click “OK.”
MANAGING DATA
IBM® SPSS® can help you create groups, select groups, and select a random sample of cases. We will use
a different data set on a study of self-esteem for the examples in this section.
Creating Groups
You might want to identify a subpopulation or group of women or athletes. This is easily accomplished
because gender and athlete status are both nominal variables (they are already categories). But what if
you have interval or ratio data? If you wanted to examine a subpopulation or group who were under 21,
for example, you could convert that ratio data into categories by creating groups in SPSS.
6. Enter a range of
scores.
In this example:
55–64 = 1
65–74 = 2
75 through
highest = 3
7. Click “Continue”
and then “OK.”
3. Click on “Median”
and then “Continue.”
OUTPUT
Frequencies
Statistics
selfesteem The output reports
N the median.
Valid 25
Missing 0 In this example, the
median = 14.
Median 14.0000
selfesteem
Frequency Percent Valid Percent Cumulative Percent
Valid 6.00 1 4.0 4.0 4.0
8.00 2 8.0 8.0 12.0
9.00 1 4.0 4.0 16.0 The frequency
10.00 4 16.0 16.0 32.0 table tells you how
12.00 4 16.0 16.0 48.0 scores are
distributed.
14.00 1 4.0 4.0 52.0
15.00 4 16.0 16.0 68.0 In this example,
52% of the
17.00 1 4.0 4.0 72.0 scores fall at
18.00 1 4.0 4.0 76.0 or below
19.00 1 4.0 4.0 80.0 the median of 14.
2. Recode the variable just as you would when using a cutoff score.
On the Menu Bar, click Transform → Recode into Different Variables
1. Click on the variable you want to split. 2. Send the variable to the input
output box. 3. Name and label the new “Output” variable. 4. Click “Change.”
5. Click Old and New Values.”
In this example,
Lowest through 14 = 1.
3. Click “OK”
Remember: When you use the Split Data or Select Cases commands, all the subsequent analyses you do
will follow this command until you return to the command screen and click “All cases.”
On a scale of 1 to 5, with 5 indicating strongly agree, how much do you agree that
Honesty and integrity in my academic work are strong values for me.
ID Age Gender #1 #2 #3 #4 #5
1 19 Male 5 2 1 4 4
2 21 Male 3 4 3 1 3
3 23 Female 3 2 1 3 4
4 18 Male 4 2 1 3 5
5 24 Female 4 1 2 4 3
6 50 Female 4 2 1 4 4
7 20 Female 3 3 4 3 1
8 19 Male 4 3 4 4 1
9 28 Male 3 2 2 3 3
10 26 Female 1 4 3 1 3
1. Enter the data using SPSS®. You can make up whatever variable names make sense to you. All
variables should be numeric, so be sure to code gender as a number.
2. Look over the questionnaire. You will want a higher number to indicate stronger values for
academic honesty. Right now, there are some questions that need to be reverse-coded. Using
SPSS®, recode these into different variables so that a score of 1 = 5, 2 = 4, 3 = 3, 4 = 2, 5 = 1).
3. Check the reliability of the scale by calculating Cronbach’s alpha. Remember to use the
RECODED variables.
a. What is the alpha (α)?
b. What decisions should you make based on these results?
_____________________________________________________________________
_____________________________________________________________________
4. Compute the total score on the value of academic honesty questionnaire. Remember to use
the RECODED variables here, not the original ones.
5. Check your answers and make any corrections to the dataset, if necessary.
Practice Exercise 2
1. Go to https://edge.sagepub.com/adams3e
2. Click on Data Sets.
3. Open the file called Reminiscence Study, and view these files:
a. Description of the Reminiscence datasets (reminiscence description.pdf )
b. Reminiscence data set 1 (reminiscence1.sav)
4. Read over the description of the Reminiscence dataset, paying special attention to how
variables are named and coded. It may be helpful to print this file.
5. Note that there are several measurement scales in the survey.
a. There are two subscales from the Savouring Beliefs Inventory (Bryant, 2003):
i. Savouring the Moment (8 items)
ii. Anticipating the Future (8 items)
b. Satisfaction with Life (5 items; Diener et al., 1985)
IBM® SPSS® Workbook Part 1 ■ Fundamentals of Data Analysis 191
c. Reminiscence Strategies (9 items created for this survey). Note that you can also
create subscales for the strategies, such as cognitive (3 items) or behavioral strategies
(6 items).
6. Download and open the dataset.
7. Choose one of the measurement scales (or subscales):
a. Look at the individual items for the measurement scale or subscale you chose. Do you
need to recode any of the items for the scale you chose? If so, recode the variables in the
data set using SPSS.
b. Using SPSS, calculate Cronbach’s alpha for the scale you chose.
α=
c. Compute the scale score.
8. If you would like more practice, repeat with another of the measurement scales.
Practice Exercise 3
1. Go to https://edge.sagepub.com/adams3e
2. Click on Datasets
Open the file called “Game of Thrones.” Here you will find two files:
{{ Description of the Game of Thrones dataset (.pdf)
{{ The dataset (.sav)
3. Read over the description of the Game of Thrones dataset, paying special attention to how
variables are named and coded. Note that this is real data based on the first season of the
television series.
a. Use simple random sampling to select a sample of 44 of the 129 characters (which will
give you a 12-point confidence interval with a 95% confidence level).
Use the Select Cases command and click “Copy Selected Cases into a New Dataset.” Click on
“Sample” and select a random sample of 44 from the first 129 cases.
b. Go back to the original dataset with all 129 cases. Suppose you want to have a random
sample with about equal numbers of characters who appeared in a few seasons and who
appeared in many seasons. To do this, you’ll need to follow several steps.
i. First, you’ll need to convert the “totalseasons” variable, that is now ratio, into a
nominal variable with two groups. Because you want about equal numbers, use a
median split (see How to Create Groups Based on a Median Split for directions).
ii. Use the Frequency command to calculate the median.
iii. Use the Recode into Different Variables command to create a new variable, coded
as 1 (scores at or below the median) and 2 (scores above the median). Name it
whatever you like.
Suppose you want a stratified random sample of 44 that represents the distribution in the
population. Look at the frequency table. Notice that 75 scores in the population (58.1%)
fall at or below the median. In a stratified random sample of 44, you will therefore want
about 26 out of 44 (about 58%) to be from the low category and the remaining 18 of the 44
to be from the high category. Next, we will walk you through the steps to randomly select
this sample.
c. Use the Split File command to organize your dataset by your new variable. Cases coded
as 1 (low number of seasons) should all be listed first in the dataset. Then use the Select
192 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
Cases command to select a random sample of 26 from the first 75 cases. Click on “Copy
Cases into a New Dataset.” This new dataset will represent your sample from the low
number of seasons.
d. Go back to the original dataset. Use the sort command (Data → Sort Cases) and sort
cases by your new variable. Click “Descending” so that now those coded as 2 will appear
first in your dataset. Use the Select Cases command to select a random sample of 18
from the first 54 cases. Click on “Copy Cases into a New Dataset.” This new dataset will
represent your sample from the high number of seasons.
Extra Practice
The following datasets are available for download. Choose one or more and practice some of the SPSS
skills you have learned.
Describe Descriptive
Confidence
Statistics
Is the goal to Interval
describe the sample chi-square
OR make inferences Make No goodness
Are you
to the population? inferences nominal of fit
comparing What is the
scores to a scale of
known or Yes measurement
expected for the interval one-sample
value? variable? or ratio t test
DESCRIPTIVE STATISTICS
Nominal Variables
Calculate the frequency to describe nominal data such as gender, marital status, religious affiliation, etc.
1. Click
on the
variable(s)
you want to
analyze in
the variable
list and 2. Be sure that “Display frequency
send to tables” is checked.
“Variable(s).”
3. Click “OK.”
193
194 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
FREQUENCIES
Statistics
Participant’s Gender
N Valid 10
Missing 0
Participant’s Gender
• The percentage of the sample in each group or category (50% of the sample was male
and 50% female)
2. Click “Statistics.”
IBM® SPSS® Workbook Part 2 ■ Analyses to Examine One Variable at a Time 195
3. Click on the
Statistics you want
to run.
• Select more options
than you will report,
including the Mean,
Median, SD, Min,
Max, and skewness.
• Use the skewness
statistic to decide
which measure of
central tendency
and variability
to report.
Click “Continue”
when you are done.
4. Click “Charts.”
5. Click
“Histograms” 6. Click “OK”
and “Show when you
normal are finished
curve…” selecting
your statistics
Then click and charts.
“Continue.”
OUTPUT
Frequencies
Statistics
In the output, you will first see the statistics you
participant's age requested.
N Valid 10 Notice here the skewness statistic (G1) is 2.22.
Missing 0 Recall from the textbook that there are two ways to
Mean 29.6000 determine skewness:
Median 28.5000 1. G1 is greater than +/−2.
Std. Deviation 10.89546 2. The absolute value of G1 is greater than twice the
standard error of the skew.
Skewness 2.224
By both these criteria, age is skewed in this sample.
Std. Error of Skewness .687
Therefore, you should not report the mean and
Minimum 19.00 standard deviation. Instead report the median and the
range (or the min and max).
Maximum 58.00
196 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
PARTICIPANT’S AGE
Valid Cumulative
Valid Frequency Percent Percent Percent The frequency table
outlines the frequency
19.00 1 10.0 10.0 10.0 for every age reported.
21.00 1 10.0 10.0 20.0
Looking at the cumu-
23.00 1 10.0 10.0 30.0 lative percent can be
25.00 1 10.0 10.0 40.0 useful with skewed
data:
28.00 1 10.0 10.0 50.0
29.00 1 10.0 10.0 60.0 “The data were posi-
tively skewed due to
30.00 1 10.0 10.0 70.0
one participant who
31.00 1 10.0 10.0 80.0 was 58. The remaining
32.00 1 10.0 10.0 90.0 ninety percent of the
sample were between
58.00 1 10.0 10.0 100.0
the ages of 19 and 32.”
Total 10 100.0 100.0
Note that the histogram can give you an idea of what the data look like, and it might suggest a positive
or negative skew. It can also alert you to a bimodal or uniform distribution that will not be detected by
the skewness statistic (G1).
If the histogram appears skewed, base the final decision about whether or not the distribution is skewed
on the skewness statistic (G1), not the graph.
REMEMBER, there are two ways to assess skewness:
1. If the skewness statistic (G1) is between -2 and 2, the distribution meets the criteria for a
normal distribution. If G1 is greater than +/-2, the distribution is skewed.
OR
2. If the absolute value of the skewness statistic (G1) is more than twice the standard error of the
skew (SES), the distribution is skewed.
If you have a skewed distribution, the sign of G1 (+ or -) will tell you the direction of the skew (positive
or negative).
2. Click “Options.”
Note that if
you want to
convert
data to z
scores,
click this
box.
IBM® SPSS® Workbook Part 2 ■ Analyses to Examine One Variable at a Time 197
3. Click on the
statistics you
want to run.
Note you do not
have the option to
calculate the
median.
Click “Continue.”
OUTPUT
Descriptive Statistics
N Minimum Maximum Mean Std. Skewness Std. Error
Statistic Statistic Statistic Statistic Deviation Statistic
Statistic
Total RM 10 4.00 11.00 8.1000 2.23358 −.754 .687
attitudes
Valid N 10
(listwise)
Notice that many of the same types of statistics are obtained from the “Descriptives” command
as from the “Frequencies” command.
A key benefit of using the “Descriptives” command is that you can easily convert data to
standardized z scores.
A downside is that it does not provide the median, frequency table, or charts that are
particularly useful if you have skewed data.
2. Run descriptive statistics for the variable of interest (attitudes toward research methods, in
this example).
a. (See earlier directions for this analysis.)
b. The analyses will exclude anyone who did not meet the criteria. In this example, we have
descriptive statistics for the nine participants who were under 58:
OUTPUT
Descriptive Statistics
Std. Skewness
Minimum Maximum Mean Deviation Std.
N Statistic Statistic Statistic Statistic Statistic Statistic Error
Total RM 9 4.00 11.00 8.2222 2.33333 –.951 .717
attitudes
Valid N 9
(listwise)
3. If you want to examine the entire sample again, you must turn off the “Select Cases”
command:
Click Data → Select Cases, select “All cases” and then “OK.”
Split File
Suppose you want to compare the scores of two or more groups in your sample. For example, you might
want to compare the mean attitudes toward research methods scores for males and females.
OUTPUT
Descriptive Statistics
Std. Skewness
Deviation
Statistic
participant’s N Minimum Maximum Mean Std.
gender Statistic Statistic Statistic Statistic Statistic Error
male Total RM 5 4.00 11.00 8.4000 2.70185 –1.339 .913
attitudes
Valid N 5
(listwise)
female Total RM 5 5.00 10.00 7.8000 1.92354 –.590 .913
attitudes
Valid N 5
(listwise)
IBM® SPSS® Workbook Part 2 ■ Analyses to Examine One Variable at a Time 199
3. If you want to examine the entire sample again, you must turn off the “Split File” command:
Click Data → Split File, select “Analyze all cases” and then “OK.”
Writing up Results
General Guidelines
The goal in a Results section is to summarize the statistics that you believe are most
important and useful in understanding your study. You should be selective in the results you
report and how you report them. Do not just provide a laundry list of data that overwhelms
the reader!
Good writing skills are just as important in a Results section as in any other writing
assignment. For example, you should use correct grammar and sentence structure, vary your
sentence length and structure, avoid passive voice when possible, avoid awkward statements,
be concise, and so on.
In a Results section, you will be using statistical notation that is likely new to you. Use the
examples in this workbook as models but modify the wording according to your study and
data. Think about what your data and results mean and then consider the best way to explain
your results to your reader.
When reporting numbers,
{{ Use words for numbers that start a sentence (e.g., Seventy participants took surveys) or are
less than 10 (e.g., There were three groups).
{{ Use digits when the number doesn’t start the sentence, when stating the number of
participants, when reporting numbers 10 and over, or when reporting numbers in a series
in which at least one number is 10 or above. For example, “There were 47 children who
ranged in age from 7 to 15 (M = 10.60, SD = 2.56).”
Round most statistics to two decimal places (see section 6.36 of the APA Publication Manual
for more information).
Only use a leading zero (a zero before the decimal point) when it is possible for the statistic
to be greater than one but the result is less (e.g., report M = .80 if the highest possible mean
is one or less, but report M = 0.80 if the mean could have been higher than one).
If you find that the text is getting too wordy or complicated, consider creating a table or
graph in Word to help report your results. You can explain what the results mean in the text
of your report, and then refer to the table for the specific statistics. (Note that your table or
graph should follow APA format−you should never cut and paste SPSS output into a research
report!)
Method
Participants
Ten students (five male and five female) enrolled in a psychology research methods course
agreed to participate in the study. The students ranged in age from 19 to 58, with a median
of 28.50. The inclusion of the 58-year-old student (who was female) skewed the age of the
sample. All the other students were between the ages of 19 and 32.
Procedure
On the first day of class during the fall 2016 semester, students enrolled in the psychology
research methods course (N = 22) were asked to volunteer to take a brief online questionnaire
about their attitudes toward the course. Students were asked to complete the questionnaire
right after class and to wait to do so until the instructor left the room. The students were
informed that their answers would be anonymous and confidential and that they could with-
draw from the study at any time without penalty. Because the questions were associated with
a course they were taking, students were told that their participation or lack of participation
IBM® SPSS® Workbook Part 2 ■ Analyses to Examine One Variable at a Time 201
would not impact their course grade and that the professor would not know which students
opted to participate and which students opted out. The students had to acknowledge that they
read and understood this before completing the questionnaire.
Materials
The questionnaire consisted of four items assessing the students’ attitudes toward
research methods (“Research Methods is a good learning experience,” “I am afraid of Research
Methods” [recoded], “I look forward to Research Methods,” and “I love Research Methods”).
The instructor created the items based on feedback and comments from previous classes.
Each item was rated on a 4-point Likert scale with 1 = strongly disagree and 4 = strongly agree.
Cronbach’s alpha was calculated on the four items and results indicated good internal consis-
tency (α = .76). However, the reliability analysis indicated that omitting the last question (“I love
Research Methods”) would raise the alpha to .88. Because this was a substantial increase, and
because the item was relatively vague, it was omitted from analyses. A total scale score was
calculated by summing the three remaining items.
Results
The total of three attitude questions made up the Research Methods Attitude Scale and
scores could range from 3 to 12. A score of 7.50 designates the middle and would indicate a
neutral attitude. A score over 7.50 indicates a positive attitude. In this sample, scores ranged
from 4 to 11 (M = 8.10, SD = 2.23).
Because the age in our sample was positively skewed due to one woman who was 58, we
ran analyses excluding this participant. The mean score increased just slightly with this exclu-
sion (M = 8.44, SD = 2.51).
Analyses were conducted to compare male and female students. Because excluding the
58-year-old female did not seem to have a large impact on the data, we ran these analyses
with all 10 participants. Males in our sample reported more positive attitudes toward Research
Methods (M = 8.80, SD = 2.95) than females in our sample (M = 7.80, SD = 1.92).
Discussion
Results of this study suggest that the students who took the survey had slightly more posi-
tive than negative views toward the course at the beginning of the semester. This increased
slightly when the 58-year-old participant was excluded from analyses. Male students scored in
the positive range, whereas female students had less positive and closer to a neutral attitude
toward the course.
It is important to note that statistical significance tests were not run, so these results can
only be used to describe the sample of 10 students. Moreover, although males in the sample
tended to have more positive attitudes than females, we cannot say that this is a statistically
significant difference.
Less than half of the class opted to take the survey, and it is unclear what motivated some to
participate and others not to and, more important, whether those who participated were very
different from those who did not. Future studies should be done to attempt to obtain a larger
and representative sample of Research Methods students. It would also be interesting to know
how attitudes toward the course change throughout the semester and beyond, and whether
attitudes predict student performance in the class.
202 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
1. What evidence is presented that supports the reliability of the Research Methods Attitude
Scale?
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
2. What evidence is presented that supports the construct validity of the Research Methods
Attitudes Scale?
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
3. Rewrite one of the paragraphs in the Results section, in your own words.
______________________________________________________________________
______________________________________________________________________
______________________________________________________________________
Practice Exercise 2
Calculating frequencies is also a good way to check for data entry errors.
1. Go to https://edge.sagepub.com/adams3e
2. Click on “Datasets,” open the file called “Reminiscence Study,” and then open the
“reminiscence with errors.sav” data file. This is real data from a study conducted as part of a
class project. However, this file has errors added to the demographic data for the purpose of
this practice exercise.
a. Using SPSS calculate frequencies for the variables named age, gender, and collegestudent.
3. Look at the frequency tables in the output to identify the two errors.
a. What is one of the errors, and how do you know it is an error?
_______________________________________________________________________
b. What is the other error, and how do you know it is an error?
_______________________________________________________________________
4. Now, you might be wondering what to do about errors such as these. If you have access to
the raw data, you could identify the cases (by ID number) and examine the original data to
identify the correct values. It would be a good idea to also do some spot checks to be sure
there were no other data entry errors.
a. In this example, you do not have access to the raw data. All you can do in such a situation is
delete the erroneous value. First sort the data (click on data → sort cases and sort by the variable
with the error), then scan the variable column to find the error, and then delete the error.
b. Delete the two errors you identified and then calculate the appropriate statistics to
describe age, and re-run the frequency of the variable collegestudent.
IBM® SPSS® Workbook Part 2 ■ Analyses to Examine One Variable at a Time 203
c. Write a brief description of the age, gender, and college student distribution in the sample
as you would in a Participants section of a research report. Be sure you only report the
correct information (without errors) and use APA format.
_______________________________________________________________________
_______________________________________________________________________
_______________________________________________________________________
_______________________________________________________________________
_______________________________________________________________________
_______________________________________________________________________
Practice Exercise 3
1. Go to https://edge.sagepub.com/adams3e
2. Click on “Datasets” and open the file called “Academic Honesty Study.” This is real data from
a study conducted as part of a class project.
a. Review the file called “academic honesty survey codesheet.pdf ” to learn about the
variables and codes.
b. Open the data file called “academic honesty.sav.”
3. Using SPSS, calculate the appropriate descriptive statistics to answer these questions:
a. Do most of the participants believe that those who plagiarize should have the opportunity
to redo the assignment? (Hint: Calculate descriptive statistics using the variable called
“responseC” in Data View and “response/redo” in the Frequencies screen)
_______________________________________________________________________
_______________________________________________________________________
b. Do those who report they have never plagiarized believe that those who do should
have the opportunity to redo the assignment? (Hint: Use the “Select if “command to
select only those who reported that they have never plagiarized, and then run the same
descriptive statistics you did for question 3a.)
_______________________________________________________________________
_______________________________________________________________________
c. Do those who identified as undergraduates differ from those who were not currently
undergraduates in their belief that those who plagiarize should have the opportunity
to redo the assignment? (Hint: Turn off the “Select if” command, then split the file to
compare those who are currently in college to those who are not, and then run the same
descriptive statistics as you did above.)
_______________________________________________________________________
_______________________________________________________________________
d. Write up the results of your analyses as you would in a Results section of a research report.
Remember not to refer to SPSS variable names (such as “responseC”) and use APA format.
_______________________________________________________________________
_______________________________________________________________________
_______________________________________________________________________
_______________________________________________________________________
_______________________________________________________________________
_______________________________________________________________________
204 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
e. For additional practice, run additional descriptive statistics on the practice data set.
Consider what questions you can ask based on the survey questions, and determine what
analyses to run in SPSS to help answer those questions.
f. You might also practice with one of the data sets available for download, listed at the end
of the SPSS Workbook Part 1.
CONFIDENCE INTERVALS
Data Entry
You can use SPSS® to calculate confidence intervals for means by running an analysis called a one-sample
t test. In order to calculate the confidence interval for a mean the variable must be measured on an
interval or ratio scale. In the example below, we have 20 students and their final exam scores (out of
100), which is a ratio variable.
You can then calculate descriptive statistics and verify that the distribution meets the criteria for a
normal curve, using either the Frequency or Descriptives command. The following analysis will repeat
some of that information.
2. To calculate a
confidence interval, leave
1. Send the variable from
the test value as 0.
the sample to “Test
Variable(s).” (Note: the default is a 95%
confidence interval, to
change this, click “Options.”)
3. Click “OK.”
OUTPUT
One-Sample Statistics
Std. Std. Error The first table provides
N Mean Deviation Mean descriptive statistics for the
sample.
finalexam 20 79.8500 10.17880 2.27605
One-Sample Test
Test Value = 0
95% Confidence Interval
of the Difference
t df Sig. (2-tailed) Mean Difference Lower Upper
finalexam 35.083 19 .000 79.85000 75.0862 84.6138
When you are interested in the confidence interval for the mean, focus on the last
two columns of the second table and ignore the rest.
Writing up Results
When reporting a confidence interval for a mean, include the following:
The mean and standard deviation (M, SD)
What confidence level you are using (e.g., 95%, 99%)
The confidence interval, typically but not always, is formatted as CI [lower, upper]
You may also opt to include other descriptive information, such as the sample size or minimum and
maximum scores or other information you gained from conducting descriptive statistics.
Examples:
Twenty students completed a final exam. The mean score was 79.85 (SD = 10.18), 95% CI
[75.09, 84.61].
OR
Student scores on the final exam ranged from 64 to 98 out of 100 possible (M = 79.85, SD =
10.18), 95% CI [75.09, 84.61].
OR
Final exam scores for 20 students indicated an average score of C+/B- (M = 79.85, SD = 10.18).
At a 95% confidence level, we estimate that the population mean falls between 75.09 and 84.61.
206 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
Waiter 1 Waiter 2
.15 .18
.16 .12
.15 .25
.12 .30
.18 .15
.22 .10
.15 .20
.25 .18
.20 .22
.14 .15
One-Sample t Test
A one-sample t test compares a mean in the sample to a known or expected mean from the population
or another source. The data must be interval or ratio. For example, suppose a professor wants to compare
their current students’ exam scores to the average from past classes, which they know is 75%.
4. Click “OK.”
OUTPUT
T Test
One-Sample Statistics
N Mean Std. Deviation Std. Error Mean The first table provides
descriptive statistics for the
Finalexam 20 79.8500 10.17880 2.27605
sample.
One-Sample Test
Test Value = 75
t df Sig. (2-tailed) Mean Difference 95% Confidence Interval of the Difference
Lower Upper
finalexam 2.131 19 .046 4.85000 .0862 9.6138
The second table provides your t statistic, df, and p value. The third provides estimates of the Cohen’s d effect size as well as
confidence interval for d.
In this example, our t is statistically significant at p < .05 and at minimum we would report:
t(19) = 2.13, p = .046, d = 0.48.
208 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
Writing up Results
Include:
{{ the degrees of freedom in parentheses
{{ t value rounded to two decimals, with a leading zero if t is less than 1.
{{ the p value rounded to two or three decimals
Do not round to two decimal places when doing so changes the interpretation of
statistically significance (e.g., do not round .046) or changes the p to zero (e.g., do not
round .002).
If the p value at three decimal places is zero, report p < .001.
{{ Cohen’s d rounded to two decimals, with a leading zero if d is less than 1.
You might also include confidence intervals. Check with your instructor.
1. Enter the data and compute the appropriate test to respond to your belief that your classmates
will use their cell phones significantly less than the national average of 60% reported by the
National Transportation Board .
2. Can you support your hypothesis? Why or why not?
____________________________________________________________________
____________________________________________________________________
3. What is the probability of a Type I error? Type II error?
4. What is the effect size?
IBM® SPSS® Workbook Part 2 ■ Analyses to Examine One Variable at a Time 209
3. Click “OK.”
OUTPUT
Chi-Square Test
Frequencies
Cointoss
Observed N Expected N Residual
Heads 18 15.0 3.0 The first table provides the
observed and expected values
Tails 12 15.0 –3.0 and total N.
Total 30
TEST STATISTICS
Cointoss
The second table provides the χ2, df, and p value.
Chi-Square 1.200 a
In this example, the χ2 is not statistically significant and we
Df 1
would report χ2(1, N = 30) = 1.20, p = .27.
Asymp. Sig. .273
Writing up Results
When reporting the results of a chi-square goodness of fit, include the following:
The frequency or percentage from your sample and the comparison value.
Whether or not results were statistically significant.
The results of the chi-square goodness of fit test formatted as in our example: χ2 (1, N = 30) =
1.20, p = .27.
a. Include:
{{ the degrees of freedom and sample size in parentheses
{{ the χ2 value rounded to two decimals, with a leading zero if the value is less than 1
{{ the p value rounded to two or three decimals as needed for interpretation (note in our
example we rounded to two decimal places).
IBM® SPSS® Workbook Part 2 ■ Analyses to Examine One Variable at a Time 211
1. There had been much publicity about the major role of female characters in the television series
Game of Thrones. Using SPSS, calculate chi-square goodness of fit analyses to determine if:
a. the ratio of named male to female characters was greater than what would be expected by
chance.
b. the ratio of named male to female characters was statistically significantly greater than
two to one (e.g., twice as many males than females).
2. There had been much talk about how many characters have been killed on the show. Did
more characters die than stay alive by the end of Season 6 than would be expected by chance
alone? Using SPSS, calculate a chi-square goodness of fit.
3. Write a few sentences summarizing the results as you would in a Results section.
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
____________________________________________________________________
IBM® SPSS®
WORKBOOK PART 3
ANALYSES TO EXAMINE
RELATIONSHIPS (NO GROUPS)
Compare Groups
nominal
Data Entry
chi-square
nominal
test of
Hereindependence
are data that have
3+ groups Cochran Q Test
Mixed
been entered for 10 participants. interval
ANOVA
Note that each participant has a or ratio
pair of data entered—one value
Mann-Whitney
for2 each
groups variable
U Test in the relationship 2 groups Wilcoxon T Test
ordinal (in this example, the variables are
ordinal
frustration tolerance and self
control, bothKruskal-Wallis
measured on an Friedman 2 test
3+ groups H Test 3+ groups
interval scale).
independent- dependent-
2 groups samples t test samples t test
2 groups
(and/or point-biserial r)
interval interval
or ratio 3+ groups or ratio 3+ groups
one -way
one-way within-
between- subjects
subjects ANOVA
2+ ANOVA 2+
factors factors
212
IBM® SPSS® Workbook Part 3 ■ Analyses to Examine Relationships (No Groups) 213
Send your Y variable to the “Y axis” (the vertical axis) box and your X variable to the “X axis” (the
horizontal axis) box.
X is the predictor variable (in this example, self-control) and Y is the predicted variable (in this example,
frustration tolerance). When you do not have a clear predictor and outcome, it doesn’t matter how you
identify X and Y.
Click “OK.”
9.00
8.00
A scatterplot will appear in the
output.
Frustration Tolerance
7.00
In the example to the left, we see
that there is a positive and mostly
6.00 linear relationship between
self-control (X ) and frustration
tolerance (Y ). Notice that there are
some data points that don’t fit with
5.00
the overall pattern, but they are
not extreme enough to be
considered outliers.
4.00
2. Click “OK.”
OUTPUT
Correlations
Frustration r = .704
Tolerance Self-control
p = .023 (and is statistically
Frustration Pearson Correlation 1 .704*
significant at the p < .05
Tolerance Sig. (2-tailed) .023 criteria)
N 10 10 Notice that this information
Self-control Pearson Correlation .704* 1 is repeated at the top and
bottom of the correlation
Sig. (2-tailed) .023
table.
N 10 10
You can also have SPSS calculate a confidence interval for the correlation coefficient by clicking on
that option after you send your X and Y variables to the “variable box.” A new box will open where you
can indicate the confidence interval you prefer (e.g., 95%). If your version of SPSS does not include
this option, check with your professor if you would like to learn about how to calculate the confidence
interval for r by hand.
To find the proportion of variance accounted for, simply square r. In this example, r2 = .50, indicating
that 50% of the variance in one variable is accounted for by the other.
Correlations
Frustration Annual
Tolerance Self-Control Income
Frustration Tolerance Pearson Correlation 1 .704* .673*
Sig. (2-tailed) .023 .033
N 10 10 10
Self-Control Pearson Correlation .704* 1 .842**
Sig. (2-tailed) .023 .002
N 10 10 10
Annual Income Pearson Correlation .673* .842** 1
Sig. (2-tailed) .033 .002
N 10 10 10
Sig. (2-tailed) .208 .013 .116
N 10 10 10
The output is a matrix that correlates each variable with all the other variables. Notice also that
each of the correlations is listed twice, so you need to pay attention only to the top or bottom half
of the matrix.
For this example, self-control is significantly positively correlated with frustration tolerance (r =
.70, p = .023) and annual income (r = .84, p = .002) and frustration tolerance and annual income are
also significantly positively correlated (r = .67, p = .033).
Writing Up Results
When you write up results from a Pearson’s r, you will include the following information:
Use this same format for a Spearman’s rho or point-biserial correlation, but use the appropriate
statistical notation (rs and rpb, respectively).
Examples:
Option 1: Report all the results in text. Note that in this example we reported all of our p values
at three decimal places for consistency. We need to report the result of p = .002 because rounding
would have resulted in p as zero. We could have rounded p = .023 to p = .02 and p = .033 to
p = .03.
Results
We calculated a series of Pearson’s r correlations to examine the relationship between self-control,
frustration tolerance, and income. Self-control had strong, positive, and statistically significant
correlations with frustration tolerance (r = .70, p = .023) and annual income (r = .84, p = .002).
The correlation between frustration tolerance and annual income was strong, positive, and statistically
significant (r = .67, p = .033).
Option 2: To avoid the redundancy of listing all the correlation coefficients, you might consider using
a table to summarize the results. You would highlight the major findings in the Results section and refer
to the table.
216 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
Results
Pearson’s r correlations indicated that the relationship between self-control, frustration tolerance,
and income were strong and positive. All correlations were also statistically significant at the p < .05
criteria (see Table 1).
(In an APA-style research report, Tables appear after the Reference section.)
Table 1
Correlations Among Study Variables
LINEAR REGRESSION
Linear regression is used when you want to examine a linear relationship between two or more variables.
Simple linear regression is used when you have two interval/ratio variables, and you wish to predict the
value of one based on the other. Multiple regression is used when you have more than two variables.
To calculate a linear regression, click
2. Click “OK.”
You can also request a graph of the regression line (line of best fit) for a simple linear regression:
3. Click “OK.”
OUTPUT (abbreviated)
Regression
Model Summary
When you only have 2 variables,
Adjusted R Std. Error of
Model R R Square Square the Estimate R = r and R Square = r 2.
ANOVA a
Sum of Mean The ANOVA table tells you if the r is
Model Squares df Square F Sig. statistically significant.
1 Regression 11.960 1 11.960 7.881 .023 b
Adjusted R Std. Error of
Model R R Square Square the Estimate R = r and R Square = r 2.
ANOVA a
Sum of Mean The ANOVA table tells you if the r is
Model Squares df Square F Sig. statistically significant.
1 Regression 11.960 1 11.960 7.881 .023 b
Residual 12.140 8 1.517
Total 24.100 9
Coefficients a
Unstandardized Coefficients Standardized Coefficients
Model B Std. Error Beta t Sig.
1 (Constant) 1.657 1.838 .901 .394
Self-control .144 .051 .704 2.807 .023
If your r is statistically significant, you can calculate the regression equation (Y' = bX + a). You find
the a and b for your regression equation in the B column.
The value labeled “Constant” (1.657) is a (the y intercept) and the value labeled as the x variable
(.144) is b (the slope). These values are used to create the regression equation. In this case,
Frustration Tolerance
9.00
5.00
Observed
Linear
4.00
25.00 30.00 35.00 40.00 45.00 50.00
Self-Control
Now that you have the regression equation, Y ′ = .14X + 1.66, you can use it to predict frustration
tolerance (Y ) based on specific self-control (X ) values.
If self-control = 40, then we predict frustration tolerance will be
The Y ′ of 7.26 will lie on the line of best fit, while the actual value for frustration tolerance = 6.00
when self-control = 40. See the scatter plot above. This difference contributes to the standard error of
the estimate (average difference between actual and predicted Y values), which we can see in the above
output is 1.23.
IBM® SPSS® Workbook Part 3 ■ Analyses to Examine Relationships (No Groups) 219
Participant Time online Find partner? Participant Time online Find partner?
1 60 4 11 10 1
2 5 1 12 300 4
3 14 2 13 45 2
4 30 3 14 15 1
5 30 4 15 60 4
6 60 2 16 240 5
7 120 2 17 60 3
8 60 3 18 15 1
9 120 5 19 45 3
10 120 4 20 150 4
1. Using SPSS, enter the data and calculate the statistics below.
a. Compute the appropriate descriptive statistics for the two variables. Describe them.
b. What is the appropriate correlational statistic to compute for these data?
c. Compute a scatterplot for the data. What type of relationship does it suggest?
d. State a null and directional alternative hypothesis for the relationship.
e. Compute the correlation coefficient.
f. Is the relationship between time on the Internet and belief in finding a partner on the
Internet significant? How do you know?
g. What proportion of variability is accounted for by the relationship?
h. Is it appropriate to predict belief in finding a partner on the Internet from the time spent
online? Explain your answer.
220 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
i. Compute the regression for the relationship in question f and write the regression
equation.
j. Comment on the standard error of estimate (small/moderate/large). How do you know
how to interpret this value?
k. If a person spends 90 minutes on the Internet, what is the predicted score on the
likelihood to believe they will find a romantic partner on the Internet?
l. Describe your findings as you would in a Results section, using APA format.
m. Write a Discussion section for your results. Include all the information that belongs in
this section.
2. Two judges for a poetry contest were asked to rank the 10 finalists’ poems from 1 = best to
10 = least good. Their rankings are below:
Judge 1 1 2 3 4 5 6 7 8 9 10
Judge 2 4 7 3 1 9 2 10 8 5 6
Decision
DecisionTree
TreeWhen
When Comparing Independent
Comparing Independent Groups
Groups or Conditions
or Conditions
chi-square test
nominal
for independence
What is the Mann-Whitney U Test
scale of 2 groups
measurement ordinal
Kruskal-Wallis H Test
for the 3+ groups independent-samples t test
outcome/DV? (and/or point-biserial r)
2 groups
INDEPENDENT-SAMPLES t TEST
You need at least two variables to run an independent-samples t test. One variable is the IV (if you have an experiment)
or grouping variable (predictor if you have a correlational design); the other is the test variable (DV for experiment or
outcome for correlation).
221
222 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
Data Entry
The variable view would look like this:
The “Define Groups” box will open in order for you to identify how you coded your groups or conditions.
In this example:
Group 1(no gesturing) = 0
Group 2 (gesturing) = 1
OUTPUT
T Test
Group Statistics
experimental Std. Error
condition N Mean Std. Deviation Mean
Step 2: Look at the t test results in the appropriate row. The p value of the t test tells you
whether or not there is a statistically significant difference between the means of the two
groups. In other words, the t test is what tests your hypothesis.
224 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
Step 1. First look at the significance level If Levene’s is not statistically significant
of the Levene’s test.* (p ≥ .05), then use the data from the 1st row (equal variances
assumed).
Levene’s
Test for
Equality of
Variances t test for Equality of Means
95% Confidence Interval
of the Difference
Sig. Mean Std. Error
F Sig. t df (2-tailed) Difference Difference Lower Upper
Creativity Equal .180 .675 1st −2.570 22 .017 −1.83333 .71333 −3.31269 −.35398
variances
assumed
Equal 2nd −2.570 20.999 .018 −1.83333 .71333 −3.31269 −.35398
variances
not
assumed
In this example, Levene’s is not statistically significant because p = .68 is greater than .05.
Consequently, we can assume equal variances and will use the t test data from the 1st row.
Levene’s
Test for
Equality of
Variances t-test for Equality of Means
95% Confidence
Interval of the
Difference
Sig. Mean Std. Error
F Sig. t df (2-tailed) Difference Difference Lower Upper
Creativity Equal .180 .675 −2.570 22 .017 −1.83333 .71333 −3.31269 −.35398
variances
assumed
Equal −2.570 20.999 .018 −1.83333 .71333 −3.31269 −.35398
variances
not
assumed
In this example, we will look at the first row’s data in order to determine if there is a
statistically significant difference in creativity between the gesturing and no gesturing
conditions.
We see that p = .02 (rounded from .017), and this meets the criteria for statistical
significance at the p < .05 level.
We would report: t(22) = -2.57, p = .02.
We would report Cohen’s d = 1.05 (rounded and omitting the sign). This indicates that there is a little
over a full standard deviation difference between the two conditions.
Alternatively, you can calculate and report the proportion of variance accounted for, which is rpb2 for
a two independent-groups design. To do this, you would calculate a point-biserial correlation following
the directions in Part 3. Note that the steps and output indicate a Pearson’s r, but you would interpret
it as a rpb.
Correlations
gesturing creativity
Gesturing Pearson Correlation 1 .481*
Square the correlation coefficient to calculate
Sig. (2-tailed) .017
the proportion of variance accounted for.
N 24 24
In this example, rpb = .48 and rpb2 = .23,
Creativity Pearson Correlation .481* 1 indicating that 23% of the variance in creativity
is accounted for by the gesturing condition.
Sig. (2-tailed) .017
N 24 24
In your write-up, describe the effect size (either Cohen’s d or rpb2 ) as weak, moderate, or strong. The
chart below provides guidelines rather than absolute cutoffs for this description. For example, you may
use “moderate to strong” to describe an effect size.
Effect Size
Proportion of Variance
Accounted for Cohen’s d Interpretation
≈ 1% ≈.20 Small/Weak
≈ 9% ≈.50 Medium/Moderate
A Nonexperimental Example
Do children who speak more than one language (multilingual) come up with more or less ways to use
an object than those who speak one language (unilingual)?
OUTPUT
Group Statistics
language N Mean Std. Deviation Std. Error Mean
number of uses for multilingual 28 6.5714 4.46740 .84426
object
unilingual 29 6.4483 1.76445 .32765
IBM® SPSS® Workbook Part 4 ■ Analyses to Compare Independent Groups 227
Levene's Test
for Equality of
Variances t-test for Equality of Means
Step 1. In this example, Levene’s Step 2. Looking at the 2nd row, we find that the difference
is statistically significant (p = .03). between means is not statistically significant (p = .89).
Consequently, we cannot assume equal Because we violated the assumption of equal variances,
variances and will use the t test results we had to use a less powerful test with fewer degrees of
from the 2nd row. freedom (in this case, df = 34.98).
We would report: t(34.98) = 0.14, p = .89.
Note that in this example, we rounded all the values to two decimal places. Additionally, we can use
SPSS to calculate Cohen’s d to find d = 0.04.
Writing up Results
When reporting the results of an independent-samples t test, include the following:
7.00 4.00
6.00 4.00
7.00 7.00
IBM® SPSS® Workbook Part 4 ■ Analyses to Compare Independent Groups 229
6.00 5.00
5.00 7.00
5.00 7.00
7.00 5.00
4.00 4.00
6.00 6.00
7.00 4.00
8.00 5.00
7.00 7.00
5.00 3.00
7.00 5.00
5.00 2.00
Practice Exercise 2
A math professor decides to investigate whether stereotype threat is operating in her class for students
of color. She randomly assigns students in her classes to one of two instructions for a practice test.
In one condition, students are told that they are taking a practice test and to do their best. In the
other condition, students are told that they are taking a standardized test that was designed by several
professors at MIT, but to do their best. The test scores for her students of color are shown below.
90.00 95.00
85.00 65.00
70.00 35.00
75.00 65.00
80.00 70.00
75.00 60.00
75.00 80.00
80.00 50.00
85.00 85.00
85.00 75.00
You can calculate the analysis with just two groups, but the independent-samples t test is the preferred
*
Data Entry
To run a one-way ANOVA, you need at least two variables. One variable is the grouping predictor or
IV, the other is the outcome or DV.
Data entry for the one-way ANOVA is very similar to the independent-samples t test, except that your
IV will have three or more levels.
In the multiple-group experiment example, we have a variable called “plants” that is coded as 0 =
zero plants, 1 = one plant, 2 = three plants and entered in one column. We also have a variable called
“productive” that is scored on an interval scale and entered in a second column.
5. Click
“Descriptive,”
“Homogeneity of
variance test,”
and “Means plot.”
6.Click
“Continue”
and then
“Post Hoc.”
7.Click “LSD”
or one of
the other
tests.
OUTPUT
Descriptives
Productive
95% Confidence
Interval for Mean
N Mean Std. Deviation Std. Error Lower Bound Upper Bound Minimum Maximum
The next output table depicts the results of the Levene’s test. In this example, Levene’s is not statistically
significant (p = .29), and therefore, we have met the assumption of equal variances. If you find a statistically
significant Levene’s, make sure the data are normally distributed and check for outliers.
The ANOVA table will tell you if there is a statistically significant difference between your groups. In this
example, F is statistically significant at p < .001. We would report: F(2, 39) = 8.70, p < .001.
ANOVA
productive
Total 59.905 41
In the effect size table, we see that η2 = .31, Which means that 31% of the variance in perceived productivity is
accounted for by the plants in the office. We might also report the confidence interval of the effect size:
CI [0.07, 0.48]
Lower Upper
Only report results of post hoc tests if the ANOVA result (F) is statistically significant. Because our F was
statistically significant, we would report the comparisons. In this example, the difference between zero and one
plant (p < .001) and one and three plants (p = .006) was statistically significant, but we do not have a statistically
significant difference between zero and three plants (p = .28).
Multiple Comparisons
(I) plants (J) plants Mean Difference (I-J) Std. Error Sig. 95% Confidence Interval
Means Plots
8.00
differences between
your groups.
7.00
6.50
6.00
Writing up Results
Include the following:
Examples:
Note: An η2 around .01 is weak, around .06 is moderate, and around .16 is strong (Cohen, 1988).
You might also report confidence intervals. Check with your instructor.
Example
Results
Employees estimated the productivity of the occupant of an office with zero plants (M = 6.29,
SD = 0.83), one plant (M = 7.86, SD = 1.03), or three plants (M = 6.71, SD = 1.20). A one-way
between-subjects ANOVA revealed a statistically significant difference between conditions F(2,
39) = 8.70, p < .001, η2 = .31. Post hoc least significant difference (LSD) tests further indicated
statistically significant differences between zero and one plant p < .001) and between one and
three plants (p = .006). However, the difference between zero and three plants was not statistically
significant (p = .28).
236 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
70 25 95
65 30 55
90 55 70
85 60 80
65 35 60
55 40 75
35 65 100
70 15 90
40 75 40
60 30 55
30 35 85
25 45 80
7. Interpret the results as you would in a Discussion section. Include one limitation and one way
to address the limitation.
__________________________________________________________________________
__________________________________________________________________________
__________________________________________________________________________
__________________________________________________________________________
Practice Exercise 2
Studies suggest that time outside increases one’s sense of well-being, but does more time increase one’s
well-being or is there only an initial gain and then a leveling of well-being? A researcher has students
spend no time (control), 10 minutes, 30 minutes, or 90 minutes outside and then has them complete a
well-being scale (higher scores reflect a greater sense of well-being). The following data result:
IV conditions:
Nature Magazines Family Magazines
5 2
4 3
Never been pregnant before 5 3
3 4
History: 4 2
3 8
5 6
Has been pregnant 4 7
5 6
3 8
In the example dataset that follows, we have entered these variables in SPSS:
2. Click “Options.”
Select Descriptive Statistics, Estimates of Effect Size,
and Homogeneity tests. Then click “Continue.”
OUTPUT (abbreviated)
Levene's Test of Equality of Error Variancesa,b Read the first line of the output
Levene Statistic df1 df2 Sig. box. In this example, Levene’s is
not statistically significant. If it
fear Based on Mean .220 3 16 .881 was, check that the outcome/DV is
Based on Median .267 3 16 .848 normally distributed and check for
outliers. A significant Levene’s may
Based on Median and with .267 3 15.835 .848
occur when there is a statistically
adjusted df
significant interaction effect.
Based on trimmed mean .211 3 16 .887
Tests the null hypothesis that the error variance of the dependent variable is equal
across groups.
a. Dependent variable: fear
b. Design: Intercept + magazine + history + magazine * history
Main Effect for Factor 1 Main Effect for Factor 2 Interaction Effect: F(1, 16) =
(magazine condition): (history): F(1, 16) = 23.53, 28.47, p < .001, η2partial = .64.
F(1, 16) = 3.77, p = .07, p < .001, η2partial = .60.
η2partial = .19.
Writing up Results
Include the following:
{{ If statistically significant, you should report the results as well as an explanation of the
interaction, with the cell means. A table is a good way to report the cell means and
standard deviations. (Tables are placed after the References in an APA-style report.)
You might also report confidence intervals. Check with your instructor.
IBM® SPSS® Workbook Part 4 ■ Analyses to Compare Independent Groups 241
Example
Results
A 2 (history of pregnancy) × 2 (magazine type) between-subjects ANOVA was conducted to
examine factors that impact a pregnant women’s fear of delivery. There was no significant difference in
fear levels for women who were given family-oriented magazines vs. nature magazines (M = 4.90, SD
= 2.38 vs. M = 4.10, SD = 0.88), F(1, 16) = 3.77, p = .07, ηpartial
2
= .19. Women who had never been
pregnant (M = 3.50, SD = 1.08) had lower levels of fear than women who had been pregnant (M = 5.50,
SD = 1.84). The relationship between history and fear of delivery was strong and statistically significant,
F(1, 16) = 23.53, p < .001, ηpartial
2
= .60.
The main effects were qualified by a statistically significant interaction effect, F(1, 16) = 28.47,
p < .001, which accounted for 64% of the variance in fear. Women who had never been pregnant
reported much lower fears about their upcoming delivery when exposed to family magazines than when
exposed to nature magazines. The opposite effect was found for women who had been pregnant in the
past. These women reported much higher fears when exposed to family magazines rather than nature
magazines (see Table 1).
Table 1
Fear as Function of Pregnancy History and Magazine Group
M SD M SD
History of Pregnancy
You might also want to include a graph to demonstrate a significant interaction. Don’t cut and paste
from the output; rather, create a graph using MSWord or data processing software such as Excel.
Figure 1
6
Fear of delivery
2
never pregnant
1 has been pregnant
0
Nature Family
Magazine condition
242 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
Descriptive Statistics
Posture condition forget condition Mean Std. Deviation N
1.00 good posture 1.00 no forgetting 51
2.00 forgot keys 56
Total 107
2.00 poor posture 1.00 no forgetting 55
2.00 forgot keys 56
Total 111
Total 1.00 no forgetting 106
2.00 forgot keys 112
Total 3.38 .888 218
Practice Exercise 2
A teacher has heard that adding a humorous cartoon to an exam might ease tension and even raise
scores. She wonders whether adding an inspirational quote might also have a positive effect and possibly
strengthen the effect of a cartoon. She creates four different versions of an extra-credit quiz and randomly
assigns students to condition. The quiz results for each condition are as follows:
Inspirational Quote
no quote quote
no cartoon 5 6
3 5
4 4
Humorous Cartoon
8 4
6 7
Cartoon 6 4
7 3
9 6
6 6
9 2
1. Using SPSS, enter the data and calculate a between-subjects two-way ANOVA.
2. Write a Results section and put the cell means and standard deviations in a table.
________________________________________________________________________
________________________________________________________________________
________________________________________________________________________
________________________________________________________________________
________________________________________________________________________
________________________________________________________________________
________________________________________________________________________
________________________________________________________________________
________________________________________________________________________
________________________________________________________________________
Table 1
The Effect of Humor and Inspirational Quotes on Quiz Scores
Practice Exercise 3
Go to https://edge.sagepub.com/adams3e. Click on “Datasets” and open the file called “Reminiscence
Study”:
Read over the description of the datasets (reminiscence description.pdf )
Download the compiled dataset (reminiscence compiled dataset.sav)
NONPARAMETRIC STATISTICS
Conducting a Chi-Square Test for Independence
Conducting a Chi-Square Test for Independence
Data Entry
4. Click
3. Click
“Cells.”
“Continue.”
4. ClickObserved
Select “Cells” &
Expected.
Select Observed &
Expected
5. Click Continue and then
5. Click “Continue” and “OK.”
“OK.”
OUTPUT (abbreviated)
Chi-Square Tests
Asymptotic We use the Pearson Chi-Square and
Significance would report:
Value df (2-sided) χ2(2, N = 20) = 9.30, p = .01.
Pearson Chi-Square 9.304 a
2 .010
A statistically significant χ2 means
Likelihood Ratio 10.483 2 .005 that our observed values were
Linear-by-Linear Association 7.879 1 .005 significantly different than the
expected values.
N of Valid Cases 20
a
5 cells (83.3%) have expected count less than 5. The minimum expected count
is 1.40.
Symmetric Measures
Approximate Square the phi or V to find the proportion
Value Significance of variance accounted for. In this example,
Nominal by Nominal Phi .682 .010 we square .68 and find V 2 = .46, or 46% of
the variance accounted for.
Cramer’s V .682 .010
N of Valid Cases 20
Writing Up Results
Example
Results
Results of a chi-square test for independence revealed that reports of health had a statistically
significant relationship with, and accounted for a large amount of the variance in, age perceptions.
Participants view those who are healthy as their correct age (n = 8) or as younger (n = 4) more than
those who are unhealthy (n = 0 and n = 0, respectively). Participants guess older ages for those who are
unhealthy (n = 5) more than for those who are healthy (n = 1), χ2(2, N = 20) = 9.30, p = .01, V 2 = .46.
IBM® SPSS® Workbook Part 4 ■ Analyses to Compare Independent Groups 247
Status
Male 10 20 30
Female 30 15 10
Nonbinary 3 2 0
McNemar Test
2 groups
nominal Cochran Q Test
3+ groups
DEPENDENT-SAMPLES t TEST
You compute a dependent-samples (or matched-pairs) t test when you are examining differences between two related
groups. The IV (if you have an experiment) or grouping variable (predictor, if you have a correlational design) is the
related variable, and the DV for an experiment (or outcome, for correlation) is the measured variable.
248
IBM® SPSS® Workbook Part 5 ■ Analyses for Dependent Groups and Mixed Designs 249
SPSS calls this test a paired-samples t test, but this name should only be used in a report if you
had matched (paired) samples.
If you have a repeated-measures design, you could use the term repeated-measures t test.
Dependent-samples t test or within-subjects t test are more general terms that can apply to either
matched or repeated-measures designs.
In a repeated-measures
design, each row represents
one participant.
Use the same steps for matched groups and repeated measures.
OUTPUT
Paired Samples Statistics
Std. Error
Mean N Std. Deviation Mean The first table provides
descriptive information for
Pair 1 peaceful 6.9167 12 1.24011 .35799
each of your groups.
stressful 6.4167 12 1.31137 .37856
N Correlation Sig.
Pair 1 peaceful & stressful 12 .862 .000
IBM® SPSS® Workbook Part 5 ■ Analyses for Dependent Groups and Mixed Designs 251
The third table shows the results for the paired-samples t test. From the example we would report
t(11) = 2.57, p = .026.
The final table provides the effect size. For our example, we would report d = 0.74
Writing up Results
The write-up is very similar to an independent-samples t test (see Part 4). However, you should be
sure your language is consistent with your dependent design, either matched-groups or repeated-
measures.
Also, you do not need to worry about Levene’s test. Include the following in your report:
9 8
8 9
8 7
7 6
8 8
7 6
6 5
6 4
5 5
5 4
Practice Exercise 2
Research has found that physicians often ignore complaints in older patients that they take seriously
in younger patients. In an effort to reduce this prejudice before it becomes too ingrained, a researcher
designs a study to have fourth-year medical students see videos of 10 patients (who are really actors) a
month apart. In one sequence of the 10 videos, an actor is made up to look about 70 and talks about
being retired, and in the other sequence of the 10 videos, the same actor is depicted as his own age (30).
The actor presents similar complaints in each video. Students are randomly assigned to the order of
viewing these two sequences of videos a month apart. After viewing each of the videos, the students list
the questions they would ask the patient, their diagnosis of the complaint(s), and their rating for the
health of the “patient” on a 15-point scale, with higher scores representing better health. The researcher
is interested in comparing the responses to the same complaints for the different “age” patient. The data
for these videos are below:
9.00 14.00
12.00 15.00
9.00 10.00
11.00 11.00
14.00 15.00
10.00 8.00
9.00 10.00
8.00 8.00
12.00 14.00
11.00 15.00
11.00 8.00
9.00 10.00
5.00 9.00
13.00 12.00
10.00 12.00
SPSS calls the analysis repeated measures ANOVA; you can use this term to describe analyses
for a repeated-measures design.
Use the term matched groups ANOVA when you used matching.
You can use the term dependent-designs ANOVA or within-subjects ANOVA for either design.
This ANOVA analysis will give you an F like the independent-designs ANOVA, but this
ANOVA is calculated as follows:
F = Between group differences (MSA) ÷ Within group differences minus the variability
associated with individuals (MSAxS)
When you report an F statistic, you need to report two types of df:
Within-groups (interaction error) dfAxS = df for groups times the number of participants per condition
minus one or (k – 1)(kps – 1)
You also need to report the effect size. Eta (h) is a correlation coefficient used to describe the
magnitude of a relationship containing two or more levels. Partial eta 2 ( η 2partial ) is the effect
size used with a dependent ANOVA and reflects the proportion of variance in the test variable
(DV) accounted for by the grouping variable (IV), with the error unique to individuals
removed.
If you find a statistically significant F, you will need to report the results of post hoc tests to
explain which groups differ significantly.
Matched-Groups Example
A researcher investigates the impact of Internet use on college students’ impulsivity scores. Impulsivity is
measured by the Continuous Performance Task, which is an individually administered test that requires
the participant to attend to and discriminate among stimuli. Higher scores indicate higher impulsivity.
Students are matched on their ability to identify identical objects from a set of similar objects. All
students agree to spend a day (8 hours) in the research lab. One group is given unlimited Internet access,
another group is allowed access for 3 hours, and the third group is given no access.
256 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
Data Entry
In a matched-groups
design, each row
represents one matched
group.
In this example, each
matched group has a
unique ID number, and
then impulsivity scores
for each matched
participant in the
unlimited, three hour,
and no internet access
conditions.
Use the same steps for matched groups and repeated measures.
6. Select “Descriptive
statistics” and
IBM® SPSS® Workbook Part 5 ■ Analyses for Dependent Groups and Mixed Designs 257
6. Select “Descriptive
statistics” and
“Estimates of effect
size.”
7. Click “Continue.”
OUTPUT (abbreviated)
We will only look at the following tables in the output:
“Descriptive Statistics”
“Mauchly’s Test of Sphericity”
“Tests of Within-Subjects Effects”
“Pairwise Comparisons”
Descriptive Statistics
Mean Std. Deviation N
The “Descriptives Statistics” table
Unlimited 6.2000 1.37321 15 displays the mean and standard
Threehours 5.8667 1.30201 15 deviation for each level of your
factor (IV or predictor).
nointernet 5.3333 1.04654 15
258 Student Study Guide With IBM® SPSS® Workbook for Research Methods, Statistics, and Applications
Like Levene’s test, Mauchly’s test of sphericity tests an assumption, and we hope that the test is NOT
statistically significant. In this example, Mauchly’s test of sphericity is not statistically significant (p ≥ .05)
and we can assume sphericity.
If Mauchly’s test of sphericity fails (i.e., it is statistically significant at p < .05), you would use an alternative
test such as Greenhouse-Geisser or Huynh-Feldt in the “Tests of Within-Subjects Effects” table.
Use the “Sphericity Assumed” results when Mauchly’s test is not statistically significant.
In this example, F(2, 28) = 5.38, p = .01, ηpartial
2
= .28
Because we have a statistically significant F, we must report the post hoc test results.
IBM® SPSS® Workbook Part 5 ■ Analyses for Dependent Groups and Mixed Designs 259
Pairwise Comparisons
Measure: MEASURE_1
95% Confidence Interval for
Mean Difference Differenceb
(I) factor1 (J) factor1 (I−J) Std. Error Sig.b Lower Bound Upper Bound
1 2 .333 .187 .288 −.175 .841
3 .867* .291 .030 .077 1.656
2 1 −.333 .187 .288 −.841 .175
3 .533 .307 .311 −.300 1.366
3 1 −.867* .291 .030 −1.656 −.077
2 −.533 .307 .311 −1.366 .300
The “Pairwise Comparisons” table depicts the results of the post hoc tests. We only report these
comparisons if our F was statistically significant.
In this example, we used a Bonferroni correction that accounts for the increased chance of a Type I error
due to computing multiple comparisons. Here we find the corrected p values for all our comparisons:
• (1) unlimited access vs. (2) 3-hour access: p = .29
• (1) unlimited access vs. (3) no access: p = .03
• (2) 3-hour access vs. (3) no access: p = .31
Writing up Results
The write-up is very similar to a one-way between-samples ANOVA (see Part 4). However, you should
be sure your language is consistent with your dependent design, either matched-groups or repeated-
measures.
Include the following:
Practice Exercise 2
A researcher who was examining the effect of different schedules of verbal reinforcement on
correct answers in a vocabulary test randomly assigns the 4-year-olds to the order of reinforcement
IBM® SPSS® Workbook Part 5 ■ Analyses for Dependent Groups and Mixed Designs 261
conditions and records the number of correct responses for each condition. Scores for each
condition are below:
8 7 7
7 5 5
6 5 6
6 5 6
7 4 5
6 6 7
8 5 6
7 6 6
5 5 7
6 6 5
NONPARAMETRIC STATISTICS
McNemar and Cochran Q Tests
Conducting a McNemar Test