Lecture 4.bb - Measurement - Part2

This document provides an overview of key concepts related to measurement in hypothesis testing, including: 1. Hypotheses consist of a predictor (independent) variable and an outcome (dependent) variable. Studies investigate "the effect of X on Y." 2. Reliability refers to the consistency of a measurement and is important for minimizing error. There are two types of reliability: test-retest and internal consistency. 3. Validity refers to how well a measurement captures the intended construct. There are three types of validity: face validity, content validity, and construct validity. 4. Multi-item scales are often used to measure complex constructs reliably by combining scores across multiple questions measuring the same concept

Uploaded by

Rodrigo Vázquez

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

11 views

Lecture 4.bb - Measurement - Part2

Uploaded by

Rodrigo Vázquez

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Lecture 4

Ch. 3. Measurement, Part 2

Thurs, Feb. 17th, 2022
Review of what we covered last class
1. Measurement assigns values to variables.
2. Variables represent the operational definitions of constructs.
3. Variables can be scaled in different ways (nominal, ordinal, interval,
or ratio) so that their values are interpretable in the context of your
study.
4. Your study tests a hypothesis about whether (and how) at least two
variables correlate.
Let’s stay focused on the “big picture” of hypothesis testing – as it
relates to measurement – for just a minute longer.
Hypotheses consist of predictor and outcome variables.
• Any study can be described as investigating “the effect of X on Y.”
• Theories are relatively useless without specifying a “direction.”
• Not all studies are designed to support a conclusion about the direction of
the effect. (Later in the semester we will learn more about this.)
• You are probably very familiar with the phrase “correlation does not equal causation.”
• But (good) hypotheses are stated this way.
• Thus, enter the Predictor Variable.
• Also called the “independent variable” (IV)
• The X variable. The cause. The predictor. The catalyst. The stimulus.
• And allow me to introduce the Outcome Variable.
• Also called the “dependent variable” (DV)
• The Y variable. The effect. The outcome. The reaction. The response.
Reliability
A fundamental law of measurement
Measured Value = True Value + Error
• In most cases, some non-zero amount of error variance will obscure the
true value from the measured value.
• Errors can be broken down into two kinds:
• Non-systematic errors (random factors that are out of the researcher’s control)
• Ex: respondents are in a bad mood during the study, or in a good mood!
• Ex: temperature and air pressure when measuring the height of the Empire State Building
• Systematic errors (non-random factors, unintentionally due to decisions of the
researcher)
• Ex: measuring well-being/happiness in Eastern cultures requires different kinds of instruments
• Ex: researchers define the “bottom” of the ESB as being the 1st floor and not the ground floor
• Good measurement will minimize the effects of non-systematic errors.
Reliability
• The extent to which a measurement instrument provides consistent
results over time.
• If I repeatedly use the same thermometer to measure my daughter’s
temperature, will I get the same reading every time?
• If yes, the thermometer is reliable.
• If no, the thermometer is not reliable.
• If I repeatedly use the same scale to measure my weight, the same ruler to
measure my height, the same oven to cook my food, the same IQ test to
measure my intelligence, etc., etc., etc.
• The corollary: If the true value changes, then a reliable measure will
indicate that change with precision, with minimal error.
Two kinds of reliability that we care about
• Test-retest reliability
• Administer the same test twice, to the same group of respondents, after a window
of time.
• If scores on the two tests correlate strongly, then you can assume high reliability.
• Internal consistency (the textbook does not do a good job of explaining this concept)
• Assume that you have a multi-item scale
• ex: IQ test (intelligence) consists of 100 items and questions.
• ex: Myers-Briggs test (personality) consists of 93 items and questions.
• On average, how strongly do the items correlate with each other?
• Individual items should correlate with each other (thus, we call this internal consistency).
Multi-Item Scales
Recall: Complex and abstract constructs require
special attention.
• Thoughtful operational definitions, often consisting of album
sales
multiple dimensions.
dancing
• Every dimension must therefore be measured separately.
As you did during the in-class experiential exercise.
• Constructs may be uni-dimensional but require KPop
quality
multiple items.
• Many consumer attitudes require multi-item measures.
popularity
• Let’s look at a few examples. rapping

• One example (brand loyalty) of multiple dimensions per vocals

construct, measuring one item per dimension.
• Two examples (health consciousness, media skepticism) of
single dimension per construct, measuring multiple items.
Multiple dimensions of brand loyalty:
• Number of items purchased in the past from a brand
• The monetary value of past purchases from a brand
• Self-reported likelihood of buying again in the future
• Self-reported likelihood of recommending the brand to friends or family
• Self-reported emotional attachment to a brand
• Self-reported tendency to forgive a brand
These variables would be unlikely to display internal consistency (they do not
necessarily all correlate with each other), but they could all be measured
separately in a single study, to reflect different dimensions of brand loyalty.
Or, Health Consciousness is uni-dimensional,
measured using 6 items.
• Health Consciousness: the readiness to undertake actions to improve one’s
well-being
• On a scale from 1 to 7, please indicate to what extent you agree/disagree
with the following statements (1 = strongly disagree, 7 = strongly agree).
• I reflect about my health a lot.
• I'm very self conscious about my health.
• I'm alert to changes in my health.
• I'm usually aware of my health.
• I take responsibility for the state of my health.
• I'm aware of the state of my health as I go through the day.
• Individual scores are obtained by summing (or averaging) item scores.
Media Skepticism: Uni-dimensional, measured
using 5 items.
• Media Skepticism: degree to which individuals discount and distrust
information presented by the mass media.
• After watching a news program, participants rate the program on a 1-4 scale
(1 = not at all true, 4 = very true):
• The program was not very accurate in its portrayal of the problem.
• Most of the story was staged for entertainment purposes.
• The presentation was slanted and unfair.
• In think the story was fair and unbiased.
• I think important facts were purposely left out of the story.
• Individual scores are obtained by summing (or averaging) item scores.
Multi-item scales produce composite scores.
The procedure:
1. Measure a construct (e.g., quality, value) with multiple items.
2. Calculate the sum or the average of those items.
3. The resulting average is more likely to reflect the true value than any of
the items can reflect on their own.
In other words, multi-item scales exhibit greater reliability than 1-item scales!

We prefer multi-item scales for the same reasons that we employ multiple judges for sporting contests. The
average score across multiple judges is more likely to reflect the true value after accounting for inter-judge error.
Validity
Validity
• The extent to which an instrument measures what it is supposed to
measure.
• Validity requires reliability (minimizing error) plus an estimate of the
construct that is true to the construct’s form.
• Example: what does the SAT actually measure?
• intelligence? college-preparedness? success in life? the ability to “pass tests?”
• Example: what does GPA actually measure?
• It is a valid measure of “success in school,” but any interpretation beyond that is invalid.
• Can a measure be reliable but not valid?
• Yes! See next slide.
Reliability vs. Validity
Kinds of validity we care about
1. Face validity
• You should be able to infer the construct being measured by reading the test
questions.
• As constructs become complex and abstract, face validity loses its appeal.
• Ex: I want to measure “how likely is it that you are lying to me?” without you knowing
that I am measuring your likelihood of lying.
2. Content validity
• Items should assess the construct as broadly, and from as many angles, as the
complexity of the construct dictates.
• Ex: Stony Brook asks all graduating seniors, before they leave: “how satisfied are you
with your experience at Stony Brook University?” on a 1-5 scale.
• Content validity is low, because there are likely many different factors that should be
assessed (e.g., housing, cost of tuition, safety, food, fees, faculty, social life, quality of
education, etc, etc, etc.)
3. Construct validity
Kinds of validity we care about
3. Construct validity
• The instrument should behave consistently with its underlying theory.
• Essentially, a valid measure should demonstrate validity by relating in predictable
patterns with other variables.
• This kind of validity is the most academic. Used for theory-building more than for
common research applications.
Continuous vs. Discrete Data
Continuous vs. discrete data
• Continuous: an infinite number of possible values between any two
points on the measurement scale.
• Many ratio scales are continuous, but not all.
• Discrete: the variable can only take on a limited number of values.
• Nominal and ordinal scales are, by definition, discrete variables.
• Interval scales are usually discrete, but we bypass this by using multi-item
scales and averaging individual items to create a composite score per person.
• The best way to think about interval vs. ratio and continuous vs.
discrete is on the next slide.
Don’t think of the distinction as a defining characteristic.
Think of it as an additional layer.

ratio

discrete continuous

interval
Don’t think of the distinction as a defining characteristic.
Think of it as an additional layer.

ratio
number of drugs milligrams of a drug
number of sales from repeat customers percentage of market share
number of correct responses on Exam 1 your Exam 1 score as a percentage of 100

discrete continuous
Likert scale ratings Nasdaq composite index
1-10 pain scale composite variables from multi-item scales
SAT scores your overall GPA

interval

Quantitative Measurement, Reliablity, Validity
No ratings yet
Quantitative Measurement, Reliablity, Validity
36 pages
Validity and Reliability in Research
100% (1)
Validity and Reliability in Research
18 pages
Warman SRH Pump Info
No ratings yet
Warman SRH Pump Info
4 pages
Reliability Dan Validity
No ratings yet
Reliability Dan Validity
12 pages
.... Week 8
No ratings yet
.... Week 8
25 pages
Validity and Reliabily
No ratings yet
Validity and Reliabily
41 pages
Ch13 Validity and Reliabity
No ratings yet
Ch13 Validity and Reliabity
41 pages
PSY 210 Chapter 10(1) (1)
No ratings yet
PSY 210 Chapter 10(1) (1)
23 pages
A1181590628 - 23746 - 19 - 2020 - Measurement and Scaling RECAP-3
No ratings yet
A1181590628 - 23746 - 19 - 2020 - Measurement and Scaling RECAP-3
20 pages
RMM Lecture 17 Criteria For Good Measurement 2006
No ratings yet
RMM Lecture 17 Criteria For Good Measurement 2006
31 pages
Week 3.1 Validity and Reliability
No ratings yet
Week 3.1 Validity and Reliability
25 pages
Chapter 5 - Measurement Techniques
No ratings yet
Chapter 5 - Measurement Techniques
46 pages
Criteria of Measurement Quality
No ratings yet
Criteria of Measurement Quality
20 pages
Measurement
No ratings yet
Measurement
34 pages
BRM Chap 13
No ratings yet
BRM Chap 13
46 pages
Construct Reability Validity
No ratings yet
Construct Reability Validity
37 pages
Lecture 2
No ratings yet
Lecture 2
29 pages
RM Unit II Important Questions
No ratings yet
RM Unit II Important Questions
14 pages
Conceptualization, Operationalization, and Measurement
No ratings yet
Conceptualization, Operationalization, and Measurement
21 pages
Unit 9 Measurements - Short
No ratings yet
Unit 9 Measurements - Short
27 pages
Attitude
No ratings yet
Attitude
16 pages
Psychological Measurement
100% (1)
Psychological Measurement
21 pages
As Psychology (Lecture 2)
No ratings yet
As Psychology (Lecture 2)
35 pages
Session-5-Measuring and Scaling Concepts
No ratings yet
Session-5-Measuring and Scaling Concepts
20 pages
Chapter6 Validity
No ratings yet
Chapter6 Validity
35 pages
Validity and Reliability
No ratings yet
Validity and Reliability
13 pages
Measurement in Research
No ratings yet
Measurement in Research
40 pages
Attitude Measurementfinal1
No ratings yet
Attitude Measurementfinal1
45 pages
L2-ResearchDesign_BRSM-Lecture2
No ratings yet
L2-ResearchDesign_BRSM-Lecture2
60 pages
M1 - Research Instrument
No ratings yet
M1 - Research Instrument
5 pages
Rufano n. Report on Validity
No ratings yet
Rufano n. Report on Validity
26 pages
MSc Statistics Booster
No ratings yet
MSc Statistics Booster
107 pages
Lecture 1 - Introd. Reliability - and - Validit
No ratings yet
Lecture 1 - Introd. Reliability - and - Validit
15 pages
Validity and Reliability
No ratings yet
Validity and Reliability
19 pages
Business Research Methods (BRM) DR Seema Garg
No ratings yet
Business Research Methods (BRM) DR Seema Garg
51 pages
Measuring The Variables
No ratings yet
Measuring The Variables
17 pages
Essentials of A Good Psychological Test
No ratings yet
Essentials of A Good Psychological Test
6 pages
PSY 101L: Psychological Testing: Prof. A.K.M. Rezaul Karim, PH.D
No ratings yet
PSY 101L: Psychological Testing: Prof. A.K.M. Rezaul Karim, PH.D
50 pages
Testing&Assessment - TEST QUALITIES
No ratings yet
Testing&Assessment - TEST QUALITIES
95 pages
Business Research Methods
No ratings yet
Business Research Methods
94 pages
Midterm Exam Review Session: What You've Learned So Far
No ratings yet
Midterm Exam Review Session: What You've Learned So Far
12 pages
10-Measurement of Variables - Scaling, Reliability, and Validity
No ratings yet
10-Measurement of Variables - Scaling, Reliability, and Validity
26 pages
Unit 3 1
No ratings yet
Unit 3 1
51 pages
Methods of Research Lession 4
No ratings yet
Methods of Research Lession 4
55 pages
Session 09 - Measuring The Variables
No ratings yet
Session 09 - Measuring The Variables
18 pages
Validity
100% (2)
Validity
17 pages
4 Qualitative and Quantitative Measurement Part 4 1 17062024 025904am
No ratings yet
4 Qualitative and Quantitative Measurement Part 4 1 17062024 025904am
32 pages
BRM chp07
No ratings yet
BRM chp07
29 pages
Measurement: Measurement Is The Process of Observing and
No ratings yet
Measurement: Measurement Is The Process of Observing and
88 pages
Research Methodology: Measurement
No ratings yet
Research Methodology: Measurement
92 pages
Prsentaion of Pyschometrics
No ratings yet
Prsentaion of Pyschometrics
20 pages
5 A Assessment Practices
No ratings yet
5 A Assessment Practices
24 pages
4 Defining and Measuring Variables
No ratings yet
4 Defining and Measuring Variables
27 pages
Methods of Research-Lession 4
No ratings yet
Methods of Research-Lession 4
57 pages
SWK_340_Powerpoint_Chapter_9._Defining_and_measuring_concepts_rDeCarloTextbook
No ratings yet
SWK_340_Powerpoint_Chapter_9._Defining_and_measuring_concepts_rDeCarloTextbook
22 pages
Measurement - Scaling, Reliability, Validity
No ratings yet
Measurement - Scaling, Reliability, Validity
42 pages
Defining and Measuring Variables
No ratings yet
Defining and Measuring Variables
24 pages
Judgnomics: Applying Measurements to Certainty
From Everand
Judgnomics: Applying Measurements to Certainty
Robert Burbank
No ratings yet
Analytical Writing Insights on the GRE General Test
From Everand
Analytical Writing Insights on the GRE General Test
Vibrant Publishers
No ratings yet
Revision Exercises in Basic Engineering Mechanics
From Everand
Revision Exercises in Basic Engineering Mechanics
Gregory Pastoll
No ratings yet
Research in Psychology
From Everand
Research in Psychology
Connor Whiteley
No ratings yet
Assignment 2
No ratings yet
Assignment 2
12 pages
Ep Lab Manual Electrical-1
No ratings yet
Ep Lab Manual Electrical-1
22 pages
CB Mods Super
100% (5)
CB Mods Super
34 pages
Analysis of Pressure Buildup Tests in A Naturally Fractured Reservoir
No ratings yet
Analysis of Pressure Buildup Tests in A Naturally Fractured Reservoir
6 pages
Demostrative Pronouns
No ratings yet
Demostrative Pronouns
16 pages
2 Supply Demand and Elasticity
No ratings yet
2 Supply Demand and Elasticity
43 pages
JC 50
No ratings yet
JC 50
6 pages
Ehrenkranz 1999
No ratings yet
Ehrenkranz 1999
12 pages
Chemistry Notes Vtu
67% (3)
Chemistry Notes Vtu
160 pages
Failure Analysis of A Gas Turbine Nozzle: Z. Mazur, A. Hernandez-Rossette, R. Garcia-Illescas, A. Luna-Ramirez
No ratings yet
Failure Analysis of A Gas Turbine Nozzle: Z. Mazur, A. Hernandez-Rossette, R. Garcia-Illescas, A. Luna-Ramirez
9 pages
Exp 5 (C 245) Complexometery
No ratings yet
Exp 5 (C 245) Complexometery
4 pages
Tutorial A2: Creating A Smart Contract: 10 Minutes
No ratings yet
Tutorial A2: Creating A Smart Contract: 10 Minutes
8 pages
Icecce49384 2020 9179470
No ratings yet
Icecce49384 2020 9179470
5 pages
Linear - Equations - Worksheet 8 Class
No ratings yet
Linear - Equations - Worksheet 8 Class
3 pages
651 PROCONNECT Blood Pressure Monitor: UA-651CN
No ratings yet
651 PROCONNECT Blood Pressure Monitor: UA-651CN
2 pages
SCConsulting Sols
No ratings yet
SCConsulting Sols
8 pages
Inventory Systems For Independent Demand
No ratings yet
Inventory Systems For Independent Demand
4 pages
Relative Valuation
No ratings yet
Relative Valuation
96 pages
4 Marks
No ratings yet
4 Marks
4 pages
Ds Lab Program
No ratings yet
Ds Lab Program
6 pages
Week 7 (Empty Notes)
No ratings yet
Week 7 (Empty Notes)
17 pages
Data Types and Expressions: C# Programming: From Problem Analysis To Program Design
No ratings yet
Data Types and Expressions: C# Programming: From Problem Analysis To Program Design
58 pages
Work, Energy & Power Class 11
No ratings yet
Work, Energy & Power Class 11
48 pages
C Program Solutions
No ratings yet
C Program Solutions
9 pages
Statistics Assignment
No ratings yet
Statistics Assignment
11 pages
Literature Review of Temperature Controlled Fan
100% (1)
Literature Review of Temperature Controlled Fan
8 pages
TOS Lecture
No ratings yet
TOS Lecture
6 pages
Smim01 Ingles Mecánica Industrial: Teacher: Karen Lillo Contreras
No ratings yet
Smim01 Ingles Mecánica Industrial: Teacher: Karen Lillo Contreras
4 pages

Lecture 4.bb - Measurement - Part2

Uploaded by

Lecture 4.bb - Measurement - Part2

Uploaded by

Lecture 4

Ch. 3. Measurement, Part 2

• One example (brand loyalty) of multiple dimensions per vocals

You might also like