Lesson 6-Wps Office

This document discusses the concept of test reliability, emphasizing its importance in ensuring consistent responses across different conditions and participants. It outlines various methods to establish reliability, including test-retest, parallel forms, split-half, internal consistency measures, and inter-rater reliability, along with the statistical methods used to analyze these reliability measures. Factors affecting reliability such as the number of test items, individual differences, and external environment are also highlighted.

Uploaded by

rachelannf240

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views20 pages

Lesson 6-Wps Office

Uploaded by

rachelannf240

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 20

LESSON 6

Establish Test Validity

Reliability

Reported by : RACHEL ANN F. SABANDO

Key Explanations

1: Consistent response expected with

same participants
2: Consistency with same/equivalent
test at different times
3: Consistency across items measuring
same characteristic
What is Reliability?
-is the consistency of the
responses to measure under three
conditions

1. When retested on the same person

2. When retested on the same
measure
3. Similarly of responses across items
measuring the same characteristic
Factors Affecting Reliability
1. Number of Items in a Test
The more items a test has, the likelihood of reliability is high. The
probability of obtaining consistent scores is high because of the large
pool of items.
2. Individual Differences of Participants
. Every participant possesses characteristics that affect their
performance in a test, such as fatigue, concentration, innate ability,
perseverance, and motivation. These individual factors change over
time and affect the consistency of the answers in a test.
3. External Environment
. The external environment may include room temperature, noise
level, depth of instruction, exposure to materials, and quality of
instruction, which could affect changes in the responses of examinees
in a test."
What are the different ways to
establish test reliability?
*Key Determinants of Reliability Method
Selection
1.Variable Measured (e.g., stable traits
like IQ vs. transient states like mood)
2.Test Type (e.g., multiple-choice vs.
performance-based)
3.Number of Test Versions Available
Method in Testing Reliability
1. Test-retest
How is this reliability done
- You have a test, and you need to administer it at
one time to a group of examinees. Administer it again at
another time to the *same group* of examinees.
- There is a time interval of not more than 6 months between
the first and second administration of tests that measure stable
characteristics, such as standardized aptitude tests. The post-
test can be given with a minimum time interval of 30 minutes.
- The responses in the test should more or less be the same
across the two points in time.
- Applicability: Test-retest is applicable for tests that measure
stable variables, such as aptitude and psychomotor measures
(e.g., typing test, tasks in physical education).
What statistics is used?
-Correlate the test scores from the first and the next
administration. Significant and positive correlation indicates that
the test has temporal stability over time. Use **Pearson Product
Moment Correlation (Pearson r)** because test data are usually
in an interval scale.
2. Parallel forms
How is this reliability done?
- There are two versions of a test. The items need to
exactly measure the same skill. Each test version is called a
*form*.
- Administer one form at one time and the other form at
another time to the *same* group of participants.
- The responses on the two forms should be more or less the
same.
- Parallel forms are applicable if there are two versions of the
test (e.g., entrance examinations, licensure examinations).
How statistics is used?
-Correlate the test results for the first form and the second
form using **Pearson r**. A significant and positive
correlation coefficient indicates consistency between forms.
3. Split -Half
How isthis reliability done?
- Administer a test to a group of examinees. Split the
items into halves (usually odd-even technique).
- Correlate the sum of points in odd-numbered items with
the sum of points in even-numbered items. Each examinee
will have two scores from the same test.
-Used when the test has a large number of items.
What statistics is used?
1. Correlate the two sets of scores using **Pearson
r**.
2. Apply the **Spearman-Brown Coefficient** to adjust
for test length.
- A significant positive correlation indicates internal
consistency.
4. Test of internal consistently using kuder-richardson and
cronbachs alha
How is this relibility done?
- Determine if scores for each item are consistently
answered by examinees.
- Works for tests with many items or Likert-scale
inventories (e.g., "strongly agree" to "strongly disagree").
What statistics is used?
- **Cronbach’s alpha** or **Kuder-Richardson
(KR-20/21)**.
- A value ≥ 0.60 indicates internal consistency.
5. Inter-rater reliability
How is hisreliability done?
- Measures consistency among multiple raters
using the same rubric.
- Used when assessments require multiple raters
(e.g., performance evaluations).
What statistics is used?
-Kendall’s tau coefficient of concordance**.
Linear Regression
Linear regression shows the relationship
between two sets of scores from the same test
administered at different times."*
Visual Representation
Scatterplot showing Monday (X) vs. Tuesday (Y)
test scores*
- Each point = one student's paired scores
- Straight line = regression line
**Interpretation**:
- Tight cluster along line → High
reliability
- Scattered points → Low reliability
Computation of Pearson r Correlation
The index of the linear regression is called a correlation
coefficient. When the points in a scatterplot tend to fall
within the linear line, the correlation is said to be strong.
When the direction of the scatterplot is directly
proportional, the correlation coefficient will have a
positive value. If the line is inverse, the correlation
coefficient will have a negative value. The statistical
analysis used to determine the correlation coefficient is
called the Pearson r. How the Pearson r is obtained is
illustrated below.
Formula
£X- add all the X scores ( Monday scores)
£Y- add all the Y scores (Tuesday scores)
X²- square the value of the X scores (Monday scores)
Y²- square the value of the Y scores ( Tuesday scores)
XY- multiply the X and Y scores
£X²- add the square values of X
£Y²- add all the square values of Y
£XY- add all the product of X and Y
* The value of a correlation coefficient does not exceed 1.00
or -1.00. A value of 1.00 and the - 1.00 indicates perfect correlation.
In test of reliability though, we aim for high positive correlation to
mean that there is consistently in the way the student answered
the test taken.

3. Difference between a positive and a negative correlation

*When the value of the correlation coefficient is positive, it means
that the higher the scores in X, the higher the scores in Y. This is
called a positive correlation. In the case of the two spelling scores, a
positive correlation is obtained.
*When the value of the correlation coefficient is
negative, it means that the higher the scores in X, the
lower the scores in Y and vice versa. This is called a
negative correlation. When the same test is administered
to the same group of participants, usually a positive
correlation indicates reliability or consistency of the
scores.

4. Determining the strength of a correlation

Strength Guidelines
| r Value | Interpretation
| 0.80-1.00 | Very strong
| 0.60-0.79 | Strong
| 0.40-0.59 | Moderate
5. Determining the Significance of the Correlation
The correlation obtained between two variables could
be due to chance. In order to determine if the correlation
is free of certain errors, it is tested for significance. When
a correlation is significant, it means that the probability of
the two variables being related is free of certain errors.
In order to determine if a correlation coefficient value
is significant, it is compared with an expected probability
of correlation coefficient values called a critical value.
When the value computed is greater than the critical
value, it means that the information obtained has more
than 95% chance of being correlated and is significant.
Method:

- Compare the computed correlation coefficient to the

*critical value* in the statistical table.
- If the computed value is higher than the critical value,
the correlation is significant.
Example of Cronbach’s Alpha for Internal
Consistency:
Five (5) students answered a checklist regarding
cleanliness (scale: 1–5).
5- always 4- often 3- sometimes 2- rarely. 1-never

Test Reliability
100% (1)
Test Reliability
41 pages
Assess 1 PED 106 Lesson 6
No ratings yet
Assess 1 PED 106 Lesson 6
75 pages
Group 6
No ratings yet
Group 6
21 pages
Assessment in Learning Lesson Plan Final
No ratings yet
Assessment in Learning Lesson Plan Final
13 pages
Chracteristics of A Good Test
No ratings yet
Chracteristics of A Good Test
58 pages
Establishing Te Lesson 6
No ratings yet
Establishing Te Lesson 6
36 pages
Test Realiabilty
No ratings yet
Test Realiabilty
39 pages
Reliability and Its Importance
No ratings yet
Reliability and Its Importance
57 pages
Reliability
No ratings yet
Reliability
11 pages
Lesson 6 1
No ratings yet
Lesson 6 1
16 pages
Assessment
No ratings yet
Assessment
3 pages
Methods in Reliability
No ratings yet
Methods in Reliability
24 pages
Reliability 2024
No ratings yet
Reliability 2024
30 pages
Language Test Reliability
No ratings yet
Language Test Reliability
20 pages
Reliability and Validity
No ratings yet
Reliability and Validity
18 pages
Reliability Reviewer
No ratings yet
Reliability Reviewer
5 pages
Psychometrics
No ratings yet
Psychometrics
102 pages
Reliability
No ratings yet
Reliability
3 pages
Psycass Reviewer
No ratings yet
Psycass Reviewer
19 pages
Module 4 Psychometric Properties
No ratings yet
Module 4 Psychometric Properties
49 pages
Lesson 6 Establishing Test Validity and Reliability: Learning Instructional Modules For CPE 105
100% (2)
Lesson 6 Establishing Test Validity and Reliability: Learning Instructional Modules For CPE 105
17 pages
Validity and Reliability - Removed 1
No ratings yet
Validity and Reliability - Removed 1
29 pages
5 Reliability
No ratings yet
5 Reliability
29 pages
Reliability
No ratings yet
Reliability
9 pages
Assignment No. 2 (8624)
No ratings yet
Assignment No. 2 (8624)
109 pages
Unit 6
No ratings yet
Unit 6
37 pages
Establishing Test Validity and Reliability
No ratings yet
Establishing Test Validity and Reliability
33 pages
Psychometric Properties
No ratings yet
Psychometric Properties
3 pages
Students Slides 1 Realibity
No ratings yet
Students Slides 1 Realibity
59 pages
Ano DelCorro Assessment in Learning 1 FINAL
No ratings yet
Ano DelCorro Assessment in Learning 1 FINAL
17 pages
Characteristics of A Good Test
No ratings yet
Characteristics of A Good Test
41 pages
The Practice of Research in Criminology and Criminal Justice (6th Edition) PDF
No ratings yet
The Practice of Research in Criminology and Criminal Justice (6th Edition) PDF
10 pages
Lesson 6 Establishing Test Validity and Reliability
No ratings yet
Lesson 6 Establishing Test Validity and Reliability
19 pages
PSY 210 L7 Reliability
No ratings yet
PSY 210 L7 Reliability
8 pages
Reliability: Floramae Z. Campos Student/MA-GC
No ratings yet
Reliability: Floramae Z. Campos Student/MA-GC
29 pages
Psy 112 Handout 6
No ratings yet
Psy 112 Handout 6
6 pages
Strategic Management Concepts and Cases Competitiveness and Globalization 11th Edition Hitt Fast Access
No ratings yet
Strategic Management Concepts and Cases Competitiveness and Globalization 11th Edition Hitt Fast Access
324 pages
Psych Stats Semi
No ratings yet
Psych Stats Semi
11 pages
3 - Reliability
No ratings yet
3 - Reliability
38 pages
Real Iab Lity
No ratings yet
Real Iab Lity
20 pages
Reliability of The Assessment Tools
No ratings yet
Reliability of The Assessment Tools
19 pages
Characteristics of Research Tools
No ratings yet
Characteristics of Research Tools
3 pages
Test - Education (1) STANDARDIZED TESTS
No ratings yet
Test - Education (1) STANDARDIZED TESTS
9 pages
Quant Notes 9-14-21
No ratings yet
Quant Notes 9-14-21
73 pages
Psyc 385 Exam 2 Study Guide
No ratings yet
Psyc 385 Exam 2 Study Guide
17 pages
Reliability Estimates: Source of Error Variance Is Test Administration
No ratings yet
Reliability Estimates: Source of Error Variance Is Test Administration
8 pages
Lesson in EDUC 4 (Establishing Test Validity and Reliability)
No ratings yet
Lesson in EDUC 4 (Establishing Test Validity and Reliability)
20 pages
Establishing Test Reliability
No ratings yet
Establishing Test Reliability
21 pages
Reliability & Pilot Testing
No ratings yet
Reliability & Pilot Testing
2 pages
Top 4 Characteristics of A Good Test: Characteristic # 1. Reliability
No ratings yet
Top 4 Characteristics of A Good Test: Characteristic # 1. Reliability
21 pages
Management Review Agenda and Minutes
100% (3)
Management Review Agenda and Minutes
6 pages
Final Notes of Psychological Testing
No ratings yet
Final Notes of Psychological Testing
13 pages
Statistics Unit 7 Notes
No ratings yet
Statistics Unit 7 Notes
9 pages
Reliability
No ratings yet
Reliability
113 pages
Edu533 Reporting
No ratings yet
Edu533 Reporting
34 pages
AQoNs PWC 7 July 2023
No ratings yet
AQoNs PWC 7 July 2023
35 pages
Lesson6 Establishing Test Validity and Reliability
No ratings yet
Lesson6 Establishing Test Validity and Reliability
42 pages
Chapter 6edited
No ratings yet
Chapter 6edited
15 pages
Reliability
No ratings yet
Reliability
2 pages
RMBS M2 Lecture 5a
No ratings yet
RMBS M2 Lecture 5a
42 pages
Hand Out Reliability
No ratings yet
Hand Out Reliability
3 pages
Review Questions of Midterm Chapters 1-4
100% (2)
Review Questions of Midterm Chapters 1-4
2 pages
The IMRAD Format
No ratings yet
The IMRAD Format
2 pages
Dissertation Topics On Media
100% (1)
Dissertation Topics On Media
6 pages
Good Psychometric Properties
No ratings yet
Good Psychometric Properties
44 pages
When Should You Adjust Standard Errors For Clustering?: Alberto Abadie, Susan Athey, Guido Imbens, & Jeffrey Wooldridge
No ratings yet
When Should You Adjust Standard Errors For Clustering?: Alberto Abadie, Susan Athey, Guido Imbens, & Jeffrey Wooldridge
33 pages
Dissertation Sur Barbe Bleue
100% (2)
Dissertation Sur Barbe Bleue
4 pages
Podcast Listening Test
No ratings yet
Podcast Listening Test
9 pages
Information: European Foundation For Quality Management Efqm
No ratings yet
Information: European Foundation For Quality Management Efqm
17 pages
Problem Based Learning and Applications
No ratings yet
Problem Based Learning and Applications
9 pages
Biowin Project
No ratings yet
Biowin Project
110 pages
Teaching Plan Geomatics Engineering Class
No ratings yet
Teaching Plan Geomatics Engineering Class
9 pages
Introduction To Reliability: What Is Reliability? Why Is It Important?
No ratings yet
Introduction To Reliability: What Is Reliability? Why Is It Important?
14 pages
The Emotions of Family History and The Development of Historical Knowledge
100% (2)
The Emotions of Family History and The Development of Historical Knowledge
23 pages
Sprinting Ahead With Agile Auditing
No ratings yet
Sprinting Ahead With Agile Auditing
24 pages
Al Emran2018
No ratings yet
Al Emran2018
45 pages
Decoding The Discourse A Comprehensive Examination of Community Dynamic
No ratings yet
Decoding The Discourse A Comprehensive Examination of Community Dynamic
12 pages
Real-Time Specific Energy Monitoring Enhances The Understanding of When To Pull Worn PDC Bits
No ratings yet
Real-Time Specific Energy Monitoring Enhances The Understanding of When To Pull Worn PDC Bits
10 pages
1.1 Introduction To Hypothesis Testing
No ratings yet
1.1 Introduction To Hypothesis Testing
10 pages
High Quality Australian Valerian Products: Production of
No ratings yet
High Quality Australian Valerian Products: Production of
48 pages
Sti College
No ratings yet
Sti College
8 pages
An Application of Arbitrage Pricing Theo
No ratings yet
An Application of Arbitrage Pricing Theo
7 pages
Basic Graph Theory With Applications To Economics: February 16, 2010
No ratings yet
Basic Graph Theory With Applications To Economics: February 16, 2010
44 pages
Notes - Survival Analysis
No ratings yet
Notes - Survival Analysis
5 pages
Activity #3 Unit Iii - What Is New Social Studies
No ratings yet
Activity #3 Unit Iii - What Is New Social Studies
2 pages
Procurement Management
No ratings yet
Procurement Management
2 pages
Anh Tran Resume
No ratings yet
Anh Tran Resume
1 page
Summer Internship
No ratings yet
Summer Internship
2 pages