Chapter 13 Interpreting Test Scores
Chapter 13 Interpreting Test Scores
This chapter begins by briefly laying down a set of general procedures for test
construction. These are then illustrated by two examples: an achievement test and a
placement test. Finally there is a short section on validation.
CONTENT
This refers not to the content of a single, particular version of a test, but to the
entire content of any number of versions. Samples of this content will appear in
individual versions of the test.
The way in which content is described will vary with its nature. The content of a
grammar test, for example, may simply list all the relevant structures. The content of a
test of a language skill, on the other hand , may be specified along a number of
dimensions. The following provides a possible framework for doing this. It is not meant
to be prescriptive; readers may wish to describe test content differently. The important
thing is that content should be as fully specified as possible.
Operations (the task that candidates have to be able to carry out). For a reading test
these might include, for example: scan text to locate specific information; guess
meaning of unknown words from context.
Types of text For example a writing test these might include: letters, forms,
academic essays up to three pages in length.
Addresses This refers to the kinds of people that the candidate is expected to be able
to write or speak to (for example native speakers of the same ages and status); or the
people for whom reading and listening materials are primarily intended (for example
native-speaker university students).
Topics
Topics are selected according to suitability for the candidate and the type test.
an achievement test
Specifications
CONTENT
Types of text The text should be academic (taken from textbooks and journal
papers).
Addresses Academics at post graduate level and beyond.
Topics The subject areas will have to be as natural as possible, since the students
are from a variety of social science and business disciplines. (economics, sociology,
management, etc.).
Operations These are based on the stated objectives of the course, and include
broad and underlying skills:
Broad skills:
1. Scan extensive written texts for pieces of information.
2. Construe the meaning of complex, closely argued passages.
Underlying skills:
(Those which are regarded as of particular importance for the development of the
broad skills, and which are given particular attention on the course.)
3. Guessing the meaning of unfamiliar words from context
4. Identifying referents of pronouns etc., often some distance removed in the text.
Example:
Cross-sectional studies have indicated that intelligence declines after the age of
thirty. Part of the explanation for this may be that certain intelligence test tasks
require , something of which we are less capable as we grow older.
5 meaning-from-context items from detailed reading passages
Example:
For each of the following, find a single word in the text with an equivalent
meaning. Note: the words in the text may have an ending such as ing, -s, etc.
Example:
What does each of the following refer to in the text? Be very precise/
the former (line 43)
SCORING PROCEDURES
There will be detailed key, making scoring, almost entirely objective. There will
nevertheless be independent double scoring. Scorers will be trained to ignore
irrelevant (for example grammatical) inaccuracy in responses.
SAMPLING
Texts will be chosen from as wide a range of topics and types of writing as is
compatible with specifications. Draft items will only be written after the suitability of
the texts has been agreed.