Historical and Philosophical Foundations of Assessment: TF Discuss Current Trends and Controversial
Historical and Philosophical Foundations of Assessment: TF Discuss Current Trends and Controversial
OVERVIEW
Testing is an integral part of our society. Any- grade students. As a result, all 50 states now
one who has attended grade school or college, have testing programs and report testing results
served in the military, or applied for a job has to their community. The use of tests should be
taken one or more tests. It is almost impossible viewed within the context of the broader
to pick up a newspaper, listen to a talk show, or concept of assessment. It is vital that counselors
watch a documentary without testing being and other helping professionals understand the
mentioned. Testing can have significant impact assessment process, the purpose of tests, how to
on all our lives. For example, in 2002, President choose appropriate tests, test interpretation, and
Bush signed into law the No Child Left Behind the use of test results in developing treatment
Act of 2001, containing four basic education re- plans or educational plans.
form principles for kindergarten through 12th-
OBJECTIVES
After studying this chapter, you should be able ^ Understand the history of assessment
to tf Discuss current trends and controversial
t/ Discuss several uses and values of testing issues in testing
Chapter 1 Historical and Philosophical Foundations of Assessment
TESTING AND ASSESSMENT
The use of tests has greatly expanded during the 20th century. More thanjone million
standardized tests are used each school day in American schools alone (Lyman, 1998).
Add to that the tests given in industry, mental health clinics, the military, government,
business, and so on, and that number increases dramatically. The growth in test sales is
impressive, from less than $7 million in 1955 to over $263 million in 1997 (Clarke,
Madaus, Horn, & Ramos, 2001). Between 1990 and 1997 alone, sales increased by
about 50%, or $88 million.
The use of computers to administer and score tests has changed the field of as-
sessment in terms of reducing the time for scoring and analyzing test results and as-
suring their accuracy. There are tests to measure of a wide variety of attributes including
intelligence, achievement, career preferences, personality type, and much more.
Often, the terms testing and assessment are used interchangeably It is important to
know that the use of tests is only one part of the overall process of assessment. ^4^-
sessment is defined by the American Psychological Association (APA) as a process that
integrates test information with information from other sources; a process for evaluating
behavior, psychological constructs, and/or characteristics of individuals or groups for
the purpose of making decisions regarding classification, selection, placement, di-
agnosis, or intervention (2000). The term test refers to a measurement procedure for
assessing psychological characteristics in which a sample of an examinee's behavior is
obtained and subsequently evaluated and scored using a standardized process (APA,
2000). Because tests provide only one source of information, professionals may also
gather client information from interviews, records, or from other professionals.
A test user is the person or persons responsible for the selection, administration, and
scoring of tests; for the analysis, interpretation, and communication of test results; and
for any decisions or actions that are based, in part, on test scores. Generally, individuals
who simply administer tests, score tests, and communicate simple or "canned" test
results are not classified as test users.
VALUE OF TESTING
Although testing is a specific task that is different from counseling, it continues to be an
important part of the counseling process (Maxmen & Ward, 1995). Though it can occur
at several points throughout the counseling relationship, it will almost always occur at
the outset of the counseling process (Seligman, 1996). The assessment process is used
for the following reasons:
• To obtain information about the client's presenting problem
• To make an accurate diagnosis
• To determine the client's goals for counseling
• To gather information to aid in the development of a treatment plan
• To gather information to aid in the development of an educational plan
• To gather baseline data to measure the client's progress in counseling
• To determine a client's suitability for a certain treatment program or modality
Historical Contexts
• To assess the effectiveness of an education/counseling program
• To carry out research studies
In addition to providing useful information about clients, test information can be es-
pecially valuable in educational decision making because it enhances placement deci-
sions. Public Law 94-142 requires exceptional students to be placed in the least
restrictive environment, and test information helps educational personnel place these
individuals in environments that will best facilitate their learning. Test data are utilized
with different criteria in the selection of students for gifted programs, for admission to
colleges and universities, and for admission to vocational and technical programs.
Advanced placement (AP) programs and college-level examination programs (CLEPs)
are examples of widely used placement programs. Teachers are also aided by test data in
their selection of instructional strategies and in their evaluation of curriculum programs.
Another way of looking at the value of test data is to consider how different groups
such as administrators, teachers, counselors, psychologists, curriculum experts, students,
researchers, the public, and the legislature use the data. All of these groups are
consumers of test information.
STANDARDIZED AND NONSTANDARDIZED TESTS
There are several classification systems for tests. Tests can be classified based on their
purpose, how they wTere constructed, their content, or how they are administered,
scored, and interpreted (Aiken, 2003)- A common classification is standardized and
nonstandardized tests. Standardization implies uniformity of procedure in administering
and scoring the test; therefore, standardized tests will have specific directions for
administration, specific instructions for the test taker, and specific scoring procedures.
Standardized tests have generally proven reliability, or consistency, of scores and val-
idity, the ability of the test to measure what it is designed to measure. Some general
categories of standardized tests include achievement tests, aptitude tests, intelligence
tests, personality inventories, and interest inventories. Nonstandardized tests are in-
formally constructed tests without proven reliability or validity. These include inter-
views, observation, questionnaires, and secondhand information.
HISTORICAL CONTEXTS
Testing is not a new idea, even though the objective test movement began only at the
turn of the century (see Table 1.1). Around 2200 B.C., the Chinese used essay exami-
nations to help select civil service employees. The philosophies of Socrates and Plato
emphasized the importance of assessing an individual's competencies and aptitudes in
vocational selection. Throughout the centuries, philosophers and educators have devised
certain scales or items to provide teachers and parents with useful information to help
their children. Fitzherbert (1470-1538) identified some items to screen individuals with
retardation from those without; for example, being able to count to 20 pence, being able
to tell one's age, and being able to identify one's father or mother.
Chapter 1 Historical and Philosophical Foundations of Assessment
Table 1.1
Major events in testing during the 20th century.
Juan Huarte (1530-1589) was probably the first author to suggest formal mental
testing. His book title was translated as The Trial of Wits: Discovering the Great
Differences of Wits Among Men and What Sorts of Learning Suit Best with Each
Genius. Jean Esquirol (1772-1840), a French physician, proposed that there are
several levels of mental deficiency and that language is a valid psychological
criterion for differentiating among levels. Eduardo Seguin (1812-1880) also
worked with individuals with mental
Historical Contexts
Table
1.1
Jensen's How Much Can We Boost IQ and Mental Measurements Yearbook reviews
Scholastic Achievement? Kuder in *
Occupational Interest Survey Cattail's Bibliographic Retrieval Service Test
Theory of Fluid and Crystallized Critiques, Vols. 1-7 Detroit Test of
Intelligence Kirk and McCarthy's Illinois Learning Aptitude IV Differential Ability
Test of Scales Neurobehavioral Functioning
Psycholinguistic Abilities Inventory Nagiieri's Nonverbal Aptitude
Bayley Scales of Infant Test: Individual
Development Administration
1970 Family Educational Rights and Test of Nonverbal Intelligence 1
Privacy Act to New York State Truth in -3 Beta III
Testing Act Bracken's Basic Concepts Scale:
1979 Public Law 94-142 Revised Bayley Scale of Infant
System of Multicultural Pluralistic Development Millon's Index of
Assessment Wechsler Intelligence Scale Personality Types
for Children-Revised Revised Standards 1990 Sixteen Personality Factor
for Educational and Questionnaire V to Tests in Print IV
Psychological Testing Rokeach 2000 Wechsler Intelligence Scale for
Value Survey Peabody Picture Children IV Wechsler Adult
Vocabulary Test Seventh and Eighth Intelligence Scale 111 Wechsler
Mental Measurements Preschool and Primary Scale of
Yearbooks Intelligence III
Use of computers in testing Wechsler Individual Achievement Test
McCarthy Scales of Children's Wechsler Abbreviated Scale of
Abilities Intelligence Early Reading Diagnostic
1980 Thorndike, Hagen, and Startler's revision Assessment: Revised Early Mathematics
of the to Stanford-Binet Diagnostic Assessment Wide Range
1989 Kaufman Assessment Battery for Achievement Test, 3rd Edition State Trait
Children Anger Expression Inventory: II Stanford-
Ninth and Tenth Mental Measurements Binet V
Yearbooks Minnesota Multiphasic Dynamic Assessment of Test
Personality Inventory II Revised Accommodations BarOn's Emotional
Standards for Educational and Quotient Inventory First Step: Screening
Psychological Testing Test for Evaluating
Computer-adaptive and computer-assisted Preschoolers
testing Wechsler Adult Intelligence Scale- Tests in Print V
Revised Nader/Nairn The Reign of ETS
Tests in Print HI