BF02289233
BF02289233
3
S E ~ M B E I ~ , 1958
Saunders' approach requires that the kurtosis (fourth moment over second
moment squared) of all loadings and their reflections be a maximum,
(3) q2 = nr ~_, ~_~ a~./(~_, ~E] a~.)2 = maximum.
zero loadings. For orthogonal factors, criteria (2), (3), and (4) are equivalent
because of tile invariance of the sum of the communalities, ~ ] ~ ] , a~ .
(This term, as well as other constants, disappear when differentiated in the
calculus problem involved in finding the required critical point.)
Indeed, it turns out that they are also equivalent to Carroll's criterion
in the orthogonal case. Minimizing (1) is equivalent to maximizing (4)
since the squared eommunality of a test is
constant = ( ~ ] a ~ ) 2 = ~]a~, + 2 ~a~,a~,
, s s<t
= q3 + 2L
Thus, since the quartimax criterion plus twice Carroll's criterion is a con-
stant, maximizing q3 is equivalent to minimizing f.
Neuhaus and Wrigley- realized that none of these criteria can be real-
istically applied without the aid of an electronic computer--the calculations
involved are too extensive for a desk calculator or punched card mechanical
computers. Consequently, they programmed the quartimax method for the
Illiac* and provided a rather extensive numerical investigation of the empirical
properties of the quartimax method.
Their results were perhaps more encouraging than Carroll's. Under
the restriction of orthogonality Carroll's criterion (or the equivalent quartimax
method) does not show nearly so obvious a bias as does Carroll's criterion
when the restriction of orthogonality is removed. However, as an explication
of orthogona] simple structure, the quartimax method does have a systematic
bias which will be more fully examined in the next section.
*The Illiac is the University of Illinois electronic computer. Subsequently, the quar ti-
max criterion has been programmed for the CRC-102A (Neuhaus), and the IBM 701
(Kaiser). The varimax criterion described in the next two sections has been programmed
for SWAC at UCLA (Comrey), the IBM 701 (Kaiser), Illiac (Dickman), and the IBM
650 (Vandenberg).
190 PSYCHOt~IETRIKA
To obtain the total criterion for the entire factor matrix, (5) could then be
summed over all tests to give
(6) q* = ~ {It E (a~.)~ - ( ~ a~.)2]/r21 -
t s s
And for the criterion for all factors, define the maximum simplicity of a
factor matrix as the maximization of
(S) v'* = ~ v * , = ~ {In ~--~.(a~,)2 -- (~,a~,)2]/n=},
summing over all possible pairs of columns for the criterion. Criterion (9)
then bears the analogous relationship to Carroll's criterion (1) that the
varimax criterion (8) does to the quartimax criterion (6).
Some distinctions between quartimax and varimax orthogonal solutions
can be illustrated numerically. In Table 1 solutions for Thurstone's eleven-
variable box problem ([12], pp. 373-375) are given. It will be noted that the
quartimax solution [7] could hardly be called a simple structure. There is
a large general factor, and the second factor seems only vaguely concerned
Table I
Test " X Y Z X Y Z X Y Z
x 9o o5 oo 68 65 05 91 19 16
y 04 85 O1 83 -47 O0 05 93 25
z 03 05 79 42 -08 79 ii 17 88
xy 62 63 -06 99 ii -OL 6h 74 2O
yz -o5 5h 57 71 -ho 56 o2 65 75
x2y 82 37 -01 92 41 03 8h 51 22
xy 2 35 76 02 96 -18 03 37 86 28
(x 2 * z2) ½ 52 -07 65 59 38 68 60 09 77
xyz 42 h3 L3 88 Oh L5 L8 58 65
with dimensions of boxes. On the other hand, the raw varimax solution
closely parallels Thurstone's original subjective solution, given the restric-
tion of orthogonality.
In Table 2 are solutions for Holzinger and Harman's 24 psychological
tests ([3], pp. 229-233). Both the quartimax [7] and the raw varimax methods
seem to duplicate the subjectively rotated simple structure patterns. But
the respective variance contributions of the factors are perhaps more interest-
ing. It is seen that the dispersion of the ~i a~ for the subjective solution
is less than the corresponding figures for the two analytic methods. In other
words, Holzinger and Harman have made the factors a little more level or
192 PSYCHOMETRIKA
Table 2
i i0 32 62 20 37 19 60 07 2~ 20 65 13 lh 19 67 17
2 07 15 41 13 24 07 38 04 17 07 42 08 IO 07 43 IO
3 iO 12 53 13 31 el 48 el 22 02 52 06 15 02 5h 08
4 15 18 53 12 36 07 h6 -oi 27 08 52 Oh 20 09 ~h 07
5 75 15 26 15 81 14 -02 -04 78 21 12 06 75 21 22 13
6 72 05 28 25 81 03 OO 06 78 i0 13 lh 75 IO 23 21
7 81 08 27 Ii 85 07 -Oh -IO 8h 15 IO O0 82 16 21 08
8 54 26 38 14 66 20 20 -04 60 25 31 05 54 26 38 12
9 76 -04 29 30 86 -06 -02 io 84 el 12 19 80 el 22 25
IO 28 66 -19 I~ 23 70 -12 Ii 17 71 -08 17 15 70 -06 24
II 27 61 -Oh 29 31 62 el 23 22 63 06 29 17 60 08 36
12 13 72 09 03 16 69 19 -oi 06 70 23 Oh 02 69 23 ii
13 2h 63 31 02 35 57 32 -08 2h 59 39 -01 18 59 41 06
14 23 19 -02 48 32 19 -03 42 26 20 el h6 22 16 04 50
15 ii l~ 08 50 25 ii I0 h5 17 ii 13 48 12 07 lh 50
16 05 22 34 h5 29 13 37 37 17 13 41 41 08 I0 41 43
17 15 24 -03 62 28 24 02 57 20 23 05 61 14 18 06 64
18 el 39 20 52 22 32 30 h7 08 31 32 51 09 26 32 5h
19 12 22 18 39 28 18 19 32 19 18 22 36 13 15 2h 39
20 31 18 46 29 52 09 35 14 42 12 43 21 35 Ii h7 25
21 17 46 33 2h 35 38 35 14 23 he 40 20 15 38 42 26
22 31 12 40 40 53 04 30 26 hh 06 37 32 36 oh 41 36
23 31 29 54 25 55 19 4h 09 43 21 52 16 35 21 57 22
2h 39 46 14 31 49 43 io 20 40 h6 18 27 34 h4 22 34
~a~ 343 292 268 236 559 242 196 142 431 260 26~ 186 350 244 308 236
j -js
of factor loadings, a test with comnmnality 0.6, for example, would tend to
influence the rotations four times as much as a test whose eommunality was
0.3. Thus, while the most obvious weights have been applied to the tests,
namely the square roots of their communalities, after the fact it seems that
there is probably a better set of weights--weights which would tend to
equalize to a greater extent the relative influence of each test during rotation.
There seems no rational basis for choosing among different weighting
schemes. Let us then make the agnostic confession of ignorance which pervades
any form of correlational analysis. For the purposes of rotation, weight
the tests equally, in the sense that the lengths of the common parts of the
test vectors have equal length. (The author is indebted to Dr. D. R. Saunders
for this suggestion.) The varimax criterion could then be rewritten as
(10) v = ~ {In ~ (a~Jh~) 2 - [~_~ (a~8/h~)]2]jn2},
, i
where h~ is the communality of the jth test. In contrast to (7) and (8), where
the variance of the squared correlations of the tests with a factor is maximized,
the variance of the squared correlations of the common parts of the tests
(the reflections of the tests onto the common-factor space) with a factor is
now being maximized. [Note from (10) that we are not advocating a permanent
weightia~g of the tests by a weight inversely as the square root of their com-
munalities. During rotation this weighting extends the common part of
each test vector to unit length, but after rotation each of these vectors is
shortened to its proper length by reweighting directly as the square root of
the test's communality.]
As will be seen in Table 2, under this modification the varimax criterio~
(the normal varimax, since rotation is with respect to normalized common
parts of tests) has effectively removed the small but disturbing bias in the
raw varimax solution of Holzinger and Harman's example. It also has been
shown in a number of other examples [61 that the normal varimax does not
seem to deviate systematically from what may be considered the best
orthogonal simple structure.*
Thus far, however, merely a numerical-intuitive basis for a weighting
procedure which leads to "prettier" results has been provided. Such a basis
is quite unsatisfactory theoretically. Indeed, this sort of ad hoc thinking
could conceivably lead to a different set of judgmentally determined weights
for any particular example--a situation as scientifically reprehensible as
the subjective graphical methods.
There is a more fundamental rationale for attempting to establish the
normal varimax criterion (10) as a mathematical definition for the rotation
*Professor Andrew Comrey has apparently reached the same conclusion in an exten-
sive application of the normal varimax criterion to interitem correlation matrices of the
MMPI (personal communication). A further example, available from the writer, is the
normal varimax solution of Thurstone's classic PMA study[ll] (dittoed).
194 PSYCHOMETRIK&
problem. Consider the situation illustrated in Fig. 1. There are two clusters
of tests, each of which is pure in the sense t h a t the reflections of the test
vectors of the cluster onto the two-dimensional, common-factor space are
eollinear. (While these clusters are drawn less t h a n 90 ° apart, the following
a r g u m e n t is perfectly general.)
"n"
='n-
Fzocm~ 1
Case for which a normal varimax solution is invariant under changes in the composition
of the test battery.
where
u+ = ( a i l / h + ) ~ - (a~2/h+) 2,
and
vi = 2 ( a ~ l / h i ) ( a i J h i ) .
Let nA (n~ > 1) be the n u m b e r of tests in the first cluster and nB ( n B > 1)
be the n u m b e r of tests in the second cluster (n = nA =k nB). I t is readily
a p p a r e n t t h a t all tests of the first cluster have the same values for u, and v~ .
HENRY F. KAISER 195
Let these values be UA and v , . Similarly let the values for the second cluster
bc uB and v, . In this case (11) reduces to
2nAns(u,VA + u~v8 -- naY, -- u,va)
(12) ¢ -- ~ a r c t a n n A n , ( u 2 + us2 -- vA2 -- vB2 -- 2 u a u , + 2vAv,) *
of rank two. Normal wu'imax does however give invariant results for two
such Spearman clusters simultaneously, and consequently the normal varimax
criterion is a two-dimensional generalization of the classic Spearman case.
Obviously, the next step would be a generalization along the same lines to
the r-dimensional ease; thus far, however, work on this problem suggests
virtually insuperable mathematical difficulties.
To investigate numerically the tendency of the criterion to give factoriatIy
invariant solutions for r larger than two, again consider Holzinger and
Harman's empirical data. Taking their centroid loadings, the first five tests
were rotated then the first six, etc., systematically until the analysis of
all 24 tests was reached, as in Table 2. The results of this application of the
normal varimax criterion are given in Table 3. There, the normal varimax
loadings for the four factors as a function of the changing number of tests
are given.
Note that factors A and C are essentially invariant from the outset;
the loading changes, while somewhat systematic, are negligible--24 appears
to be essentially infinite. On the other hand, factors B and D show more
movement before they become stable. The reason for this is readily apparent.
For both B and D, this movement occurs only while they are underdetermined,
i.e., only while they contain no appreciable Ioadings. However, once they
pick up a test or two with high loadings, their ambiguous definition abruptly
stops, and they settle down to exhibit a degree of invnriance similar to the
Table 3A
Normal Varimax Loading Changes for Holzinger and Harman's Factor A (n = 5, 6, ..., 24) a
Test 5 6 7 8 9 I0 ii 12 13 lh 15 16 17 18 19 20 21 22 23 2}~
i 19 19 18 19 18 16 16 16 16 16 15 16 15 16 16 15 15 15 14 lh
2 12 12 12 13 12 12 12 12 12 ii ii Ii Ii 12 ii Ii II Ii i0 i0
3 15 16 15 16 16 17 17 17 17 16 16 16 16 17 17 16 16 16 15 15
4 21 21 21 22 21 21 21 21 21 21 21 21 21 21 21 20 20 20 20 20
5 79 78 78 79 78 76 75 76 76 76 75 75 75 76 76 75 75 75 75 75
6 78 77 77 78 77 76 77 77 76 75 75 75 75 75 75 75 75 75 75
7 83 8h 83 82 82 82 82 82 82 82 82 83 82 82 82 82 82 82
8 59 58 55 55 55 55 55 55 55 55 55 55 55 55 55 54 5h
9 82 83 82 83 83 81 80 80 80 80 80 80 80 80 80 80
iO 18 15 17 17 16 16 16 15 16 16 16 16 15 16 15
II 19 21 21 19 18 18 18 18 18 18 18 18 18 17
12 03 02 03 03 03 03 03 03 03 03 03 O2 O2
13 I~ 19 19 20 20 20 20 19 19 19 18 18
14 23 21 22 21 22 22 22 22 22 22 22
15 ii 12 ii 12 12 12 12 12 12 12
16 09 09 iO 09 09 09 09 O9 O8
17 13 14 lh 14 lh lh lh 14
18 Ol Ol OO OO oo OO OO
19 13 13 13 13 13 13
20 35 35 35 35 35
21 16 16 16 15
22 36 36 36
23 35 35
2h
34
~ 0
O~
II,
I I I 0
~0000
Co
~o 0~
~ o ~ o
P
~
o
i
~ 0 ~ 0 0 0 ~
~ o ~ o o o ~
¢0
I
~ ~ o ~ w ~ o o ~
i
~o ~ o o ~ ~
o ~ ~ o ~ o
i
~ o ~ o o ~ ~
cl
~
o
i c~ 1
~o ~o ~ o o ~ ~ o ~ 0 ~ ~ ~ 0 ~ 0 0 0 ~ c~
~ ~ o ~ o ~
~ o ~ o ~ o o ~ ~
5" !
H
~ o ~o ~ o o ~ ~ I
~ ~ ~ o ~ o ~ o ~ o ~ o ~ o ~ o o 0 ~
~ ~ ~ 0 ~
~ o ~o ~ o o ~ ~ I
~ ~ ~ 0 ~ 0 ~ ' ~ P ~ o o ~ ~ ~ o ~ o o o ~
~ o ~ ~ ~ 0 ~ P
~ o ~o ~ o o ~ ~ 0 ~ o ..... o g ~ o S ~ o o 0 ~
~ ~ ~ o ~
~ o ~ o o ~ ~ o ~ o~ ~ o~
o g o~
!
198 FSYCHOMETRIKA
Table 3D
Nodal Varimm.x Loading Changes for Holzinger and Ha~man's Factor D (n = 5, 6 . . . . . 2L) a
;q
Test 5 6 7 8 9 i0 ii i2 13 lh 15 16 17 18 19 20 -~! 22 23 24
1 Ol -00 O0 Ol -07 Ol Ol Ol 02 08 13 i4 15 17 19 18 i~ i? 17 17
2 Ol Ol 02 Ol O0 02 02 02 Ol 05 08 09 09 l0 ii ii i0 lO i0 iO
3 -00 Ol Ol ~OO 05 -00 Ol -00 -01 02 06 07 07 08 09 09 O~ 08 08 08
-03 -03 -03 -Oh -01 -04 -03 -oh -Oh Ol 05 06 06 o7 09 08 o~ 08 o? o7
5 -00 -06 -0] -02 -09 -06 i06 -05 -05 07 12 12 lh lh 15 lh 1k ih lh 13
6 06 09 09 05 05 o6 06 05 17 21 21 22 21 22 22 22 22 22 21
7 -06 -06 -06 -i0 -I0 -09 -i0 03 07 O? O9 08 09 O9 O9 08 08 O8
8 -04 -lh -07 -08 -07 -07 05 iO iO 12 12 i4 13 13 i3 12 i2
9 15 II 12 12 Ii 22 26 26 27 25 26 26 26 26 25 25
I0 -00 -07 O0 02 15 19 19 23 24 25 25 25 2L 2h 2J,
L1 07 12 15 27 31 32 35 36 37 37 37 J? 37 36
12 -lh -ii -00 05 05 09 ii 13 12 i2 I_1 I_I ii
13 -17 -05 oo oi o4 07 08 08 07 07 07 o6
L,7 ~9 h9 50 50 50 50 50 50 50 50
15 L9 49 50 50 50 50 50 50 50 50
16 )~2 L2 L~ &5 L~4 L~ ~L /45 ~3
17 6h 65 6h 6£ 65 6h 6L 65
18 5L 55 55 5b 51~ ~ 94
19 ho &o }40 39 39 39
2O 26 26 26 25 25
21 27 26 26 26
22 36 36 36
23 22 22
2~ 3~
other two factors, which had high loadings from the beginning. For n = 24,
there appear to be good approximations to the domain normal varimax
factors.
it may be shown that in the orthogonal ease v = - 2c. This alternative form
of the normal varimax may then be used to obtain oblique factors. The
mathematical problem of minimizing (13) is exactly analogous to Kaiser's
[5] treatment for Carroll's criterion. Comput~tionally, the (iterative) solution
involves finding the latent vector associated with the smallest latent root of
a constantly changing symmetric matrix of order r.
HENRY F. KAISER 199
Computational Appendix
To compute an orthogonal normal varimax solution, the following
procedure is suggested. The first step is to normalize the rows of the arbitrary
reference factor matrix (e.g., principal axes or centroids) by dividing each
element by h;. Rotation to the direction of the normal varimax factors may
then be carried out with respect to these normalized loadings.
The criterion (10) will be applied to two factors at a time. For this purpose,
the following notation for an orthogonal rotation is convenient.
where x; and y~, the present normalized loadings, are constants, and X;
and Yi , the desired normalized loadings, are functions of ¢, the angle of
rotation.
It is immediately seen that
(14) X~ = x; cos¢ -b y~ sin ¢,
(15) Y; = - x t sin ¢ -{- yj cos ¢.
Thus,
(16) dXJd¢ = Y~,
(17) d Y~/d¢ = - X i .
According to (10), in this plane,
(18) n 2 v~ -- n ~ (X2) 2 -- ( ~ X2) 2 -~ n ~ (y2)2 _ ( ~ y~)2
should be a maximum. Differentiating (18) with respect to ¢, using (16) and
(17), and setting the derivative equal to zero,
(19) n ~ XY(X 2- y2) _ ~..Xy ~ (X ~ - Y~) = O.
To solve (19) for ¢ in terms of xi and y~, substitute the values of X; and
Y~ from (14) and (I5), consult a table of trigonometric identities, and, after
a good deal of algebraic manipulation,
(20) ¢ = ¼ arctan
2[~ ,, E (x ~ ~ y 2) ( x2 y ) - - E (x ~ y~) E (2xy)]
- -
0 ° to 0 ° to
sign of "~22½° --22½°
denominator +22½ ° -22½ °
to ~-45 ° to --45 °
T h e sign of n u m e r a t o r a n d d e n o m i n a t o r refer to t h e r i g h t - h a n d m e m b e r
of (20); t h e values in the cells refer to ¢.
These single-plane rotations are m a d e on factors 1 with 2, i with 3, • • • , 1
w i t h r, 2 with 3, • .- , 2 with r, • • • , (r -- 1) with r, 1 with 2, - . . iteratively
until r(r - 1)/2 successive rotations of ¢ = 0 are obtained, i.e., until the
process converges. (It was shown [6] t h a t v in (10) c a n n o t be greater t h a n
(r - 1)/r, a n d since each successive application of (20) can result 0nly in
a non-decrease of v, this iterative procedure m u s t converge.) A f t e r con-
vergence, each normalized test v e c t o r is restored to its proper length b y
m u l t i p l y i n g b y ha .
Since this article was accepted for publication, t h e a u t h o r has p r e p a r e d
,~ detailed outline for coding an electronic c o m p u t e r p r o g r a m for tlle v a r i n ~ x
criterion. This (dittoed) p a p e r is available f r o m t h e writer.
REFERENCES
[1] Carroll, J. B. An analytical solution for approximating simple structure in factor
analysis. Psychometrika, 1953, 18, 23-38.
[2] Ferguson, G. A. The concept of parsimony in factor analysis. Psychometrika, 1954,
19, 281-290.
[3] tIolzinger, K. J. and Harman, H. H. Factor analysis. Chicago: Univ. Chicago Press,
1941.
[4] Kaiser, H. F. An analytic rotational criterion for factor analysis. Amcr. Psychol-
ogist, 1955, 10~ 438. (Abstract)
[5] Kaiser, H. F. Note on Carroll's analytic simple structure. Psychometrika, 1956, 21,
89-92.
[6] Kaiser, H. F. The v~rimax method of factor analysis. Unpublished doctoral disserta-
tion, Univ. California, 1956.
[7] Neuhaus, J. 0. and Wrigley, C. The quartimax method: an analytical approach to
orthogonM simple structure. Brit. J. statist. Psychol., 1954, 7, 81-91.
[8] Saunders, D. R. An analytic method for rotation to orthogonal simple structure.
Princeton: Educational Testing Service Research Bulletin 53-10, 1953.
[9] Thomson, G. H. The factorial analysis of human ability. (Sth ed.) New York: Houghton
Mifflin, 1951.
[10] Thurstone, L. L. Theory of multiple factors. Ann Arbor: Edwards Bros., 1932.
[11] Thurstone, L. L. Primary mental abilities. Psychometric Monogr. No. 1, 1938.
[12] Thurstone~ L. L. Multiple-factor analysis. Chicago: Univ. Chicago Press, 1947.
Manuscript received 11/15/57
Revised manuscript received 1/9/58