Multivariate Statistics PCA
Multivariate Statistics PCA
Sudipta Das
Assistant Professor,
Department of Data Science,
Ramakrishna Mission Vivekananda University, Kolkata
Outline I
Formal definition
First principal component
is the linear combination of a1 0 X that maximizes Var (a1 0 X) subject to
a1 0 a1 = 1
Second principal component
is the linear combination of a2 0 X that maximizes Var (a2 0 X) subject to
a2 0 a2 = 1 and Cov (a2 0 X, a1 0 X) = 0
······
ith principal component
is the linear combination of ai 0 X that maximizes Var (ai 0 X) subject to
ai 0 ai = 1 and Cov (ai 0 X, ak 0 X) = 0 for all k < i
Sketch of proof:
To get the first principal component, we need
a0 Σa
0 0
max(Var (a X)) s. t. a a = 1 ⇒ max
a a a0 a
Thus (Lemma: Maximization of Quadratic Forms for Points on the Unit Sphere),
0
a Σa
max = λ1
a a0 a
Thus (Lemma: Maximization of Quadratic Forms for Points on the Unit Sphere),
0
a Σa
max = λi
a⊥e1 ,...,ei−1 a0 a
and maximum is attained at a = ei .
Hence, Yi = e0i X and Var (Yi ) = e0i Σei = λi
Also,
Cov (Yi , Yk ) = Cov (e0i X, e0k X) = e0i Σek = 0
λ1 + . . . + λk
λ1 + . . . + λk + . . . + λp
Sketch of proof:
σ11 ... 0
. .. ..
recall V = .. . . .
0 ... σpp
Note:
E(Z) = 0 = [0 .. . 0]0
1 ρ12 ... ρ1p
.. .. ..
Cov (Z) = ρ = . .
. .
ρ1p ρ2p ... 1
Yi = ei0 Z, for i = 1, 2, . . . , p.
Moreover,
p
X p
X
Var (Yi ) = Var (Zi ) = p.
i=1 i=1
λ1 + . . . + λk
p
where (λ̂1 , ê1 ), (λ̂2 , ê2 ), . . . , (λ̂p , êp ) are the eigenvalue-
eigenvector pairs for S, with λ̂1 ≥ λ̂2 ≥ · · · ≥ λ̂p ≥ 0. Also,
Var (Ŷi ) = λ̂i , for i = 1, 2, . . . , p
and
Cov (Ŷi , Ŷk ) = 0, for i 6= k .
In addition,
p
X p
X
Total Sample Variance = sii = λ̂i
i=1 i=1
where (λ̂1 , ê1 ), (λ̂2 , ê2 ), . . . , (λ̂p , êp ) are the eigenvalue-
eigenvector pairs for R, with λ̂1 ≥ λ̂2 ≥ · · · ≥ λ̂p ≥ 0. Also,
and
Cov (Ŷi , Ŷk ) = 0, for i 6= k .
In addition,
p
X
Total Sample Variance = λ̂i = p
i=1