Ch. 10 Principal Components Analysis (PCA)

This document discusses principal component analysis (PCA) and how it can be used for dimensionality reduction in regression models with many correlated predictors. PCA transforms the original predictors into a new set of uncorrelated variables called principal components. It describes how to calculate the principal components and use them in regression instead of the original predictors to avoid multicollinearity issues. The document also provides an example using PROC FACTOR in SAS to perform PCA on a dataset with various socioeconomic predictors of infant mortality.

Uploaded by

José António Pereira

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

146 views

Ch. 10 Principal Components Analysis (PCA)

Uploaded by

José António Pereira

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

Ch. 10 Principal Components Analysis (PCA) Outline 1. 2. 3. 4. Why use PCA?

Calculating Principal Components Using Principal Components in Regression PROC FACTOR

This material is loosely related to Section 10B. I would encourage you to read the rest of Chapter 10, but very critically; factor analysis is very popular in social science but also very controversial.

Why use PCA? There are several uses (and abuses) for PCA. The most important use of PCA is probably in multiple regression. Suppose a response variable Y is to be regressed against a large number of covariates. Variable selection techniques are not often very eective, and there may be scientic interest in including information from most or all of the covariates. Retaining all covariates will likely lead to severe multicollinearity or non-identiability of regression coecients. Without remedy, standard errors will be unacceptably large, and predictions may be very inaccurate.
2

What does PCA do? Objectives: 1. To nd a small set of linear combinations of the covariates which are uncorrelated with each other. This will avoid the multicollinearity problem. 2. To ensure that the linear combinations chosen have maximal variance. A good regression design chooses values of the covariates which are spread out.

Calculating Principal Components Suppose n independent observations are taken on X1, X2, . . . , Xk , where the covariance between Xi and Xj is Cov(Xi, Xj ) = i,j = 2Ri,j for i, j = 1, 2, . . . , k. Let 1 > 2 > > k > 0 be the eigenvalues of R and let z1, z2, . . . , zk be the corresponding eigenvectors. T Normalize these so that zj zj = 1.

Calculating Principal Components Dene W1 to be the rst principal component. It will be a linear combination of the Xs which has the largest possible variance: W1 = aT X = 1 where aT a1 = 1. 1 Var(W1) = aT a1 = 2aT Ra1 1 1 Constrained maximization leads to the Lagrangian: L = aT Ra1 + (aT a1 1) 1 1
k

a1iXi
i=1

Calculating Principal Components Solution: a1 is a unit vector satisfying Ra1 = a1 i.e. a1 must be an eigenvector of R. Remember that we want to maximize aT Ra1 = aT a1 1 1 where is the eigenvalue which corresponds to the eigenvector a1. Therefore, take a1 = z1. That is,
T W1 = z1 X = k

z1iXi.
i=1

W1 has the largest variance among all linear combinations of the Xs.

Calculating the Second Principal Component Let W2 be a second linear combination of the Xs which has the largest possible variance: W2 = aT X = 2
k

a2iXi,
i=1

but Corr(W1, W2) = 0. i.e. aT z1 = 0. 2 Now the constrained maximization leads to the Lagrangian L = aT Ra2 + 1(aT a2 1) + 2(aT z1) 2 2 2

Calculating the Second Principal Component Solution: (R + 1I)a2 + 2z1 = 0 with aT a2 = 1, Rz1 = 1z1, and aT z1 = 0. 2 2 T to nd that = 0. Multiply through by z1 2 Therefore, Ra2 = 1a2. Therefore, a2 is an eigenvector (which cannot be z1 because of the orthogonality condition). a2 = z2 will make Var(W2) = 22 which is large as possible under the given constraints. That is,
T W 2 = z2 X = k

z2iXi.
i=1

W2 has the largest variance among all linear combinations of the Xs which are orthogonal to W1.
8

Calculating the Remaining Principal Component The third, fourth, fth, ... principal components follow from the same reasoning. Wj is the linear combination of the Xs which has largest variance, subject to the constraint that Wk is uncorrelated with W1, W2, . . . , Wj1. The constrained maximization problem is solved by setting
T Wj = zj X = k

zjiXi.
i=1

The variance of Wj is 2j .

How many Principal Components should be used? Remember that the objective is to use only the rst few components. The usual technique is to look for where there is a sharp drop in the component variance. (Remember that a good regression design will have spread out covariates, so the components with small variance (i.e. small eigenvalues) will be omitted.

Principal Components Regression After identifying the principal components which account for most of the variance in X1, X2, . . . , Xk (often, 2 to 4 of the components), these components can be used in regression. e.g. Original data set x1 x2 x3 ... xk y 2 3 1 ... 5 20 4 3 3 ... 5 25 ...................

Principal Components Regression Principal components:

W1 =
i=1 k

z1k Xk

W2 =
i=1 k

z2k Xk

W3 =
i=1

z3k Xk

New data set x1 x2 x3 ... xk y w1 w2 w3 2 3 1 ... 5 20 2z11+3z12+...5z1k 2z21+3z22+...+5z2k .... 4 3 3 ... 5 25 4z11+3z12+...5z1k 4z21+3z22+...+5z2k .... ...............................................................
12

Principal Components Regression Now regress Y = 0 + 1W1 + 2W2 + 3W3 + instead of

Y = 0 +
i=1

iXi +

Advantage: W1, W2 and W3 are orthogonal so t-tests for coecients are easy to interpret. No multicollinearity.

Using PROC FACTOR for PCA We will apply PCA to the data in swiss.txt which is a data set that can be found in the textbook Data Analysis and Regression: A Second Course in Statistics by F. Mosteller and J.W. Tukey (1977). These data were collected in about 1888 in the 47 French-speaking provinces of Switzerland. The variables are Fertility, Agriculture, Examination, Education, and Catholic as well as Infant.Mortality which we will treat as a response variable.

Using PROC FACTOR for PCA DATA SWISS; INFILE swiss.txt FIRSTOBS=2; INPUT FERT AGRI ARMYEXAM EDUC CATHOL INFMORT; PROC FACTOR PREPLOT PLOT ROTATE=VARIMAX NFACTORS=2 OUT=FACT SCREE; TITLE PCA for 1988 Swiss Data; VAR FERT AGRI ARMYEXAM EDUC CATHOL;

Using PROC FACTOR for PCA Preplot shows a factor plot before rotation (i.e. computing the principal components). Plot shows a factor plot after rotation (i.e. computing the principal components). ROTATE=VARIMAX causes PROC FACTOR to compute the principal components in the way described above. NFACTORS controls the number of principal computes that will be computed. SCREE gives a scree plot which is useful for choosing the number of components. Look for a sharp drop.

Principal Components Regression Now regress Y = 0 + 1W1 + 2W2 + /*Since DATA FACT from PROC FACTOR contains all the data in the DATA=SWISS plus new variables called Factor1, Factor2, we can continue to do PROC REG*/ PROC REG DATA=FACT; TITLE Regresson of INFMORT on Factor1 and Factor2; MODEL INFMORT = FACTOR1 FACTOR2; RUN;

04c IGCSE Maths 4MA1 2HR June 2023 Mark Scheme PDF
60% (5)
04c IGCSE Maths 4MA1 2HR June 2023 Mark Scheme PDF
31 pages
Lecture Planner - Mathematics - LAKSHYA JEE 2022 PLANNER - Mathematics
No ratings yet
Lecture Planner - Mathematics - LAKSHYA JEE 2022 PLANNER - Mathematics
5 pages
Kinya Sharon - Ass2 - Machine Learning
No ratings yet
Kinya Sharon - Ass2 - Machine Learning
12 pages
Unit5 1
No ratings yet
Unit5 1
98 pages
MV - Principal Components Using SAS
No ratings yet
MV - Principal Components Using SAS
69 pages
Pca Tutorial
No ratings yet
Pca Tutorial
11 pages
DR Pca
No ratings yet
DR Pca
22 pages
Need of Principal Component Analysis
No ratings yet
Need of Principal Component Analysis
8 pages
Sanjay Singh Principal Component Analysis
No ratings yet
Sanjay Singh Principal Component Analysis
9 pages
Principal Components Analysis (PCA) : 2.1 Outline of Technique
No ratings yet
Principal Components Analysis (PCA) : 2.1 Outline of Technique
21 pages
Remote Sensing Assignment
No ratings yet
Remote Sensing Assignment
10 pages
Lecture Five-Multivariate Factor Models
No ratings yet
Lecture Five-Multivariate Factor Models
20 pages
Principal Component Analysis
100% (1)
Principal Component Analysis
10 pages
MiM Predictive Analytics Sessions 1 2 (PCA)
No ratings yet
MiM Predictive Analytics Sessions 1 2 (PCA)
26 pages
Unit-3
No ratings yet
Unit-3
28 pages
L08 PrincipalComponentAnalysis
No ratings yet
L08 PrincipalComponentAnalysis
36 pages
PC Regression
No ratings yet
PC Regression
25 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
3 pages
Dimensionality Reduction (Principal Component Analysis)
No ratings yet
Dimensionality Reduction (Principal Component Analysis)
12 pages
Week 9 Lecture - Revision Test-dual-translated
No ratings yet
Week 9 Lecture - Revision Test-dual-translated
92 pages
RES805-RM-Module 2
No ratings yet
RES805-RM-Module 2
26 pages
Lecture 9 - Data Reduction
No ratings yet
Lecture 9 - Data Reduction
36 pages
PCA Explained Stepbystep
No ratings yet
PCA Explained Stepbystep
4 pages
PCA_dev
No ratings yet
PCA_dev
16 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
9 pages
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-18 Reference-Material-I
No ratings yet
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-18 Reference-Material-I
62 pages
3.2 Pca
No ratings yet
3.2 Pca
27 pages
Lecture 12
No ratings yet
Lecture 12
31 pages
Chapter2 PCA
No ratings yet
Chapter2 PCA
65 pages
Principal Components Analysis: Hal Whitehead BIOL4062/5062
No ratings yet
Principal Components Analysis: Hal Whitehead BIOL4062/5062
29 pages
Singular Value Decomposition (SVD) / Principal Components Analysis (Pca)
No ratings yet
Singular Value Decomposition (SVD) / Principal Components Analysis (Pca)
31 pages
P-3.1.4 - Pca
No ratings yet
P-3.1.4 - Pca
44 pages
PCA Finds Representation Through Linear Transformation
No ratings yet
PCA Finds Representation Through Linear Transformation
28 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
34 pages
pca
No ratings yet
pca
16 pages
Pca
No ratings yet
Pca
18 pages
Qrm2024 Topic5 Pca Fa
No ratings yet
Qrm2024 Topic5 Pca Fa
67 pages
Principal Component Analysis (PCA) : Principles, Biplots, and Modern Extensions For Sparse Data
No ratings yet
Principal Component Analysis (PCA) : Principles, Biplots, and Modern Extensions For Sparse Data
70 pages
STAT502
No ratings yet
STAT502
13 pages
Lecture 6 - PCA - Lecturefin
No ratings yet
Lecture 6 - PCA - Lecturefin
71 pages
MDA PrincipalComponentAnalysis
No ratings yet
MDA PrincipalComponentAnalysis
20 pages
ACPusingR
No ratings yet
ACPusingR
25 pages
Practical Guide To Principal Component N R
No ratings yet
Practical Guide To Principal Component N R
43 pages
Statistical ML Overview
No ratings yet
Statistical ML Overview
34 pages
Chapter 3 Normalized Principal Components Analysis
No ratings yet
Chapter 3 Normalized Principal Components Analysis
4 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
10 pages
Bia b350f Unit 4
No ratings yet
Bia b350f Unit 4
38 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
12 pages
Pca PDF
No ratings yet
Pca PDF
33 pages
cs229 Notes10 PDF
No ratings yet
cs229 Notes10 PDF
6 pages
Chapter 4: Normalized Principal Components Analysis: Dr. Lassad El Moubarki Tunis Business School
No ratings yet
Chapter 4: Normalized Principal Components Analysis: Dr. Lassad El Moubarki Tunis Business School
23 pages
02 Principal Components
No ratings yet
02 Principal Components
9 pages
Principal Component Analysis Concepts: T56Gzsrvah
No ratings yet
Principal Component Analysis Concepts: T56Gzsrvah
16 pages
Principal Component Analysis Concepts
No ratings yet
Principal Component Analysis Concepts
16 pages
Module12 - Unsupervised Learning
No ratings yet
Module12 - Unsupervised Learning
52 pages
Principal Component Analysis Concepts
No ratings yet
Principal Component Analysis Concepts
16 pages
10-601 Machine Learning (Fall 2010) Principal Component Analysis
No ratings yet
10-601 Machine Learning (Fall 2010) Principal Component Analysis
8 pages
2 - 4 Principal Component Analysis (PCA)
No ratings yet
2 - 4 Principal Component Analysis (PCA)
15 pages
Principal Component Analysis & Factor Analysis: Psych 818 Deshon
No ratings yet
Principal Component Analysis & Factor Analysis: Psych 818 Deshon
52 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
1 page
Geometric functions in computer aided geometric design
From Everand
Geometric functions in computer aided geometric design
Oscar Ruiz
No ratings yet
Kronecker Products and Matrix Calculus with Applications
From Everand
Kronecker Products and Matrix Calculus with Applications
Alexander Graham
No ratings yet
Week 6
No ratings yet
Week 6
12 pages
Fake Syllabus
No ratings yet
Fake Syllabus
5 pages
Impact of Math Lab (Feedback + Testimonials)
No ratings yet
Impact of Math Lab (Feedback + Testimonials)
2 pages
Nonlinear Systems and Control Lecture # 11 Exponential Stability & Region of Attraction
No ratings yet
Nonlinear Systems and Control Lecture # 11 Exponential Stability & Region of Attraction
18 pages
CHAPTER 6.1.1_ Special Right Triangles (Page 321-325) (4) (1)
No ratings yet
CHAPTER 6.1.1_ Special Right Triangles (Page 321-325) (4) (1)
5 pages
Gr. 7 T2 Week 7-9, MEMO, Career fields, Worksheet 7
No ratings yet
Gr. 7 T2 Week 7-9, MEMO, Career fields, Worksheet 7
7 pages
Ex. Decrement % in 5 and 8 Is 5/8 × 100 37.5%.: Some Important Percent /fractions
No ratings yet
Ex. Decrement % in 5 and 8 Is 5/8 × 100 37.5%.: Some Important Percent /fractions
3 pages
Caqb 1 Sem 5
100% (1)
Caqb 1 Sem 5
17 pages
DSP Lab Fourier Series and Transforms: EXP No:4
No ratings yet
DSP Lab Fourier Series and Transforms: EXP No:4
19 pages
MSDD DDPSK
No ratings yet
MSDD DDPSK
5 pages
X X X X Y: 1 Arcsin D D
No ratings yet
X X X X Y: 1 Arcsin D D
4 pages
G3_ICT_Ch 7
No ratings yet
G3_ICT_Ch 7
3 pages
Harvard University - Linear Algebra & Differential Equation
100% (3)
Harvard University - Linear Algebra & Differential Equation
70 pages
2012 Middle Primary First Round Solution
No ratings yet
2012 Middle Primary First Round Solution
7 pages
Propositions&Arguments
No ratings yet
Propositions&Arguments
11 pages
Basic Structures Sets, Functions Sequences, and Sums
No ratings yet
Basic Structures Sets, Functions Sequences, and Sums
34 pages
A Four-Step Camera Calibration Procedure With Implicit Image Correction
No ratings yet
A Four-Step Camera Calibration Procedure With Implicit Image Correction
7 pages
Set4 Queuing Theory
No ratings yet
Set4 Queuing Theory
103 pages
CZ1102 Computing & Problem Solving Lecture 7
No ratings yet
CZ1102 Computing & Problem Solving Lecture 7
28 pages
Sdi Walisongo: Islamic International School
No ratings yet
Sdi Walisongo: Islamic International School
6 pages
2023ICPC Problemset Regional
No ratings yet
2023ICPC Problemset Regional
22 pages
Activity 6 Equations in L TEX: 1 Basic Equation
No ratings yet
Activity 6 Equations in L TEX: 1 Basic Equation
6 pages
Sample Papers Maths (III)
No ratings yet
Sample Papers Maths (III)
7 pages
Syllabus Book
No ratings yet
Syllabus Book
104 pages
Modulus
No ratings yet
Modulus
2 pages
Model Predictive Control of Van de Vusse Reactor
No ratings yet
Model Predictive Control of Van de Vusse Reactor
5 pages
Lec 22
No ratings yet
Lec 22
13 pages
4.quadratic Equations MCQs
No ratings yet
4.quadratic Equations MCQs
5 pages