MBA 2020-21 Factor Analysis
MBA 2020-21 Factor Analysis
Note:
Each variables is first standardized in FA, so that its mean becomes 0 and S.D. (or variance)
becomes 1.
Only that factor which has information greater than that of a single variable is retained. This
information is the variance captured by that factor (also called as Eigen Value of that factor).
Only that factor which has information greater than that of a single variable is retained. This
information is the variance captured by that factor (also called as Eigen Value of that factor).
Since variance of a standardized variable is 1. the aforementioned gives us the criteria for
identifying the number of factors that one should retain i.e. Eigen Value > 1.
Note: Since we retain only factors with EV >1, the Exploratory Factor Analysis is also
considered as a data reduction technique.
Vishal Mishra (IBS, Hyderabad)
* (Adapted from Nargundkar, R. (2003). Marketing research-Text & cases 2E. Tata McGraw-Hill Education.)
Vishal Mishra (IBS, Hyderabad)
v1 v2 v3 v4 v5
v1 1.00000000 -0.18310493 0.54848220 0.07357736 0.17067213
v2 -0.18310493 1.00000000 -0.54780886 -0.08859109 -0.32965368
v3 0.54848220 -0.54780886 1.00000000 -0.14356106 0.00000000
v4 0.07357736 -0.08859109 -0.14356106 1.00000000 0.89801144
v5 0.17067213 -0.32965368 0.00000000 0.89801144 1.00000000
v6 0.16659801 0.06083144 -0.18330397 0.94361921 0.85244790
v7 0.19627719 -0.20748068 -0.04854521 0.91486619 0.95505454
v8 0.30923952 -0.32150944 0.43863109 -0.27727914 -0.14291756
v9 0.28267571 -0.25901360 0.40585385 -0.02709517 0.08796296
v10 0.47514388 0.02411144 0.31968303 0.08719866 0.12325736
Vishal Mishra (IBS, Hyderabad)
v6 v7 v8 v9 v10
v1 0.16659801 0.19627719 0.30923952 0.28267571 0.47514388
v2 0.06083144 -0.20748068 -0.32150944 -0.25901360 0.02411144
v3 -0.18330397 -0.04854521 0.43863109 0.40585385 0.31968303
v4 0.94361921 0.91486619 -0.27727914 -0.02709517 0.08719866
v5 0.85244790 0.95505454 -0.14291756 0.08796296 0.12325736
v6 1.00000000 0.88593769 -0.32790761 -0.04784146 0.07125659
v7 0.88593769 1.00000000 -0.22223154 0.02879561 0.16708878
v8 -0.32790761 -0.22223154 1.00000000 0.82085983 0.06003911
v9 -0.04784146 0.02879561 0.82085983 1.00000000 -0.09955402
v10 0.07125659 0.16708878 0.06003911 -0.09955402 1.00000000
Vishal Mishra (IBS, Hyderabad)
v6 v7 v8 v9 v10
v1 0.16659801 0.19627719 0.30923952 0.28267571 0.47514388
v2 0.06083144 -0.20748068 -0.32150944 -0.25901360 0.02411144
v3 -0.18330397 -0.04854521 0.43863109 0.40585385 0.31968303
v4 0.94361921 0.91486619 -0.27727914 -0.02709517 0.08719866
v5 0.85244790 0.95505454 -0.14291756 0.08796296 0.12325736
v6 1.00000000 0.88593769 -0.32790761 -0.04784146 0.07125659
v7 0.88593769 1.00000000 -0.22223154 0.02879561 0.16708878
v8 -0.32790761 -0.22223154 1.00000000 0.82085983 0.06003911
v9 -0.04784146 0.02879561 0.82085983 1.00000000 -0.09955402
v10 0.07125659 0.16708878 0.06003911 -0.09955402 1.00000000
Vishal Mishra (IBS, Hyderabad)
How many of the total correlations should satisfy this condition (of being greater than a specific
value) ?
Vishal Mishra (IBS, Hyderabad)
Example: With 3 variables A,B & C, the number of bi-variate correlations are (AB,BC,CA) 3.
The general formula for identifying the number of correlations is same as selecting 2 variables
out of 3 i.e. nC2
It gives the proportion of common variance in the data set (calculated using correlations).
It lies between 0 and 1 and a higher value indicates more common variance i.e. adequate
correlation between variables.
As a rule of thumb, a value of >= 0.5 indicates suitability of the data for factor analysis.
Vishal Mishra (IBS, Hyderabad)
Bartlett’s test of sphericity: Tests the null hypothesis that correlation matrix is an identity
matrix
Chi-square Test
Bartlett’s test of sphericity: Tests the null hypothesis that correlation matrix is an identity
matrix
Chi-square Test
$chisq
[1] 164.0985
H0: Correlation matrix is an Identity matrix
H1: Correlation matrix is not an Identity matrix $p.value
[1] 1.868594e-15
$df
[1] 45
Vishal Mishra (IBS, Hyderabad)
Eigen Value: Variance captured by a given factor from the total variance in the
data set
Once variables are standardized their mean becomes 0 and standard deviation
becomes 1
Importance of components:
Eigen Values Comp.1 Comp.2 Comp.3 Comp.4
Standard deviation 1.9704863 1.6664376 1.1724960 0.97207952
Comp.1 = 3.8828 Proportion of Variance 0.3882816 0.2777014 0.1374747 0.09449386
Comp.2 = 2.7770 Cumulative Proportion 0.3882816 0.6659830 0.8034577 0.89795157
(We use orthogonal rotation, Varimax: This ensures that the extracted factors are not
related/correlated to each other… it is then said that they are orthogonal to each other)
Vishal Mishra (IBS, Hyderabad)
$r.scores
RC1 RC2 RC3
RC1 1.000000e+00 2.109424e-15 -1.026956e-15
RC2 2.123302e-15 1.000000e+00 9.436896e-16
RC3 -1.016548e-15 9.159340e-16 1.000000e+00