0% found this document useful (0 votes)
27 views

Dimension Reduction Techniques: Dr. Gaurav Dixit

The document discusses dimension reduction techniques which are used to reduce the number of variables in datasets with many correlated variables. It covers domain knowledge, data exploration, data conversion, automated reduction techniques and references dimension reduction in data mining.

Uploaded by

Aniket Sujay
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
27 views

Dimension Reduction Techniques: Dr. Gaurav Dixit

The document discusses dimension reduction techniques which are used to reduce the number of variables in datasets with many correlated variables. It covers domain knowledge, data exploration, data conversion, automated reduction techniques and references dimension reduction in data mining.

Uploaded by

Aniket Sujay
Copyright
© © All Rights Reserved
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 8

DIMENSION REDUCTION TECHNIQUES

LECTURE 13

DR. GAURAV DIXIT


DEPARTMENT OF MANAGEMENT STUDIES

1
DIMENSION REDUCTION TECHNIQUES

• Large no. of variables


– Subsets of variables might be highly correlated
– Computational issues
– Costs of data preparation, exploration, and conditioning
– Dimensionality (Principle of Parsimony)
• Dimension Reduction is also called as factor selection or
feature extraction is some domains

2
DIMENSION REDUCTION TECHNIQUES

• Dimension Reduction Techniques


– Domain Knowledge
– Data Exploration Techniques
– Data Conversion Techniques
– Automated reduction Techniques
– Data Mining Techniques

3
DIMENSION REDUCTION TECHNIQUES

• Domain Knowledge
– Identifying key variables for the data mining task
– Removing redundant variables
– Identifying erroneous variables
– Measurement issues for variables

4
DIMENSION REDUCTION TECHNIQUES

• Data Exploration Techniques


– Descriptive statistics
• Summary statistics
• Pivot tables
• Correlation analysis
– Visualization Techniques

• Open RStudio

5
DIMENSION REDUCTION TECHNIQUES

• Data Conversion Techniques


– Combining categories
– Converting a categorical variable into a numerical variable

• RStudio

• Automated reduction Techniques


– Principal Component Analysis (PCA)

6
Key References

• Data Science and Big Data Analytics: Discovering, Analyzing,


Visualizing and Presenting Data by EMC Education Services
(2015)
• Data Mining for Business Intelligence: Concepts, Techniques,
and Applications in Microsoft Office Excel with XLMiner by
Shmueli, G., Patel, N. R., & Bruce, P. C. (2010)

7
Thanks…

You might also like