0% found this document useful (0 votes)

32 views5 pages

45B AIML Practical 06

Uploaded by

Ahmed Shaikh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views5 pages

45B AIML Practical 06

Uploaded by

Ahmed Shaikh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

Name of Student: Ahmed Mobin Ahmed Shaikh

Roll Number: 45 Lab Practical Number: 06

Title of Lab Assignment: Implementation of dimensionality

reduction techniques: Normalization, Transformation, Principal
Components Analysis.

DOP: 19/03/24 DOS: 23/03/24

CO Mapped: PO Mapped: Signature:

CO3, CO6 PO1, PO2,
PO3, PO4,
PO5, PO6,
PO7, PO8,
PO9, PO11,
PO12, PSO1,
PSO2.
4/2/24, 10:06 PM 45_AIML_Practical_06.ipynb - Colaboratory

AIM: Implementation of dimensionality reduction techniques: Normalization,

keyboard_arrow_down Transformation, Principal Components Analysis.

keyboard_arrow_down Theory:
Dimensionality reduction techniques such as feature extraction and selection are crucial in various fields including machine learning, data
analysis, and signal processing. Here's an overview of the theory behind the implementation of these techniques:

Feature Extraction: Feature extraction involves transforming the original high-dimensional data into a lower-dimensional space by creating new
features that capture the most relevant information. Common techniques for feature extraction include Principal Component Analysis (PCA).

Principal Component Analysis (PCA):

PCA extracts linear combinations of the original features, called principal components, that capture the maximum variance in the data.

The principal components are computed as the eigenvectors of the covariance matrix of the data.

Implementation involves centering the data, computing the covariance matrix, performing eigenvalue decomposition, and selecting the
top k eigenvectors.

import numpy as np
import pandas as pd
import pprint
import matplotlib.pyplot as plt
from sklearn.datasets import load_iris
%matplotlib inline
%precision 3
np.set_printoptions(precision=3)
import pylab as pl

ahmed_df = pd.read_csv("/content/Fish.csv")
#feature_columns = ['priceUSD', 'transactions', 'size', 'sentbyaddress'] # Specify your feature columns
ahmed_df.head(20)

Category Species Weight Height Width Length1 Length2 Length3

0 1 Bream 242.0 11.5200 4.0200 23.2 25.4 30.0

1 1 Bream 290.0 12.4800 4.3056 24.0 26.3 31.2

2 1 Bream 340.0 12.3778 4.6961 23.9 26.5 31.1

3 1 Bream 363.0 12.7300 4.4555 26.3 29.0 33.5

4 1 Bream 430.0 12.4440 5.1340 26.5 29.0 34.0

5 1 Bream 450.0 13.6024 4.9274 26.8 29.7 34.7

6 1 Bream 500.0 14.1795 5.2785 26.8 29.7 34.5

7 1 Bream 390.0 12.6700 4.6900 27.6 30.0 35.0

8 1 Bream 450.0 14.0049 4.8438 27.6 30.0 35.1

9 1 Bream 500.0 14.2266 4.9594 28.5 30.7 36.2

10 1 Bream 475.0 14.2628 5.1042 28.4 31.0 36.2

11 1 Bream 500.0 14.3714 4.8146 28.7 31.0 36.2

12 1 Bream 500.0 13.7592 4.3680 29.1 31.5 36.4

13 1 Bream 340.0 13.9129 5.0728 29.5 32.0 37.3

14 1 Bream 600.0 14.9544 5.1708 29.4 32.0 37.2

15 1 Bream 600.0 15.4380 5.5800 29.4 32.0 37.2

16 1 Bream 700.0 14.8604 5.2854 30.4 33.0 38.3

17 1 Bream 700.0 14.9380 5.1975 30.4 33.0 38.5

18 1 Bream 610.0 15.6330 5.1338 30.9 33.5 38.6

19 1 Bream 650.0 14.4738 5.7276 31.0 33.5 38.7

Next steps: toggle_off View recommended plots

https://colab.research.google.com/drive/1gJ2K4xsmVl5LyYrPZmxSPTwDKBVvuABn#scrollTo=Htv5XQQTbrGf&printMode=true 1/4
4/2/24, 10:06 PM 45_AIML_Practical_06.ipynb - Colaboratory
feature_columns = ['Weight', 'Length1', 'Length2', 'Length3'] # Specify your feature columns
ahmed_X = ahmed_df[feature_columns]
ahmed_df.shape

(159, 8)

from sklearn.preprocessing import StandardScaler

X_std = StandardScaler().fit_transform(ahmed_X)
print (X_std[0:5])
print ("The shape of Feature Matrix is -",X_std.shape)

[[-0.438 -0.306 -0.282 -0.106]

[-0.304 -0.226 -0.198 -0.002]
[-0.163 -0.236 -0.179 -0.011]
[-0.099 0.005 0.055 0.196]
[ 0.089 0.025 0.055 0.24 ]]
The shape of Feature Matrix is - (159, 4)

ahmed_X_covariance_matrix = np.cov(X_std.T)
ahmed_X_covariance_matrix

array([[1.006, 0.922, 0.924, 0.929],

[0.922, 1.006, 1.006, 0.998],
[0.924, 1.006, 1.006, 1. ],
[0.929, 0.998, 1. , 1.006]])

eig_vals, eig_vecs = np.linalg.eig(ahmed_X_covariance_matrix)

print('Eigenvectors \n%s' %eig_vecs)
print('\nEigenvalues \n%s' %eig_vals)

Eigenvectors
[[-0.485 -0.873 -0.047 -0.005]
[-0.505 0.309 -0.482 -0.646]
[-0.505 0.292 -0.296 0.756]
[-0.505 0.237 0.823 -0.106]]

Eigenvalues
[3.897e+00 1.188e-01 8.993e-03 3.174e-04]

Make a list of (eigenvalue, eigenvector) tuples

ahmed_eig_pairs = [(np.abs(eig_vals[i]), eig_vecs[:,i]) for i in range(len

(eig_vals))]

Sort the (eigenvalue, eigenvector) tuples from high to low

ahmed_eig_pairs.sort(key=lambda x: x[0], reverse=True)

Visually confirm that the list is correctly sorted by decreasing eigenvalues

print('Eigenvalues in descending order:')

for i in ahmed_eig_pairs:
print(i[0])

Eigenvalues in descending order:

3.8971801711761636
0.11882554021972752
0.008993300121559085
0.0003174441787529088

tot = sum(eig_vals)
var_exp = [(i / tot)*100 for i in sorted(eig_vals, reverse=True)]
cum_var_exp = np.cumsum(var_exp)
print ("Variance captured by each component is \n",var_exp)
print(40 * '-')
print ("Cumulative variance captured as we travel each component \n"
,cum_var_exp)

Variance captured by each component is

[96.81674010154617, 2.9519552444523494, 0.22341846213936087, 0.007886191862100564]
----------------------------------------
Cumulative variance captured as we travel each component
[ 96.817 99.769 99.992 100. ]

https://colab.research.google.com/drive/1gJ2K4xsmVl5LyYrPZmxSPTwDKBVvuABn#scrollTo=Htv5XQQTbrGf&printMode=true 2/4
4/2/24, 10:06 PM 45_AIML_Practical_06.ipynb - Colaboratory
print ("All Eigen Values along with Eigen Vectors")
pprint.pprint(ahmed_eig_pairs)
print(40 * '-')
matrix_w = np.hstack((ahmed_eig_pairs[0][1].reshape(4,1),ahmed_eig_pairs[1][1].reshape(4,1)))
print ('Matrix W:\n', matrix_w)

All Eigen Values along with Eigen Vectors

[(3.8971801711761636, array([-0.485, -0.505, -0.505, -0.505])),
(0.11882554021972752, array([-0.873, 0.309, 0.292, 0.237])),
(0.008993300121559085, array([-0.047, -0.482, -0.296, 0.823])),
(0.0003174441787529088, array([-0.005, -0.646, 0.756, -0.106]))]
----------------------------------------
Matrix W:
[[-0.485 -0.873]
[-0.505 0.309]
[-0.505 0.292]
[-0.505 0.237]]

ahmed_Y = X_std.dot(matrix_w)
print (ahmed_Y[0:5])

[[ 0.563 0.18 ]
[ 0.362 0.137]
[ 0.294 0.015]
[-0.081 0.151]
[-0.204 0.003]]

from sklearn.decomposition import PCA

from sklearn.preprocessing import StandardScaler

# Assuming df contains features and target column

X = ahmed_df.drop('Species', axis=1) # Features
y = ahmed_df['Species'] # Target

# Standardize the features

scaler = StandardScaler()
X_scaled = scaler.fit_transform(X)

# Perform PCA
pca = PCA(n_components=2)
X_pca = pca.fit_transform(X_scaled)

# Plot the PCA

plt.figure(figsize=(8, 6))
targets = y.unique()
colors = ['r', 'g', 'b'] # Adjust based on the number of unique target values
for target, color in zip(targets, colors):
indicesToKeep = y == target
plt.scatter(X_pca[indicesToKeep, 0], X_pca[indicesToKeep, 1], c=color, label=target)
plt.xlabel('Principal Component 1')
plt.ylabel('Principal Component 2')
plt.legend(title='Target')
plt.title('PCA of Custom Dataset')
plt.show()

https://colab.research.google.com/drive/1gJ2K4xsmVl5LyYrPZmxSPTwDKBVvuABn#scrollTo=Htv5XQQTbrGf&printMode=true 4/4

45B Ahmed Shaikh AIML Prac05
No ratings yet
45B Ahmed Shaikh AIML Prac05
4 pages
EXP-15
No ratings yet
EXP-15
12 pages
Mathematical Approach To PCA
No ratings yet
Mathematical Approach To PCA
8 pages
Unit-3
No ratings yet
Unit-3
28 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
34 pages
program - 3
No ratings yet
program - 3
4 pages
ML Unit - 3 DimensionalitY Reduction
No ratings yet
ML Unit - 3 DimensionalitY Reduction
39 pages
Dimensionality Reduction - PCA LDA
No ratings yet
Dimensionality Reduction - PCA LDA
25 pages
Assignment 2 Documentation
No ratings yet
Assignment 2 Documentation
15 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
8 pages
4. PCA Steps -Numerical Problem
No ratings yet
4. PCA Steps -Numerical Problem
8 pages
DimensionalitY Reduction
No ratings yet
DimensionalitY Reduction
29 pages
Kinya Sharon - Ass2 - Machine Learning
No ratings yet
Kinya Sharon - Ass2 - Machine Learning
12 pages
4.5 Principal Component Analysis
No ratings yet
4.5 Principal Component Analysis
15 pages
ML_Lec-20
No ratings yet
ML_Lec-20
17 pages
Dimension Reduction
No ratings yet
Dimension Reduction
15 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
12 pages
Feature Extraction: - Saheni Patra
No ratings yet
Feature Extraction: - Saheni Patra
17 pages
AML Non Evaluative Assignment 2 Fe82d2aded8429c766345d5b671eaee1
No ratings yet
AML Non Evaluative Assignment 2 Fe82d2aded8429c766345d5b671eaee1
2 pages
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-18 Reference-Material-I
No ratings yet
FALLSEM2024-25 SWE1015 ETH VL2024250103260 2024-09-18 Reference-Material-I
62 pages
Singular Value Decomposition (SVD) / Principal Components Analysis (Pca)
No ratings yet
Singular Value Decomposition (SVD) / Principal Components Analysis (Pca)
31 pages
Aim: Theory: Experiment 3
No ratings yet
Aim: Theory: Experiment 3
3 pages
pca
No ratings yet
pca
16 pages
Class8-9 DataPreprocessing DataReduction 30Sept-05Oct2020
No ratings yet
Class8-9 DataPreprocessing DataReduction 30Sept-05Oct2020
22 pages
AML Unit - 1 Material
No ratings yet
AML Unit - 1 Material
36 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
13 pages
AIML
No ratings yet
AIML
5 pages
Need of Principal Component Analysis
No ratings yet
Need of Principal Component Analysis
8 pages
Lecture6 PCA
No ratings yet
Lecture6 PCA
30 pages
Unit 3
No ratings yet
Unit 3
21 pages
MLSP-6 dimensionality reduction
No ratings yet
MLSP-6 dimensionality reduction
39 pages
MLSP Exp02
No ratings yet
MLSP Exp02
10 pages
Zaid-LAB - 10 - Jupyter Notebook
No ratings yet
Zaid-LAB - 10 - Jupyter Notebook
2 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
13 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
22 pages
Dimensionality Reduction (Principal Component Analysis)
No ratings yet
Dimensionality Reduction (Principal Component Analysis)
12 pages
Fem2063 Data Analytics - May 2020 Lab Practice 5 (Week 6)
No ratings yet
Fem2063 Data Analytics - May 2020 Lab Practice 5 (Week 6)
8 pages
2. PCA
No ratings yet
2. PCA
22 pages
Pca
No ratings yet
Pca
18 pages
Principal Component Analysis (PCA) Final
No ratings yet
Principal Component Analysis (PCA) Final
37 pages
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
No ratings yet
Unit - IV - DIMENSIONALITY REDUCTION AND GRAPHICAL MODELS
59 pages
U5@-Data Reduction
No ratings yet
U5@-Data Reduction
22 pages
Principal Component Analysis Concepts: T56Gzsrvah
No ratings yet
Principal Component Analysis Concepts: T56Gzsrvah
16 pages
Unit 4 Dimenstionality Reduction
No ratings yet
Unit 4 Dimenstionality Reduction
104 pages
# Loop Over Classes: 6.2 Principal Components Analysis (Pca)
No ratings yet
# Loop Over Classes: 6.2 Principal Components Analysis (Pca)
10 pages
PCA
100% (1)
PCA
33 pages
Lecture 9 - Data Reduction
No ratings yet
Lecture 9 - Data Reduction
36 pages
DR Pca
No ratings yet
DR Pca
22 pages
Principal Component Analysis (PCA)
No ratings yet
Principal Component Analysis (PCA)
18 pages
Dimensionality Reduction DR (2)
No ratings yet
Dimensionality Reduction DR (2)
31 pages
Maths Pca
No ratings yet
Maths Pca
6 pages
CHBE413CDS Lecture 12 Unsupervised DimRed
No ratings yet
CHBE413CDS Lecture 12 Unsupervised DimRed
30 pages
PCA - Ensemble Classifiers
No ratings yet
PCA - Ensemble Classifiers
9 pages
Unit 3dimentionality Reduction
No ratings yet
Unit 3dimentionality Reduction
13 pages
Eigenvalues Eigenvectors ML
No ratings yet
Eigenvalues Eigenvectors ML
5 pages
Fattah DimensionReduction 02june2022
No ratings yet
Fattah DimensionReduction 02june2022
134 pages
PCA Code-Checkpoint
No ratings yet
PCA Code-Checkpoint
4 pages
Dimensionality Reduction
No ratings yet
Dimensionality Reduction
5 pages
3
No ratings yet
3
12 pages
MCA Microsoft Certified Associate Azure Security Engineer Study Guide: Exam AZ-500
From Everand
MCA Microsoft Certified Associate Azure Security Engineer Study Guide: Exam AZ-500
Shimon Brathwaite
No ratings yet
45b - Ui Prac2
No ratings yet
45b - Ui Prac2
30 pages
45B AIML Practical 11
No ratings yet
45B AIML Practical 11
5 pages
45B AIML Practical07 Clustering
No ratings yet
45B AIML Practical07 Clustering
8 pages
45B AIML Practical 08
No ratings yet
45B AIML Practical 08
10 pages
Is 802 (Part 1/sec 2) : 1992
0% (1)
Is 802 (Part 1/sec 2) : 1992
14 pages
Experiment 4 Dynamic Response of Temperature Measuring Devices (Transient Heat Transfer)
No ratings yet
Experiment 4 Dynamic Response of Temperature Measuring Devices (Transient Heat Transfer)
6 pages
Standard Form with Large Numbers - Lesson
No ratings yet
Standard Form with Large Numbers - Lesson
48 pages
Nelson/Salmo Pennywise Sept. 1, 2015
No ratings yet
Nelson/Salmo Pennywise Sept. 1, 2015
48 pages
Automated Guided Vehicle
No ratings yet
Automated Guided Vehicle
14 pages
Ellipse Problems
No ratings yet
Ellipse Problems
23 pages
Unit-1 Basics of JavaScript Programming
No ratings yet
Unit-1 Basics of JavaScript Programming
58 pages
Formative Assessment vs. Summative Assessment
0% (1)
Formative Assessment vs. Summative Assessment
12 pages
Vax 88
No ratings yet
Vax 88
127 pages
How to_ Notify an Application When an Item Is Removed from the Cache
No ratings yet
How to_ Notify an Application When an Item Is Removed from the Cache
3 pages
J44
No ratings yet
J44
4 pages
Rear Seat Assembly: Components
No ratings yet
Rear Seat Assembly: Components
6 pages
S3 Magazine Issue 27
100% (1)
S3 Magazine Issue 27
116 pages
Department of Computer Engineering (English) Report Book
No ratings yet
Department of Computer Engineering (English) Report Book
7 pages
Chapter 01
100% (2)
Chapter 01
28 pages
C2. Assessment Workbook
No ratings yet
C2. Assessment Workbook
111 pages
DG-12 Degasser PDF
No ratings yet
DG-12 Degasser PDF
98 pages
ECE-6323 Deck 01x PDF
No ratings yet
ECE-6323 Deck 01x PDF
51 pages
Designer's Notes: Napoleonic Brigade Series 3.0
No ratings yet
Designer's Notes: Napoleonic Brigade Series 3.0
15 pages
Sinamics Sm120 CM en
No ratings yet
Sinamics Sm120 CM en
2 pages
Responses - Cusat
No ratings yet
Responses - Cusat
3 pages
Project Report Soft Copy Akash Yadav Son of Ram Naresh Yadav
No ratings yet
Project Report Soft Copy Akash Yadav Son of Ram Naresh Yadav
58 pages
P-150 Providing-Perineal-Care
No ratings yet
P-150 Providing-Perineal-Care
3 pages
Sample Tasks c1 Delp
No ratings yet
Sample Tasks c1 Delp
10 pages
Interactive Learning Activities
No ratings yet
Interactive Learning Activities
2 pages
GrammarExtraCreditPunctuationVocabSpellingHomophonesandRevising 1
No ratings yet
GrammarExtraCreditPunctuationVocabSpellingHomophonesandRevising 1
13 pages
Course Guide - Enggchem
No ratings yet
Course Guide - Enggchem
15 pages
Antibiotics-Chapter-1-3 Final
No ratings yet
Antibiotics-Chapter-1-3 Final
36 pages
8.1 Smartube G.I Conduit
No ratings yet
8.1 Smartube G.I Conduit
8 pages
College Bus Timing 15th June 2024
No ratings yet
College Bus Timing 15th June 2024
1 page

45B AIML Practical 06

Uploaded by

45B AIML Practical 06

Uploaded by

Name of Student: Ahmed Mobin Ahmed Shaikh

Roll Number: 45 Lab Practical Number: 06

Title of Lab Assignment: Implementation of dimensionality

DOP: 19/03/24 DOS: 23/03/24

CO Mapped: PO Mapped: Signature:

AIM: Implementation of dimensionality reduction techniques: Normalization,

Principal Component Analysis (PCA):

Category Species Weight Height Width Length1 Length2 Length3

0 1 Bream 242.0 11.5200 4.0200 23.2 25.4 30.0

1 1 Bream 290.0 12.4800 4.3056 24.0 26.3 31.2

2 1 Bream 340.0 12.3778 4.6961 23.9 26.5 31.1

3 1 Bream 363.0 12.7300 4.4555 26.3 29.0 33.5

4 1 Bream 430.0 12.4440 5.1340 26.5 29.0 34.0

5 1 Bream 450.0 13.6024 4.9274 26.8 29.7 34.7

6 1 Bream 500.0 14.1795 5.2785 26.8 29.7 34.5

7 1 Bream 390.0 12.6700 4.6900 27.6 30.0 35.0

8 1 Bream 450.0 14.0049 4.8438 27.6 30.0 35.1

9 1 Bream 500.0 14.2266 4.9594 28.5 30.7 36.2

10 1 Bream 475.0 14.2628 5.1042 28.4 31.0 36.2

11 1 Bream 500.0 14.3714 4.8146 28.7 31.0 36.2

12 1 Bream 500.0 13.7592 4.3680 29.1 31.5 36.4

13 1 Bream 340.0 13.9129 5.0728 29.5 32.0 37.3

14 1 Bream 600.0 14.9544 5.1708 29.4 32.0 37.2

15 1 Bream 600.0 15.4380 5.5800 29.4 32.0 37.2

16 1 Bream 700.0 14.8604 5.2854 30.4 33.0 38.3

17 1 Bream 700.0 14.9380 5.1975 30.4 33.0 38.5

18 1 Bream 610.0 15.6330 5.1338 30.9 33.5 38.6

19 1 Bream 650.0 14.4738 5.7276 31.0 33.5 38.7

Next steps: toggle_off View recommended plots

from sklearn.preprocessing import StandardScaler

[[-0.438 -0.306 -0.282 -0.106]

array([[1.006, 0.922, 0.924, 0.929],

eig_vals, eig_vecs = np.linalg.eig(ahmed_X_covariance_matrix)

Make a list of (eigenvalue, eigenvector) tuples

ahmed_eig_pairs = [(np.abs(eig_vals[i]), eig_vecs[:,i]) for i in range(len

Sort the (eigenvalue, eigenvector) tuples from high to low

ahmed_eig_pairs.sort(key=lambda x: x[0], reverse=True)

Visually confirm that the list is correctly sorted by decreasing eigenvalues

print('Eigenvalues in descending order:')

Eigenvalues in descending order:

Variance captured by each component is

All Eigen Values along with Eigen Vectors

from sklearn.decomposition import PCA

# Assuming df contains features and target column

# Standardize the features

# Plot the PCA

You might also like