0% found this document useful (0 votes)

3 views2 pages

Assignment 1

Uploaded by

AKANTO NANDI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views2 pages

Assignment 1

Uploaded by

AKANTO NANDI

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

import numpy as np

import pandas as pd

# Step 1: Load the dataset

data = pd.read_csv('ANandi.csv')

# Step 2: Standardize the data (mean = 0, variance = 1)

# Exclude non-numeric columns if present
numeric_data = data.select_dtypes(include=[np.number])
mean = numeric_data.mean()
std = numeric_data.std()
standardized_data = (numeric_data - mean) / std

# Step 3: Compute the covariance matrix

cov_matrix = np.cov(standardized_data, rowvar=False)

# Step 4: Perform eigen decomposition

eigenvalues, eigenvectors = np.linalg.eigh(cov_matrix)

# Step 5: Sort eigenvalues and eigenvectors

sorted_indices = np.argsort(eigenvalues)[::-1]
eigenvalues = eigenvalues[sorted_indices]
eigenvectors = eigenvectors[:, sorted_indices]

# Select the first two principal components

top_eigenvectors = eigenvectors[:, :2]

# Step 6: Transform the data

data_reduced = np.dot(standardized_data, top_eigenvectors)

# Step 7: Save the reduced data

data_reduced_df = pd.DataFrame(data_reduced, columns=['PC1', 'PC2'])
data_reduced_df.to_csv('data_reduced.csv', index=False)

print("PCA completed. The first two principal components are saved in 'data_reduced.csv'.")

PCA completed. The first two principal components are saved in 'data_reduced.csv'.

# Step 1: Load the reduced data

data = pd.read_csv('data_reduced.csv').to_numpy()

# Parameters
k = 3 # Number of clusters (adjust as needed)
max_iterations = 100 # Maximum number of iterations
tolerance = 1e-4 # Convergence threshold

# Step 2: Initialize cluster centers randomly

np.random.seed(42) # For reproducibility
centers = data[np.random.choice(data.shape[0], k, replace=False)]

# Step 3-5: K-Means Algorithm

for iteration in range(max_iterations):
# Step 3: Assign clusters based on the closest center
distances = np.linalg.norm(data[:, np.newaxis] - centers, axis=2)
cluster_assignments = np.argmin(distances, axis=1)

# Step 4: Update cluster centers

new_centers = np.array([data[cluster_assignments == i].mean(axis=0) for i in range(k)])

# Check for convergence

if np.linalg.norm(new_centers - centers) < tolerance:
print(f"Converged after {iteration + 1} iterations.")
break

centers = new_centers
else:
print("Reached maximum iterations.")

# Step 6: Save results

# Create a DataFrame with the cluster assignments
output_data = pd.DataFrame(data, columns=['PC1', 'PC2'])
output_data['Cluster'] = cluster_assignments

# Save cluster assignments and cluster centers

output_data.to_csv('cluster_assignments.csv', index=False)
centers_df = pd.DataFrame(centers, columns=['PC1', 'PC2'])
centers_df.to_csv('cluster_centers.csv', index=False)

print("Clustering completed.")
print("Cluster assignments saved to 'cluster_assignments.csv'.")
print("Cluster centers saved to 'cluster_centers.csv'.")
Converged after 9 iterations.
Clustering completed.
Cluster assignments saved to 'cluster_assignments.csv'.
Cluster centers saved to 'cluster_centers.csv'.
Loading [MathJax]/jax/output/CommonHTML/fonts/TeX/fontdata.js

Practical 5
No ratings yet
Practical 5
6 pages
Slip Clustering
No ratings yet
Slip Clustering
2 pages
Program - 3
No ratings yet
Program - 3
4 pages
Iris Dataset PCA Analysis Code
No ratings yet
Iris Dataset PCA Analysis Code
21 pages
MLFILE
No ratings yet
MLFILE
21 pages
AIML Lab 10
No ratings yet
AIML Lab 10
4 pages
Market Analysis by Pchandru
No ratings yet
Market Analysis by Pchandru
10 pages
PCA Implementation and Analysis
No ratings yet
PCA Implementation and Analysis
15 pages
Drawback of Standard K-Means Algorithm
No ratings yet
Drawback of Standard K-Means Algorithm
5 pages
KMeans & KMedoids Clustering Guide
No ratings yet
KMeans & KMedoids Clustering Guide
10 pages
Untitled Document-2-1-13-7-11.4
No ratings yet
Untitled Document-2-1-13-7-11.4
5 pages
IDM Assignment
No ratings yet
IDM Assignment
15 pages
Pca 2382487
No ratings yet
Pca 2382487
8 pages
ML Minors Exp7
No ratings yet
ML Minors Exp7
6 pages
ML Lab Experiment Shortened With Same Output
No ratings yet
ML Lab Experiment Shortened With Same Output
6 pages
ML2 Practical List
No ratings yet
ML2 Practical List
80 pages
K-Means 10
No ratings yet
K-Means 10
2 pages
SOLUTION ONLY CODE DWDM - Lab - All
No ratings yet
SOLUTION ONLY CODE DWDM - Lab - All
8 pages
PCA Code-Checkpoint
No ratings yet
PCA Code-Checkpoint
4 pages
Arogya AI
No ratings yet
Arogya AI
4 pages
Baidurya Debnath 4
No ratings yet
Baidurya Debnath 4
37 pages
Aiml Lab
No ratings yet
Aiml Lab
37 pages
22F-3437 22F-3407 Assignment 4 Ai
No ratings yet
22F-3437 22F-3407 Assignment 4 Ai
15 pages
Prac9 23bme053
No ratings yet
Prac9 23bme053
4 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
34 pages
Clustering
No ratings yet
Clustering
1 page
ML - Lab Manual
No ratings yet
ML - Lab Manual
54 pages
Slip
No ratings yet
Slip
5 pages
Prac7 8 9 10
No ratings yet
Prac7 8 9 10
12 pages
Data Mining Techniques for CKD Analysis
No ratings yet
Data Mining Techniques for CKD Analysis
12 pages
Advanced Machine Learning Experiments
No ratings yet
Advanced Machine Learning Experiments
15 pages
DS - ML - 7 - 60019210046 1
No ratings yet
DS - ML - 7 - 60019210046 1
6 pages
ML - Unit-6 KMeans
No ratings yet
ML - Unit-6 KMeans
20 pages
Mla 7th
No ratings yet
Mla 7th
2 pages
LAB7 Kmeans
No ratings yet
LAB7 Kmeans
11 pages
Week 8 DS Practical
No ratings yet
Week 8 DS Practical
13 pages
Principal Component Analysis Notes : Info
No ratings yet
Principal Component Analysis Notes : Info
22 pages
Spectral Clustering
No ratings yet
Spectral Clustering
5 pages
Ds Paper
No ratings yet
Ds Paper
35 pages
Document 10
No ratings yet
Document 10
3 pages
DataScience All 1to8
No ratings yet
DataScience All 1to8
6 pages
K-Means Clustering Implementation
No ratings yet
K-Means Clustering Implementation
3 pages
Machine Learning Algorithms Guide
No ratings yet
Machine Learning Algorithms Guide
34 pages
Bone Suplement Market Segmentation
No ratings yet
Bone Suplement Market Segmentation
20 pages
Data Science Exercise Hard
No ratings yet
Data Science Exercise Hard
12 pages
Aiml Assignment 10
No ratings yet
Aiml Assignment 10
6 pages
Simple Dataset
No ratings yet
Simple Dataset
2 pages
AAM 7th Prac
No ratings yet
AAM 7th Prac
4 pages
Pca Implementation
No ratings yet
Pca Implementation
2 pages
ML Short Code - Under Updating
No ratings yet
ML Short Code - Under Updating
4 pages
Data Mining Ex1
No ratings yet
Data Mining Ex1
10 pages
GMM
No ratings yet
GMM
5 pages
23CC554
No ratings yet
23CC554
10 pages
Assignment # 1: Performance Timeline of Flynn Taxonomy
No ratings yet
Assignment # 1: Performance Timeline of Flynn Taxonomy
21 pages
Clustering Techniques in Python Analysis
No ratings yet
Clustering Techniques in Python Analysis
10 pages
KMeans Clustering Guide
No ratings yet
KMeans Clustering Guide
5 pages
K-Means Clustering for CS Students
No ratings yet
K-Means Clustering for CS Students
30 pages
(Feature Engineering) (Extended-Cheatsheet)
100% (1)
(Feature Engineering) (Extended-Cheatsheet)
9 pages
DMDW Lab8
No ratings yet
DMDW Lab8
3 pages
DGIM
No ratings yet
DGIM
90 pages
MatLab Image Processing for ECE Students
No ratings yet
MatLab Image Processing for ECE Students
5 pages
Nama: Refin Ananda NIM: 3.32.17.0.20 Kelas: EK-3A Individual Presentation 2 Analog To Digital Converter 1. The Meaning of ADC
No ratings yet
Nama: Refin Ananda NIM: 3.32.17.0.20 Kelas: EK-3A Individual Presentation 2 Analog To Digital Converter 1. The Meaning of ADC
4 pages
IntroductionToAutomataTheory Kandar
No ratings yet
IntroductionToAutomataTheory Kandar
8 pages
On The Convergence of The Iterative Image Space Reconstruction Algorithm For Volume ECT
No ratings yet
On The Convergence of The Iterative Image Space Reconstruction Algorithm For Volume ECT
2 pages
BFS and DFS
No ratings yet
BFS and DFS
5 pages
Eceg2102 CM Notes - Ch123
No ratings yet
Eceg2102 CM Notes - Ch123
46 pages
Liang-Barsky Line Clipping Algorithm
No ratings yet
Liang-Barsky Line Clipping Algorithm
2 pages
Deep Learning
No ratings yet
Deep Learning
23 pages
Linear Programming Optimization Method
No ratings yet
Linear Programming Optimization Method
20 pages
Geeks for Geeks DSA Course Overview
No ratings yet
Geeks for Geeks DSA Course Overview
15 pages
Master Theorem for Recurrences
No ratings yet
Master Theorem for Recurrences
12 pages
Cheat Sheet 2
No ratings yet
Cheat Sheet 2
2 pages
XGBoost: Mastering Gradient Boosting
No ratings yet
XGBoost: Mastering Gradient Boosting
39 pages
Regula Falsi Method Explained
No ratings yet
Regula Falsi Method Explained
9 pages
Top 21 DotNet Coding Interview Question Answers
No ratings yet
Top 21 DotNet Coding Interview Question Answers
6 pages
Lecture 23
No ratings yet
Lecture 23
43 pages
Numerical Analysis for Students
No ratings yet
Numerical Analysis for Students
103 pages
Practice Test 11
No ratings yet
Practice Test 11
2 pages
Floyd-Warshall Algorithm Explained
No ratings yet
Floyd-Warshall Algorithm Explained
5 pages
Ramco Institute of Technology: Goals (Learning Outcomes)
No ratings yet
Ramco Institute of Technology: Goals (Learning Outcomes)
10 pages
Statistics and Numerical Method Important Questions
No ratings yet
Statistics and Numerical Method Important Questions
3 pages
Decisiones de Optimización en Modelos
No ratings yet
Decisiones de Optimización en Modelos
38 pages
Data Structures Algorithms U5
No ratings yet
Data Structures Algorithms U5
83 pages
Capstone Notes-Model
No ratings yet
Capstone Notes-Model
20 pages
Normalization With Decimal Scaling in Data Mining - Examples Data Mining
No ratings yet
Normalization With Decimal Scaling in Data Mining - Examples Data Mining
4 pages
Coding 2 Questions
No ratings yet
Coding 2 Questions
2 pages
Digital Signal Processing - EC3492 - Important Questions With Answer - Unit 4
No ratings yet
Digital Signal Processing - EC3492 - Important Questions With Answer - Unit 4
9 pages
IFEM Ch10
No ratings yet
IFEM Ch10
16 pages
Cu4073 - Set 4
No ratings yet
Cu4073 - Set 4
2 pages

Assignment 1

Uploaded by

Assignment 1

Uploaded by

import numpy as np

# Step 1: Load the dataset

# Step 2: Standardize the data (mean = 0, variance = 1)

# Step 3: Compute the covariance matrix

# Step 4: Perform eigen decomposition

# Step 5: Sort eigenvalues and eigenvectors

# Select the first two principal components

# Step 6: Transform the data

# Step 7: Save the reduced data

# Step 1: Load the reduced data

# Step 2: Initialize cluster centers randomly

# Step 3-5: K-Means Algorithm

# Step 4: Update cluster centers

# Check for convergence

# Step 6: Save results

# Save cluster assignments and cluster centers

You might also like