0% found this document useful (0 votes)

2 views

06K_means_clustering

The document outlines a practical guide for implementing K-Means clustering using Python, including steps for importing packages, reading data, calculating distances, and plotting results. It demonstrates how to cluster height and weight data into three groups and visualize the clusters. The final output includes a DataFrame showing the height, weight, and assigned cluster for each data point.

Uploaded by

Pratham Dhiman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

2 views

06K_means_clustering

Uploaded by

Pratham Dhiman

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 4

4/22/24, 10:06 PM K_means_clustering

Practical - 6 : K-Means for Clustering

Table of Contents

Importing Needed Packages

Reading the data
Calculating the minimum distance
Plotting the data
Applying the Model
Plotting the Result

Importing Needed Packages

In [ ]: import pandas as pd
from sklearn.model_selection import train_test_split
from sklearn.tree import DecisionTreeClassifier
import matplotlib.pyplot as plt
from sklearn.tree import plot_tree

from sklearn.cluster import KMeans

import seaborn as sns

Reading the data

In [ ]: htWtData = pd.read_csv("/BOOK.csv")
df = pd.DataFrame(htWtData)
df.head()

Out[ ]: Height Weight

0 185 72

1 170 56

2 168 60

3 179 68

4 182 72

Calculating the Minimum Distance

In [ ]: ht = df["Height"].tolist()
print(ht)
wt = df["Weight"].tolist()
print(wt)

ed, mnD = [], 10 ** 9

for i in range(0, len(ht)) :

for j in range(i+1, len(wt)) :
ds = ((ht[i] - ht[j]) ** 2) + ((wt[i] - wt[j]) ** 2)
ed.append(ds ** (0.5))
d = ds ** (0.5)
mnD = min(mnD, d)

localhost:8888/nbconvert/html/Desktop/ML/K_means_clustering.ipynb?download=false 1/4
4/22/24, 10:06 PM K_means_clustering
if len(ed) >= 5 : break

print("\n")
for d in ed :
print(d)

print("\n minimum distance : ", mnD)

[185, 170, 168, 179, 182, 188, 180, 180, 183, 180, 180, 177]
[72, 56, 60, 68, 72, 77, 71, 70, 84, 88, 67, 76]

21.93171219946131
20.808652046684813
7.211102550927978
3.0
5.830951894845301
5.0990195135927845
5.385164807134504
12.165525060596439
16.76305461424021
7.0710678118654755
8.94427190999916

minimum distance : 3.0

Plotting the Data

In [ ]: plt.figure(figsize=(8, 5))
# plt.show()

plt.scatter(htWtData['Weight'], htWtData['Height'])
plt.xlabel("weight")
plt.ylabel("height")
plt.show()

Applying the K-Means Model

localhost:8888/nbconvert/html/Desktop/ML/K_means_clustering.ipynb?download=false 2/4
4/22/24, 10:06 PM K_means_clustering

In [ ]: kmeans = KMeans(n_clusters = 3)
kmeans.fit(htWtData)

pdVals = kmeans.predict(htWtData)
print(pdVals)

f = pd.DataFrame(htWtData)
f["cluster"] = pdVals
print(f)

color = ["red", "blue","green"]

for k in range(0,3) :
final = f[f["cluster"] == k]
plt.scatter(final["Height"], final["Weight"], c = color[k])

plt.show()

[0 1 1 0 0 2 0 0 2 2 0 0]
/usr/local/lib/python3.10/dist-packages/sklearn/cluster/_kmeans.py:870: FutureWarn
ing: The default value of `n_init` will change from 10 to 'auto' in 1.4. Set the v
alue of `n_init` explicitly to suppress the warning
warnings.warn(

Plotting the Result

In [ ]: f = pd.DataFrame(htWtData)
f["cluster"] = pdVals
print(f)

color = ["red", "blue","green"]

for k in range(0,3) :
final = f[f["cluster"] == k]
plt.scatter(final["Height"], final["Weight"], c = color[k])

plt.show()

Height Weight cluster

0 185 72 0
1 170 56 1
2 168 60 1
3 179 68 0
4 182 72 0
5 188 77 2
6 180 71 0
7 180 70 0
8 183 84 2
9 180 88 2
10 180 67 0
11 177 76 0

localhost:8888/nbconvert/html/Desktop/ML/K_means_clustering.ipynb?download=false 3/4
4/22/24, 10:06 PM K_means_clustering

localhost:8888/nbconvert/html/Desktop/ML/K_means_clustering.ipynb?download=false 4/4

Applications of Machine Learning and Data Analytics Models in Maritime Transportation
No ratings yet
Applications of Machine Learning and Data Analytics Models in Maritime Transportation
319 pages
JAVIER KMeans Clustering Jupyter Notebook
No ratings yet
JAVIER KMeans Clustering Jupyter Notebook
7 pages
02.1 K-Means Example
No ratings yet
02.1 K-Means Example
12 pages
Day59 K Means Clustering 1701989733
No ratings yet
Day59 K Means Clustering 1701989733
5 pages
K-Means in Python - Solution
No ratings yet
K-Means in Python - Solution
6 pages
K-Means Algorithm
No ratings yet
K-Means Algorithm
29 pages
21BEC505 Exp2
No ratings yet
21BEC505 Exp2
7 pages
CSC649 Lecture 3 Unsupervised ML - KMeansClustering
No ratings yet
CSC649 Lecture 3 Unsupervised ML - KMeansClustering
22 pages
01 K Means - Merged
No ratings yet
01 K Means - Merged
26 pages
Lab Report6 - B21CI014
No ratings yet
Lab Report6 - B21CI014
8 pages
Department Of: Computer Science & Engineering
No ratings yet
Department Of: Computer Science & Engineering
4 pages
Avinash Tiwari 9
No ratings yet
Avinash Tiwari 9
4 pages
Kmeansclustering Sales Dataset
No ratings yet
Kmeansclustering Sales Dataset
6 pages
SE_KMeansClustering
No ratings yet
SE_KMeansClustering
21 pages
Unit_4 (1)
No ratings yet
Unit_4 (1)
63 pages
IDS Unit-3 L2
No ratings yet
IDS Unit-3 L2
26 pages
EXPERIMENT 9
No ratings yet
EXPERIMENT 9
10 pages
AAM 7th prac
No ratings yet
AAM 7th prac
4 pages
clustering R codes
No ratings yet
clustering R codes
2 pages
K Means
100% (2)
K Means
329 pages
K Means Clustering
No ratings yet
K Means Clustering
11 pages
SUMERA - Kmeans Clustering - Jupyter Notebook
No ratings yet
SUMERA - Kmeans Clustering - Jupyter Notebook
7 pages
K Means Clustering
No ratings yet
K Means Clustering
5 pages
Machine Learning K Means - Unsupervised
No ratings yet
Machine Learning K Means - Unsupervised
5 pages
K.means Clustering
No ratings yet
K.means Clustering
8 pages
Data Mining
No ratings yet
Data Mining
18 pages
Cluster Analysis Thesis Matlab Code PDF
100% (3)
Cluster Analysis Thesis Matlab Code PDF
7 pages
STAT452 Project1
No ratings yet
STAT452 Project1
13 pages
10.Lab Activity
No ratings yet
10.Lab Activity
11 pages
3.1 K - Means
No ratings yet
3.1 K - Means
16 pages
Pa66 ML Exp6
No ratings yet
Pa66 ML Exp6
9 pages
MS6711 Data Mining Homework 1: 1.1 Implement K-Means Manually (8 PTS)
No ratings yet
MS6711 Data Mining Homework 1: 1.1 Implement K-Means Manually (8 PTS)
6 pages
Practical 03
No ratings yet
Practical 03
3 pages
K-MEANS CLUSTERING ppt kpu
No ratings yet
K-MEANS CLUSTERING ppt kpu
4 pages
06_K Means Clustering
No ratings yet
06_K Means Clustering
36 pages
Unsupervisd Learning Algorithm
No ratings yet
Unsupervisd Learning Algorithm
6 pages
Lecture 11 K Means Clustering
No ratings yet
Lecture 11 K Means Clustering
8 pages
DSCI 100 Clustering Concept Cheat Sheet
No ratings yet
DSCI 100 Clustering Concept Cheat Sheet
4 pages
Ex No: Date: K-Means Clustering Using Python: Scatter
No ratings yet
Ex No: Date: K-Means Clustering Using Python: Scatter
10 pages
Cluster Analysis: Talha Farooq Faizan Ali Muhammad Abdul Basit
No ratings yet
Cluster Analysis: Talha Farooq Faizan Ali Muhammad Abdul Basit
16 pages
K Means Clustering - Experiment 12
No ratings yet
K Means Clustering - Experiment 12
3 pages
ML DSBA Lab7
No ratings yet
ML DSBA Lab7
6 pages
DWM_EXP4
No ratings yet
DWM_EXP4
9 pages
Data Science Analysis Final Project
No ratings yet
Data Science Analysis Final Project
10 pages
FullMarks - Clustering StudentSolution 2
No ratings yet
FullMarks - Clustering StudentSolution 2
13 pages
Elbow Method for Optimal Cluster Number in K-Means
No ratings yet
Elbow Method for Optimal Cluster Number in K-Means
8 pages
INTRO TO ML ASS
No ratings yet
INTRO TO ML ASS
3 pages
DA_EXP_10_66
No ratings yet
DA_EXP_10_66
6 pages
DA_EXP_10 (1)
No ratings yet
DA_EXP_10 (1)
6 pages
Facebook Live Seller
No ratings yet
Facebook Live Seller
8 pages
Unit 4 Aam
No ratings yet
Unit 4 Aam
26 pages
PGM 7
No ratings yet
PGM 7
3 pages
DWDM Lab All
No ratings yet
DWDM Lab All
20 pages
Presentation 1
No ratings yet
Presentation 1
47 pages
Kman 07
No ratings yet
Kman 07
9 pages
S27
No ratings yet
S27
30 pages
AI Week 11
No ratings yet
AI Week 11
21 pages
DA_EXP_10
No ratings yet
DA_EXP_10
6 pages
K-Means Clustering
No ratings yet
K-Means Clustering
5 pages
4 Clustering With K-Means - Kaggle
No ratings yet
4 Clustering With K-Means - Kaggle
9 pages
Computer Engineering Laboratory Solution Primer
From Everand
Computer Engineering Laboratory Solution Primer
Karan Bhandari
No ratings yet
Vmls Python Companion
No ratings yet
Vmls Python Companion
192 pages
Intelligent Systems Tutorial
No ratings yet
Intelligent Systems Tutorial
6 pages
Peyer2016 (Voluntary Simplifiers)
No ratings yet
Peyer2016 (Voluntary Simplifiers)
7 pages
L Moments
No ratings yet
L Moments
39 pages
Bat Algorithm Literature Review and Appl PDF
No ratings yet
Bat Algorithm Literature Review and Appl PDF
10 pages
Large Scale Analysis of Formations in Soccer Paper PDF
No ratings yet
Large Scale Analysis of Formations in Soccer Paper PDF
8 pages
07 OUTLIER DETECTION
No ratings yet
07 OUTLIER DETECTION
54 pages
Describe The Data Processing Chain: Business Understanding
No ratings yet
Describe The Data Processing Chain: Business Understanding
4 pages
Artificial Intelligence in Landscape Architecture - A Literature R
No ratings yet
Artificial Intelligence in Landscape Architecture - A Literature R
56 pages
MFDS - Test 1 Problems
No ratings yet
MFDS - Test 1 Problems
9 pages
9-0-SP1 Getting Started With WebMethods and Terracotta
100% (1)
9-0-SP1 Getting Started With WebMethods and Terracotta
86 pages
Objective: For One Dimensional Data Set (7,10,20,28,35), Perform Hierarchical Clustering
No ratings yet
Objective: For One Dimensional Data Set (7,10,20,28,35), Perform Hierarchical Clustering
13 pages
L18 K Means
No ratings yet
L18 K Means
27 pages
Data Vizualisation (Types of Charts)
No ratings yet
Data Vizualisation (Types of Charts)
159 pages
A Fair Load Sharing Approach Based On Microgrid Clusters and Transactive Energy Concept
No ratings yet
A Fair Load Sharing Approach Based On Microgrid Clusters and Transactive Energy Concept
4 pages
K Means Clustering Solved Numerical - 5 Minutes Engineering
No ratings yet
K Means Clustering Solved Numerical - 5 Minutes Engineering
8 pages
Krishna Data Scientist +1 (713) - 478-5282
No ratings yet
Krishna Data Scientist +1 (713) - 478-5282
5 pages
Exam SRM Sample Questions
No ratings yet
Exam SRM Sample Questions
69 pages
DWDM Bits
100% (1)
DWDM Bits
11 pages
Advances in Green Energy Systems and Smart Grid
No ratings yet
Advances in Green Energy Systems and Smart Grid
340 pages
2022 CS244 End Sem Soln
No ratings yet
2022 CS244 End Sem Soln
6 pages
Ai ML Research Paper-219311275
No ratings yet
Ai ML Research Paper-219311275
6 pages
1 s2.0 S187705092030644X Main
No ratings yet
1 s2.0 S187705092030644X Main
11 pages
Integrated Petrophysical Rock Classi¿ Cation in The McElroy Field, West Texas, USA
No ratings yet
Integrated Petrophysical Rock Classi¿ Cation in The McElroy Field, West Texas, USA
18 pages
Chinese Comments Sentiment Classification Based On Word2vec and SVM
No ratings yet
Chinese Comments Sentiment Classification Based On Word2vec and SVM
7 pages
ERP data warehousing in organiz 1st Edition by Gerald Grant ISBN - Download the full ebook now to never miss any detail
100% (8)
ERP data warehousing in organiz 1st Edition by Gerald Grant ISBN - Download the full ebook now to never miss any detail
76 pages
FRCCS24 Jason V1
No ratings yet
FRCCS24 Jason V1
5 pages
Crop and Yield Prediction Model
No ratings yet
Crop and Yield Prediction Model
6 pages
An Integrative Review of Computational Methods For
No ratings yet
An Integrative Review of Computational Methods For
15 pages