100% found this document useful (1 vote)

123 views62 pages

02 01 KMeans

Uploaded by

sahandakpou

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

100% found this document useful (1 vote)

123 views62 pages

02 01 KMeans

Uploaded by

sahandakpou

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 62

Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Machine Learning (CE 40477)

Fall 2024

Ali Shariﬁ-Zarchi

CE Department
Sharif University of Technology

October 15, 2024

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 1 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

1 Unsupervised Learning Overview

2 K-Means

3 Challenges in K-Means

4 Other Clustering Algorithms

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 2 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

1 Unsupervised Learning Overview

2 K-Means

3 Challenges in K-Means

4 Other Clustering Algorithms

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 3 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Unsupervised Learning

• Unsupervised Learning involves analyzing unlabeled data to uncover hidden

patterns or structures within the data

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 4 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Some Common Tasks

• Clustering: Grouping data points into clusters based on similarity.

• Dimensionality Reduction: Reducing the number of features under consideration
and keeping (perhaps approximately) the most informative features.
• Anomaly Detection: Identifying data points that deviate signiﬁcantly from the
norm (e.g., fraud detection).
• Generative Modeling: Learning the distribution of data to generate new, similar
instances.

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 5 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Clustering

• Clustering organizes data points into groups of similar objects.

• Data points in a cluster are more similar to each other than to those in other
clusters.
• The notion of similarity depends on the task at hand (e.g., purchase behavior in
market segmentation).

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 6 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Some Applications of Clustering

• Customer Segmentation (Marketing)

• Image Segmentation and Object Detection (Computer Vision)
• Anomaly Detection (Cybersecurity, Finance)
• Genomics and Bioinformatics
• Social Network Analysis and Community Detection

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 7 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Clustering in Action: Music Recommendation Systems

• Music recommendation systems cluster songs based on similarity.

Adopted from machinelearninggeek.com

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 8 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Clustering in Action: Music Recommendation Systems

• When you like a song, the system suggests others from the same cluster.

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 9 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Clustering in Action: Gene Expression Clustering

• Clustering can decipher hidden patterns in gene expression data, which can help
in understanding disease mechanisms or genetic variations.

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 10 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Two Beginning Questions

• How to create ’good’ clusters?

• How many clusters do we need?

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 11 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

1 Unsupervised Learning Overview

2 K-Means

3 Challenges in K-Means

4 Other Clustering Algorithms

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 12 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

K-Means overview

• The most widely used clustering algorithm.

• Partitions data into K distinct groups based on feature similarity
• It works by iteratively assigning data points to the nearest centroid (mean of the
group) and then recalculating the centroids based on the new group memberships
• The process repeats until the assignments no longer change

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 13 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

K-Means in action

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 14 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

K-Means in action

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 15 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

K-Means in action (cont.)

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 16 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

K-Means in action (cont.)

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 17 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

K-Means in action (cont.)

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 18 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

K-Means in action (cont.)

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 19 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Algorithm

Algorithm 1 K-means Clustering

1: Input: K (number of clusters), D = {x (1) , . . . , x (N) } (data points)

2: Initialize: Select K random points as centroids {µ1 , . . . , µK }
3: repeat
4: Assign each point x (i) to nearest centroid f (x (i) ) = arg minj ∥x (i) − µj ∥
5: For each 1 ≤ j ≤ K set Cj = {x(i) |f (x(i) ) = j}
∑
6: Update centroids µj = |C1j | x(i) ∈Cj x (i)
7: until Centroids do not change
8: Output: Final clusters {C1 , C2 , . . . , CK }

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 20 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Problem deﬁnition

• Formally: We have Xtrain = {x(1) , x(2) , . . . , x(N) } ⊆ Rd

• K is the number of clusters.
• We are learning:
1 A function or mapping f : Rd → {1, 2, . . . , K } that assigns a cluster to each data point.
2 A set of K prototypes µ = {µ1 , µ2 , . . . , µK } ⊆ Rd as the cluster representatives, called
centeroids.

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 21 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Objective Function

• We want samples in the same cluster to be similar.

• In K-Means, this is expressed as:

∑
K ∑
J= ||x(i) − µj ||2
j=1 x(i) ∈Cj

• Choose f and µ = {µ1 , µ2 , . . . , µK } to minimize this.

• This problem is NP-hard. K-Means is a heuristic solution, which is NOT guaranteed
to ﬁnd optimal solution.

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 22 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

K-Means Process Example

Adopted from mlbhanuyerra.github.io

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 23 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Convergence

• How do we know K-Means will converge in a ﬁnite number of steps ?

• First we show in each step J will decrease, as long as we have not converged.

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 24 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Convergence (cont.)

• We initially assigne each sample to the nearest centroid.

f (x) := argminj ||x − µj ||2

.
• Keep each sample’s assignment ﬁxed until a closer centriod is found.
• Each time a sample is reassigned. the total distance between samples and their
centroids decreases.
• The number of possible sample-to-centroid assignments is ﬁnite.
• The algorithm terminates when no sample changes its assigned centroid.

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 25 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Convergence (cont.)

• In Updating step, with f (x) ﬁxed, J is a quadratic function of µj (like SSE) and by
taking derivative we can minimize it as:

∂J ∑ ( (i) )
= 0 =⇒ 2 x − µj = 0
∂µj x(i) ∈C j

• This means we should update each µj as the mean of cluster Cj :

∑ (i)
x(i) ∈Cj x
µj =
|Cj |

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 26 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Convergence (cont.)

• For each cluster, the mean of its samples minimizes squared distances.
∑ ∑
• For Cj if µ′ was the old centroid we have: x(i) ∈Cj ||x(i) − µ′ ||2 ≥ x(i) ∈Cj ||x(i) − µj ||. So
j j
Jnew ≤ Jold .

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 27 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Convergence (cont.)

• J is non-negative, and there are a ﬁnite number of partitions so there is a minimum

for J and we can’t decrease J forever.
• Therefore we must converge at some point.
• The convergence properties of the K-means algorithm were studied by MacQueen
(1967).

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 28 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

K-Means Convergence (cont.)

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 29 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Strengths

• Simple: easy to understand and to implement.

• Efﬁcient: Time complexity: O(tkn), where
• n is the number of data points,
• k is the number of clusters, and
• t is the number of iterations.

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 30 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

1 Unsupervised Learning Overview

2 K-Means

3 Challenges in K-Means

4 Other Clustering Algorithms

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 31 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Initialization

• K-Means always converges. What could go wrong ?

• K-Means algorithm is a heuristic
• It requires initial centroids, and the choice is important as it could affect the t in
O(tkn).

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 32 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Local Optimum

• The algorithm ﬁnds a local minimum but there is no guarantee to ﬁnd global
minimum.
• Its result is highly affected by the initialization.
• Some suggestions are:
• Multiple runs with random initial centroids, then select the "best" result.
• Initialization heuristics (K-Means++ , Furthest Traversal).
• Initializing with the suggested results of another method.

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 33 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Local Optimum

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 34 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Local optimum (cont.)

Optimal clustering Possible clustering

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 35 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Deﬁnition of Mean

• We assume x(i) ∈ Rd , which is not always the case. K-Means requires a space where
sample mean is deﬁned.
• Categorical data.
• A suggested solution: K-Mode - the centroid is the most frequent category (the mode)
in each cluster.
• Closest centroid is found by the Hamming Distance.

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 36 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

How many clusters?

Adopted from
slides of Dr. Soleymani, Modern Information Retrieval Course, Sharif University of technology.

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 37 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

How many clusters? (cont.)

• Number of clusters is usually given in advance in the problem of clustering.

However; ﬁnding the right number of clusters is also a problem.
• First we need to know how we can evaluate a clustering.

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 38 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Clustering Evaluation

• Evaluating clusters involves two key aspects:

• Intra-cluster cohesion (compactness): How similar the data points are within a
cluster.
• Often measured by the within-cluster sum of squares (WCSS):

∑
K ∑
WCSS = ||x − µi ||2
i=1 x∈Ci

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 39 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Clustering Evaluation

• Inter-cluster separation (isolation): How different the data points are between
clusters.
• Single-link (Minimum Distance):
• Measures the **minimum distance** between any two points from different clusters.

dsingle (Ci , Cj ) = min d(x, y)

x∈Ci ,y∈Cj

• Complete-link (Maximum Distance):

• Measures the maximum distance between any two points from different clusters.

dcomplete (Ci , Cj ) = max d(x, y)

x∈Ci ,y∈Cj

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 40 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Clustering Evaluation

• Inter-cluster separation (isolation): How different the data points are between
clusters.
• Centroid (Wards Method):
• Measures the distance between the centroids of two clusters.

dcentroid (Ci , Cj ) = d(µi , µj )

• Average-link:
• Measures the average distance between all pairs of points from different clusters.

1 ∑ ∑
daverage (Ci , Cj ) = d(x, y)
|Ci | · |Cj | x∈Ci y∈Cj

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 41 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Elbow Method for Optimal K

• Finds the optimal number of clusters K by minimizing the within-cluster sum of

squares (WCSS).
• Elbow Point:
• Plot WCSS versus K .
• The point where the rate of decrease sharply slows down (resembles an "elbow") is
considered the optimal K .

CE Department (Sharif University of Technology) Machine Learning (CE 40477) Adopted from medium.com
October 15, 2024 42 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Silhouette Method for Cluster Evaluation

• Silhouette Score for a single point i:

b(i) − a(i)
S(i) =
max(a(i), b(i))
• where:
• a(i) is the average distance between i and all other points in the same cluster.
• b(i) is the average distance between i and points in the nearest neighboring cluster.
• Interpretation:
• S(i) ∈ [−1, 1]
• S(i) ≈ 1 : Well-clustered.
• S(i) ≈ 0 : On or near the decision boundary between clusters.
• S(i) ≈ −1 : Misclustered.

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 43 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

How many Clusters? (cont.)

• There is a trade-off between having better focus within each cluster or having too
many clusters.
• Don’t want one-element clusters.
• Optimization problem: penalize having too many clusters

K ∗ = arg mink J(k) + λk

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 44 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Outliers

• The algorithm is sensitive to outliers

• Outliers are data points that are very far away from other data points.
• Outliers could be errors in data recording or unique data points with signiﬁcantly
different values.

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 45 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Data Distribution

• There is a problems with how k-means deﬁnes clusters.

• K-means assumes clusters are spherical and separated by equal variance, which
limits its effectiveness on non-spherical or complex-shaped clusters.

Figure 1: example when k-means wont work

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 46 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

1 Unsupervised Learning Overview

2 K-Means

3 Challenges in K-Means

4 Other Clustering Algorithms

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 47 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Hard vs Soft Clustering

• Hard Clustering(Partitional): Each data

point belongs to exactly one cluster
• More common and easier to use.
• Soft Clustering(Bayesian)

Figure adapted from Machine Learning and

Pattern Recognition, Bishop

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 48 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Hard vs Soft Clustering (cont.)

• Hard Clustering(Partitional)
• Soft Clustering(Bayesian): Each sample is
assigned to different clusters with
probabilities, rather than {0, 1}.
• data point belongs to each cluster with a
probability

Figure adapted from Machine Learning and

Pattern Recognition, Bishop

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 49 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Hierarchical Clustering
• Hierarchical algorithms ﬁnd successive clusters using previously established
clusters. Two Types:
• Agglomerative (bottom-up): Start with individual points and merge clusters.
• Divisive (top-down): Start with all points and split clusters.
Result: A hierarchy of clusters represented by a dendrogram.

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 50 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Agglomerative Clustering Algorithm

• Start with each point as its own cluster.

• Merge the "closest" clusters.
• Repeat until one cluster remains or desired number is reached.
• Closest cluster can be determined using inter-cluster separation measures

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 51 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Dendrogram and Cutting

• A dendrogram shows the hierarchy of merges.

• Cut the dendrogram at a desired level to form clusters.

Adopted from r-graph-gallery.com

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 52 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Hierarchical Algorithms

• Advantages:
• No need to specify the number of clusters.
• Produces a dendrogram for visualization.
• Works with arbitrary-shaped clusters.
• Disadvantages
• High computational cost.
• Sensitive to noise and outliers.
• Greedy: cannot undo merges.

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 53 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

DBSCAN

DBSCAN (Density-Based Spatial Clustering of Applications with Noise):

• Groups points in high-density regions.
• Labels points in low-density regions as noise.
• Does not require specifying the number of clusters K .
Parameters:
• ϵ (epsilon): Maximum distance for neighbors.
• minPts: Minimum points to form a dense region.

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 54 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Core Concepts in DBSCAN

DBSCAN deﬁnes three types of points:

• Core Point: A point with at least minPts neighbors within distance ϵ.
• Border Point: A point within ϵ of a core point but with fewer than minPts
neighbors.
• Noise: Points that are neither core points nor border points.

Adopted from ai.plainenglish.io

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 55 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Core Concepts in DBSCAN (cont.)

Deﬁnitions:
• A point xi is a core point if:

|{xj : d(xi , xj ) ≤ ϵ}| ≥ minPts

• A point is a border point if it is within distance ϵ of a core point, but not itself a core
point.

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 56 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

DBSCAN Algorithm Steps

Algorithm Steps:
1 For each unvisited point xi :
• Mark xi as visited.
• Find all points within distance ϵ (neighborhood).
2 If xi is a core point:
• Create a new cluster and expand it by recursively adding all reachable core and
border points.
3 If xi is not a core point:
• Label it as noise if it does not belong to any cluster.

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 57 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Advantages of DBSCAN

• Can ﬁnd clusters of arbitrary shape (non-spherical).

• Does not require specifying the number of clusters K in advance.
• Robust to noise and outliers.
• Works well with large datasets.

Adopted

CE Department (Sharif University of Technology)

from mrinalyadav7.medium.com
Machine Learning (CE 40477) October 15, 2024 58 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Limitations of DBSCAN

• DBSCAN struggles with datasets of varying densities.

• Sensitive to the selection of parameters ϵ and minPts.
• Does not perform well with high-dimensional data.

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 59 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Clustering Algorithms

• Each algorithm is suited for different kinds of patterns and information in data.

Adopted from
CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 60 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

Contributions

• This slide has been prepared thanks to:

• Hooman Zolfaghari

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 61 / 62
Unsupervised Learning Overview K-Means Challenges in K-Means Other Clustering Algorithms

CE Department (Sharif University of Technology) Machine Learning (CE 40477) October 15, 2024 62 / 62

Clustering K-Means
100% (2)
Clustering K-Means
28 pages
Unit 4
No ratings yet
Unit 4
125 pages
UNIT III Part-1
No ratings yet
UNIT III Part-1
69 pages
Module - 05 Machine Learning (BCS602) Search Creators
No ratings yet
Module - 05 Machine Learning (BCS602) Search Creators
47 pages
Machine Learning-4
No ratings yet
Machine Learning-4
73 pages
ML Lecture06 Unsupervised Learning
No ratings yet
ML Lecture06 Unsupervised Learning
87 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
78 pages
Sajjad DS
100% (2)
Sajjad DS
97 pages
ML Unit 4
No ratings yet
ML Unit 4
110 pages
Week 4 - Lecture Slides - K-Means, Mixture Models, & EM
No ratings yet
Week 4 - Lecture Slides - K-Means, Mixture Models, & EM
65 pages
Lecture Unsupervised (17!04!2024)
No ratings yet
Lecture Unsupervised (17!04!2024)
61 pages
Kmeans
No ratings yet
Kmeans
92 pages
WINSEM2023-24 BEEE410L TH VL2023240502246 2024-03-22 Reference-Material-I
No ratings yet
WINSEM2023-24 BEEE410L TH VL2023240502246 2024-03-22 Reference-Material-I
95 pages
04 - KMeans Clustering
No ratings yet
04 - KMeans Clustering
56 pages
ML 5
No ratings yet
ML 5
61 pages
L7 Clustering
No ratings yet
L7 Clustering
58 pages
Unit 4
No ratings yet
Unit 4
53 pages
Clustering and Dimensionality Reduction
No ratings yet
Clustering and Dimensionality Reduction
58 pages
ML UNIT 4 Sir
No ratings yet
ML UNIT 4 Sir
42 pages
ML CH 4
No ratings yet
ML CH 4
51 pages
Week 11
No ratings yet
Week 11
49 pages
Unit 4 Clustering - K-Means and Hierarchical
No ratings yet
Unit 4 Clustering - K-Means and Hierarchical
40 pages
9.1. Machine Learning Unsupervised Learning-1
No ratings yet
9.1. Machine Learning Unsupervised Learning-1
57 pages
Intro Data Science: Cluster Analysis
No ratings yet
Intro Data Science: Cluster Analysis
60 pages
04-FSSR DS610 2024 2025T1 Kmeans
No ratings yet
04-FSSR DS610 2024 2025T1 Kmeans
57 pages
Week 9
No ratings yet
Week 9
66 pages
K Means
No ratings yet
K Means
40 pages
Unit 4
No ratings yet
Unit 4
46 pages
Kmeans&Variants
No ratings yet
Kmeans&Variants
29 pages
K Means
No ratings yet
K Means
25 pages
P-3 1 2-Kmeans
No ratings yet
P-3 1 2-Kmeans
43 pages
ML Unit 4 V1
No ratings yet
ML Unit 4 V1
30 pages
Week 14 and 15 Machine Learning Unsupervised 2
No ratings yet
Week 14 and 15 Machine Learning Unsupervised 2
25 pages
K Means
No ratings yet
K Means
24 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
27 pages
Chapter 5. Clustering Algorithms-Stud
No ratings yet
Chapter 5. Clustering Algorithms-Stud
44 pages
ML Application in Signal Processing and Communication Engineering
No ratings yet
ML Application in Signal Processing and Communication Engineering
27 pages
Unit4 ML
No ratings yet
Unit4 ML
20 pages
Lecture - 10 Unsupervised Learning & K-Means Clustering
No ratings yet
Lecture - 10 Unsupervised Learning & K-Means Clustering
31 pages
Som New
No ratings yet
Som New
21 pages
6 - Into To Data Science Techniques and Clustering
No ratings yet
6 - Into To Data Science Techniques and Clustering
16 pages
ML Clustering2
No ratings yet
ML Clustering2
11 pages
Machine Learning Chapter 3
No ratings yet
Machine Learning Chapter 3
12 pages
Week 10
No ratings yet
Week 10
41 pages
20 - 1 - ML - Unsup - 01 - Partition Based - Kmeans
No ratings yet
20 - 1 - ML - Unsup - 01 - Partition Based - Kmeans
20 pages
Introduction To Unsupervised Learning:: Clustering
No ratings yet
Introduction To Unsupervised Learning:: Clustering
21 pages
EAI13
No ratings yet
EAI13
19 pages
Electronics 09 01295 v2
No ratings yet
Electronics 09 01295 v2
12 pages
K Mean
No ratings yet
K Mean
7 pages
Machine Learning & Data Mining: Understanding
No ratings yet
Machine Learning & Data Mining: Understanding
7 pages
Chapter 9
No ratings yet
Chapter 9
8 pages
ML DSBA Lab7
No ratings yet
ML DSBA Lab7
6 pages
10.lab Activity
No ratings yet
10.lab Activity
11 pages
K Means Final
No ratings yet
K Means Final
10 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
12 pages
Clustering Algorithm: An Unsupervised Learning Approach
No ratings yet
Clustering Algorithm: An Unsupervised Learning Approach
23 pages
K - Means Clustering
No ratings yet
K - Means Clustering
13 pages
K Means
No ratings yet
K Means
9 pages
K Mean
No ratings yet
K Mean
12 pages
13: Clustering: Unsupervised Learning - Introduction
No ratings yet
13: Clustering: Unsupervised Learning - Introduction
4 pages
WEKA Practical Protocol
No ratings yet
WEKA Practical Protocol
40 pages
Krishna Edx Machine Learning With Python
No ratings yet
Krishna Edx Machine Learning With Python
18 pages
Data Mining - UNIT-IV
No ratings yet
Data Mining - UNIT-IV
24 pages
PML Lab Exp 12
No ratings yet
PML Lab Exp 12
2 pages
DBSCAN Algorithm
No ratings yet
DBSCAN Algorithm
15 pages
L07 - Advance Analytical Theory and Methods - Clustering
No ratings yet
L07 - Advance Analytical Theory and Methods - Clustering
22 pages
DMDW Lab8 Kirtan
No ratings yet
DMDW Lab8 Kirtan
49 pages
Automated Open-Stope Design Otimization Through Machine Learning Methods
No ratings yet
Automated Open-Stope Design Otimization Through Machine Learning Methods
21 pages
Machine Learning in Medicine Cookbook Premium Download
100% (17)
Machine Learning in Medicine Cookbook Premium Download
17 pages
DB Scan
No ratings yet
DB Scan
7 pages
Cheat Sheet-Building Unsupervised Learning Models
No ratings yet
Cheat Sheet-Building Unsupervised Learning Models
3 pages
AISAR Artificial Intelligence-Based Student Assess
No ratings yet
AISAR Artificial Intelligence-Based Student Assess
22 pages
A Cluster-Based Optimization Framework For Vehicle Routing Problem With Workload Balance
No ratings yet
A Cluster-Based Optimization Framework For Vehicle Routing Problem With Workload Balance
14 pages
Image Clustering: Prof. Dr. Rafiqul Islam Department of CSE
No ratings yet
Image Clustering: Prof. Dr. Rafiqul Islam Department of CSE
26 pages
A Comprehensive Survey of Clustering Algorithms
No ratings yet
A Comprehensive Survey of Clustering Algorithms
30 pages
Clustering
No ratings yet
Clustering
11 pages
DWDM Unit Vi
No ratings yet
DWDM Unit Vi
23 pages
DBSCAN Algorithm Java Implementation
No ratings yet
DBSCAN Algorithm Java Implementation
12 pages
Section A: Ques. 1
No ratings yet
Section A: Ques. 1
31 pages
Techniques of Cluster Analysis: A Seminar On
No ratings yet
Techniques of Cluster Analysis: A Seminar On
25 pages
Strategies and Algorithms For Clustering Large Datasets: A Review
No ratings yet
Strategies and Algorithms For Clustering Large Datasets: A Review
20 pages
Harvard CS109B Syllabus Draft 20211216
No ratings yet
Harvard CS109B Syllabus Draft 20211216
6 pages
Clusters - Density-Based
No ratings yet
Clusters - Density-Based
12 pages
February 2024-: Top Read Articles in Computer Science & Information Technology
No ratings yet
February 2024-: Top Read Articles in Computer Science & Information Technology
35 pages
Singh2013 - 2 Metode PDF
No ratings yet
Singh2013 - 2 Metode PDF
5 pages
Fusing Concurrent Orthogonal Wide-Aperture Sonar Images For Dense Underwater 3D Reconstruction
No ratings yet
Fusing Concurrent Orthogonal Wide-Aperture Sonar Images For Dense Underwater 3D Reconstruction
8 pages
Ajith-Quiz 1 - K-Means, DBSCAN and Hierarchical Clustering - Machine Learning 3 - Olympus LMS
No ratings yet
Ajith-Quiz 1 - K-Means, DBSCAN and Hierarchical Clustering - Machine Learning 3 - Olympus LMS
7 pages
Front Page Ramesh
No ratings yet
Front Page Ramesh
7 pages
A Novel Density-Based Clustering Algorithm For Predicting Cardiovascular Disease
No ratings yet
A Novel Density-Based Clustering Algorithm For Predicting Cardiovascular Disease
12 pages
IGNOU MCA Data Science and Big Data Previous Years Unsolved Papers MCS 226
From Everand
IGNOU MCA Data Science and Big Data Previous Years Unsolved Papers MCS 226
Manish Soni
No ratings yet