0% found this document useful (0 votes)

79 views

Clustering: Unsupervised Learning

This document discusses the k-means clustering algorithm. It begins with an introduction to clustering and unsupervised learning. It then explains the k-means algorithm, which takes as input the number of clusters k and randomly assigns cluster centroids initially. It iterates between assigning examples to their closest centroids and recalculating the centroid positions until convergence. The document discusses challenges like non-separated clusters and notes that k-means can find local optima depending on random initialization. It concludes with methods for choosing the optimal number of clusters k, such as the elbow method.

Uploaded by

sourabh

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

79 views

Clustering: Unsupervised Learning

Uploaded by

sourabh

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 44

Clustering

Unsupervised learning
introduction

Machine Learning
Supervised learning

Training set:
Andrew Ng
Unsupervised learning

Training set:
Andrew Ng
Applications of clustering

Market segmentation Social network analysis

Image credit: NASA/JPL-Caltech/E. Churchwell (Univ. of Wisconsin, Madison)

Organize computing clusters Astronomical data analysis

Andrew Ng
Andrew Ng
Clustering
K-means
algorithm
Machine Learning
Andrew Ng
Andrew Ng
Andrew Ng
Andrew Ng
Andrew Ng
Andrew Ng
Andrew Ng
Andrew Ng
Andrew Ng
K-means algorithm assumption

Input:
- (number of clusters)
- Training set

(drop convention)

Andrew Ng
K-means algorithm

Randomly initialize cluster centroids

Repeat {
for = 1 to
:= index (from 1 to ) of cluster centroid
closest to
for = 1 to
:= average (mean) of points assigned to cluster

}
Andrew Ng
Andrew Ng
Andrew Ng
K-means for non-separated clusters

T-shirt sizing

Weight
Height

Andrew Ng
K-means for non-separated clusters

T-shirt sizing

Weight
Height

Andrew Ng
Clustering
Optimization
objective
Machine Learning
K-means optimization objective
= index of cluster (1,2,…, ) to which example is currently
assigned
= cluster centroid ( )
= cluster centroid of cluster to which example has been
assigned
Optimization objective:

Andrew Ng
K-means algorithm

Randomly initialize cluster centroids

Repeat {
for = 1 to
:= index (from 1 to ) of cluster centroid
closest to
for = 1 to
:= average (mean) of points assigned to cluster
}
Andrew Ng
Andrew Ng
Andrew Ng
Clustering
Random
initialization
Machine Learning
K-means algorithm

Randomly initialize cluster centroids

Repeat {
for = 1 to
:= index (from 1 to ) of cluster centroid
closest to
for = 1 to
:= average (mean) of points assigned to cluster
}
Andrew Ng
Random initialization
Should have

Randomly pick training

examples.

Set equal to these

examples.

Andrew Ng
Depending on the initialization of cluster
centroids K-means can produce different results
Local optima

Andrew Ng
Random initialization
For i = 1 to 100 {

Randomly initialize K-means.

Run K-means. Get .
Compute cost function (distortion)

Pick clustering that gave lowest cost

Andrew Ng
Andrew Ng
Andrew Ng
Andrew Ng
Clustering
Choosing the
number of clusters
Machine Learning
What is the right value of K?

Andrew Ng
Choosing the value of K
Elbow method:
Cost function

Cost function
1 2 3 4 5 6 7 8 1 2 3 4 5 6 7 8

(no. of clusters) (no. of clusters)

Andrew Ng
Andrew Ng
Andrew Ng
Andrew Ng
Andrew Ng
Andrew Ng
Choosing the value of K
Sometimes, you’re running K-means to get clusters to use for some
later/downstream purpose. Evaluate K-means based on a metric for
how well it performs for that later purpose.

E.g. T-shirt sizing T-shirt sizing

Weight
Weight

Height Height
Andrew Ng

Diet Problem With Application of Winqsb For LP
100% (1)
Diet Problem With Application of Winqsb For LP
7 pages
Clustering: Unsupervised Learning
No ratings yet
Clustering: Unsupervised Learning
29 pages
Lecture 13
No ratings yet
Lecture 13
29 pages
Clustering: Unsupervised Learning Introduc3on
No ratings yet
Clustering: Unsupervised Learning Introduc3on
29 pages
Clusterin G: Unsupervised Learning
No ratings yet
Clusterin G: Unsupervised Learning
29 pages
2 - K-Mean
No ratings yet
2 - K-Mean
39 pages
Clustering: Introducción Al Aprendizaje No Supervisado
No ratings yet
Clustering: Introducción Al Aprendizaje No Supervisado
37 pages
Clustering
No ratings yet
Clustering
6 pages
Unsupervised Learning
No ratings yet
Unsupervised Learning
24 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
169 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
59 pages
Week 4 - Lecture Slides - K-Means, Mixture Models, & EM
No ratings yet
Week 4 - Lecture Slides - K-Means, Mixture Models, & EM
65 pages
P-3 1 2-Kmeans
No ratings yet
P-3 1 2-Kmeans
43 pages
13: Clustering: Unsupervised Learning - Introduction
No ratings yet
13: Clustering: Unsupervised Learning - Introduction
4 pages
1731009606_Clustering_(Class_38-39)
No ratings yet
1731009606_Clustering_(Class_38-39)
45 pages
Presentation: Operating System Concept CS-582
No ratings yet
Presentation: Operating System Concept CS-582
13 pages
Clustering-Part1.pptx
No ratings yet
Clustering-Part1.pptx
84 pages
CLUSTERING CLASSIFICATION AND INTRO NEURAL NETWORK
No ratings yet
CLUSTERING CLASSIFICATION AND INTRO NEURAL NETWORK
168 pages
K-MEANS CLUSTERING ppt kpu
No ratings yet
K-MEANS CLUSTERING ppt kpu
4 pages
K Means Clustering
No ratings yet
K Means Clustering
22 pages
Clustering
No ratings yet
Clustering
28 pages
Clustering
No ratings yet
Clustering
4 pages
Clustering Algorithm
No ratings yet
Clustering Algorithm
47 pages
02.1 K-Means Example
No ratings yet
02.1 K-Means Example
12 pages
19.1. Partitioning-Based Clustering Algorithms
No ratings yet
19.1. Partitioning-Based Clustering Algorithms
27 pages
Week 9
No ratings yet
Week 9
66 pages
DSML-ML09. Unsupervised Learning
No ratings yet
DSML-ML09. Unsupervised Learning
69 pages
2021 Clustering
No ratings yet
2021 Clustering
50 pages
WWW Simplilearn Com Tutorials Machine Learning Tutorial K Means Clustering Algor
No ratings yet
WWW Simplilearn Com Tutorials Machine Learning Tutorial K Means Clustering Algor
19 pages
WINSEM2021-22_ECE6093_ETH_VL2021220505450_Reference_Material_I_23-03-2022_slides_kmeans_(1) (1)
No ratings yet
WINSEM2021-22_ECE6093_ETH_VL2021220505450_Reference_Material_I_23-03-2022_slides_kmeans_(1) (1)
28 pages
13 Clustering
No ratings yet
13 Clustering
29 pages
04 - KMeans Clustering
No ratings yet
04 - KMeans Clustering
56 pages
Kmeans
No ratings yet
Kmeans
92 pages
K means algorithm
No ratings yet
K means algorithm
4 pages
K_means.ipynb_-_Colab
No ratings yet
K_means.ipynb_-_Colab
10 pages
Stat 390 Presentation 2
No ratings yet
Stat 390 Presentation 2
14 pages
Clustering
No ratings yet
Clustering
24 pages
Cluster
No ratings yet
Cluster
50 pages
Chapter 5. Clustering Algorithms-Stud
No ratings yet
Chapter 5. Clustering Algorithms-Stud
44 pages
Machine Learning & Data Mining
No ratings yet
Machine Learning & Data Mining
108 pages
A Tutorial On Clustering Algorithms
No ratings yet
A Tutorial On Clustering Algorithms
4 pages
Machine_Learning_Unit_4
No ratings yet
Machine_Learning_Unit_4
22 pages
Unit-4
No ratings yet
Unit-4
46 pages
ML Application in Signal Processing and Communication Engineering
No ratings yet
ML Application in Signal Processing and Communication Engineering
27 pages
Clustering Notes
No ratings yet
Clustering Notes
37 pages
Clustering Algorithm: An Unsupervised Learning Approach
No ratings yet
Clustering Algorithm: An Unsupervised Learning Approach
23 pages
Unit 4 Clustering - K-Means and Hierarchical
No ratings yet
Unit 4 Clustering - K-Means and Hierarchical
40 pages
K Means
No ratings yet
K Means
9 pages
Unsupervised Learning - Clustering
No ratings yet
Unsupervised Learning - Clustering
55 pages
ML UNIT 4 Sir
No ratings yet
ML UNIT 4 Sir
42 pages
Clustering K-Means
100% (2)
Clustering K-Means
28 pages
Introduction To Unsupervised Learning:: Clustering
No ratings yet
Introduction To Unsupervised Learning:: Clustering
21 pages
Chapter 5 - K-mean clustering
No ratings yet
Chapter 5 - K-mean clustering
32 pages
Mod4_Unsupervised Learning
No ratings yet
Mod4_Unsupervised Learning
9 pages
K, Eans
No ratings yet
K, Eans
4 pages
ML DSBA Lab7
No ratings yet
ML DSBA Lab7
6 pages
ML 8
No ratings yet
ML 8
31 pages
ML Lecture#04
No ratings yet
ML Lecture#04
40 pages
AppliedML-Chap1-Clustering
No ratings yet
AppliedML-Chap1-Clustering
37 pages
INTRO TO ML ASS
No ratings yet
INTRO TO ML ASS
3 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Components of A Computer
100% (3)
Components of A Computer
21 pages
Chapter2 IntroductionToJava
No ratings yet
Chapter2 IntroductionToJava
62 pages
EE271 Lab1 Group06
No ratings yet
EE271 Lab1 Group06
9 pages
Nboguide
No ratings yet
Nboguide
32 pages
Spanning Trees: Spanning Trees: A Subgraph T of A Undirected Graph G (V, E) Is A Spanning Tree of G If It Is A
No ratings yet
Spanning Trees: Spanning Trees: A Subgraph T of A Undirected Graph G (V, E) Is A Spanning Tree of G If It Is A
4 pages
Create Actual Assessment 11
No ratings yet
Create Actual Assessment 11
26 pages
WPF Features Data Grid, Ribbon - VSM
No ratings yet
WPF Features Data Grid, Ribbon - VSM
50 pages
Beginning Perl For Bioinformatics
No ratings yet
Beginning Perl For Bioinformatics
17 pages
Plant Structuxure - Unity Pro and OPC Software - MKTED2140504EN (Web)
100% (1)
Plant Structuxure - Unity Pro and OPC Software - MKTED2140504EN (Web)
47 pages
WMF3 0 RC Release Notes
No ratings yet
WMF3 0 RC Release Notes
18 pages
BACHELOR OF COMPUTER APPLICATIONS (B.C.A.) PART-I (Semester I & II) PDF
No ratings yet
BACHELOR OF COMPUTER APPLICATIONS (B.C.A.) PART-I (Semester I & II) PDF
26 pages
WP RedisLabs Geospatial Redis
No ratings yet
WP RedisLabs Geospatial Redis
12 pages
Sic Her He Its Hand Buch
No ratings yet
Sic Her He Its Hand Buch
66 pages
TCP/IP Essentials A Lab-Based Approach: The Web, DHCP, NTP and Nat
No ratings yet
TCP/IP Essentials A Lab-Based Approach: The Web, DHCP, NTP and Nat
39 pages
Image Quilting For Texture Synthesis and Transfer
No ratings yet
Image Quilting For Texture Synthesis and Transfer
6 pages
V Manoj Resume
No ratings yet
V Manoj Resume
3 pages
Win32 - Printjob Cancel Job C# - Buscar Con Google
No ratings yet
Win32 - Printjob Cancel Job C# - Buscar Con Google
1 page
MD. Shamim +8801911009528 4C87 2018 2045 477K 055X 403E 9539 ( ) Windows XP
No ratings yet
MD. Shamim +8801911009528 4C87 2018 2045 477K 055X 403E 9539 ( ) Windows XP
6 pages
Unix Manual
No ratings yet
Unix Manual
4 pages
Mersenne Twister Matlab
No ratings yet
Mersenne Twister Matlab
4 pages
Instruction Types
No ratings yet
Instruction Types
8 pages
110-6168-EN-R3 SANHQ Guide V3.0 Web PDF
No ratings yet
110-6168-EN-R3 SANHQ Guide V3.0 Web PDF
234 pages
Cloud Log
No ratings yet
Cloud Log
39 pages
Agile Planning For Software Products: Course Notes
No ratings yet
Agile Planning For Software Products: Course Notes
66 pages
An Introduction To Microprocessor Architecture Using 8085 As A Classic Processor
No ratings yet
An Introduction To Microprocessor Architecture Using 8085 As A Classic Processor
17 pages
8088 Instruction Set Summary
No ratings yet
8088 Instruction Set Summary
4 pages
Red Zuma
25% (4)
Red Zuma
3 pages
EE261 The Fourier Transform and Its Applications Fall 2007 Syllabus and Schedule
No ratings yet
EE261 The Fourier Transform and Its Applications Fall 2007 Syllabus and Schedule
2 pages
CAD Mechanical All Theories
No ratings yet
CAD Mechanical All Theories
165 pages