0% found this document useful (0 votes)

7 views

Spectral Approach For Tabular and Graph Data Clustering

Uploaded by

Debasis Mahapatra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

7 views

Spectral Approach For Tabular and Graph Data Clustering

Uploaded by

Debasis Mahapatra

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

Spectral approach for tabular and graph data

clustering

Dr.Debasis Mohapatra
Assistant Professor
Department of Computer Science and Engineering,
PMEC, Berhampur, Odisha, India, 761003

National Seminar on ”Emerging Applications of Artificial Intelligence and Data Sci-

ence”, Berhampur University, 26/09/2023

National Seminar(BU) 2023

Presentation Overview

▶ Introduction
▶ Tabular and Graph data
▶ Spectral Clustering for Tabular data
▶ Analysis on Spectral Clustering Algorithm
▶ Spectral Clustering for Tabular data (Illustration)
▶ Spectral Clustering for Graph data (Illustration)
▶ Measuring the strength of clustering
▶ Computation of modularity
▶ Implementation of Spectral Clustering for graph data
▶ Conclusions

National Seminar(BU) 2023

Introduction

▶ What is Clustering?
Clustering is an unsupervised machine-learning technique that
works on unlabeled data. It groups the objects based on their
similarity. The objects in a group are similar to each other and
dissimilar to the objects present in other groups.
▶ Why Clustering is important?
It helps in understanding the pattern or structure present in a
data set that is not visible otherwise.
▶ Some popular Clustering Algorithms
▶ Tabular data: K-means Clustering, Hierarchical Clustering, DB-
SCAN, etc.
▶ Graph data: Girvan-Newman, Louvain, Leiden, etc.

National Seminar(BU) 2023

Tabular and Graph data
Tabular Data:
Atr1 Atr2 ... Atrn
Record1 ...
Record2 ...
:
:
Recordm ...

Graph Data:

National Seminar(BU) 2023

Spectral Clustering of Tabular data

1. Given a dataset of n points, the first step is to construct a simi-

larity matrix, where the entries of the matrix represent the pairwise
similarity between the data points. Common similarity measures
include Euclidean distance, cosine similarity, and Gaussian kernel
similarity.
2. Next, the similarity matrix is transformed into a Laplacian matrix,
which is a measure of the connectivity between the data points.
There are different types of Laplacian matrices that can be used,
such as the unnormalized Laplacian, the normalized Laplacian, and
the symmetric normalized Laplacian.
3. The eigenvectors and eigenvalues of the Laplacian matrix are
then computed. The number of eigenvectors to be computed is a
hyperparameter that needs to be tuned.

National Seminar(BU) 2023

Spectral Clustering of Tabular data(Contd...)

4. The eigenvectors are arranged into a matrix, and the rows of

this matrix are used as the new feature representations of the data
points.
5. Finally, a clustering algorithm such as k-means is applied to the
new feature representations to obtain the final clustering.

National Seminar(BU) 2023

Analysis on Spectral Clustering Algorithm

The time complexity of Spectral Clustering Algorithm is O(n3 ) where

n is the number of nodes in the graph.

National Seminar(BU) 2023

Spectral Clustering of Tabular data(Illustration)

National Seminar(BU) 2023

Spectral Clustering of Graph data(Illustration)

National Seminar(BU) 2023

Measuring the quality/strength of clustering
Modularity: Modularity is a measure of the structure of networks or
graphs which measures the strength of the division of a network into
modules (also called groups, clusters, or communities). Networks
with high modularity have dense connections between the nodes
within modules but sparse connections between nodes in different
modules.
v
X Lc Kc 2
Q= ( − γ( ) ) (1)
m 2m
c=1
where the sum iterates over all communities/clusters c, m is the
number of edges, Lc number of intra-cluster links for cluster c, Kc is
the sum of degrees of the nodes in cluster c, and γ is the resolution
parameter. The resolution parameter sets an arbitrary tradeoff be-
tween intra-group edges and inter-group edges. It is very common
to simply use γ = 1.

National Seminar(BU) 2023

Computation of modularity (Example)

Pv Lc Kc 2
Q= c=1 ( m − γ( 2m ) )

v = 2, γ = 1, m = 7

Q = ( 37 − ( 14
7 2
) ) + ( 37 − ( 14
7 2
) )

= (0.42857 − 0.25) ∗ 2

= 0.35714

National Seminar(BU) 2023

Spectral Clustering of Graph data (Implementation)

Implementation of Spectral Clustering of Graph data

National Seminar(BU) 2023

Conclusion

▶ Spectral clustering algorithms are not applicable to large datasets.

▶ The idea is applicable in various fields. In computer science, the
concept of graph clustering is used in social network analysis,
image processing, natural language processing, etc. In biolog-
ical science, it is helpful in finding out the closeness present
among the various biological entities like organisms, cells, pro-
teins, etc.
▶ Apart from these two fields, this concept is very much applicable
to economics, sociology, political science, etc.

National Seminar(BU) 2023

References
1. Despalatović, L., Vojković, T., Vukicević, D. (2014, May). Commu-
nity structure in networks: Girvan-Newman algorithm improvement.
In 2014 37th international convention on information and commu-
nication technology, electronics and microelectronics (MIPRO) (pp.
997-1002). IEEE.
2. Newman M.E.J., Networks: An Introduction( Oxford University Press,
New York, 2010 )
3. Blondel, V. D., Guillaume, J. L., Lambiotte, R., Lefebvre, E. (2008).
Fast unfolding of communities in large networks. Journal of statistical
mechanics: theory and experiment, 2008(10), P10008.
4. Girvan, M., Newman, M. E. (2002). Community structure in social
and biological networks. Proceedings of the national academy of
sciences, 99(12), 7821-7826.
5. Schubert, E., Hess, S., Morik, K. (2018). The Relationship of DB-
SCAN to Matrix Factorization and Spectral Clustering. In LWDA
(pp. 330-334).

National Seminar(BU) 2023

Thank You

National Seminar(BU) 2023

A Novel Architecture For Web-Based Attack Detection Using Convolutional Neural Network
No ratings yet
A Novel Architecture For Web-Based Attack Detection Using Convolutional Neural Network
12 pages
Spectral Clustering Survey
No ratings yet
Spectral Clustering Survey
12 pages
Spectral Approach (BU)
No ratings yet
Spectral Approach (BU)
2 pages
PR_module_4_QB - Copy
No ratings yet
PR_module_4_QB - Copy
37 pages
Spectral_Clustering
No ratings yet
Spectral_Clustering
4 pages
The Latest Research Progress On Spectral Clustering
No ratings yet
The Latest Research Progress On Spectral Clustering
10 pages
Math 118: Mathematical Methods of Data Theory: Lecture 9: Graphs and Spectral Clustering
No ratings yet
Math 118: Mathematical Methods of Data Theory: Lecture 9: Graphs and Spectral Clustering
11 pages
DS303 Clustering
No ratings yet
DS303 Clustering
20 pages
Clustering
No ratings yet
Clustering
28 pages
Research On Spectral Clustering Algorithms and Prospects
No ratings yet
Research On Spectral Clustering Algorithms and Prospects
5 pages
Spec Clus Mod
No ratings yet
Spec Clus Mod
29 pages
Luxburg07 Tutorial 4488
No ratings yet
Luxburg07 Tutorial 4488
32 pages
GraphSigProc Part I v18 NowFnT
No ratings yet
GraphSigProc Part I v18 NowFnT
49 pages
Makdad - Chloe - NCUWM2021Poster Connections Between Graph Spectral Clustering and PDEs
No ratings yet
Makdad - Chloe - NCUWM2021Poster Connections Between Graph Spectral Clustering and PDEs
1 page
Handbook of Cluster Analysis: C. Hennig, M. Meila, F. Murtagh, R. Rocci (Eds.)
No ratings yet
Handbook of Cluster Analysis: C. Hennig, M. Meila, F. Murtagh, R. Rocci (Eds.)
28 pages
Spectral Clustering: Eyal David Image Processing Seminar May 2008
No ratings yet
Spectral Clustering: Eyal David Image Processing Seminar May 2008
52 pages
I Jcs It 2015060141
No ratings yet
I Jcs It 2015060141
5 pages
Spectral Clustering
No ratings yet
Spectral Clustering
7 pages
Ml Assignment 2
No ratings yet
Ml Assignment 2
6 pages
Data Clustering in K-Means Hierarchical Clustering DBSCAN Clustering
No ratings yet
Data Clustering in K-Means Hierarchical Clustering DBSCAN Clustering
14 pages
GABB18 Paper 5
No ratings yet
GABB18 Paper 5
8 pages
Machine Learning-4
No ratings yet
Machine Learning-4
73 pages
Sem232 LA CC07 Group08
No ratings yet
Sem232 LA CC07 Group08
23 pages
Module - 5 - ECE3047 - Machine Learning
No ratings yet
Module - 5 - ECE3047 - Machine Learning
52 pages
Unsupervised Learning (A.k.a Clustering) : Marcello Pelillo
No ratings yet
Unsupervised Learning (A.k.a Clustering) : Marcello Pelillo
102 pages
GIU_2719_65_22376_2025-02-17T23_42_29
No ratings yet
GIU_2719_65_22376_2025-02-17T23_42_29
37 pages
Comparison of Graph Clustering Algorithms
No ratings yet
Comparison of Graph Clustering Algorithms
6 pages
Tutorial On Spectral Clustering
No ratings yet
Tutorial On Spectral Clustering
26 pages
Slidesgo Unlocking Connections The Power of Spectral Graph Theory 20241003174037IcC9
No ratings yet
Slidesgo Unlocking Connections The Power of Spectral Graph Theory 20241003174037IcC9
8 pages
Data Clustering (Contd) : CS771: Introduction To Machine Learning Piyush Rai
No ratings yet
Data Clustering (Contd) : CS771: Introduction To Machine Learning Piyush Rai
15 pages
Atif-IS-paperwork
No ratings yet
Atif-IS-paperwork
31 pages
Graph based clustering
No ratings yet
Graph based clustering
78 pages
Community Detection
No ratings yet
Community Detection
9 pages
LecN10_R
No ratings yet
LecN10_R
9 pages
Summary
No ratings yet
Summary
25 pages
SpectralClustering
No ratings yet
SpectralClustering
52 pages
zhang_cgf10_spect_survey
No ratings yet
zhang_cgf10_spect_survey
29 pages
03 23MAT214 MIS4 KMeans Spectral Clustering (1)
No ratings yet
03 23MAT214 MIS4 KMeans Spectral Clustering (1)
52 pages
09_Spectral Clustering
No ratings yet
09_Spectral Clustering
22 pages
2092 On Spectral Clustering Analysis and An Algorithm
No ratings yet
2092 On Spectral Clustering Analysis and An Algorithm
8 pages
Mathematics of Signals, Networks, and Learning
No ratings yet
Mathematics of Signals, Networks, and Learning
68 pages
clustering_notes
No ratings yet
clustering_notes
4 pages
Social Network Analysis Unit-3
No ratings yet
Social Network Analysis Unit-3
28 pages
Slides - Graph Signal Processing: An Introductory Overview
No ratings yet
Slides - Graph Signal Processing: An Introductory Overview
47 pages
A Survey of Kernel and Spectral Methods For Clustering
No ratings yet
A Survey of Kernel and Spectral Methods For Clustering
15 pages
Spectral Clustering 2
No ratings yet
Spectral Clustering 2
39 pages
Variance
No ratings yet
Variance
6 pages
Graph Theory
No ratings yet
Graph Theory
2 pages
ML Unit - IV
No ratings yet
ML Unit - IV
56 pages
Entropy: Kernel Spectral Clustering For Big Data Networks
No ratings yet
Entropy: Kernel Spectral Clustering For Big Data Networks
20 pages
Pattern Vectors From Algebraic Graph Theory
No ratings yet
Pattern Vectors From Algebraic Graph Theory
14 pages
Community Detection in Social Networks
No ratings yet
Community Detection in Social Networks
24 pages
521-lecture-13
No ratings yet
521-lecture-13
7 pages
2019 REU Dimension Reduction Poster
No ratings yet
2019 REU Dimension Reduction Poster
1 page
Lecture Clustering
No ratings yet
Lecture Clustering
42 pages
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
No ratings yet
Jalali@mshdiua - Ac.ir Jalali - Mshdiau.ac - Ir: Data Mining
53 pages
Clustering
No ratings yet
Clustering
22 pages
Graph-Based Clustering and Data Visualization Algorithms
No ratings yet
Graph-Based Clustering and Data Visualization Algorithms
1 page
menendezLlorente
No ratings yet
menendezLlorente
22 pages
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
From Everand
Mesh Generation: Advances and Applications in Computer Vision Mesh Generation
Fouad Sabry
No ratings yet
Knowing The Data Set
No ratings yet
Knowing The Data Set
31 pages
Lecture-4 (Day 3) - Pandas
No ratings yet
Lecture-4 (Day 3) - Pandas
4 pages
Lecture-1 (Day 1)
No ratings yet
Lecture-1 (Day 1)
16 pages
Lecture-3 (Day 2) - NumPy
No ratings yet
Lecture-3 (Day 2) - NumPy
2 pages
Full Access1229-1238
No ratings yet
Full Access1229-1238
10 pages
STA301 - (Assignment No.1)
No ratings yet
STA301 - (Assignment No.1)
2 pages
Fundementalsof Data Science
No ratings yet
Fundementalsof Data Science
4 pages
Ambo University Inistitute of Technology Department of Computer Science
No ratings yet
Ambo University Inistitute of Technology Department of Computer Science
13 pages
Maple - SolvingSolving Stochastic Differential Equations in Maple Stochastic Differential Equations in Maple
No ratings yet
Maple - SolvingSolving Stochastic Differential Equations in Maple Stochastic Differential Equations in Maple
3 pages
L02 Possible Values of A Random Variable
No ratings yet
L02 Possible Values of A Random Variable
4 pages
Pert7 10668708
No ratings yet
Pert7 10668708
25 pages
Zero Padding of Signals
No ratings yet
Zero Padding of Signals
5 pages
TOC Questions 012210030237 1
No ratings yet
TOC Questions 012210030237 1
4 pages
MIE263 Syllabus
No ratings yet
MIE263 Syllabus
3 pages
Probability & Calendar
No ratings yet
Probability & Calendar
38 pages
Result Archive - 7 Colleges Affiliated With University of Dhaka
No ratings yet
Result Archive - 7 Colleges Affiliated With University of Dhaka
1 page
Timevarying in R
No ratings yet
Timevarying in R
10 pages
Ayon Roy: Education Experience
No ratings yet
Ayon Roy: Education Experience
1 page
MA40189 20 Open
No ratings yet
MA40189 20 Open
6 pages
Computer Lab Assignment - "Applications" Handout (Part II)
No ratings yet
Computer Lab Assignment - "Applications" Handout (Part II)
1 page
QM Notes 3
No ratings yet
QM Notes 3
2 pages
Modeling and Control For Stability and Rotation Velocity of A Rotary Inverted Pendulum PDF
No ratings yet
Modeling and Control For Stability and Rotation Velocity of A Rotary Inverted Pendulum PDF
6 pages
Assignment 2
No ratings yet
Assignment 2
2 pages
PSYC220 Final Assignment
No ratings yet
PSYC220 Final Assignment
6 pages
Cloud Intrusion Detection Method Based On Stacked Contractive Auto-Encoder and Support Vector Machine
No ratings yet
Cloud Intrusion Detection Method Based On Stacked Contractive Auto-Encoder and Support Vector Machine
13 pages
Mid - Sem - 2019 - Linear Control System
No ratings yet
Mid - Sem - 2019 - Linear Control System
1 page
Theoretical Computer Science
No ratings yet
Theoretical Computer Science
9 pages
EE3512- CONTROL AND INSTRUMENTATION LABORATORY MANUAL
No ratings yet
EE3512- CONTROL AND INSTRUMENTATION LABORATORY MANUAL
73 pages
Ma147 QS7
No ratings yet
Ma147 QS7
3 pages
Assignment 3: This Assignment Aims To Fit A VAR To The Following Variables
No ratings yet
Assignment 3: This Assignment Aims To Fit A VAR To The Following Variables
6 pages
CS205 Data Structures
No ratings yet
CS205 Data Structures
3 pages
Bulk Encryption On GPUs - AMD
No ratings yet
Bulk Encryption On GPUs - AMD
25 pages
Bookbinders Case 1
100% (1)
Bookbinders Case 1
8 pages

Spectral Approach For Tabular and Graph Data Clustering

Uploaded by

Spectral Approach For Tabular and Graph Data Clustering

Uploaded by

Spectral approach for tabular and graph data

National Seminar on ”Emerging Applications of Artificial Intelligence and Data Sci-

National Seminar(BU) 2023

National Seminar(BU) 2023

National Seminar(BU) 2023

National Seminar(BU) 2023

1. Given a dataset of n points, the first step is to construct a simi-

National Seminar(BU) 2023

4. The eigenvectors are arranged into a matrix, and the rows of

National Seminar(BU) 2023

The time complexity of Spectral Clustering Algorithm is O(n3 ) where

National Seminar(BU) 2023

National Seminar(BU) 2023

National Seminar(BU) 2023

National Seminar(BU) 2023

National Seminar(BU) 2023

Implementation of Spectral Clustering of Graph data

National Seminar(BU) 2023

▶ Spectral clustering algorithms are not applicable to large datasets.

National Seminar(BU) 2023

National Seminar(BU) 2023

National Seminar(BU) 2023

You might also like