Social Network Analysis Unit-3

Uploaded by

Guribilli Varaprasad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

203 views

Social Network Analysis Unit-3

Uploaded by

Guribilli Varaprasad

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

SOCIAL NETWORK ANALYSIS

UNIT – III
CONTENTS
• Introduction
• Communities in Context
• Core Methods
• Quality Functions
• The Kernighan-Lin(KL) algorithm
• Agglomerative/Divisive
Algorithms
• Spectral Algorithms
• Multi-level Graph Partitioning
• Markov Clustering
©
protein-protein interaction network based on confidence score (0.7)
for each chronic stress related lifestyle diseases
Clinical Trials Dataset – Efficacy and Toxicity
network of genes having z score >1 involved in top five diseases
Food Web of the Great Barrier Reef
Multiple language editions of Wikipedia – Linguistic Network
The social network of Zachary's karate club. Red dots denote the supporters
of instructor and blue squares denote the supporters of the president
Introduction
• The Objective
• Uncover and understand important
network/sub-network structures at
multiple topological and temporal scales.
• Extracting community structure and
leverage them to predict emergent,
critical and casual nature of dynamic
networks.
• The complexity
• Topological properties are uncertain and
existing techniques are not suitable.
• Requirements of directed and dynamic
networks are different.
• Scalability
©
Definition
• Social networks exhibit strong modular nature or
community structure.
• A community is a collection of people sharing similar
interests or having same characteristics.
• A network community is a set of nodes which have dense
connections within the group, and sparse connections
outside the group.
• A clique is in some sense a stronger version of a
community. A set of nodes forms a clique (a complete
subgraph) if all possible connections between nodes exist.
• Community detection aims at grouping nodes in accordance
with the relationships among them to form strongly linked
subgraphs from the entire graph

©
Communities in Context
• Social groups could be revealed by suitably
rearranging the rows and the columns of
matrices describing social ties, until they take an
approximate block-diagonal form.
• Identify bridging nodes and use them to separate
out community structure.
• Community discovery
• can facilitate understanding of a social system.
• allows to summarize the interactions within a
network concisely.
• lend itself to actionable pattern discovery.

©
Real World Communities
• Zachary’s Karate club
• Loans among financial institutions
• Social Behaviour of animals
• Proxy caches
• Link farms
• MANETS
• Personalized recommendation engines
• Telecommunication Networks

©
©
Quality Metrics
• Quality function quantifies the goodness of a given
division of the network into communities.
• Normalized cutThe sum of weights of the edges
that connect S to the rest of the graph, normalized
by the total edge weight of S and that of the rest of
the graphS.

• Conductance

• Modularity it is independent of the number of clusters

that the graph is divided into.

©
Core Methods
• Spectral methodsmeant for bi-partitioning the
network, can also be used to recursively subdivide
the network into as many communities as desired.
• Kernighan-Lin Algorithm
• Flow based post processing

• Clustering methodsallow the user to indirectly

control the granularity of the output communities
• Markov clustering
• Clustering via shingling

©
Kernighan-Lin(KL) Algorithm
• Graph partitioning algorithm which optimizes the KL
objective function i.e. minimize the edge cut while keeping
the cluster sizes balanced.
• At each iteration, the algorithm searches for a subset of
vertices from each part of the graph such that swapping
them will lead to a reduction in the edge cut.
• The gain gv of a vertex is the reduction in edge-cut if vertex v
is moved from its current partition to the other partition.
• Repeatedly select from the larger partition the vertex with
the largest gain and move it to the other partition.
• A vertex is not considered for moving again if it has already
been moved in the current iteration.
• After a vertex has been moved, the gains for its neighbouring
vertices will be updated in order to reflect the new
assignment of vertices to partitions.
©
Agglomerative/Divisive Algorithms
• Agglomerative:
• Begin with each node in the social network in
its own community.
• At each step merge communities that are
deemed to be sufficiently similar
• Continue until either the desired number of
communities is obtained or the remaining
communities are found to be too dissimilar to
merge any further.
• Divisive
• Begin with the entire network as one
community
• At each step, choose a certain community and
split it into two parts. ©
©
Agglomerative/Divisive Algorithms
• Both kinds of hierarchical clustering algorithms often
output a dendrogram which is a binary tree , where the
leaves are nodes of the network, and each internal node
is a community.
• In divisive algorithms, a parent-child relationship
indicates that the community represented by the parent
node was divided to obtain the communities
represented by the child nodes.
• In agglomerative algorithms, a parent-child relationship
in the dendrogram indicates that the communities
represented by the child nodes were agglomerated (or
merged) to obtain the community represented by the
parent node

• Edge betweenness measures are defined in a way that

edges with high betweenness scores are more likely to be
the edges that connect different communities.
• Inter-community edges are designed to have higher edge
betweenness scores than intra-community edges do
• By identifying and discarding such edges with high
betweenness scores, one can disconnect the social network
into its constituent communities.
• Shortest path betweenness
• random-walk betweenness

• Disadvantage: High computational cost computing the
betweenness for all edges takes O(|V ||E|) time, and the
entire algorithm requires O(|V|3) time.

©
Spectral Algorithms
• Assign nodes to communities based on the
eigenvectors of matrices.
• The top k eigen vectors define an
embedding of the nodes of the network as
points in a k-dimensional space.
• Classical data clustering techniques such as
K-means clustering are applied to derive the
final assignment of nodes to clusters.
• Spectral clustering can be shown to solve
real relaxations of different weighted graph
cut problems.
©
Spectral Algorithms
• A is the adjacency matrix of the network
• D is the diagonal matrix with the degrees of the
nodes along the diagonal
• Un-normalized Laplacian L = D −A
• Normalized Laplacian

• Both L and are symmetric and positive definite,

and therefore have real and positive eigenvalues.
• The eigenvector corresponding to the smallest
non-zero eigen value of L is known as the Fiedler
vector and forms the basis for bi-partitioning the
graph.
©
MULTI-LEVEL GRAPH PARTITIONING

• Provide a powerful framework for fast and high-

quality graph partitioning.
Coarsening: Produce a smaller graph that is similar
to the original graph. Construct a matching on the
graph, where a matching is defined as a set of edges
no two of which are incident on the same vertex.
Initial partitioning: Partition the coarsest graph
using spectral partitioning.
Uncoarsening: Partition on the current graph is used
to initialize a partition on the finer (original) graph.
©
Markov Clustering
• Clusters graphs via manipulation of the
stochastic matrix or transition probability matrix
corresponding to the graph.
• The MCL process consists of two operations on
stochastic matrices, Expand and Inflate.
• Expand(M) is simply M ∗ M, Inflate(M,r) raises each entry in the matrix M
to the inflation parameter r ( > 1, and typically set to 2)
• Re-normalize the columns to sum to 1.
• These two operators are applied in alternation
iteratively until convergence, starting with the
initial transition probability matrix.

©
References
• https://pdfs.semanticscholar.org/895a/c6105492c16b6dbd9
7bae2d7965b8e605073.pdf
• http://www.genome-integrity.org/article.asp?issn=2041-
9414;year=2015;volume=6;issue=1;spage=1;epage=1;aulast
=Roy
• http://www.sthda.com/english/articles/28-hierarchical-
clustering-essentials/92-visualizing-dendrograms-
ultimate-guide/
• https://www.cs.cmu.edu/~ckingsf/bioinfo-
lectures/kernlin.pdf
• https://people.csail.mit.edu/jshun/6886-
s18/lectures/lecture13-1.pdf
• https://www.cs.ucsb.edu/~xyan/classes/CS595D-
2009winter/MCL_Presentation2.pdf

©
• Course MOODLE
• http://sna-cse.moodlecloud.com/
• http://sna-it.moodlecloud.com/
– Username:regdno, password: student

• NPTEL Online Courses – 12 week course

– https://onlinecourses.nptel.ac.in/noc18_cs56/pr
eview
–

Introduction to information and big data security
No ratings yet
Introduction to information and big data security
39 pages
2013 WMI Grade 3 Solutions Part 1
100% (7)
2013 WMI Grade 3 Solutions Part 1
4 pages
ME Math 10 Q2 0903 PS
No ratings yet
ME Math 10 Q2 0903 PS
26 pages
Equations - Multi Step - Integers Level2 All
No ratings yet
Equations - Multi Step - Integers Level2 All
12 pages
Research Paper
No ratings yet
Research Paper
7 pages
6 - KNN Classifier
No ratings yet
6 - KNN Classifier
10 pages
J. E. Cremona
No ratings yet
J. E. Cremona
48 pages
(Series - Graduate Texts in Mathematics, Vol. 153) William Fulton-Algebraic Topology - A First Course-Springer-Verlag (1995)
100% (3)
(Series - Graduate Texts in Mathematics, Vol. 153) William Fulton-Algebraic Topology - A First Course-Springer-Verlag (1995)
449 pages
PNC and Probability Permutationspdf
67% (3)
PNC and Probability Permutationspdf
41 pages
Social Network Analysis
No ratings yet
Social Network Analysis
50 pages
Chap8 Basic Cluster Analysis
100% (1)
Chap8 Basic Cluster Analysis
104 pages
Lecture 4 Centrality Measure
No ratings yet
Lecture 4 Centrality Measure
83 pages
Liaquat Majeed Sheikh: National University of Computer and Emerging Sciences
No ratings yet
Liaquat Majeed Sheikh: National University of Computer and Emerging Sciences
79 pages
Why Data Preprocessing?: Incomplete
No ratings yet
Why Data Preprocessing?: Incomplete
17 pages
Social Network Analysis in R PDF
No ratings yet
Social Network Analysis in R PDF
35 pages
Powerbi Intro
No ratings yet
Powerbi Intro
46 pages
Social Network Analysis Unit-5
No ratings yet
Social Network Analysis Unit-5
31 pages
DATA Mining
No ratings yet
DATA Mining
55 pages
BDM Unit I Slides Part 1
No ratings yet
BDM Unit I Slides Part 1
27 pages
Lecture 1
No ratings yet
Lecture 1
46 pages
1-Big Data Analytics
No ratings yet
1-Big Data Analytics
37 pages
Social Network Analysis Unit-2
No ratings yet
Social Network Analysis Unit-2
24 pages
Project
No ratings yet
Project
14 pages
Statistical Infrences Lec 1
No ratings yet
Statistical Infrences Lec 1
35 pages
Topic 1 Etw3482
100% (2)
Topic 1 Etw3482
69 pages
CP5074 - SNA Unit III Notes
No ratings yet
CP5074 - SNA Unit III Notes
27 pages
Data Mining Functionalities
No ratings yet
Data Mining Functionalities
58 pages
Lecture1 Big Data
No ratings yet
Lecture1 Big Data
47 pages
Part 1 - Intro Data Viz & Power BI
No ratings yet
Part 1 - Intro Data Viz & Power BI
39 pages
BCSE 0105 - Machine Learning - Module 1 - Complete - NC
No ratings yet
BCSE 0105 - Machine Learning - Module 1 - Complete - NC
200 pages
Assignment 1&2
No ratings yet
Assignment 1&2
4 pages
Instructor Materials Chapter 6: Architecture For Big Data and Data Engineering
No ratings yet
Instructor Materials Chapter 6: Architecture For Big Data and Data Engineering
32 pages
02-03 ASAP Business Analytics-2 Descriptive Statistics
No ratings yet
02-03 ASAP Business Analytics-2 Descriptive Statistics
109 pages
Classification Algorithms Used in Data Mining. This Is A Lecture Given To MSC Students.
100% (5)
Classification Algorithms Used in Data Mining. This Is A Lecture Given To MSC Students.
63 pages
Big Data: Introduction To Terms, Concepts and Tools
No ratings yet
Big Data: Introduction To Terms, Concepts and Tools
23 pages
A6515 BDA Question Bank
No ratings yet
A6515 BDA Question Bank
9 pages
6 Different Ways To Compensate For Missing Values in A Dataset
No ratings yet
6 Different Ways To Compensate For Missing Values in A Dataset
6 pages
Syllabus of Big Data Analysis - Proposed
No ratings yet
Syllabus of Big Data Analysis - Proposed
2 pages
What Is A DSS?: Decision Support Systems Concepts, Methodologies, and Technologies: An Overview
No ratings yet
What Is A DSS?: Decision Support Systems Concepts, Methodologies, and Technologies: An Overview
9 pages
Topic:use Statistical Data Analysis To Drive Fact - Based Decisions
0% (1)
Topic:use Statistical Data Analysis To Drive Fact - Based Decisions
11 pages
Bigdata MINT PDF
No ratings yet
Bigdata MINT PDF
4 pages
Mining Frequent Itemset-Association Analysis
No ratings yet
Mining Frequent Itemset-Association Analysis
59 pages
Data Mining Techniques and Applications
No ratings yet
Data Mining Techniques and Applications
16 pages
Data Science and Its Relationship To Big Data and Data-Driven Decision Making
No ratings yet
Data Science and Its Relationship To Big Data and Data-Driven Decision Making
22 pages
Bridge Course Computer Science
No ratings yet
Bridge Course Computer Science
2 pages
Decision Trees
100% (6)
Decision Trees
28 pages
Kaggle's State of Machine Learning and Data Science 2021
No ratings yet
Kaggle's State of Machine Learning and Data Science 2021
45 pages
An Introduction To Social Network Analysis
100% (8)
An Introduction To Social Network Analysis
38 pages
Business Intelligence & Business Analytics
No ratings yet
Business Intelligence & Business Analytics
8 pages
2nd Unit - 2.2 - Data Analytics
No ratings yet
2nd Unit - 2.2 - Data Analytics
22 pages
Social Network Analysis
No ratings yet
Social Network Analysis
40 pages
CCW331 Business Analytics Material Unit I Type2
No ratings yet
CCW331 Business Analytics Material Unit I Type2
43 pages
AI-UNIT-1 PPT
No ratings yet
AI-UNIT-1 PPT
149 pages
Analisis de Datos MIT
No ratings yet
Analisis de Datos MIT
340 pages
Introduction To Machine Learning
No ratings yet
Introduction To Machine Learning
1 page
Data Mining: Concepts and Techniques: - Chapter 6
No ratings yet
Data Mining: Concepts and Techniques: - Chapter 6
172 pages
CS8091 BDA Unit1
No ratings yet
CS8091 BDA Unit1
63 pages
11-12 Big Data Concepts and Tools
No ratings yet
11-12 Big Data Concepts and Tools
30 pages
Machine Learning Algorithms
100% (1)
Machine Learning Algorithms
15 pages
Why Data Mining? Behavioral Data: From Lecture Notes
No ratings yet
Why Data Mining? Behavioral Data: From Lecture Notes
5 pages
Data Science Training in Hyderabad
No ratings yet
Data Science Training in Hyderabad
7 pages
Four Dimensions of Social Network Analysis
No ratings yet
Four Dimensions of Social Network Analysis
33 pages
(Excerpts From) Investigating Performance: Design and Outcomes With Xapi
From Everand
(Excerpts From) Investigating Performance: Design and Outcomes With Xapi
Janet Laane Effron
No ratings yet
The Definitive Guide to Data Integration: Unlock the power of data integration to efficiently manage, transform, and analyze data
From Everand
The Definitive Guide to Data Integration: Unlock the power of data integration to efficiently manage, transform, and analyze data
Pierre-yves Bonnefoy
No ratings yet
E-Communities -Part1
No ratings yet
E-Communities -Part1
80 pages
Unit 6 Mining Social Network Graph (1)
No ratings yet
Unit 6 Mining Social Network Graph (1)
9 pages
Prime Coding Cracker
No ratings yet
Prime Coding Cracker
90 pages
Math 123
No ratings yet
Math 123
47 pages
(Ebook) Guiding Children's Learning of Mathematics, 11th Edition by Leonard M. Kennedy, Steve Tipps, Art Johnson ISBN 049509191X instant download
100% (1)
(Ebook) Guiding Children's Learning of Mathematics, 11th Edition by Leonard M. Kennedy, Steve Tipps, Art Johnson ISBN 049509191X instant download
52 pages
Module 6 - Differential Equations 1 (Self Study)
0% (1)
Module 6 - Differential Equations 1 (Self Study)
3 pages
12 Math Practice 01
No ratings yet
12 Math Practice 01
6 pages
Quadratic Eqn Ques
No ratings yet
Quadratic Eqn Ques
25 pages
Basic Terms of Probability
No ratings yet
Basic Terms of Probability
7 pages
Kindergarten Handbook
100% (12)
Kindergarten Handbook
281 pages
Happy New Year 2024 Sudoku Contest IB v1.5
No ratings yet
Happy New Year 2024 Sudoku Contest IB v1.5
11 pages
further maths ss1 questions
No ratings yet
further maths ss1 questions
9 pages
RRL Partial
No ratings yet
RRL Partial
9 pages
Chapter II A Force Systems 2D: Engineering Mechanics
100% (1)
Chapter II A Force Systems 2D: Engineering Mechanics
117 pages
Tutorial - Math.lamar - Edu PDF Laplace Table
No ratings yet
Tutorial - Math.lamar - Edu PDF Laplace Table
2 pages
Revision Guide Foundation Essential Skills Worksheet
No ratings yet
Revision Guide Foundation Essential Skills Worksheet
2 pages
Indra Bahadur Khatiwada Research
No ratings yet
Indra Bahadur Khatiwada Research
29 pages
Module 2
No ratings yet
Module 2
7 pages
Writing Exercise Reports: Advise and Instructions
No ratings yet
Writing Exercise Reports: Advise and Instructions
21 pages
Bose Math
No ratings yet
Bose Math
7 pages
Quantum Computing Lesson 1-3
No ratings yet
Quantum Computing Lesson 1-3
10 pages
Xii Computer Project Yousuf
No ratings yet
Xii Computer Project Yousuf
90 pages
1 - Algebra
No ratings yet
1 - Algebra
6 pages
Q3 Week 6 Stem G11 Basic Calculus
No ratings yet
Q3 Week 6 Stem G11 Basic Calculus
13 pages
2015 Assignments With Solutions
No ratings yet
2015 Assignments With Solutions
48 pages
RD Sharma Solutions For Class 8 Chapter 9 Linear Equation in One Variable
No ratings yet
RD Sharma Solutions For Class 8 Chapter 9 Linear Equation in One Variable
54 pages

Social Network Analysis Unit-3

Uploaded by

Social Network Analysis Unit-3

Uploaded by

SOCIAL NETWORK ANALYSIS

• Modularity it is independent of the number of clusters

• Clustering methodsallow the user to indirectly

• Edge betweenness measures are defined in a way that

• Both L and are symmetric and positive definite,

• Provide a powerful framework for fast and high-

• NPTEL Online Courses – 12 week course

You might also like