0% found this document useful (0 votes)
10 views

Introduction To Social Network Analysis

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
10 views

Introduction To Social Network Analysis

Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 28

Introduction to Social

Network Analysis

Saeed Roshani
Assistant Professor of Technology Management
Types of Networks
Directed Networks
A network where relationships (edges)
between nodes have a direction. Each
edge points from one node to another,
representing asymmetric relationships.
Use Cases: Communication patterns,
authority structures, citation networks.
Undirected Networks
A network where edges between nodes do
not have a direction. Connections are
bidirectional, representing symmetric
relationships.
Use Cases: Collaboration networks,
mutual relationships, co-authorship
networks.
Weighted Networks
A network where edges carry a weight
representing the strength, frequency, or
capacity of the relationship between
nodes.
Use Cases: Financial networks
(transaction amounts), supply chain
analysis (goods volume), social
interaction intensity.
Bipartite Networks
A network with two distinct sets of nodes
where edges only connect nodes from
different sets.
Example: Movie recommendation
networks (one set of nodes = users, the
other set = movies, with edges
representing who has watched what).

Use Cases: Financial networks


(transaction amounts), supply chain
analysis (goods volume), social
interaction intensity.
Multilayer Networks
A network where nodes can participate in
multiple layers or types of relationships,
often represented by multiple types of
edges.
Example: Social media networks where
users interact through different platforms
(e.g., Facebook, Instagram, LinkedIn).

Use Cases: Complex systems analysis,


social influence spread across various
platforms, global supply chains that
involve multiple kinds of relationships
(e.g., suppliers, distributors, regulators).
Homophily vs Heterophily in Networks
Homophily Networks: Networks where nodes with
similar attributes are more likely to be connected.
Example: People with similar interests or professions
forming tightly-knit communities.

Heterophily Networks: Networks where nodes with


different attributes connect.
Example: Interdisciplinary research collaborations
between scientists from different fields.
Static vs Dynamic Networks
Networks where nodes and edges evolve
over time, capturing changing
relationships.

Use Cases: Longitudinal studies, tracking


social movements, or communication
patterns over time.
Random vs Scale-Free Networks

Random Networks: Networks where edges between nodes are


created randomly. Degree distribution is typically normal,
meaning most nodes have a similar number of connections.
• Example: A hypothetical network generated for
theoretical studies (e.g., Erdős–Rényi model).
• Use Cases: Theoretical modeling and comparison with
real-world networks.
Scale-Free Networks: Networks where a few nodes (hubs)
have a disproportionately large number of connections,
following a power-law distribution.
• Example: The internet, where a few websites receive
most of the traffic (e.g., Google, Facebook).
• Use Cases: Understanding the robustness of networks,
identifying key players in systems like airline routes or
communication platforms.
Level of Analysis
Macro-Level Network Analysis
Network Density
Density is the ratio of the number
of actual edges in a network to the
maximum possible number of
edges. It measures how
interconnected the network is.
Degree Distribution

Degree distribution describes the frequency of nodes with a certain


number of connections (degree) within a network.
Average Path Length
The average path length is the average
number of steps along the shortest paths
for all possible pairs of network nodes.

Interpretation
• Small average path lengths indicate that nodes are
generally close to each other, often a feature of "small-
Steps to use the formula: world" networks.
1.Identify all pairs of nodes i and j. • A long average path length suggests that
2.Find the shortest path d(i,j) between each pair of communication or interaction between nodes is more
difficult or slower.
nodes.
3.Sum all the shortest path distances.
4.Divide the total sum by the number of unique pairs
of nodes, which is N(N−1).
Connected Components
A connected component is a subgraph in
which any two nodes are connected by a
path, and which is connected to no
additional nodes in the network.

Interpretation:
The number and size of connected components give insights into the network's
overall structure. A network can be fragmented into several small components,
or it can have one large "giant component."

A network with many small disconnected components might indicate isolation


between groups or inefficiency in communication.
Assortativity
Assortativity is a measure of the tendency for
nodes to connect to other nodes that are similar in
some way (e.g., degree, attribute).

Types:
Degree Assortativity:
Do high-degree nodes tend to connect to other high-degree nodes?
Attribute Assortativity: What about:
Do nodes with similar attributes (e.g., age, profession) tend to be connected? Robustness?
Information Spread?
Interpretation:
• Positive assortativity indicates that similar nodes tend to be connected (e.g., in social networks,
people may tend to form connections with others who are similar in status or wealth).
• Negative assortativity means dissimilar nodes tend to connect (e.g., hierarchical structures where
leaders connect to followers).
Example:
Social networks are often assortative by degree (highly connected individuals connect with other highly
connected individuals).
Clustering Coefficient
The clustering coefficient measures the likelihood that two
neighbors of a node are also connected, reflecting the network's
tendency to form tightly knit groups.

• Local Clustering Coefficient: Measures how likely it is for the


neighbors of a particular node to be connected.

• Global Clustering Coefficient (Transitivity): An overall


measure of the network's tendency to form triangles (closed
triplets of nodes).
Micro-Level Network Analysis
Highest
t
Betweenness
Centrality Highest
c
Closeness
Centrality
s
e
b
q
j
d
r
g
u I
k
a
n
f L
Highest
Highest h Degree
Eigenvector m Centrality
Centrality
Degree Centrality
Degree centrality measures the number of direct
connections (edges) a node has. It quantifies the immediate
influence or popularity of a node in the network.

Interpretation:
Nodes with high degree centrality have more direct connections and
are often considered key players in spreading information or
maintaining communication in the network.
Example:
In a social network, a person with many friends or contacts has a high
degree centrality. In a collaboration network, a researcher with many
co-authors has a high degree centrality.
Betweenness Centrality
Betweenness centrality measures how often a
node appears on the shortest path between
other pairs of nodes. It quantifies a node’s role
as a bridge or gatekeeper in the network.

Interpretation:
Nodes with high betweenness centrality control information flow in the
network. They act as intermediaries between otherwise disconnected parts
of the network, giving them strategic importance.
Example:
In a corporate hierarchy, a manager who coordinates between different
departments has high betweenness centrality. In a communication network,
a node that links different subgroups is central in terms of betweenness.
Closeness Centrality
Closeness centrality measures how close a node
is to all other nodes in the network. It is the
inverse of the average distance from the node to
all other nodes.

Is node 4 is more central than node 3?

Interpretation:
A node with high closeness centrality can quickly reach all other
nodes in the network, making it efficient for spreading information.
Such nodes are central in terms of their proximity to others.

Example:
In a company's communication network, an employee who can reach
everyone in a few steps (perhaps through email or other
communication tools) has high closeness centrality.
Eigenvector Centrality
Eigenvector centrality measures a node’s influence
based on the importance of its neighbors. A node
has a high eigenvector centrality if it is connected
to many highly central nodes.

λ is a constant (the largest eigenvalue of the adjacency matrix).

Interpretation:
Nodes with high eigenvector centrality are not only well-connected
but are connected to other influential nodes. It extends the idea of
degree centrality by accounting for the quality of connections, not just
the quantity.
Example:
In a social network, a person who is well-connected to influential
people (e.g., celebrities or political leaders) has high eigenvector
centrality, even if they don't have many direct connections themselves.
Centrality Measure Definition Example Interpretation
Measures the number of
Degree Centrality direct connections a node A person with many friends in a social network. High values indicate many direct connections.
has.
Measures how often a
node lies on the shortest A manager who connects different departments High values indicate strategic positions for controlling
Betweenness Centrality
path between other in a company. information flow.
nodes.

Measures how close a An employee who can quickly reach others in a High values indicate the ability to quickly reach other
Closeness Centrality
node is to all other nodes. communication network. nodes.

Measures influence based


A person connected to many influential people in High values indicate connection to influential or well-
Eigenvector Centrality on the importance of a
a social network. connected nodes.
node's neighbors.
Measures influence
A person with few direct but many indirect High values indicate influence through both direct and
Katz Centrality accounting for direct and
influential connections. indirect connections.
indirect connections.
Ranks nodes based on
High values indicate importance based on backlinks
PageRank Centrality incoming links from A website with many important backlinks.
from influential nodes.
important nodes.
Sums the reciprocals of
A node in a transportation network that is High values indicate closeness to many nodes, even in
Harmonic Centrality distances to all other
centrally located. disconnected networks.
nodes.
Measures importance
A node that spreads a disease or information in High values indicate key roles in spreading processes
Percolation Centrality based on contribution to
an epidemiological model. like disease or information.
spreading processes.
Measures importance
Flow Betweenness A router facilitating a significant portion of High values indicate key roles in managing the flow of
based on contribution to
Centrality network traffic. information or resources.
the flow of information.

You might also like