0% found this document useful (0 votes)
3 views

Social Network Analysis

Uploaded by

Akash Singhania
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
3 views

Social Network Analysis

Uploaded by

Akash Singhania
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 56

Social

Network
Analysis
Social Networks and Social Network Analysis (SNA)
• Social media platforms allow members/visitors to connect to each other, to brands, to groups:
• Such interconnections on these platforms give rise to networks that may consist of individuals, groups, or
brands/companies
• It is important for the marketer to study the network of connections in order to uncover the role of an
individual/brand page

• Social Network Analysis (SNA) is the process of investigating social structures through the use of network
and graph theories:
• A typical network consists of nodes (represented through circles) and edges/connections (represented through
lines), technically referred to as a sociogram
• Sociograms were developed to analyze choices or preferences within a group and represent the structure and
patterns of group interactions through a diagram
• It can be drawn on the basis of many different criteria: friend requests, interactions with comments, memes
spread, information circulation, business networks, knowledge networks, and likes on posts, among many others
• These visualizations provide a means of qualitatively assessing networks by varying the visual representation of
their nodes and edges to reflect attributes of interest
Social Networks and Social Network Analysis (SNA)

• The advantages of SNA are twofold

• Firstly, it can process a large amount of relational data and describe the
overall relational network structure

• SNA context and choose which parameters to define the “center”


according to the characteristics of the network
Applications of Social Networks
Social networks can be used to create reports about community health, comparisons of subgroups, and identification of
important individuals, as well as in applications that rank, sort, compare, and search for content and experts

Traditional participation statistics can provide important insights into the volume of engagement of a community but
can say little about the structure of the connections between community members. Network analysis can help explain
important social phenomena such as group formation, group cohesion, social roles, personal influence, and overall
community health. Combining traditional participation metrics with network metrics provides the best of worlds and
allows you to answer important questions such as the following:
• What kinds of social roles are being performed within a social media collection? Does a community have enough people
filling the important roles?
• Which individuals play important social roles within a group or collection? Who would make a good administrator based
on that person’s network position?
• What subgroups exist? Do connections between subgroups exist? Who plays the bridge roles that connect otherwise
unconnected groups?
• How do new ideas propagate through a network? Who are the influencers that spark the spread of ideas?
• How do the overall structures of a social network change after a particular event (e.g., a company social, a round of new
hires or layoffs, a product launch or recall)?
Applications of Social Networks
This network graph visualization
paints a picture of the social
relationships among the Twitter
accounts of members of the
United States Senate in 2018

The size of each Twitter user’s


profile image is determined by
the user’s total number of
followers as reported by the
Twitter Application Programmer
Interface (API), which gives
software access to extended
details about each user’s profile
and message data

A line, or edge, exists between


two people when one user
account “mentions” or “replies
to” another
Nodes and Edges

• Nodes/Vertex:
• Nodes are the entities in the network
• Often, they represent people or social structures such as workgroups, teams, organizations, institutions, states,
or even countries
• At other times they represent content such as web pages, keywords, or videos

• Edges:
• Edges are the connections between the nodes. Edges can be any relationship that the marketer wishes to
capture
• Edges can represent many different types of relationships like proximity, collaboration, kinship, friendship,
trade partnership, literature citation, investment, hyperlink, transaction, or any shared attribute (e.g., people
who attended the same University)
• An edge can be said to exist if it has some official status, is recognized by the participants, or is observed by
exchange or interaction between them
Directed and Undirected Networks

Networks can either be directed or undirected

• Directed Networks:
• Directed edges (also known as asymmetric edges) have a clear origin and destination
• For example, money is lent from one person to another, a Twitter user follows another user, an email is
sent from an author to a recipient, or a web page links to another web page
• They are represented on a graph as a line with an arrow pointing from the source node to the recipient
node

• Undirected Networks:
• An undirected edge (also known as a symmetric or mutual edge) simply exists between two people or
things: a couple is married, two Facebook users are friends, or two people are members of the same
organization
• No origin or destination is clear in these mutual relationships
• They cannot exist unless they are reciprocated. Undirected edges are represented on a graph as a line
connecting two vertices with no arrows
Directed and Undirected Networks
• For example, the marketer may be interested in learning more about a network where specific individuals
ask questions to others, while a specific set of individuals answer these questions. The researcher can draw
up to two separate directed networks:

• One in which these individuals who pose questions have an arrow directed from them to the one to
whom the question is being asked
• Another in which the direction of the arrow points from the answer giver to the one who is asking
questions

• In an undirected network, only connections are captured and the direction is not important
Unweighted and Weighted Networks
• Unweighted Networks:
• An unweighted edge or binary edge, only indicates if an edge exists or not
• For example, a friendship tie between Facebook users either exists or does not

• Weighted Networks:
• A weighted edge includes values associated with each edge that indicate the strength or frequency of a tie
• For example, a weighted edge between two Facebook users may indicate the number of photo comments
exchanged or the duration since the creation of a friendship
• Weighted edges are often represented visually as thicker or darker or as more or less opaque lines
• Including weighted edge data in a network dataset is preferable because this provides additional
information about each tie

However, many social network analysis metrics are designed for unweighted networks
Fortunately, any weighted network can be converted to an unweighted one by choosing a cutoff point. For example, an
unweighted edge could be shown between individuals who exchanged at least 10 email messages, with no edge between
people who exchanged fewer than 10 messages
Shortest Path

• The shortest path between two people is called the “geodesic distance”

• The geodesic distance is the length of the shortest path between two people

• If you think of the edges as roads and the vertices as houses, the geodesic distance would be the number of
roads someone must take to get from one house to another, assuming that the person is traveling on the
shortest path possible

• The maximum geodesic distance, or diameter of a network, is the largest geodesic distance in the network,
or the distance between the two vertices that are farthest from each other
Social Network Analysis: SNA Metrics

A network represented as a matrix


Ann Bob Carol
Ann 0 1 1
Bob 0 0 0
Carol 1 0 0

The table describes a directed network because not all connections are reciprocated. For example, Ann “points to” Bob
as shown in row 1, but Bob does not “point to” Ann as shown in row 2. If it were an undirected network, it would be a
symmetric matrix; if Ann points to Bob, then Bob must necessarily point to Ann

This network is a binary network because it only includes 1s and 0s, where a 1 indicates that there is a connection and a 0
indicates that there is no connection

Allowing additional values would create a weighted network. For example, the 1s could be replaced with the number of
email messages sent or phone calls made to the other person
Social Network Analysis: SNA Metrics

A network represented as an edge list


Vertex1 Vertex2
Ann Bob
Ann Carol
Carol Ann

An alternative to the matrix data format that is a more efficient representation of a network is called an “edge list”

It is simply a list of all edges in the network. Individuals in the Vertex1 column “point to” those in the Vertex2 column

Unless data describing the value of each edge are provided in additional columns, the network is implied to be a binary one
Social Network Analysis: Network
Types of Networks
Social networks range in size from a handful of people to national and planetary populations. They also differ in
the types of vertices they include, the nature of the edges that connect them, and the ways in which they are
formed

• Egocentric (Ego) Network

It is often useful to consider social networks from an individual member’s point of view. Network analysts call
the individual that is the focus of attention “ego” and the people he or she is connected to “alters”
Some networks, called egocentric networks, only include individuals who are connected to a specified ego
For example, a network of your personal Facebook friends would be an egocentric network because you are, by
definition, connected to all other vertices
Types of Networks
• Unimodal, Multimodal, and Affiliation Networks
The standard networks are called unimodal networks because they include one type (i.e., mode) of vertex. They connect
users to users, or they connect documents to documents, but they don’t include both users and documents

However, networks can include different types of vertices creating multimodal networks. For example, connecting
Marvel Movies to Characters in those movies. Rich sets of intersecting networks often form in social media environments
composed of connections between people, photos, videos, messages, documents, groups, organizations, locations, and
services. In many cases, these multimodal networks have to be transformed into simpler unimodal networks to perform
meaningful network analysis, as most network metrics are designed for unimodal networks

A common type of multimodal network is a bimodal network with exactly two types of vertices. Data for these networks often
include individuals and some event, activity, or content with which they are affiliated, creating an affiliation network. For
example, an affiliation network may connect users with the wiki pages they have edited

People are affiliated with pages. In this network, no two users would directly connect to each other. Likewise, no two pages
would directly connect to each other. Pages only link to people (i.e., editors)

Bimodal affiliation networks can be transformed into two separate unimodal networks: a “user edits page” network can be
converted into a user-to-user network and a page to page network
Social Network Analysis: SNA Metrics
• These quantitative network metrics allow analysts to systematically inspect the patterns of
connection within the social world, creating a basis on which to compare networks, track
changes in a network over time, and determine the relative position of individuals and
clusters within a network

• Centrality measures are a set of metrics that describe the position that an individual
occupies in a network. These metrics describe how a particular node can be said to be in
the ‘middle’ of the network. Leading measures of centrality include:
• Degree Centrality
• Betweenness Centrality
• Closeness Centrality
• Eigenvector/PageRank Centrality
It is more common to normalize this score so that it represents the average length of the shortest paths rather than their sum. This adjustment
allows comparisons of the closeness centrality of nodes of graphs of different sizes
Degree Centrality
Degree Centrality: It is a simple count of the total number of connections linked to a node.
In other words, it measures the number of neighbors of the node

• It can be thought of as a kind of popularity measure, but a crude one that does
not recognize a difference between quantity and quality. Degree centrality does
not differentiate between a link to the CEO of a big company and a link to its most
recent trainee hire
• For directed networks where relationships have an origin and a destination, In-
degree is the number of directed edges incident on a node. Out-degree is the
number of directed edges that originate at a node

The node with the higher degree is more central


The node with higher outdegree is more central (choices made)
The node with higher indegree is more prestigious (choices received)
Degree Centrality
Degree centrality starts with the assumption that the person with the most connections
(edges) is the most important

Rather than returning a count, it is the degree of the node divided by the total possible
number of edges that the node could have

For the case of the directed graph, the degree of the incoming vertices and outgoing vertices
would likely be treated separately

Degree Centrality of a node v = No. of Edges from the Node v


N-1
Where N is the number of nodes in the network
• A Degree Centrality of 1 means the node is directly connected to every other node
• A Degree Centrality of 0 means the node isn’t connected to any other node in the network
Undirected Network: Degree Centrality

Node No. of Edges Degree = No. of Edges from a Node/N-1


A 1 0.143
B 2 0.286
C 2 0.286
D 3 0.429
E 4 0.571
F 2 0.286
G 3 0.429
H 3 0.429
N = no. of nodes in the network = 8
N-1 = 7
Betweenness Centrality
Betweenness Centrality: It captures brokerage. It measures how often a given node lies on the shortest
paths between two other nodes. The nodes with high betweenness centrality play a significant role in the
communication/information flow within the network

• This can be thought of as a kind of “bridge” score, a measure of how much removing a person
would disrupt the connections between other people in the network

• The idea of brokering is often captured in the measure of betweenness centrality

• A “structural hole” is a term for recognizing a missing bridge. Wherever two or more groups
fail to connect, one can argue that there is a structural hole, a missing gap waiting to be filled

Social network analysis has many strategic applications for people in an organization to analyze their position and the
position of others. Managers and leaders can recognize gaps or disconnections within organizations and devote resources to
bridging the divide. People may be able to apply social network analysis to identify locations in which a gap exists and elect
to fill them, recognizing the value they can generate as brokers between two otherwise separate groups
Betweenness Centrality

Here,
σst(v) is used to indicate the number of those paths that pass through v
σst is used to indicate the total number of shortest paths from node s to node t

For standardization, the summation is divided by (N-1)(N-2)/2 for undirected networks and (N-1)(N-2) for directed
networks

N is the total number of nodes in the network

Betweenness Centrality for the starting and end nodes is always 0


Undirected Network: Betweenness Centrality of Node D
Betweenness Centrality
No. of Shortest Paths
Path (σst) (σst)ν
AB 1 0 0
BC 1 0 0
AC 1 0 0
AE 1 1 1
BE 1 1 1
CE 1 1 1
AH 1 1 1
BH 1 1 1
CH 1 1 1
AG 2 1 0.5
AF 1 1 1
Betweenness Centrality = BG 2 1 0.5
BF 1 1 1
(0+0+0+1+1+1+1+1+1+0.5+1+
CG 2 1 0.5
0.5+1+0.5+1+0+0+0+0+0) =
CF 1 1 1
10.5/(7*6/2) = 0.5 EF 1 0 0
EG 1 0 0
EH 1 0 0
FG 1 0 0
GH 1 0 0
Closeness Centrality
Closeness Centrality: It is the average distance between a node and every other node in the network, i.e., how
close a node is to other nodes. The ‘farness’ of a node is defined as the sum of its distances to all other nodes.
Closeness is defined as the inverse of farness. A centrally located node has a low distance from all the other
nodes
• Assuming that vertices can only pass messages to or influence their existing connections, a low
closeness centrality means that a person is directly connected or “just a hop away” from most others
in the network
• In contrast, vertices in very peripheral locations may have high closeness centrality scores,
indicating the high number of hops or connections they need to take to connect to distant others in
the network
• Think of closeness, paradoxically, as a “distance” score. Some people are just a few miles from the
big city, others must drive for hours: similarly, people with high “closeness” centrality scores have
many miles or rather personal connections that they must travel to reach many other people in the
network
The more central a node is, the lower its total distance to all other nodes
Closeness can be regarded as a measure of how long it will take to spread information from a node to all other nodes
sequentially
Closeness Centrality

This measure assumes that the node that is closest to all the other nodes is the most important. It is the ratio of the highest
possible degree for the node to the sum of the shortest paths to the other nodes.

Closeness Centrality of a node v = N-1 p


Sum of Shortest Paths of v
Undirected Network: Closeness Centrality of Node A

Node Shortest Path from A


B 1
C 2
D 3
E 4
F 5
G 5
H 4

Closeness Centrality = 7/(1+2+3+4+5+5+4) = 0.29


Undirected Network: Closeness Centrality of Node D

Node Shortest Path from D


A 3
B 2
C 1
E 1
F 2
G 2
H 1

Closeness Centrality: 7 /(3+2+1+1+2+2+1) = 0.58


Eigenvector/PageRank Centrality
Eigenvector/PageRank Centrality: It is not just how many individuals one is connected to, but the type of
people one is connected with that can decide the importance of a node. Eigenvector centrality is a measure of
how important a node is by accounting for the fact of how well it is connected to other important nodes

Intuitively, it considers not just “how many people you know,” but also “who you know”

• There is a variety of software for undertaking social network analysis including free and
paid versions:
• Some of the leading SNA software include UCINET, Gephi, NodeXL, and Pajek

PageRank is a variant of Eigenvector Centrality that is used for directed networks


Centrality
Undirected Network: Centrality Example
The following network represents a friendship network. Each node represents a person, and
each edge represents the friendship between the persons at the ends. If you are interested in
finding the most popular person in the network, which centrality measure is the most
appropriate? Answer with reasons why it is the most appropriate
Undirected Network: Centrality Example
The following network represents a friendship network. Each node represents a person, and
each edge represents the friendship between the persons at the ends. If you are interested in
finding the most popular person in the network, which centrality measure is the most
appropriate? Answer with reasons why it is the most appropriate

The most popular person should have the highest number of friends. Thus, degree centrality is the most
appropriate measure
Undirected Network: Centrality Example
Suppose the following network refers to the information flow network of an organization. If
you are interested in finding the section that can most frequently control information
flow in the network, which centrality measure is the most appropriate? Answer with
reasons why it is the most appropriate
Undirected Network: Centrality Example
Suppose the following network refers to the information flow network of an organization. If
you are interested in finding the section that can most frequently control information
flow in the network, which centrality measure is the most appropriate? Answer with
reasons why it is the most appropriate

To control information flow, a node should be between other nodes because the node can interrupt information flow
between them. Thus, betweenness centrality is the most appropriate measure
Undirected Network: Centrality Example
Suppose the following network refers to the information flow network of an
organization. Each node represents a section in the organization, and each edge represents a
possible information exchange between the sections at the ends. If you are interested in
finding the section that can most efficiently obtain information from every other
section, which centrality measure is the most appropriate? Answer with reasons why it is the
most appropriate
Undirected Network: Centrality Example
Suppose the following network refers to the information flow network of an
organization. Each node represents a section in the organization, and each edge represents a
possible information exchange between the sections at the ends. If you are interested in
finding the section that can most efficiently obtain information from every other
section, which centrality measure is the most appropriate? Answer with reasons why it is the
most appropriate

To obtain information, one should be near everyone. In this sense, the node in the nearest position on average can most
efficiently obtain information. Thus, closeness centrality is the most appropriate
Undirected Networks: Centrality Exercises

Calculate:

Degree Centrality
Betweenness Centrality
Closeness Centrality

Also, find the most important nodes based on different centrality measures
Directed Networks: Degree and Betweenness Centrality

Degrees: Node C has the highest degree of centrality measure: 4 Degrees, 1 in-degree, 3 out-degrees
Node B: 2 degrees, 1 in-degree, 1 out-degree

Betweenness: Calculated in the same way as an undirected network. The sum is divided by (N-1)(N-2) as the direction is
known
Directed Networks: Closeness Centrality

Closeness Centrality of a node v = Jv / (N-1) p


Sum of Shortest Paths of v/ Jv

Jv is the number of nodes that can be reached by v

Node A B C D E F Closeness Centrality


A 0 1 2 1 3 3 (5/5)/(10/5) = 0.5
B 0 0 1 2 2 2 (4/5)/(7/4) = 0.45713
C 0 0 0 1 1 1 (3/5)/(3/3) = 0.6 Highest Closeness
Centrality
D 0 0 0 0 0 0 0
E 0 0 0 1 0 0 (1/5)/(1/1) = 0.2
F 0 0 0 0 0 0 0
Directed Networks: Centrality Exercises

Calculate:

Degree Centrality
Betweenness Centrality
Closeness Centrality

Also, find the most important nodes based on different centrality measures
Social Network Analysis: SNA Metrics

A visualization of a dataset based on closeness centrality


Implications of SNA for Marketers

• Community Detection:
• Community detection algorithms in SNA will allow an individual to be included in more than
one group, therefore giving a more realistic evaluation of the groups to which that person may
belong to

• Using Results Obtained from SNA in Traditional Analytics:


• An SNA procedure calculates centrality metrics for each node in the network
• The network characteristics of an individual may be used with traditional metrics (e.g.,
demographic, psychographic, individual choice behaviour) in order to undertake analytics
Grouping and Filtering
• Most real-world social media networks are large and messy

• Visualizing and making sense of large networks can be challenging, particularly if they are
densely connected

• One strategy for understanding large networks is to filter out information. Many
criteria can be used to filter out vertices and/or edges. For example, vertices with low
centrality scores can be filtered out, leaving only those most important in the network.
Other data associated with vertices, such as age, country of origin, time zone, or the number
of Twitter followers, can be used to filter vertices

• Filtering can also be applied to edges. For example, if edges represent the number of email
messages exchanged between two people, a network of “strongly” connected individuals
may filter out those who have sent less than 10 messages to one another
Grouping and Filtering

• Social network analysis provides a set of tools for identifying and


understanding groups, also called clusters or communities, by
network researchers

• In the language of network analysis, clusters are pockets of densely


connected vertices that are only sparsely connected to other pockets

• For example, the figure shows a network consisting of three densely


connected pockets (within the dotted circles) that are loosely
connected to each other by only a few ties
Network Density
• Network Density is defined as the ratio of the actual number of edges (links) to the maximum
possible number of edges (links) in the network or, in other words, the number of connections
a participant has divided by the total possible connections a participant could have

• On a scale of 0 to 1, a 0 would mean that there are no connections at all, and a 1 would
indicate that all possible edges are present (a perfectly connected network)

• A high value of density represents that the network is more dense and nodes are more
cohesive. On the other hand, a low-density value represents a less connected (sparse) network.
In a dense network, the information can flow easily and faster than in a sparse network

• Modularity is a measure of relative density in a network: a community (called a module or


modularity class) has high density relative to other nodes within its module but low density
with that outside
Community Detection
• A key aspect of network analysis is community detection. In a social network, this is the
idea that a large network can be broken down into smaller communities. For example, if the
network represents the social relationships of all the students at a school, a community
would be a friendship group

• There are many ways to approach community detection in networks. The modularity
optimization method is one of the widely used methods. Modularity is a measure of the
extent to which like is connected to like in a network
Community Detection
• Many algorithms are used for community detection. However, the two widely used
algorithms are

• Girvan-Newman Algorithm
• Louvain Algorithm

• The Girvan-Newman algorithm splits communities by removing edges, essentially creating


a new graph for each iteration

• On the other hand, Louvain simply compares node modularity and does not actually
change the original graph's setup
Girvan-Newman Algorithm
Repeat these steps until no edges are left:

• Compute edge betweenness for every edge in the graph


• Remove the edge with the highest edge betweenness
• Calculate edge betweenness for remaining edges
• Connected components are communities
Louvain Algorithm
• The Louvain algorithm is a fast implementation of community detection. It can be used to
analyze a network of 2 million nodes in only 2 minutes

• It is a hierarchical clustering algorithm that involves two phases: modularity optimization


and community aggregation

• The Louvain method of community detection uses a greedy approach and initially assigns
each node to an individual community
Greedy Modularity Communities
Greedy modularity maximization begins with each node in its own community and repeatedly joins the
pair of communities that lead to the largest modularity until no further increase in modularity is possible (a
maximum)

Q = Modularity

m = Weights of all edges in the graph (if unweighted graph, then count of all edges in the graph)

Aij = Weight of edge between node i and j if weighted graph


else 1 if edge between i and j exists or 0 otherwise in case of unweighted graph

Ki = Degree (total connections) of node i

Ꟙ(ci, cj) = 1 if both nodes i and j are in the same community else 0
Louvain Algorithm Steps

• Initially assign each node a unique community such that total nodes = total unique
communities

• Now, in an iterative manner, assign every node i to its neighboring node j community and
recalculate the modularity of the graph. If modularity improves as compared to when node i
wasn’t in the j node’s community, we will assign i to community j else not

• This will be repeated for multiple iterations till no further gain is observed in modularity by
moving any node to its neighbor’s community, and this way, maxima of modularity are
reached
Facebook Case Study
• This dataset consists of ‘circles’ (or ‘friends lists’) from Facebook. The dataset includes
node features (profiles), circles, and ego networks

• Facebook data has been anonymized by replacing the Facebook-internal ids for each user
with a new value. Also, while feature vectors from this dataset have been provided, the
interpretation of those features has been obscured. For instance, where the original dataset
may have contained a feature “political=Democratic Party”, the new data would simply
contain “political=anonymized feature 1”. Thus, using the anonymized data it is possible
to determine whether two users have the same political affiliations, but not what their
individual political affiliations represent
Summary

Centrality Measures
Degree Centrality
• Definition: Degree centrality assigns an importance score based simply on the number of
links held by each node

• What it tells us: How many direct, ‘one hop’ connections each node has to other nodes in
the network

• When to use it: For finding very connected individuals, popular individuals, individuals
who are likely to hold the most information, or individuals who can quickly connect with
the wider network

• A bit more detail: Degree centrality is the simplest measure of node connectivity.
Sometimes it’s useful to look at in-degree (number of inbound links) and out-degree
(number of outbound links) as distinct measures, for example when looking at transactional
data or account activity
Betweenness Centrality
• Definition: Betweenness centrality measures the number of times a node lies on the
shortest path between other nodes

• What it tells us: This measure shows which nodes are ‘bridges’ between nodes in a
network. It does this by identifying all the shortest paths and then counting how many
times each node falls on one

• When to use it: For finding the individuals who influence the flow around a system

• A bit more detail: Betweenness is useful for analyzing communication dynamics but
should be used with care. A high betweenness count could indicate someone holds
authority over disparate clusters in a network, or just that they are on the periphery of both
clusters
Closeness Centrality
• Definition: Closeness centrality scores each node based on its ‘closeness’ to all
other nodes in the network

• What it tells us: This measure calculates the shortest paths between all nodes,
then assigns each node a score based on its sum of shortest paths

• When to use it: For finding the individuals who are best placed to influence the
entire network most quickly

• A bit more detail: Closeness centrality can help find good ‘broadcasters’, but in
a highly-connected network, you will often find all nodes have a similar score.
What may be more useful is using Closeness to find influencers in a single
Eigenvector Centrality
• Definition: Like degree centrality, Eigenvector Centrality measures a node’s influence based
on the number of links it has to other nodes in the network. Eigenvector Centrality then goes
a step further by also taking into account how well-connected a node is, and how many links
their connections have, and so on through the network

• What it tells us: By calculating the extended connections of a node, Eigenvector Centrality
can identify nodes with influence over the whole network, not just those directly connected to
it

• When to use it: Eigenvector Centrality is a good ‘all-round’ SNA score, handy for
understanding human social networks, but also for understanding networks like malware
propagation

• A bit more detail: Each node’s Eigenvector Centrality is calculated by converging on an


PageRank Centrality
• Definition: PageRank is a variant of Eigenvector Centrality, also assigning nodes a score
based on their connections, and their connections’ connections. The difference is that
PageRank also takes link direction and weight into account – so links can only pass
influence in one direction, and pass different amounts of influence

• What it tells us: This measure uncovers nodes whose influence extends beyond their direct
connections into the wider network

• When to use it: Because it takes into account direction and connection weight, PageRank
can be helpful for understanding citations and authority

• A bit more detail: PageRank is famously one of the ranking algorithms behind the original
Google search engine (the ‘Page’ part of its name comes from creator and Google founder,
Larry Page)

You might also like