0% found this document useful (0 votes)

43 views

Lecture 11

The document discusses Kruskal's algorithm for finding a minimum spanning tree (MST) in a graph. It introduces the union-find disjoint set data structure and describes how Kruskal's algorithm uses it. Several implementations of the union-find data structure are presented, including linked lists, trees, and improved versions that use weights or ranks to balance trees during merging.

Uploaded by

bunty da

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

43 views

Lecture 11

Uploaded by

bunty da

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

CSC 263 Lecture 11

September 13, 2006

19 Kruskal’s Algorithm for MCST

Kruskal’s algorithm uses a Union-Find ADT. We need to define this before proceeding with the
algorithm.

19.1 The Disjoint Set ADT (also called the Union-Find ADT)
Two sets A and B are disjoint if their intersection is empty: A ∩ B = ∅. In other words, if there
is no element in both sets, then the sets are disjoint. The following abstract data type, called
“Disjoint Set” or “Union-Find,” deals with a group of sets where each set is disjoint from every
other set (i.e. they are pairwise disjoint).
Object: A collection of nonempty, pairwise disjoint sets: S1 , . . . , Sk . Each set contains a special
element called its representative.
Operations:

• MAKE-SET(x): Takes an element x that is not in any of the current sets, and adds the set
{x} to the collection. The representative of this new set is x.

• FIND-SET(x): Given an element x, return the representative of the set that contains x (or
some NIL if x does not belong to any set).

• UNION(x,y): Given two distinct elements x and y, let Si be the set that contains x and
Sj be the set that contains y. This operation adds the set Si ∪ Sj to the collection and it
removes Si and Sj (since all the sets must be disjoint). It also picks a representative for the
new set (how it chooses the representative is implementation dependent). Note: if x and y
originally belong to the same set, then Union(x,y) has no effect.

The Union-Find ADT provides us with an easy method for testing whether an undirected graph
is connected:

For all v in V do
MAKE-SET(v)
For all (u,v) in E do
UNION(u,v)

Now we can test whether there is a path between u and v by testing FIND-SET(u) = FIND-SET(v).

58
19.2 Pseudocode for Kruskal
KRUSKAL-MST(G=(V,E),w:E->Z)
A := {};
sort edges so w(e_1) <= w(e_2) <= ... <= w(e_m);
for each vertex v in V, MAKE-SET(v);
for i := 1 to m do
(let (u_i,v_i) = e_i)
if FIND-SET(u_i) != FIND-SET(v_i) then
UNION(u_i,v_i);
A := A U {e_i};
end if
end for
END KRUSKAL-MST

Intuitively, Kruskal’s algorithm grows an MCST A by repeatedly adding the “lightest” edge
from E that won’t create a cycle.

19.3 Correctness
We can argue correctness in a similar way to the way we proved correctness for Prim’s algorithm.

Theorem. If G = (V, E) is a connected, undirected, weighted graph, A is a subgraph of some

MCST T of G, and e is any edge of minimum weight which does not create a cycle with A, then
A ∪ {e} is a subset of some MCST of G.

Proof. We use a similar argument as before. If e is part of T , then we are finished. If not, then e
forms a cycle with T . If so, there must be some other edge e0 that is in T but not contained in A
(because e does not form a cycle with A). Also, e0 cannot form a cycle with A, because otherwise,
it would form a cycle with T . By assumption, w(e) ≤ w(e0 ). Let T 0 = T ∪ {e} − {e0 }. Then, as
before, w(T 0 ) ≤ w(T ) and A ∪ {e} ⊆ T 0 .

19.4 Data Structures for Union-Find

1. Linked lists: Represent each set by a linked list, where each node is an element. The
representative element is the head of the list. Each node contains a pointer back to the head.
The head also contains a pointer to the tail. We can implement the operations as follows
(listx is the list containing x and listy is the list containing y):

• MAKE-SET(x): Just create a list of one node containing x. Time: O(1).

• FIND-SET(x): Just follow x’s pointer back to the head and return the head. Time:
O(1).
• UNION(x,y): Append listy to the end of listx . Since we can find the head of listy
and the tail of listx in constant time, this takes O(1) time. The representative of this
combined list is the head of listx , but the nodes of listy still point to the head of listy .
To update them to point to the head of listx , it takes time Θ( length of listy ).

59
The worst-case sequence complexity for m of these operations is certainly O(m 2 ): no list will
contain more than m elements since we can’t call MAKE-SET more than m times. The most
expensive operation is UNION; if we call this m times on lists of length m, it will take time
O(m2 ). Obviously this an overestimate of the time since we can’t call both MAKE-SET and
UNION m times.
We can show, however, that the worst-case sequence complexity of m operations is Ω(m 2 ).
To do this, we have to give a sequence that will take time Ω(m2 ): start by calling MAKE-SET
m/2 + 1 times on elements x1 , x2 , . . . , xm/2+1 . Now do the loop:

for i = 2 to m/2 do
UNION (x_i, x_1)

This will create a longer and longer list that keeps getting appended to a single element. The
execution of the loop takes time Θ(m2 ).

2. Linked lists with union-by-weight: Everything remains the same except we will store the
length of each linked list at the head. Whenever we do a UNION, we will take the shorter list
and append it to the longer list. So, UNION(x,y) will no longer take O( length of list y ), but
rather O(min{length(listx ), length(listy )}). This type of union is called “union-by-weight”
(where “weight” just refers to the length of the list).
It might seem like union-by-weight doesn’t make much of a difference, but it greatly affects
the worst-case sequence complexity. Consider a sequence of m operations and let n be the
number of MAKE-SET operations in the sequence (so there are never more than n elements in
total). UNION is the only expensive operation and it’s expensive because of the number of
times we might have to update pointers to the head of the list. For some arbitrary element
x, we want to prove an upper bound on the number of times that x’s head pointer can be
updated during the sequence of m operations. Note that this happens only when list x is
unioned with a list that is no shorter (because we update pointers only for the shorter list).
This means that each time x’s back pointer is updated, x’s new list is at least twice the
size of its old list. But the length of listx can double only log n times before it has length
greater than n (which it can’t have because there are only n elements). So we update x’s
head pointer at most log n times. Since x could be any of n possible elements, we do total of
at most n log n pointer updates. So the cost for all the UNION’s in the sequence is O(n log n).
The other operations can cost at most O(m) so the total worst-case sequence complexity is
O(m + n log n).

3. Trees: Represent each set by a tree, where each element points to its parent and the root
points back to itself. The representative of a set is the root. Note that the trees are not
necessarily binary trees: the number of children of a node can be arbitrarily large (or small).

• MAKE-SET(x): Just create a tree with a single node x. Time: O(1).

• FIND-SET(x): Follow the parent pointers from x until you reach the root. Return
root. Time: Θ( height of tree ).
• UNION(x,y): Let rootx be the root of the tree containing x, treex , and let rooty be the
root of the tree containing y, treey . We can find rootx and rooty using FIND-SET(x) and

60
FIND-SET(y). Then make rooty a child of root x. Since we have to do both FIND-SETs,
the running time is Θ(max{height(treex ), height(treey )}).
root_x root_y

y
x

root_x
UNION (x,y)
root_y

The worst-case sequence complexity for m operations is just like the linked list case, since we
can create a tree which is just a list:

for i = 1 to m/4 do
MAKE-SET(x_i)
for i = 1 to m/4 - 1 do
UNION(x_(i+1), x_i)

61
X(m/4-1)
Xm/4 m/4 - 1

.
.
.

UNION(Xm/4, X(m/4-1))

Xm/4

X(m/4-1)
m/4 - 1

.
.
.

X1
Creating this tree takes m/4 MAKE-SET operations and m/4−1 UNION operations. The running
time for m/2 + 1 FIND-SET operations on x1 now is m/4(m/2 + 1) = Θ(m2 ).

Exercise. How do we know there is not a sequence of operations that takes longer than
Θ(m2 )?

4. Trees with union-by-rank: We improved the performance of the linked-list implemen-

tation by using “weight” or “size” information during UNION. We will do the same thing for
trees, using “rank” information. The rank of a tree is an integer that will be stored at the
root:

• MAKE-SET(x): Same as before. Set rank = 0.

• UNION(x,y): If rank(treex ) ≥ rank(treey ) then make rooty a child of rootx . Other-
wise, make rootx a child of rooty . The rank of the combined tree is rank(treex ) + 1 if
rank(treex ) = rank(treey ), and max{rank(treex ), rank(treey )} otherwise. The running
time is still Θ(max{height(treex ), height(treey )}).
• FIND-SET(x): Same as before.

62
We can prove two things about union-by-rank:

(a) The rank of any tree created by a sequence of these operations is equal to its height.
(b) The rank of any tree created by a sequence of these operations is O(log n), where n is
the number of MAKE-SETs in the sequence.

These two facts imply that the running times of FIND-SET and UNION are O(log n), so the
worst-case sequence complexity of m operations is O(m log n).

5. Trees with union-by-rank and path compression: In addition to doing union-by-rank,

there is another way to improve the tree implementation of Union-Find: When performing
FIND-SET(x), keep track of the nodes visited on the path from x to rootx (in a stack or queue),
and once the root is found, update the parent pointers of each of these nodes to point directly
to the root. This at most doubles the running time of the current FIND-SET operation, but
it can speed up future FIND-SETs. This technique is called “path compression.”
This is the state-of-the-art data structure for Union-Find. Its worst case sequence complexity
is O(m log∗ n) (see section 22.4 of the text for a proof). The function log ∗ n is a very slowly
growing function; it is equal to the number of times you need to apply log to n before the
answer is less than 1. For example, if n = 15, then 3 < log n < 4, so 1 < log log n < 2 and
2
22
log log log n < 1. So log∗ n = 3. Also, if n = 265536 = 22 , then log∗ n = 5.

19.5 Complexity of Kruskal’s Algorithm

Let’s assume that m, the number of edges, is at least n − 1, where n is the number of vertices,
otherwise G is not connected and there is no spanning tree. Sorting the edges can be done in
time O(m log m) using mergesort, for example. Let’s also assume that we implement Union-Find
using linked-lists with union-by-weight. We do n MAKE-SETs, at most 2m FIND-SETs and at most
m UNIONs. The first two take time O(n) and O(m), respectively. The last can take time at most
O(n log n) since in that amount of time we would have built up the set of all vertices. Hence, the
running time of Kruskal is O(m log m + n + m + n log n) = O(m log m).

A4 Solution
No ratings yet
A4 Solution
3 pages
BMU 4.0 - User Manual - Z31906 - 110315
100% (1)
BMU 4.0 - User Manual - Z31906 - 110315
41 pages
Presentation On Artificial Intelligence
100% (2)
Presentation On Artificial Intelligence
21 pages
Baseband Manually Integration
No ratings yet
Baseband Manually Integration
9 pages
Union-Find and Amortized Analysis
No ratings yet
Union-Find and Amortized Analysis
5 pages
WINSEM2023-24 PSTS601L SS VL2023240500309 2024-04-23 Reference-Material-I
No ratings yet
WINSEM2023-24 PSTS601L SS VL2023240500309 2024-04-23 Reference-Material-I
21 pages
Correctness of Kruskal's Algorithm: Operations
No ratings yet
Correctness of Kruskal's Algorithm: Operations
7 pages
Lecture 11: Kruskal's MST Algorithm: CLRS Chapter 23
No ratings yet
Lecture 11: Kruskal's MST Algorithm: CLRS Chapter 23
15 pages
Disjoint Set Data Structure
No ratings yet
Disjoint Set Data Structure
4 pages
1 Greedy
No ratings yet
1 Greedy
116 pages
Minimum Spanning Trees (Ch. 23) ! Minimum Spanning Trees!
No ratings yet
Minimum Spanning Trees (Ch. 23) ! Minimum Spanning Trees!
5 pages
Daa Mani
No ratings yet
Daa Mani
10 pages
Lec 26 Supp
No ratings yet
Lec 26 Supp
3 pages
CS 332: Algorithms: Review of MST Algorithms Disjoint-Set Union Amortized Analysis
No ratings yet
CS 332: Algorithms: Review of MST Algorithms Disjoint-Set Union Amortized Analysis
26 pages
Disjoint Sets Union Find Algorithms
No ratings yet
Disjoint Sets Union Find Algorithms
3 pages
Consider That There Are 5 Students in A Classroom Namely, A, B, C, D, E. They Will Be Denoted As 5 Different Subsets: (A), (B), (C), (D), (E)
No ratings yet
Consider That There Are 5 Students in A Classroom Namely, A, B, C, D, E. They Will Be Denoted As 5 Different Subsets: (A), (B), (C), (D), (E)
22 pages
Disjoint set problem
No ratings yet
Disjoint set problem
6 pages
11 Unionfind
No ratings yet
11 Unionfind
14 pages
Lecture 9: Kruskal's MST Algorithm: Disjoint Set Union-Find
No ratings yet
Lecture 9: Kruskal's MST Algorithm: Disjoint Set Union-Find
12 pages
Disjoint in Data Structure
No ratings yet
Disjoint in Data Structure
17 pages
lect0912 (2)
No ratings yet
lect0912 (2)
8 pages
Data Structures For Disjoint Sets - 1.PDF Unit 4
No ratings yet
Data Structures For Disjoint Sets - 1.PDF Unit 4
5 pages
Union-Find Structures
No ratings yet
Union-Find Structures
23 pages
Minimum Spanning Trees: Implementing Kruskal's Algorithm Via Union-Find
No ratings yet
Minimum Spanning Trees: Implementing Kruskal's Algorithm Via Union-Find
8 pages
Hskladjas
No ratings yet
Hskladjas
18 pages
Disjoint Set Data Structure: Piyali Chandra Assistan Professor Uemk
No ratings yet
Disjoint Set Data Structure: Piyali Chandra Assistan Professor Uemk
10 pages
Data Structures
No ratings yet
Data Structures
4 pages
DSA Lab Manual(MST)
No ratings yet
DSA Lab Manual(MST)
4 pages
Lecture 24
No ratings yet
Lecture 24
19 pages
Disjoint Sets and Joint Sets
No ratings yet
Disjoint Sets and Joint Sets
9 pages
Topic8 05 Graphs Kruskal SLIDES Lahcen
No ratings yet
Topic8 05 Graphs Kruskal SLIDES Lahcen
89 pages
Disjoint Ssets
No ratings yet
Disjoint Ssets
37 pages
The Disjoint Set ADT
No ratings yet
The Disjoint Set ADT
67 pages
Spanning tree
No ratings yet
Spanning tree
59 pages
Lecture07_DisjointSets
No ratings yet
Lecture07_DisjointSets
2 pages
Data Structure - Disjoint Set
No ratings yet
Data Structure - Disjoint Set
13 pages
Disjoint Sets: 1. Union Find Problem
No ratings yet
Disjoint Sets: 1. Union Find Problem
20 pages
A Scalable Parallel Union-Find Algorithm For Distributed Memory Computers
No ratings yet
A Scalable Parallel Union-Find Algorithm For Distributed Memory Computers
10 pages
Union Find
No ratings yet
Union Find
5 pages
CS 332: Algorithms: Dijkstra's Algorithm Disjoint-Set Union
No ratings yet
CS 332: Algorithms: Dijkstra's Algorithm Disjoint-Set Union
46 pages
Liniar Time Disjoint-Set by Tarjan
No ratings yet
Liniar Time Disjoint-Set by Tarjan
13 pages
Chap 8
No ratings yet
Chap 8
36 pages
Cs 180 Notes UCLA
No ratings yet
Cs 180 Notes UCLA
3 pages
Chapter 10 Complete
No ratings yet
Chapter 10 Complete
9 pages
ADS UNIT-4
No ratings yet
ADS UNIT-4
46 pages
Minimum Cost Spanning Tree Unit-3
No ratings yet
Minimum Cost Spanning Tree Unit-3
20 pages
Disjoint Set Data Structure: Find (X) - Determine Which Set An Item With Key X Is In, I.e., Return The Key of
No ratings yet
Disjoint Set Data Structure: Find (X) - Determine Which Set An Item With Key X Is In, I.e., Return The Key of
5 pages
DS-Assignment 1
No ratings yet
DS-Assignment 1
3 pages
Ada Manual
No ratings yet
Ada Manual
31 pages
ch23 MST
No ratings yet
ch23 MST
32 pages
MST KRUSKALS ALGORITHM
No ratings yet
MST KRUSKALS ALGORITHM
26 pages
LN 3 Greedy Technique
No ratings yet
LN 3 Greedy Technique
73 pages
11.DisjointSets
No ratings yet
11.DisjointSets
12 pages
greedy
No ratings yet
greedy
13 pages
Kruskal Algorithm
No ratings yet
Kruskal Algorithm
7 pages
DAA Lecture Notes
No ratings yet
DAA Lecture Notes
171 pages
4
No ratings yet
4
34 pages
Algorithms Theory 09 - Union-Find Data Structures
No ratings yet
Algorithms Theory 09 - Union-Find Data Structures
6 pages
UNIT - 1: Disjoint SETS: Equivalence Relations
No ratings yet
UNIT - 1: Disjoint SETS: Equivalence Relations
11 pages
MCS 208 TTE DEC COMPLETE
No ratings yet
MCS 208 TTE DEC COMPLETE
25 pages
Mansur Alanazi
No ratings yet
Mansur Alanazi
2 pages
Theory of Approximation
From Everand
Theory of Approximation
N. I. Achieser
No ratings yet
Square Summable Power Series
From Everand
Square Summable Power Series
Louis de Branges
5/5 (1)
Homework 2 - Solutions CS 414, Spring 2009 Instructor: Klara Nahrstedt
No ratings yet
Homework 2 - Solutions CS 414, Spring 2009 Instructor: Klara Nahrstedt
12 pages
Solutions-Propositional Logic
No ratings yet
Solutions-Propositional Logic
6 pages
Graphs Part 3: Applications of DFS: Application 1 DFS: Topological Sort
No ratings yet
Graphs Part 3: Applications of DFS: Application 1 DFS: Topological Sort
3 pages
06 Memory
No ratings yet
06 Memory
113 pages
Multivalued Dependencies: Fourth Normal Form Fourth Normal Form Reasoning About FD's + MVD's
No ratings yet
Multivalued Dependencies: Fourth Normal Form Fourth Normal Form Reasoning About FD's + MVD's
30 pages
Chefconnexion Spring Summer 2018
No ratings yet
Chefconnexion Spring Summer 2018
67 pages
Chapter 5: Process Synchronization: Silberschatz, Galvin and Gagne ©2013 Operating System Concepts - 9 Edition
No ratings yet
Chapter 5: Process Synchronization: Silberschatz, Galvin and Gagne ©2013 Operating System Concepts - 9 Edition
61 pages
Final Examination: This Is A Closed Book, Closed Notes, No Calculator Exam
No ratings yet
Final Examination: This Is A Closed Book, Closed Notes, No Calculator Exam
10 pages
PartialOrderings QA
No ratings yet
PartialOrderings QA
8 pages
Homework #5 - : Scheduling (Chapter 9) RT Scheduling (Chapter 10)
No ratings yet
Homework #5 - : Scheduling (Chapter 9) RT Scheduling (Chapter 10)
3 pages
Homework #6 - : Questions: Answers
No ratings yet
Homework #6 - : Questions: Answers
2 pages
Homework #2 - : Parbegin
No ratings yet
Homework #2 - : Parbegin
3 pages
Midterm (Fall2001) Solutions
No ratings yet
Midterm (Fall2001) Solutions
19 pages
Workflow
No ratings yet
Workflow
36 pages
VPN Clients Quick Guide: Iportalmais June 27, 2012
No ratings yet
VPN Clients Quick Guide: Iportalmais June 27, 2012
24 pages
Design Patterns Are Proven, Reusabl
No ratings yet
Design Patterns Are Proven, Reusabl
2 pages
MT 309 Es
No ratings yet
MT 309 Es
14 pages
Ict-Grade-9 Summary
No ratings yet
Ict-Grade-9 Summary
7 pages
Ax To Ec-Net4 Ug 15 en
No ratings yet
Ax To Ec-Net4 Ug 15 en
88 pages
Raw Waveform Processing - BayesMap Solutions, LLC
No ratings yet
Raw Waveform Processing - BayesMap Solutions, LLC
3 pages
Quick Start Guide CTRLX Core r999002047
No ratings yet
Quick Start Guide CTRLX Core r999002047
1 page
DD259A02MR
No ratings yet
DD259A02MR
50 pages
Artificial Intelligent Approach To Predict The Student Behaviour and Performance
No ratings yet
Artificial Intelligent Approach To Predict The Student Behaviour and Performance
11 pages
CST Studio Suite - Getting Started
No ratings yet
CST Studio Suite - Getting Started
72 pages
A Comprehensive and Systematic Look Up Into Deep Learning Based Object Detection Techniques - A Review
No ratings yet
A Comprehensive and Systematic Look Up Into Deep Learning Based Object Detection Techniques - A Review
29 pages
Brkaci 2102 PDF
No ratings yet
Brkaci 2102 PDF
142 pages
5-Determinants
No ratings yet
5-Determinants
18 pages
BSBMKG417 Assessment 1 2020 V1
No ratings yet
BSBMKG417 Assessment 1 2020 V1
14 pages
MCA - Project Synopsis Template for Final Year Project
No ratings yet
MCA - Project Synopsis Template for Final Year Project
7 pages
Sangeetha S FlowCV Resume 20231130
No ratings yet
Sangeetha S FlowCV Resume 20231130
2 pages
PPS 2024-25 model paper
No ratings yet
PPS 2024-25 model paper
3 pages
V600 User Manual
No ratings yet
V600 User Manual
32 pages
13 - MPLS-TE-Affinity-Attribute-Flag-ArashDeljoo
No ratings yet
13 - MPLS-TE-Affinity-Attribute-Flag-ArashDeljoo
6 pages
Excel Probability and Statistics
No ratings yet
Excel Probability and Statistics
11 pages
Ptu Thesis Guidelines
100% (3)
Ptu Thesis Guidelines
6 pages
Using The Accelerometer On De-Soc Boards: For Quartus Prime 16.1
No ratings yet
Using The Accelerometer On De-Soc Boards: For Quartus Prime 16.1
17 pages
BenchManager Installation Guide
No ratings yet
BenchManager Installation Guide
43 pages
VB Manual
No ratings yet
VB Manual
95 pages
Recording Calls Study
No ratings yet
Recording Calls Study
97 pages
Ari, John Lloyd: Serdonix Street Ramar Village City of San Fernando Pampanga 09123951990
No ratings yet
Ari, John Lloyd: Serdonix Street Ramar Village City of San Fernando Pampanga 09123951990
3 pages