0% found this document useful (0 votes)

87 views5 pages

Disjoint Set Algorithms Explained

The disjoint set data structure partitions a set of items into mutually exclusive sets, where each item belongs to exactly one set. It supports two main operations: Find, which determines which set an item belongs to; and Union, which combines two sets. The disjoint set can be represented using a parent pointer array, where each item points to its parent representative in the set. Improved algorithms like path compression and union by rank can perform Find and Union operations in nearly constant time on average. Disjoint sets have applications in problems that involve dynamically merging sets, such as computing the connected components of a graph.

Uploaded by

Swagata Rana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

87 views5 pages

Disjoint Set Algorithms Explained

Uploaded by

Swagata Rana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOC, PDF, TXT or read online on Scribd

Disjoint Set Data Structure

Suppose we have n items (student records, bank account records, whatever) each
with unique keys from 1..n. We want to keep the items in a collection of sets
(disjoint sets) such that an item must occur in exactly one of those sets. For
example, we want to partition a set of students into "students with GPA >= 2.0"
and "students with GPA < 2.0." Or we might want the collection to have many sets,
e.g. different income levels of people. The point is, the sets are mutually exclusive
and include all the items.
In disjoint sets, each set is identified by a representative that is some member of
the set. For convenience, we can choose this element to be the one with the
smallest key, but we need to be able to choose some representative. It will soon
become convenient to think of the representative as the parent of the other items in
the set, like in a tree.
In many situations in computer science, problems involving disjoint sets naturally
arise such that the sets grow dynamically (i.e., during the course of an algorithm,
sets change by merging) and two important operations are:

Find (x) - determine which set an item with key x is in, i.e., return the key of
the representative of the set x is in. Using this operation, one can tell whether
two elements are in the same set: you just do a Find on both of them and
compare the return values. If they are the same, then the two items are in the
same set.
Union (x, y) - unite the sets containing x and y. (Note: union is a C keyword;
don't write a function called union in your C or C++ program!)

Here is an example of an algorithm that uses these operation. It computes the

connected components of an undirected graph G=(V,E). Recall that the connected
components of a graph are the subgraphs that are all mutually connected:
Connected-Components (G)
for each vertex v in V do
// each vertex is initially in its own component
make v a singleton set with a unique key from 1..|V|
end for
for each edge (u, v) in E do
// if u and v are connected by an edge, then
// everything in u's component is connected
// to everything in v's component, so Union the sets
if Find (u) != Find (v) then Union (u, v)
end for

Now, if we want to find out whether two vertices are in the same connected
component, we just call Find on both vertices and see if the result is the same. If
so, then they are connected.
There is an easy algorithm that implements the disjoint set operations. The idea is
that we have an array p[1..n] with an element for each item that is in some set. If
the item is a representative (parent) of some set, then the value of this element its
own index, otherwise the value is the index of another item in the array, giving rise
to a linked list eventually ending in the parent item. For example, suppose we have
the following disjoint sets (indicated by their unique 1..n keys):
{ 6 14 1 } { 2 3 13 } { 5 12 7 8 10 } { 9 11 4 15 }

Then we might have the following lists giving the set relationships:
1 <-- 6 <-- 14
2 <-- 3 <-- 13
5 <-- 12 <-- 7 <-- 8 <-- 10
9 <-- 11 <-- 4 <-- 15

To find the representative of an element, we just traverse the list until we reach the
parent (which points to itself).
Of course, nothing (except for ASCII graphics) prevents us from having tree-like
structures with this representation, where more than one item points to the same
parent item, e.g.:
/
14
5<\

\
/>12<\
/ ^
7
10 |
8

/>1<\
\
6

13 -->2<-- 3

/>9<\
^ \
/
|
\
4
11 15
/

An array representing this forest would look like this:

i
1
p[i] 1

2
2

3
2

4
9

5
5

6
1

7 8 9
12 12 9

10
12

11
9

12
5

13
2

14
1

15
9

Let's see algorithms to implement this linked-list Union/Find. We'll assume that
p[1..n] is initialized to p[i] = i so each item is in its own singleton set and is its own
set's representative.
// return the key of the representative of this set
Find (x)

if p[x] != x then
return Find (p[x])
end if
return x
// join two sets containing items x and y together
Union (x, y)
a = Find (x) // a is x's representative
b = Find (y) // b is y's representative
p[a] = b
// now b is a's parent, and Find (x) would return
b

These algorithms are very simple. In the worst case, Find will take O(n)
operations; this worst case would occur when we have one big set represented as a
long linked list and try to Find the representative of the last item. Union is bound
by the same worst case time, since it calls Find O(1) times.
One obvious improvement would be to have all set members point directly to their
representatives, instead of to arbitrary other members of the set. One way to do this
is called path compression: each time we do a Find on a set element, we make its
parent the representative element. We "compress" the path from leaf to root; after a
Find, instead of many levels between a leaf and the root, there is just one. And,
every item along the path from leaf to root is also directly connected to the root:
Find (x)
if p[x] != x then
p[x] = Find (p[x])
end if
return p[x]

The next time a Find is done on any element along this path, the parent will be
returned in O(1) time. This yields very good performance in practice, since any
long chain will likely be broken quickly. However, we still have to worry about
Union: each time we do a Union, we push some tree down a level. This means that
the next find done on a member of this subtree will take a little longer than it
would have before the Union (although the path is compressed during the Find, the
damage is already done with the time spent doing the Find). We can minimize the
impact of this situation using a heuristic called union by rank. With this method,
we keep track of how many elements are in a subtree, and make the smaller subtree
the child of the larger subtree. This insures that most of the items in the new tree
are unaffected by the Union in terms of how long it takes to find the representative.
With each item x we associate a count count[x] that contains the number of items
in the tree rooted at x. When the sets first start out as singletons, they each have a
count of one.
Union (x, y)
a = Find (x)

b = Find (y)
if count[a] > count[b]
// a has more kids; make b its child
p[b] = a
// and update the count of a to include b's kids
count[a] += count[b]
else
// or vice-versa
p[a] = b
count[b] += count[a]
endif

(Your book uses a different, approximate method in order to make the analysis of
the algorithm easier.)
Here is an example of doing some Union operations on sets with indices 1..6 (c[i]
is count[i]):
i
p[i]
c[i]

1
1
1

2
2
1

3
3
1

4
4
1

5
5
1

6
6
1

Union (3, 5)
i
1 2
p[i]
1 2
c[i]
1 1

3
5
1

4
4
1

5
5
2

6
6
1

Union (4, 2)
i
1 2
p[i]
1 2
c[i]
1 2

3
5
1

4
2
1

5
5
2

6
6
1

Union (2, 6)
i
1 2
p[i]
1 2
c[i]
1 3

3
5
1

4
2
1

5
5
2

6
2
1

Union (1, 4)
i
1 2
p[i]
2 2
c[i]
1 4

3
5
1

4
2
1

5
5
2

6
2
1

Union (3, 6)
i
1 2
p[i]
2 2
c[i]
1 6

3
5
1

4
2
1

5
2
2

6
2
1

(We'll see the trees on the board in class.)

The analysis of this Union/Find algorithm is beyond the scope of an undergraduate

class. However, the results are pretty amazing: a Union or Find operation takes
O(lg* n) time amortized over all the operations (i.e., one particular instance may
take longer, but overall, each one averages out to O(lg* n). This lg* is the iterated
logarithm function; it's the number of times you can take the log base 2 of a
number. This is a very slowly growing number; lg* 10^20 is about 4. You will
probably never need to do Union/Find on sets with that many elements; indeed,
there aren't even that many bytes in all the computers in the world. So for all
practical purposes, Union/Find can be considered to run in constant time.

DAA II-Unit (2) (Conflict2024-04-06-14-34-05)
No ratings yet
DAA II-Unit (2) (Conflict2024-04-06-14-34-05)
59 pages
Ads Unit-4
No ratings yet
Ads Unit-4
46 pages
Unit 2 Daa Updated 26th
No ratings yet
Unit 2 Daa Updated 26th
82 pages
Disjoint-Set Data Structure Guide
No ratings yet
Disjoint-Set Data Structure Guide
17 pages
Chap 8
No ratings yet
Chap 8
36 pages
DAA Lecture Notes
No ratings yet
DAA Lecture Notes
171 pages
Unit 2 Disjoint Sets
No ratings yet
Unit 2 Disjoint Sets
28 pages
Disjoint Sets and Joint Sets
No ratings yet
Disjoint Sets and Joint Sets
9 pages
Disjoint-Set Data Structure Explained
No ratings yet
Disjoint-Set Data Structure Explained
3 pages
Daa Unit Ii
No ratings yet
Daa Unit Ii
14 pages
DAA-UNIT-II - Disjoint Sets
No ratings yet
DAA-UNIT-II - Disjoint Sets
13 pages
Unit-1 2
No ratings yet
Unit-1 2
7 pages
ADA Unit-II P1 DisjointSets C
No ratings yet
ADA Unit-II P1 DisjointSets C
26 pages
Module 2 Daa
No ratings yet
Module 2 Daa
34 pages
CS603PC Daa Unit-2
No ratings yet
CS603PC Daa Unit-2
15 pages
Disjoint Set ADT Explained
No ratings yet
Disjoint Set ADT Explained
67 pages
Disjoint Set Data Structure Overview
No ratings yet
Disjoint Set Data Structure Overview
5 pages
Disjoint Ssets
No ratings yet
Disjoint Ssets
37 pages
11 Unionfind
No ratings yet
11 Unionfind
14 pages
DAA Unit1
No ratings yet
DAA Unit1
23 pages
Disjoint Sets and Graph Components
No ratings yet
Disjoint Sets and Graph Components
49 pages
ADS Unit 1
No ratings yet
ADS Unit 1
95 pages
Unit - 5 Disjoint Set
No ratings yet
Unit - 5 Disjoint Set
22 pages
Disjoint Set Data Structure
No ratings yet
Disjoint Set Data Structure
4 pages
Consider That There Are 5 Students in A Classroom Namely, A, B, C, D, E. They Will Be Denoted As 5 Different Subsets: (A), (B), (C), (D), (E)
No ratings yet
Consider That There Are 5 Students in A Classroom Namely, A, B, C, D, E. They Will Be Denoted As 5 Different Subsets: (A), (B), (C), (D), (E)
22 pages
DSA2 L14 (Disjoint Set)
No ratings yet
DSA2 L14 (Disjoint Set)
29 pages
Disjoint Sets: 1. Union Find Problem
No ratings yet
Disjoint Sets: 1. Union Find Problem
20 pages
Efficient Disjoint Set Operations
No ratings yet
Efficient Disjoint Set Operations
11 pages
Ada U2 Notes
No ratings yet
Ada U2 Notes
7 pages
Disjoint Set Operations Guide
No ratings yet
Disjoint Set Operations Guide
6 pages
Disjoint-Set Data Structures
No ratings yet
Disjoint-Set Data Structures
4 pages
Forest Data Structure Overview
No ratings yet
Forest Data Structure Overview
36 pages
Disjoint Set Structures Explained
No ratings yet
Disjoint Set Structures Explained
10 pages
Unit 2
No ratings yet
Unit 2
19 pages
Disjoint - Sets Data Structure
No ratings yet
Disjoint - Sets Data Structure
83 pages
Unit V Ads
No ratings yet
Unit V Ads
7 pages
Disjoint Sets Notes
No ratings yet
Disjoint Sets Notes
13 pages
Disjoint Sets Data Structure Explained
No ratings yet
Disjoint Sets Data Structure Explained
8 pages
Data Structures
No ratings yet
Data Structures
4 pages
09 Disjoint Set - 2021
No ratings yet
09 Disjoint Set - 2021
25 pages
DAA Unit - 2 Spectrum
No ratings yet
DAA Unit - 2 Spectrum
40 pages
Lec11 Graphs
No ratings yet
Lec11 Graphs
77 pages
12 13 Union Find
No ratings yet
12 13 Union Find
53 pages
Ada UNIT2
No ratings yet
Ada UNIT2
15 pages
Efficiency of A Good But Not Linear Set Union Algorithm. Tarjan
No ratings yet
Efficiency of A Good But Not Linear Set Union Algorithm. Tarjan
11 pages
Daa Notes
No ratings yet
Daa Notes
112 pages
Algorithms: Union-Find Techniques
No ratings yet
Algorithms: Union-Find Techniques
6 pages
Sets & Disjoint Set Union
No ratings yet
Sets & Disjoint Set Union
27 pages
Liniar Time Disjoint-Set by Tarjan
No ratings yet
Liniar Time Disjoint-Set by Tarjan
13 pages
Lecture07 DisjointSets
No ratings yet
Lecture07 DisjointSets
2 pages
Lecture 15
No ratings yet
Lecture 15
40 pages
Disjoint Set
No ratings yet
Disjoint Set
4 pages
67af1ba153efd PPT
No ratings yet
67af1ba153efd PPT
14 pages
DAA Unit 2
No ratings yet
DAA Unit 2
28 pages
Unit - I: Random Access Machine Model
No ratings yet
Unit - I: Random Access Machine Model
39 pages
15.082J/6.855J/ESD.78J September 14, 2010: Data Structures
No ratings yet
15.082J/6.855J/ESD.78J September 14, 2010: Data Structures
45 pages
Union-Find Structures
No ratings yet
Union-Find Structures
23 pages
Daa r22 Unit-2 QB Answers Key
No ratings yet
Daa r22 Unit-2 QB Answers Key
20 pages
11 DisjointSets
No ratings yet
11 DisjointSets
12 pages
32-Bit Microprocessor - Student Manual PDF
No ratings yet
32-Bit Microprocessor - Student Manual PDF
502 pages
ABAP Webdynpro Introduction
No ratings yet
ABAP Webdynpro Introduction
17 pages
Full Seminar Report
No ratings yet
Full Seminar Report
17 pages
Practical Linux Projects
No ratings yet
Practical Linux Projects
3 pages
Sandeepani Verilog
100% (1)
Sandeepani Verilog
354 pages
Images Full Form
No ratings yet
Images Full Form
2 pages
Zentyal PDC Setup for IT Admins
No ratings yet
Zentyal PDC Setup for IT Admins
48 pages
Lista Cod Virus
No ratings yet
Lista Cod Virus
90 pages
031 04 May12
No ratings yet
031 04 May12
259 pages
Recursive Functions for Mathematicians
No ratings yet
Recursive Functions for Mathematicians
29 pages
Controlling: Creation of SKF: SKF Represent Activities or Statistics. in CO, SKF Is Used in Assessment and Distribution
No ratings yet
Controlling: Creation of SKF: SKF Represent Activities or Statistics. in CO, SKF Is Used in Assessment and Distribution
4 pages
C Fundamentals Running Language Supports
100% (1)
C Fundamentals Running Language Supports
349 pages
ECE3829 Class Examples: Binary To BCD Converter
No ratings yet
ECE3829 Class Examples: Binary To BCD Converter
6 pages
Computer Science (Fe) Mcq's
100% (1)
Computer Science (Fe) Mcq's
41 pages
Cetrics DBBackupTool User Manual
No ratings yet
Cetrics DBBackupTool User Manual
16 pages
AIX Boot Process
No ratings yet
AIX Boot Process
99 pages
Data Structures and Algorithms
No ratings yet
Data Structures and Algorithms
33 pages
C++ Programming Course Guide
No ratings yet
C++ Programming Course Guide
6 pages
Hilove
No ratings yet
Hilove
2 pages
Understanding IDOCs: Creation & Use
No ratings yet
Understanding IDOCs: Creation & Use
62 pages
Fast Fourier Transform (FFT) Overview
No ratings yet
Fast Fourier Transform (FFT) Overview
50 pages
Blind SQL Injection Tutorial Guide
100% (1)
Blind SQL Injection Tutorial Guide
17 pages
Database Systems: Design, Implementation, and Management: Normalization of Database Tables
No ratings yet
Database Systems: Design, Implementation, and Management: Normalization of Database Tables
50 pages
Oracle Report 10g Training
No ratings yet
Oracle Report 10g Training
89 pages
Conclusion Memory&storage
No ratings yet
Conclusion Memory&storage
2 pages
Online Art Gallery Documentation
33% (3)
Online Art Gallery Documentation
21 pages
Smoothing Techniques in Image Processing
No ratings yet
Smoothing Techniques in Image Processing
59 pages
TIBCO General Interface
No ratings yet
TIBCO General Interface
2 pages
AIG PROPOSAL FORM - Long Form 2019 - Editabile - Eng
No ratings yet
AIG PROPOSAL FORM - Long Form 2019 - Editabile - Eng
24 pages
Getting Started With Use-Case Modeling PDF
No ratings yet
Getting Started With Use-Case Modeling PDF
19 pages

Disjoint Set Algorithms Explained

Uploaded by

Disjoint Set Algorithms Explained

Uploaded by

Disjoint Set Data Structure

Here is an example of an algorithm that uses these operation. It computes the

An array representing this forest would look like this:

(We'll see the trees on the board in class.)

The analysis of this Union/Find algorithm is beyond the scope of an undergraduate

You might also like