0% found this document useful (0 votes)

41 views7 pages

Disjoint Set, String Matching, NP Problem

The document discusses various algorithms related to disjoint sets, string matching, NP-complete problems, and approximation algorithms. It explains the operations of the disjoint set data structure, string matching techniques like Naive, Rabin Karp, and KMP algorithms, and the definitions and differences between NP, NP-Hard, and NP-Complete problems. Additionally, it introduces approximation algorithms aimed at finding near-optimal solutions for optimization problems.

Uploaded by

farwajavaid19

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views7 pages

Disjoint Set, String Matching, NP Problem

Uploaded by

farwajavaid19

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

Design and Analysis of Algorithm

Disjoint Set (Union-Find Algorithm)

Two sets are called disjoint sets if they don’t have any element in common, the
intersection of sets is a null set.

A data structure that stores non overlapping or disjoint subset of elements is

called disjoint set data structure. The disjoint set data structure supports
following operations:

 Adding new sets to the disjoint set.

 Merging disjoint sets to a single disjoint set using Union operation.

 Finding representative of a disjoint set using Find operation.

 Check if two sets are disjoint or not.

Operations on Disjoint Set Data Structures:

1. Find

2. Union

1. Find:

Can be implemented by recursively traversing the parent array until we hit a node
that is the parent of itself.

Time complexity: This approach is inefficient and can take O(n) time in worst
case.

2. Union:

It takes two elements as input and finds the representatives of their sets using
the Find operation, and finally puts either one of the trees (representing the set)
under the root node of the other tree.

Time complexity: This approach is inefficient and could lead to tree of length O(n)
in worst case.
Design and Analysis of Algorithm

String Matching Introduction

String Matching Algorithm is also called "String Searching Algorithm." This is a

vital class of string algorithm is declared as "this is the method to find a place
where one is several strings are found within the larger string."

Applications of String Matching Algorithms:

Plagiarism Detection: The documents to be compared are decomposed into

string tokens and compared using string matching algorithms. Thus, these
algorithms are used to detect similarities between them and declare if the work is
plagiarized or original.

Bioinformatics and DNA Sequencing: Bioinformatics involves applying

information technology and computer science to problems involving genetic
sequences to find DNA patterns. String matching algorithms and DNA analysis are
both collectively used for finding the occurrence of the pattern set.

Algorithms of string Matching

 Naive Algorithm: It slides the pattern over text one by one and check
for a match. If a match is found, then slides by 1 again to check for
subsequent matches.

Pros: •Very simple to understand and implement. • Works well when

the patterns are short and the text is not too large.

Cons: • Can be very slow, especially if the pattern occurs frequently

but with mismatches.

 Rabin Karp Algorithm: It uses hashing to find any set of pattern

occurrences. Instead of checking all characters of the pattern at
every position (like the naive algorithm), it checks a hash value.

• Think of it like looking for a specific page in a book by its unique

code instead of by reading every word. If the page number (hash)
matches, then you check to make sure it's really the page you're
looking for.
Design and Analysis of Algorithm

Pros: • Faster than the naive approach on average. • Very efficient

for multiple pattern searches at once.

Cons: • Requires a good hash function to avoid frequent spurious

hits. • The worst-case time complexity can be as bad as the Naive
algorithm if many hash collisions occur.

 KMP (Knuth Morris Pratt) Algorithm:

The KMP algorithm is smarter. It pre-processes the pattern to

understand its structure and eliminates unnecessary comparisons
when a mismatch occurs.

Pros: • The algorithm ensures that the characters of the text are
never compared more than once, which makes it very efficient. • No
backtracking is needed, so it's more efficient than the naive
approach.

Cons: • The pre-processing step requires additional time and

memory. • The algorithm is more complex to understand and
implement.

NP Complete Problem

NP Problem:
Design and Analysis of Algorithm

The NP problems set of problems whose solutions are hard to find but easy to
verify and are solved by Non-Deterministic Machine in polynomial time.

NP-Hard Problem:
A Problem X is NP-Hard if there is an NP-Complete problem Y, such that Y is
reducible to X in polynomial time. NP-Hard problems are as hard as NP-Complete
problems. NP-Hard Problem need not be in NP class.

If every problem of NP can be polynomial time reduced to it called as NP Hard.

A lot of times takes the particular problem solve and reducing different
problems.

example :

1. Hamiltonian cycle .

2. optimization problem .

3. Shortest path
NP-Complete Problem:

A problem X is NP-Complete if there is an NP problem Y, such that Y is reducible

to X in polynomial time. NP-Complete problems are as hard as NP problems. A
problem is NP-Complete if it is a part of both NP and NP-Hard Problem. A non-
deterministic Turing machine can solve NP-Complete problem in polynomial
time.

A problem is np-complete when it is both np and np hard combines together.

this means np complete problems can be verified in polynomial time.

Design and Analysis of Algorithm

Difference between NP-Hard and NP-Complete:

NP-hard NP-Complete

NP-Hard problems(say X) can be solved if NP-Complete problems can be

and only if there is a NP-Complete solved by a non-deterministic
problem(say Y) that can be reducible into Algorithm/Turing Machine in
X in polynomial time. polynomial time.

To solve this problem, it do not have to To solve this problem, it must be

be in NP . both NP and NP-hard problems.

Time is known as it is fixed in NP-

Time is unknown in NP-Hard.
Hard.

NP-Complete is exclusively a decision

NP-hard is not a decision problem.
problem.

Not all NP-hard problems are NP- All NP-complete problems are NP-
complete. hard

Do not have to be a Decision problem. It is exclusively a Decision problem.

It is optimization problem used. It is Decision problem used.

Approximation Algorithm
Design and Analysis of Algorithm

An Approximate Algorithm is a way of approach NP-COMPLETENESS for the

optimization problem. This technique does not guarantee the best solution. The
goal of an approximation algorithm is to come as close as possible to the
optimum value in a reasonable amount of time which is at the most polynomial
time. Such algorithms are called approximation algorithm or heuristic algorithm.

Performance Ratios

The main idea behind calculating the performance ratio of an approximation

algorithm, which is also called as an approximation ratio, is to find how close the
approximate solution is to the optimal solution.

The approximate ratio is represented using ρ(n) where n is the input size of the
algorithm, C is the near-optimal solution obtained by the algorithm, C* is the
optimal solution for the problem. The algorithm has an approximate ratio of ρ(n)
if and only if −

max{CC∗,C∗C}≤ρ(n)

Few popular examples of the approximation algorithms are −

 Vertex Cover Algorithm

 Set Cover Problem

 Travelling Salesman Problem (Approximation Approach)

 The Subset Sum Problem

Design and Analysis of Algorithm

Disjoint Set and Next
No ratings yet
Disjoint Set and Next
6 pages
Unit 7
No ratings yet
Unit 7
60 pages
NP-complete Problems: Notes On Design and Analysis of Algorithms
No ratings yet
NP-complete Problems: Notes On Design and Analysis of Algorithms
18 pages
NP Completeness Lecture Notes
No ratings yet
NP Completeness Lecture Notes
7 pages
Unit 4
No ratings yet
Unit 4
66 pages
NP-Complete Exam Guide
100% (1)
NP-Complete Exam Guide
7 pages
Chapter - 2: Fundamentals of Algorithmic Problem Solving
No ratings yet
Chapter - 2: Fundamentals of Algorithmic Problem Solving
23 pages
DAA Unit5
No ratings yet
DAA Unit5
17 pages
DAA R19 - Unit-5
No ratings yet
DAA R19 - Unit-5
12 pages
Daaunit5 IT3
No ratings yet
Daaunit5 IT3
21 pages
Lecture13 IO BLG336E
No ratings yet
Lecture13 IO BLG336E
58 pages
NP-Hard and NP-Complete Overview
No ratings yet
NP-Hard and NP-Complete Overview
7 pages
Daa - Unit 5 Part 2
No ratings yet
Daa - Unit 5 Part 2
12 pages
AOA1
No ratings yet
AOA1
38 pages
Algorithm To Find Sum of The Digits of Num
No ratings yet
Algorithm To Find Sum of The Digits of Num
14 pages
Computational Complexity Theory: P (Polynomial Time)
No ratings yet
Computational Complexity Theory: P (Polynomial Time)
11 pages
Randomized Algorithms
No ratings yet
Randomized Algorithms
12 pages
B.E-Degree Examination, November / December 2006 Computer Science and Engineering Cs 1201 - Design and Analysis of Algorithms Answer Key Part-A
No ratings yet
B.E-Degree Examination, November / December 2006 Computer Science and Engineering Cs 1201 - Design and Analysis of Algorithms Answer Key Part-A
20 pages
Unit 3new
No ratings yet
Unit 3new
21 pages
M5 Daa-Cs201
No ratings yet
M5 Daa-Cs201
22 pages
Understanding NP-Completeness
No ratings yet
Understanding NP-Completeness
35 pages
Daa Unit5 Notes
No ratings yet
Daa Unit5 Notes
15 pages
B.E-Degree Examination, November / December 2006 Computer Science and Engineering Cs 1201 - Design and Analysis of Algorithms Answer Key Part-A
No ratings yet
B.E-Degree Examination, November / December 2006 Computer Science and Engineering Cs 1201 - Design and Analysis of Algorithms Answer Key Part-A
21 pages
DAA - Non-Deterministic Algorithms
No ratings yet
DAA - Non-Deterministic Algorithms
13 pages
Adsa 4
No ratings yet
Adsa 4
85 pages
Unit 5 Imp Daa
No ratings yet
Unit 5 Imp Daa
13 pages
Daa PR10 123
No ratings yet
Daa PR10 123
19 pages
Non-Deterministic Algorithm
No ratings yet
Non-Deterministic Algorithm
13 pages
P and NP Problems
No ratings yet
P and NP Problems
4 pages
Lec - 3 Final
No ratings yet
Lec - 3 Final
52 pages
Unit 6 TOC
No ratings yet
Unit 6 TOC
38 pages
7063.NP Hard
No ratings yet
7063.NP Hard
17 pages
Daa Decode
No ratings yet
Daa Decode
205 pages
Department of Computer Science and Engineering
No ratings yet
Department of Computer Science and Engineering
12 pages
Daa R20 Unit 5
No ratings yet
Daa R20 Unit 5
13 pages
Topic NP NPC
No ratings yet
Topic NP NPC
32 pages
Intro To NP Completeness Modified
No ratings yet
Intro To NP Completeness Modified
72 pages
P, NP, NP-Complete and NP-Hard Problems in Computer Science-1-3
No ratings yet
P, NP, NP-Complete and NP-Hard Problems in Computer Science-1-3
3 pages
Analysis of Algorithm
No ratings yet
Analysis of Algorithm
32 pages
DAA Module1
No ratings yet
DAA Module1
9 pages
286 Shyam Sundar Ghosh PCC-CSD402
No ratings yet
286 Shyam Sundar Ghosh PCC-CSD402
9 pages
University Solution 19-20
No ratings yet
University Solution 19-20
33 pages
Daas Solved 2019
No ratings yet
Daas Solved 2019
7 pages
NP Hard and NP Complete
No ratings yet
NP Hard and NP Complete
5 pages
Presentation1 1
No ratings yet
Presentation1 1
8 pages
Daa Unit 5
No ratings yet
Daa Unit 5
22 pages
DAA Introduction 1
No ratings yet
DAA Introduction 1
41 pages
Adsa
No ratings yet
Adsa
9 pages
Lecture Notes
100% (1)
Lecture Notes
30 pages
Lecture 10 - NP - Complete Problems
No ratings yet
Lecture 10 - NP - Complete Problems
30 pages
Unit - I: Random Access Machine Model
No ratings yet
Unit - I: Random Access Machine Model
39 pages
Analysis and Design of Algorithm-2
No ratings yet
Analysis and Design of Algorithm-2
29 pages
Ada 2016
No ratings yet
Ada 2016
18 pages
Unit-5-Np Hard and NP Complete Problems-1
100% (1)
Unit-5-Np Hard and NP Complete Problems-1
32 pages
Unit - V
100% (1)
Unit - V
22 pages
Daa Book
No ratings yet
Daa Book
167 pages
Stack - Tutorialspoint
No ratings yet
Stack - Tutorialspoint
6 pages
Tri-Search: A New and Efficient Searching Algorithm: An Extension of Ternary Search Approach With Variable Partitioning
No ratings yet
Tri-Search: A New and Efficient Searching Algorithm: An Extension of Ternary Search Approach With Variable Partitioning
4 pages
Lesson 3. Arrays
No ratings yet
Lesson 3. Arrays
11 pages
A Parallelizable Variant of HCA : Sreenivasan Ganti Visnu Srinivasan Pallavi Ramicetty
No ratings yet
A Parallelizable Variant of HCA : Sreenivasan Ganti Visnu Srinivasan Pallavi Ramicetty
7 pages
Harmony Search Algorithm
No ratings yet
Harmony Search Algorithm
3 pages
Lab Manual DSA Week 3 & 4
No ratings yet
Lab Manual DSA Week 3 & 4
9 pages
Program - 3 (1) DS
No ratings yet
Program - 3 (1) DS
4 pages
An Introduction To Optimization 4th Edition: Book Information Sheet Book Information Sheet
No ratings yet
An Introduction To Optimization 4th Edition: Book Information Sheet Book Information Sheet
1 page
MATLAB Golden Section Search Guide
No ratings yet
MATLAB Golden Section Search Guide
20 pages
34926-Article Text-38993-1-2-20250410
No ratings yet
34926-Article Text-38993-1-2-20250410
9 pages
BCH Codes
No ratings yet
BCH Codes
11 pages
Ex 08
No ratings yet
Ex 08
2 pages
Binary Search Tree-By Asst Prof Praveen Yadav
No ratings yet
Binary Search Tree-By Asst Prof Praveen Yadav
20 pages
Object-Oriented Programming Question Bank
No ratings yet
Object-Oriented Programming Question Bank
7 pages
DDS Question Bank For Mid Sem
No ratings yet
DDS Question Bank For Mid Sem
2 pages
DSA V2 Lab Final Paper
No ratings yet
DSA V2 Lab Final Paper
3 pages
CE Numerical Solutions Reviewer Detailed
No ratings yet
CE Numerical Solutions Reviewer Detailed
4 pages
Roots of Equations Bracketing Methods: Credit: Prof. Lale Yurttas, Chemical Eng., Texas A&M University
No ratings yet
Roots of Equations Bracketing Methods: Credit: Prof. Lale Yurttas, Chemical Eng., Texas A&M University
11 pages
Understanding Support Vector Machines
No ratings yet
Understanding Support Vector Machines
11 pages
Daa Self Study Topic Presentation: Scheduling Identical Processors
No ratings yet
Daa Self Study Topic Presentation: Scheduling Identical Processors
18 pages
Lec 20
No ratings yet
Lec 20
26 pages
Back Propagation Algorithm - Numerical Solved - by Sujan Karna - Medium
No ratings yet
Back Propagation Algorithm - Numerical Solved - by Sujan Karna - Medium
16 pages
Lecture 9
No ratings yet
Lecture 9
158 pages
Numerical Methods in Applied Maths
No ratings yet
Numerical Methods in Applied Maths
33 pages
Poriyaan - C Programming and Data Structures - CS3353 2021 Regulation - Semester Question Paper 2024 April May - Bin
No ratings yet
Poriyaan - C Programming and Data Structures - CS3353 2021 Regulation - Semester Question Paper 2024 April May - Bin
4 pages
@vtucode - in Module 3 DS 2022 Scheme
No ratings yet
@vtucode - in Module 3 DS 2022 Scheme
28 pages
Backtracking
No ratings yet
Backtracking
45 pages
Understanding the Secant Method
No ratings yet
Understanding the Secant Method
22 pages
M01S04 Daa-Unit-2 - Best, Average and Worst Cases
No ratings yet
M01S04 Daa-Unit-2 - Best, Average and Worst Cases
11 pages
Module-3 Association Analysis: Data Mining Association Analysis: Basic Concepts and Algorithms
No ratings yet
Module-3 Association Analysis: Data Mining Association Analysis: Basic Concepts and Algorithms
34 pages

Disjoint Set, String Matching, NP Problem

Uploaded by

Disjoint Set, String Matching, NP Problem

Uploaded by

Design and Analysis of Algorithm

Disjoint Set (Union-Find Algorithm)

A data structure that stores non overlapping or disjoint subset of elements is

 Adding new sets to the disjoint set.

 Merging disjoint sets to a single disjoint set using Union operation.

 Finding representative of a disjoint set using Find operation.

 Check if two sets are disjoint or not.

Operations on Disjoint Set Data Structures:

String Matching Introduction

String Matching Algorithm is also called "String Searching Algorithm." This is a

Applications of String Matching Algorithms:

Plagiarism Detection: The documents to be compared are decomposed into

Bioinformatics and DNA Sequencing: Bioinformatics involves applying

Algorithms of string Matching

Pros: •Very simple to understand and implement. • Works well when

Cons: • Can be very slow, especially if the pattern occurs frequently

 Rabin Karp Algorithm: It uses hashing to find any set of pattern

• Think of it like looking for a specific page in a book by its unique

Pros: • Faster than the naive approach on average. • Very efficient

Cons: • Requires a good hash function to avoid frequent spurious

 KMP (Knuth Morris Pratt) Algorithm:

The KMP algorithm is smarter. It pre-processes the pattern to

Cons: • The pre-processing step requires additional time and

If every problem of NP can be polynomial time reduced to it called as NP Hard.

A problem X is NP-Complete if there is an NP problem Y, such that Y is reducible

A problem is np-complete when it is both np and np hard combines together.

this means np complete problems can be verified in polynomial time.

Difference between NP-Hard and NP-Complete:

NP-Hard problems(say X) can be solved if NP-Complete problems can be

To solve this problem, it do not have to To solve this problem, it must be

Time is known as it is fixed in NP-

NP-Complete is exclusively a decision

Do not have to be a Decision problem. It is exclusively a Decision problem.

It is optimization problem used. It is Decision problem used.

An Approximate Algorithm is a way of approach NP-COMPLETENESS for the

The main idea behind calculating the performance ratio of an approximation

Few popular examples of the approximation algorithms are −

 Vertex Cover Algorithm

 Set Cover Problem

 Travelling Salesman Problem (Approximation Approach)

 The Subset Sum Problem

You might also like