0% found this document useful (0 votes)

68 views

Distributed Deadlock Detection

Distributed deadlock detection in distributed systems is complicated because no single site has an accurate view of the global state. Three common approaches are centralized, distributed, and hierarchical control. Centralized detection maintains a global wait-for graph at a single controller node but has single point of failure issues. Distributed detection spreads the graph across all nodes but resolution can be difficult. The Ho-Ramamoorthy algorithms use a centralized controller that periodically collects local status tables to construct the global graph and detect cycles, addressing issues like false deadlocks.

Uploaded by

manyamlakshmiprasanna

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

68 views

Distributed Deadlock Detection

Uploaded by

manyamlakshmiprasanna

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 18

Distributed Deadlock Detection

Introduction:

Deadlocks are a fundamental problem in distributed systems.

A process may request resources in any order, which may not be known a priori
and a process can request resource while holding others.

If the sequence of the allocations of resources to the processes is not controlled,

deadlocks can occur.

A deadlock is a state where a set of processes request resources that are held by
other processes in the set.

A distributed program is composed of a set of n asynchronous processes p1, p2, . .

. , pi , . . . , pn that communicates by message passing over the communication
network.

Without loss of generality we assume that each process is running on a different

processor.

The processors do not share a common global memory and communicate solely
by passing messages over the communication network.

There is no physical global clock in the system to which processes have

instantaneous access.

The communication medium may deliver messages out of order, messages may
be lost garbled or duplicated due to timeout and retransmission, processors may
fail and communication links may go down.

We make the following assumptions:

The systems have only reusable resources.

Processes are allowed to make only exclusive access to resources.

There is only one copy of each resource.

A process can be in two states: running or blocked.

In the running state (also called active state), a process has all the needed
resources and is either executing or is ready for execution.

In the blocked state, a process is waiting to acquire some resource.

Wait-For-Graph:

The state of the system can be modeled by directed graph, called a wait for
graph(WFG).

In a WFG , nodes are processes and there is a directed edge from node P1 to
mode P2 if P1 is blocked and is waiting for P2 to release some resource.

A system is deadlocked if and only if there exists a directed cycle or knot in the
WFG

Deadlock Handling Strategies

There are three strategies for handling deadlocks, viz.,

deadlock prevention,

deadlock avoidance, and

deadlock detection.

Handling of deadlock becomes highly complicated in distributed systems because

no site has accurate knowledge of the current state of the system and because
every inter-site communication involves a finite and unpredictable delay.
Deadlock prevention:It is commonly achieved either by having a process acquire
all the needed resources simultaneously before it begins executing or by
preempting a process which holds the needed resource.

• A method that might work is to order the resources and require processes to
acquire them in strictly increasing order. This approach means that a process can
never hold a high resource and ask for a low one, thus making cycles impossible.

• With global timing and transactions in distributed systems, two other methods
are possible ‐‐ both based on the idea of assigning each transaction a global
timestamp at the moment it starts.

• When one process is about to block waiting for a resource that another process
is using, a check is made to see which has a larger timestamp.

• We can then allow the wait only if the waiting process has a lower timestamp.

• The timestamp is always increasing if we follow any chain of waiting processes,

so cycles are impossible ‐‐‐ we can used decreasing order if we like.

• It is wiser to give priority to old processes because

– they have run longer so the system have larger investment on these processes.
– they are likely to hold more resources.

– A young process that is killed off will eventually age until it is the oldest one in
the system, and that eliminates starvation.

This approach is highly inefficient and impractical in distributed systems.

Deadlock avoidance:In this approach to distributed systems, a resource is granted

to a process if the resulting global system state is safe (note that a global state
includes all the processes and resources of the distributed system).

However, due to several problems, deadlock avoidance is impractical in

distributed systems.
Deadlock detection requires examination of the status of process-resource
interactions for presence of cyclic wait.

Deadlock detection in distributed systems seems to be the best approach to

handle deadlocks in distributed systems.

Issues in Deadlock Detection:

Deadlock handling using the approach of deadlock detection entails addressing

two basic issues: First, detection of existing deadlocks and second resolution of
detected deadlocks.

Detection of deadlocks involves addressing two issues:

Maintenance of the WFG and Searching of the WFG for the presence of cycles(or
knots).

Correctness Criteria:

A deadlock detection algorithm must satisfy the following 2 conditions:

I) Progress(No Undetected deadlocks):

The algorithm must detect all existing deadlocks in finite time.
In other words, after all wait-for dependencies for deadlocks
Have formed, the algorithm should not wait for any more events to
occur to detect the deadlocks.
II) Safety(No false deadlocks):
The algorithm should not report deadlocks which do not exist(called
phantom or false deadlocks).

Since a global state in a distributed system is put together by

communicating messages, a global WFG may include out-of-date arcs
that appear to denote a cycle in the system.
It is hard to create algorithms that are not confused by this.
Resolution of a Detected Deadlock:
Deadlock resolution involves breaking existing wait-for dependencies
between the processes to resolve the deadlock.
It involves rolling back one or more deadlocked processes and assigning
their resources to blocked processes so that they can resume execution.
Killing a process in the cycle(s);
Preempting the resources from a process in the cycle(s);
Rolling back a process in the cycle(s).

The resources of this process (or processes) are then released and may be
acquired by other processes.

Control Organization for Distributed Deadlock Detection Algorithms

Algorithms for detecting distributed deadlock can be handled in three
different ways:
 Centralized
 Distributed
 Hierarchical
Assume that the network supports reliable communication.
Centralized:
One central site sets up a global WFG and searchs for cycles.
All decisions are made by the central control node.
 It must maintain the global WFG constantly or
 Periodically reconstruct it.
The main advantage is that this permits the use of relatively simple
algorithms.
The disadvantages include the following:
 There is one, single point of failure.
 There can be a communication bottleneck around the site due to all the
WFG information messages.
 Furthermore, this traffic is independent of the formation of any deadlock.
Distributed:
In a distributed control organization,
 All sites have an equal amount of information.
 All sites make decisions based on local information.
 All sites bear equal responsibility for the final decision in detecting
deadlock.
 All sites expend equal effort to the final decision.
 The global WFG is spread across the sites.
 Deadlock detection is initiated whenever a process thinks there might be a
problem.
 Several sites can initiate the detection at the same time.
The advantages include the following:
o There is no central point of failure.
o A single node failure cannot cause a crash.
o There is no one site with heavy traffic due to the detection algorithm.
o The algorithm is only initiated when process(es) feel there might be a
problem.
o The algorithm is not run periodically, only when needed.
The main disadvantage is that resolution may be difficult, as not all sites
may be aware of the processes involved in the deadlock.
The proof of correctness for this type of algorithm may be difficult.

Centralized Deadlock Detection:

We use a centralized deadlock detection algorithm and try to imitate the

non‐distributed algorithm.

– Each machine maintains the resource graph for its own processes and
resources
. – A centralized coordinator maintain the resource graph for the entire
system
– When the coordinator detect a cycle, it kills off one process to break the
deadlock.
– In updating the coordinator’s graph, messages have to be passed.
• Method 1) Whenever an arc is added or deleted from the resource
graph, a message have to be sent to the coordinator.
• Method 2) Periodically, every process can send a list of arcs added and
deleted since previous update.
• Method 3) Coordinator ask for information when it needs it.

False Deadlocks:

One possible way to prevent false deadlock is to use the Lamport’s

algorithm to provide global timing for the distributed systems.
• When the coordinator gets a message that leads to a suspect deadlock:
– It send everybody a message saying “I just received a message with a
timestamp T which leads to deadlock. If anyone has a message for me
with an earlier timestamp, please send it immediately”
– When every machine has replied, positively or negatively, the
coordinator will see that the deadlock has really occurred or not.

Centralized Deadlock Detection Algorithms

• The Ho‐Ramamoorthy Algorithms
– The Two‐Phase Algorithm
– The One‐phase Algorithm
Ho‐Ramamoorthy 2‐phase Algorithm –
Each site maintains a status table of all processes initiated at that site:
includes all resources locked & all resources being waited on.
– Controller requests (periodically) the status table from each site.
– Controller then constructs WFG from these tables, searches for cycle(s).
– If no cycles, no deadlocks.
– Otherwise, (cycle exists): Request for state tables again. – Construct
WFG based only on common transactions in the 2 tables.
– If the same cycle is detected again, system is in deadlock.
– Later proved: cycles in 2 consecutive reports need not result in a
deadlock. Hence, this algorithm detects false deadlocks.

Ho‐Ramamoorthy 1‐phase Algorithm

– Each site maintains 2 status tables: resource status table and
process status table.
– Resource table: transactions that have locked or are waiting for
resources.
– Process table: resources locked by or waited on by transactions.
– Controller periodically collects these tables from each site.
– Constructs a WFG from transactions common to both the tables.
– No cycle, no deadlocks.
– A cycle means a deadlock.

Distributed Deadlock‐Detection Algorithms

• A Path‐Pushing Algorithm
– The site waits for deadlock‐related information from
other sites
– The site combines the received information with its local
TWF graph to build an updated TWF graph
– For all cycles ‘EX ‐> T1 ‐> T2 ‐> Ex’ which contains the
node ‘Ex’, the site transmits them in string form ‘Ex, T1,
T2, Ex’ to all other sites where a sub‐transaction of T2 is
waiting to receive a message from the sub‐transaction of
T2 at that site.

Edge‐Chasing Algorithm
• Chandy‐Misra‐Haas’s Algorithm:
– A probe(i, j, k) is used by a deadlock detection process Pi. This
probe is sent by the home site of Pj to Pk.
– This probe message is circulated via the edges of the graph. Probe
returning to Pi implies deadlock detection.
– Terms used:
• Pj is dependent on Pk, if a sequence of Pj, Pi1,.., Pim, Pk exists.
• Pj is locally dependent on Pk, if above condition + Pj,Pk on
same site.
• Each process maintains an array dependenti: dependenti(j) is
true if Pi knows that Pj is dependent on it. (initially set to false
for all i & j).

Chandy‐Misra‐Haas’s Algorithm
Sending the probe:
if Pi is locally dependent on itself then deadlock.
else for all Pj and Pk such that
(a) Pi is locally dependent upon Pj, and
(b) Pj is waiting on Pk, and
(c ) Pj and Pk are on different sites, send probe(i,j,k) to the home
site of Pk.
Receiving the probe:
if (d) Pk is blocked, and
(e) dependentk(i) is false, and
(f) Pk has not replied to all requests of Pj,
then begin
dependentk(i) := true;
if k = i then Pi is deadlocked
else ...
Receiving the probe:
…….
else for all Pm and Pn such that
(a’) Pk is locally dependent upon Pm, and
(b’) Pm is waiting on Pn, and
(c’) Pm and Pn are on different sites, send probe(i,m,n)
to the home site of Pn.
end.
Performance:
For a deadlock that spans m processes over n sites, m(n-1)/2 messages
are needed.
Size of the message 3 words.
Delay in deadlock detection O(n).

Chandy‐Misra‐Haas Algorithm

• There are several ways to break the deadlock:

– The process that initiates commit suicide ‐‐ this is overkilling because
several process might initiates a probe and they will all commit suicide in
fact only one of them is needed to be killed.
– Each process append its id onto the probe, when the probe come back,
the originator can kill the process which has the highest number by
sending hima message. (Even for several probes, they will all choose the
same guy)

Hierarchical:
The sites (nodes) are logically connected in a hierarchical structure (such as
a tree).
A site can detect deadlock in its descendants.
This type of algorithm has the best of both the centralized and the
distributed deadlock detection algorithms.
For efficiency purposes, it is best to keep clusters of interacting processes
together in the hierarchy.
• Follows Ho-Ramamoorthy’s 1-phase algorithm. More than 1 control site
organized in hierarchical manner.
• Each control site applies 1-phase algorithm to detect (intracluster)
deadlocks.
• Central site collects info from control sites, applies 1-phase algorithm to
detect intracluster deadlocks.
Menasce and Muntz Hirarchical deadlock dectection:

Sites (called controllers) are organized in a tree

Leaf controllers manage resources
Each maintains a local WFG concerned only about its own resources
Interior controllers are responsible for deadlock detection
Each maintains a global WFG that is the union of the WFGs of its children
Detects deadlock among its children
changes are propagated upward either continuously or periodically

Ho and Ramamoorthy’s hierarchical deadlock detection:

Sites are grouped into disjoint clusters
Periodically, a site is chosen as a central control site
Central control site chooses a control site for each cluster
Control site collects status tables from its cluster, and uses the
Ho and Ramamoorthy one-phase centralized deadlock detection algorithm
to detect deadlock in that cluster
All control sites then forward their status information and WFGs to the
central control site, which combines that information into a global WFG and
searches it for cycles
Control sites detect deadlock in clusters
Central control site detects deadlock between clusters
Resource Management, Distributed Environment, Peer-to-Peer
I. INTRODUCTION

Resource Management in Distributed Environment is a management system of

resources like files other data over the distributed system whose main aim is to make
sure that a user/client can access the remote resources with as much ease as it can
access local resources. The basis of resource management is also resource sharing. Since
a computer can request a service or file from another computer by sending an
appropriate request to it over the communication network. Hardware and software
resources can be shared among autonomous computers. This communication can also
be referred to as peer-to-peer communication mechanism which is also the basis of
distributed system rather than the centralized-server and client mechanism. The peer-
to-peer communication.
Mechanism is much more efficient, flexible, convenient and faster than the centralized-
server and client’s mechanism. In this architecture all the process involved in a task like
resource management play similar roles, interacting co-operatively as peers without any
distinction between client and server processes or the computers they run on. The aim
of the peer-to-peer architecture is to exploit the resources in a large number of
participating computers for the fulfilment of a given task. Organizing the interaction
between each computer is of prime importance. In order to be able to use the widest
possible range and types of computers, the protocol or communication channel should
not contain or misuse that may not be misunderstood by certain machines. Special care
must also be taken that messages are indeed delivered correctly and that invalid
messages are rejected which would otherwise bring down the system and perhaps the
rest of the network. Another important factor is the ability to send software to another
computer in a portable way so that it may execute and interact with the existing
network. This may not always be possible or practical when using different hardware
and resources, in which case other methods must be used such as cross-compiling or
manually porting this software.

Apache Cassandra Administrator Associate - Exam Practice Tests
From Everand
Apache Cassandra Administrator Associate - Exam Practice Tests
Cristian Scutaru
No ratings yet
Catalogue PAI - 2017 Water Pumps
100% (4)
Catalogue PAI - 2017 Water Pumps
531 pages
Reading Comprehension: Level J
No ratings yet
Reading Comprehension: Level J
67 pages
Elen C-Series-User-Manual
100% (1)
Elen C-Series-User-Manual
68 pages
Oracle 11g Streams Implementer's Guide
From Everand
Oracle 11g Streams Implementer's Guide
Ann L. R. McKinnell
No ratings yet
Distributed Mutual Exclusion
No ratings yet
Distributed Mutual Exclusion
23 pages
Machine Learning-4
No ratings yet
Machine Learning-4
18 pages
Distributed Mutual Xclusion Algorithms
No ratings yet
Distributed Mutual Xclusion Algorithms
5 pages
Aos Unit-1
No ratings yet
Aos Unit-1
39 pages
Deadlock-System Model Notes
No ratings yet
Deadlock-System Model Notes
2 pages
Concurrency Control in Distributed Transactions (1)
No ratings yet
Concurrency Control in Distributed Transactions (1)
17 pages
Chapter 3 - Old PPT - Deadlock
100% (1)
Chapter 3 - Old PPT - Deadlock
40 pages
Load Scheduling
100% (1)
Load Scheduling
10 pages
Literature Survey On A Load Balancing Model Based
No ratings yet
Literature Survey On A Load Balancing Model Based
12 pages
Daa
No ratings yet
Daa
113 pages
Module-2 Lecture 7
100% (1)
Module-2 Lecture 7
21 pages
Expert System Architecture
No ratings yet
Expert System Architecture
11 pages
Distributed Database
No ratings yet
Distributed Database
22 pages
MC4202 - Adavanced Database Technology
No ratings yet
MC4202 - Adavanced Database Technology
159 pages
Jntu Kakinada - M.tech - Mathematical Foundations of Computer Science Sup FR 28
No ratings yet
Jntu Kakinada - M.tech - Mathematical Foundations of Computer Science Sup FR 28
2 pages
Concurrency Control in Distributed Database Systems
No ratings yet
Concurrency Control in Distributed Database Systems
5 pages
HighPerformanceComputing DS
No ratings yet
HighPerformanceComputing DS
2 pages
Introduction To Distributed Database Presentation
100% (1)
Introduction To Distributed Database Presentation
67 pages
Unit II Cloud Computing
100% (1)
Unit II Cloud Computing
9 pages
Solar Cell - Working Principle & Construction
No ratings yet
Solar Cell - Working Principle & Construction
14 pages
Ad3451 ML Unit 4 Notes Eduengg
No ratings yet
Ad3451 ML Unit 4 Notes Eduengg
36 pages
Classical Problems of Synchronization
No ratings yet
Classical Problems of Synchronization
10 pages
Distributed Mutual Exclusion
No ratings yet
Distributed Mutual Exclusion
28 pages
CST 402 DC QB
No ratings yet
CST 402 DC QB
6 pages
Concurrency Control in Distributed Databases
100% (1)
Concurrency Control in Distributed Databases
12 pages
Concurrent Process and Programming: Processs and Threads Processes
No ratings yet
Concurrent Process and Programming: Processs and Threads Processes
11 pages
Process Synchronization: Critical Section Problem
No ratings yet
Process Synchronization: Critical Section Problem
8 pages
Distributed-Computing Notes
No ratings yet
Distributed-Computing Notes
108 pages
Aos Unit-1 Notes
No ratings yet
Aos Unit-1 Notes
29 pages
Introduction To Parallel Databases
No ratings yet
Introduction To Parallel Databases
24 pages
Note On Operating System and Kernel
No ratings yet
Note On Operating System and Kernel
3 pages
Superpipelining
No ratings yet
Superpipelining
7 pages
Distributed Shared Memory
No ratings yet
Distributed Shared Memory
30 pages
ADF Syllabus
No ratings yet
ADF Syllabus
8 pages
Knowledge Representation & Reasoning: By: Irum Naz Sodhar Lecturer IT, SBBU-SBA Main Campus
100% (1)
Knowledge Representation & Reasoning: By: Irum Naz Sodhar Lecturer IT, SBBU-SBA Main Campus
22 pages
Openmp Tutorial: Seung-Jai Min
No ratings yet
Openmp Tutorial: Seung-Jai Min
30 pages
Parallel Computing
No ratings yet
Parallel Computing
57 pages
Unit - 5 DBMS Kca 204
No ratings yet
Unit - 5 DBMS Kca 204
19 pages
Knowledge Representation and Reasoning Unit 5
No ratings yet
Knowledge Representation and Reasoning Unit 5
72 pages
Transaction in DDB
100% (1)
Transaction in DDB
9 pages
DSA RTU 2022 Paper
No ratings yet
DSA RTU 2022 Paper
15 pages
Lec01 Conceptlearning
100% (1)
Lec01 Conceptlearning
49 pages
CS09 607 (P) - DBMS Lab Manual PDF
100% (1)
CS09 607 (P) - DBMS Lab Manual PDF
94 pages
DLunit 4
No ratings yet
DLunit 4
16 pages
Snort 2
No ratings yet
Snort 2
50 pages
Interprocess Communication and Synchronization
No ratings yet
Interprocess Communication and Synchronization
9 pages
Multiprocessor and Multicomputers
No ratings yet
Multiprocessor and Multicomputers
5 pages
Disk Management in Operating System
100% (1)
Disk Management in Operating System
8 pages
Overview of Ad-Hoc Routing Protocols
No ratings yet
Overview of Ad-Hoc Routing Protocols
4 pages
Cse330 Agent-Based-Intelligent-Systems TH 1.00 Ac26
0% (1)
Cse330 Agent-Based-Intelligent-Systems TH 1.00 Ac26
2 pages
Case Study On Linux
No ratings yet
Case Study On Linux
40 pages
6CS5 DS Unit-4
No ratings yet
6CS5 DS Unit-4
64 pages
Chapter 3 - Recovery Techniques
100% (1)
Chapter 3 - Recovery Techniques
22 pages
Distributed File System - File Service Architecture
No ratings yet
Distributed File System - File Service Architecture
51 pages
Distributed Database Systems: January 2002
No ratings yet
Distributed Database Systems: January 2002
25 pages
Distributed Database
No ratings yet
Distributed Database
7 pages
Standard Template Library
No ratings yet
Standard Template Library
24 pages
6.0 Introduction To Real-Time Operating Systems (Rtos)
No ratings yet
6.0 Introduction To Real-Time Operating Systems (Rtos)
35 pages
CISSP Domain2 - 2024
No ratings yet
CISSP Domain2 - 2024
49 pages
Stack Class XII
No ratings yet
Stack Class XII
16 pages
Sequential Calibration of Options
No ratings yet
Sequential Calibration of Options
15 pages
DLL 17. Eapp Restaurant Review Nov7
No ratings yet
DLL 17. Eapp Restaurant Review Nov7
2 pages
BSL Method Statememt For Lifting Thermal Tank - 1
No ratings yet
BSL Method Statememt For Lifting Thermal Tank - 1
4 pages
Basic Operation of A Lathe
No ratings yet
Basic Operation of A Lathe
6 pages
Evisa Series
No ratings yet
Evisa Series
40 pages
Finding Nemo Thesis Statement
100% (3)
Finding Nemo Thesis Statement
5 pages
Similes Metaphors Activities
No ratings yet
Similes Metaphors Activities
4 pages
Blept Reviewer
No ratings yet
Blept Reviewer
7 pages
Compo Boxlsx
No ratings yet
Compo Boxlsx
4 pages
Chapter (7) Beams: Revision
No ratings yet
Chapter (7) Beams: Revision
12 pages
DIASS WEEK 6 MODULE - Communication
No ratings yet
DIASS WEEK 6 MODULE - Communication
3 pages
Badri Engineering Corporation: One Stop Shop For All Your Industrial Needs
No ratings yet
Badri Engineering Corporation: One Stop Shop For All Your Industrial Needs
2 pages
CHN Lecture - 2
No ratings yet
CHN Lecture - 2
13 pages
Annexure 6
No ratings yet
Annexure 6
1 page
MASONRY Handout
100% (1)
MASONRY Handout
20 pages
Adding and Subtracting Algebraic Terms
No ratings yet
Adding and Subtracting Algebraic Terms
3 pages
To Identify Specific Information in A Text
No ratings yet
To Identify Specific Information in A Text
7 pages
Radex Radiator Heaters
No ratings yet
Radex Radiator Heaters
8 pages
ORGANELLE SPEED DATING With Profile Template (Shared)
No ratings yet
ORGANELLE SPEED DATING With Profile Template (Shared)
3 pages
Original Sin 2001 - Google Search
No ratings yet
Original Sin 2001 - Google Search
1 page
Steps To Create Process - Control in GENTRAN - DIRECTOR
No ratings yet
Steps To Create Process - Control in GENTRAN - DIRECTOR
8 pages
Heading Hints B KLT 2001
No ratings yet
Heading Hints B KLT 2001
63 pages
1647-Article Text-3369-1-10-20220320
No ratings yet
1647-Article Text-3369-1-10-20220320
5 pages
Smart Meter Display User Guide
No ratings yet
Smart Meter Display User Guide
11 pages
Krystal Crystallizer
83% (6)
Krystal Crystallizer
24 pages