Distributed Deadlock Detection

This document discusses distributed deadlock detection in distributed systems. It covers key topics such as: 1) Defining deadlock and the four necessary conditions for deadlock to occur. 2) Modeling process-resource interactions using resource allocation graphs and wait-for graphs to represent system state and detect cycles that indicate deadlock. 3) Approaches to handling deadlocks including prevention, avoidance, and detection with recovery by breaking dependency cycles. 4) Centralized, distributed, and hierarchical control organizations for maintaining wait-for graph information and detecting cycles across a distributed system.

Uploaded by

comp.enginer

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

69 views

Distributed Deadlock Detection

Uploaded by

comp.enginer

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 28

Synchronization:

Distributed Deadlock Detection

DDD: Introduction
• In distributed systems, a process can request and
release resources (local or remote) in any order, which
may not be known a priori and a process can request
some resources while holding others
• If the sequence of the allocation of resources to
processes is not controlled in such environments,
deadlocks can occur.
• The problem of deadlocks has been generally studied in
distributed systems under the following model:
– The systems have only reusable resources.
– Processes are allowed only exclusive access to resources.
– There is only one copy of each resource.
Introduction (contd.)
• A process can be in two states:
– running or blocked
• In the running state (also called the active state), a
process has all the needed resources and is either
executing or is ready for execution.
• In the blocked state, a process is waiting to acquire
some resources.
• Deadlock is a situation in which a set of processes is
blocked waiting for other process in the set to release
the resource
• Following conditions should hold simultaneously for
deadlock to occur:
1. Mutual Exclusion 2. No preemption
3. Hold and wait 4. Circular Wait
DDD: Resource vs.
Communication Deadlocks
• Two types of deadlocks have been discussed:
– resource deadlock
• Processes can simultaneously wait for several resources and
cannot proceed until they have acquired all those resources
• A set of processes is resource-deadlock if each process in the set
requests resources held by another process in the set and it mush
receive all of the requested resources before it can become
unblocked
– communication deadlock
• Processes wait to communicate with other processes among a set
of processes
• A waiting process can unblock on receiving a communication from
any one of these processes
• A set of processes is communication deadlocked if each process in
the set is waiting to communicate with another process in the set
and no process in the set ever initiates any further communication
until it receives the communication for which it is waiting
DDD: A Graph-Theoretic Model
– The state of process-resource interaction in distributed systems
can be modeled by a bi-partite directed graph called a resource
allocation graph.
– The nodes of this graph are processes and resources of a
system, and the edges of the graph depict assignments or
pending requests.
– A pending request is represented by a request edge directed
from the node of a requesting process to the node of the
requested resource.
– A resource assignment is represented by an assignment edge
directed from the node of an assigned resource to the node of
the assigned process.
– A system is deadlocked if its resource allocation graph contains
a directed cycle or a knot.
DDD: Resource Allocation Graph

P1
R1 R4

P4 P2 Assignment
Edge

Request Edge
R2 P3 R3
DDD: Wait-For Graph
• Wait-For Graphs:
– In distributed systems, the system state can be
modeled or represented by a directed graph, called a
wait-for graph (WFG)
– In a WFG, nodes are processes and there is a
directed edge from node P1 to node P2 if P1 is
blocked and is waiting for P2 to release some
resource
– A system is deadlocked if and only if there is a
directed cycle or knot (depending upon the underlying
model) in the WFG
DDD: Wait-For Graph

P1 P2

P4 P3
DDD: Deadlock handling strategies in
DS
• There are three strategies to handle deadlock
– Deadlock Prevention
– Deadlock Avoidance
– Deadlock Detection and Recovery
• Deadlock handling is complicated to implement
in DS because no one site has accurate
knowledge of the current state of the system and
because every inter-site communication involves
a finite and unpredictable delay
DDD: Deadlock Prevention
• deadlock can be prevented in a DS through following approaches:
– Deadlock can be prevented in a DS through linear ordering of the
resources, simple to implement with less overhead
– Can be achieved either by having a process acquired all the needed
resources simultaneously before it begins execution or by preempting a
process that holds the needed resources
– Can use time-stamping and priority with resource preemption
– To control the preemption each process is assigned a unique priority
– These values are used to decide whether a process will wait for Pj if Pi
has a priority higher than Pj, otherwise Pi is rolled back
– It prevents deadlock because for nay edge Pi Pj, in the wait-for-graph,
Pi has a higher priority then Pj, then a cycle cant exist
– Low priority process will always be rolled back
– It can be avoided by the use of timestamps, two schemes can be used
as following:
Deadlock Prevention (contd.)
• The Wait-Die Scheme:
– A non-preemptive approach
– When process Pi requests a resource currently held
by Pj, Pi is allowed to wait only if it has a smaller
timestamp than does Pj (Pi is older than Pj)
– Otherwise Pi is rolled back
• The Wound-Wait scheme:
– A preemptive approach
– When process Pi requests a resource currently held
by Pj, Pi is allowed to wait only if it has a larger
timestamp than Pj ( pi is younger)
– Otherwise Pj is rolled back (pj is wounded)
DDD: Deadlock Avoidance
• A resource is granted to a process if the resulting global
state is safe ( a global state includes all the processes
and resources of the DS)
• Deadlock is practically impossible to implement because
– Every site ahs to maintain information on the global state of the
system, which translates into huge storage requirements an
dextensive communication costs
– The process of checking for a safe global state must be mutually
exclusive, because if several sites concurrently perform checks
for a safe state they may all find the state safe but the net global
state may not be safe
– Due to the large number of processes and resources it will be
computationally expensive to check for a safe state
DDD: Deadlock Detection and
Recovery
• Requires an examination of the staus of
process-resource interaction for the presence
of cyclical wait
• Two conditions exist in the DS:
– Once a cycle is formed in the WFG, it persists
until its is detected and broken, and
– Cycle detection can proceed concurrently with the
normal activities of a system
– We’ll study the techniques to detect deadlock in a
DS instead of trying to prevent or avoid it
Detection and Recovery
(contd.)
• Deadlock detection and resolution entails addressing two
basic issues:
– First, detection of existing deadlocks and
– second resolution of detected deadlocks.
• The detection of deadlocks involves two Issues:
– maintenance of the WFG and
– search of the WFG for the presence of cycles (or knots)
• In distributed systems, a cycle may involve several sites,
so the search for cycles greatly depends upon how the
WFG of the system is represented across the system
• Depending upon the manner in which WFG information
is maintained and the search for cycles is carried out,
there are centralized, distributed, and hierarchical
algorithms for deadlock detection in distributed systems.
Detection and Recovery
(contd.)
• Deadlock resolution involves breaking existing
wait-for dependencies in the system WFG to
resolve the deadlock
• It involves rolling back one or more processes
that are deadlocked and assigning their
resources to blocked processes in the deadlock
so that they can resume execution
DDD: Control Organizations
• Centralized Control
– Provided with a control site responsible of constructing WFG
– However, a single point of failure, congested links, continuous
message generation for deadlock detection, are the demerits.
• Distributed Control
– Detection of a global deadlock is shared equally
– Deadlock detection is initiated only when a waiting process is
suspected to be a part of deadlock cycle.
– However, such systems are difficult to design, several sites may
initiate detection for the same deadlock, proof of correctness is
difficult, deadlock resolution is cumbersome.
• Hierarchical Control
– Site detects deadlocks involving only its descendent sites
– Exploits access patterns local to a cluster of sites to efficiently detect
deadlocks.
– However, the control is defeated if most deadlocks span several
clusters.
Centralized Deadlock-
Detection
The Completely Centralized Algorithm
The Ho-Ramamoorthy Algorithms
The Completely Centralized
Algorithm
• A designated site called the Control Site is
provided which maintains the WFG of the entire
system
– It checks the WFG for the existence of deadlock cycles
whenever a request edge is added to the WFG.
• Sites request or release through Request and
Release message for all resources, whether local
or remote.
• However, it is highly inefficient due to
concentration of all messages.
– It imposes larger delays, large communication
overhead, and the congestion of communication links.
– Moreover, the reliability is poor due to single point of
failure.
The Ho-Ramamoorthy Algorithms
• Two proposals to resolve the problems in Centralized
algorithm
– The Two-Phase Algorithm
• Every site maintains a status table containing status of all the
processes initiated at that site.
• A designated site requests the status table from all sites, periodically.
• Two received reports are matched and a WFG is generated based on
the differences.
• The drawback is it may report a false deadlock.
– The One-Phase Algorithm
• It request one status report from each site, however, each site
maintains two status table, i.e. resource & process.
• Resource table keeps track of transactions, whereas the process table
keeps track of resources locked.
• Periodically, a designated site requests both the tables from every
site, constructs a WFG using the information provided by both tables.
• If no cycle is found, then the system is not deadlocked.
Distributed Deadlock
Detection Algorithms
A Path-Pushing Algorithm
An Edge-Chasing Algorithm
A Diffusion Computation Based Algorithm
Global State Detection Based Algorithm
An Overview
• All sites collectively cooperate to detect a cycle in
the state graph that is likely to be distributed over
several sites of the system.
• The algorithm can be initiated whenever a
process is forced to wait.
• The algorithm can be initiated either by the local
site of the process or by the site where the
process waits.
• These algorithm can be divided into four classes,
– Path-pushing algorithm
– Edge-chasing algorithm
– Diffusion computation algorithm
– Global state detection algorithm
An Edge-Chasing Algorithm
• If Pi is locally dependent on itself then declare deadlock
• Else for all Pj and Pk such that
– Pi is locally dependent upon Pj, and
– Pj is waiting on Pk, and
– Pj and Pk are on different sites,
• Send probe(I,j,k) to the home site of Pk
• On receipt of probe(I,j,k), the site takes the following actions:
• If
– Pk is blocked, and
– dependentk(i) is false, and
– Pk has not replied to all requests of Pj,
• then
begin
dependentk(i)=true;
if k=I then declare that Pi is deadlocked
else for all Pm and Pn such that
Pk is locally dependent upon Pm, and
Pm is waiting on Pn and
Pm and Pn are on different sites,
send probe(I,m,n) to the home sites of Pn
end
A Pictorial Example

P1 P3

Probe (1,9,1) Site S1 Probe (1,3,4)

P9 Probe (1,6,8) P4
P6
P8 P5
P10 P7
Probe (1,7,10)
Site S3 Site S2
Hierarchical Deadlock
Detection Algorithm
The Menasce-Muntz Algorithm
The Ho-Ramamoorthy Algorithm
An Overview
• In hierarchical deadlock detection algorithms, sites are
arranged in a hierarchical fashion, and a site detects
deadlocks involving only its descendant sites
• Hierarchical algorithms exploit access patterns local to a
cluster of sites to efficiently detect deadlocks
• However, hierarchical deadlock detection algorithms
require special care while arranging the sites in a
hierarchy
• For efficiency, most deadlocks should be localized to as
few clusters as possible - the objective of hierarchical
control is defeated if most deadlocks span several clusters
• These algorithms can be divided into two classes,
– The Menasce-Muntz Algorithm
– The Ho-Ramamoorthy Algorithm
The Ho-Ramamoorth Algorithm
• Sites are grouped into several disjoint clusters.
• Periodically, a site is chosen as a central control site,
which dynamically chooses a control site for each cluster.
• The central control site requests from every control site
their intercluster transaction status information and wait-for
relation.
• A control site collects status tables from all the site in its
cluster and applies the one-phase deadlock detection
algorithm.
• It then sends status information and wait-for relation to the
central control site.
• Finally, the central site constructs a system WFG and
searches it for cycles.
A Pictorial Example
Control Site

Central Site

Control Site Control Site

Algorithms Unleashed Perspectives
• Theory of Correctness
– A formal proof of the correctness is nontrivial
• TWF graph and deadlock cycles can form in innumerable ways and it
is difficult to imagine
• Deadlock is very sensitive to the timing of requests
• Message delays are unpredictable
• Performance
– The number of messages exchanges may not be the true indicator
as it varies in algorithms
– The persistence of deadlocks results in wasteful utilization of
resources, therefore, the average time a deadlock persist can be
an important measure
– Other measures include, storage overhead, processing overhead,
resource holding time, etc.
• Deadlock Resolution
– A deadlock is resolved by aborting at least one process and
granting the released resources to other processes.

Distributed Deadlock Detection
No ratings yet
Distributed Deadlock Detection
18 pages
Chapter 3 - Old PPT - Deadlock
100% (1)
Chapter 3 - Old PPT - Deadlock
40 pages
Distributed Deadlock
No ratings yet
Distributed Deadlock
61 pages
Distributed Deadlock: Nargish
No ratings yet
Distributed Deadlock: Nargish
23 pages
Distributed Deadlock
No ratings yet
Distributed Deadlock
62 pages
Distributed Deadlock
No ratings yet
Distributed Deadlock
55 pages
Deadlock in Distributed Enviornment
0% (1)
Deadlock in Distributed Enviornment
31 pages
Osppt
No ratings yet
Osppt
23 pages
CS8603_DS_Unit3_CompleteMaterial
No ratings yet
CS8603_DS_Unit3_CompleteMaterial
95 pages
Deadlocks IIITH
No ratings yet
Deadlocks IIITH
30 pages
04 - Deadlocks in Distributed Systems1231312
No ratings yet
04 - Deadlocks in Distributed Systems1231312
32 pages
deadlock
No ratings yet
deadlock
18 pages
lec12
No ratings yet
lec12
29 pages
Deadlocks - An Introduction
No ratings yet
Deadlocks - An Introduction
5 pages
Session 24
No ratings yet
Session 24
4 pages
Os Mod3
No ratings yet
Os Mod3
24 pages
Deadlock Notes
No ratings yet
Deadlock Notes
4 pages
Oslecture8-9 (Copy)
No ratings yet
Oslecture8-9 (Copy)
94 pages
Deadlock Prevention, Avoidance, and Detection
No ratings yet
Deadlock Prevention, Avoidance, and Detection
29 pages
Deadlock Prevention, Avoidance, and Detection
No ratings yet
Deadlock Prevention, Avoidance, and Detection
29 pages
Seminar Report On DEADLOCK
50% (2)
Seminar Report On DEADLOCK
25 pages
lecture 6
No ratings yet
lecture 6
19 pages
Deadlock Prevention, Avoidance, and Detection
No ratings yet
Deadlock Prevention, Avoidance, and Detection
29 pages
Deadlock Prevention, Avoidance, and Detection
No ratings yet
Deadlock Prevention, Avoidance, and Detection
29 pages
Deadlocks
No ratings yet
Deadlocks
94 pages
os_unit_iii
No ratings yet
os_unit_iii
15 pages
Deadlock
No ratings yet
Deadlock
21 pages
OS_Chapter_6_Deadlock[1]
No ratings yet
OS_Chapter_6_Deadlock[1]
76 pages
2.8 Centralized Deadlock Detection and Resolution
No ratings yet
2.8 Centralized Deadlock Detection and Resolution
26 pages
OS Deadlock Notes Unit 3
No ratings yet
OS Deadlock Notes Unit 3
8 pages
Deadlock Detection and Recovery from Deadlock
No ratings yet
Deadlock Detection and Recovery from Deadlock
7 pages
Chapter 4 Deadlock
No ratings yet
Chapter 4 Deadlock
24 pages
Operating System Lecturer Notes
No ratings yet
Operating System Lecturer Notes
12 pages
Sybscit - Os-U-Iii
No ratings yet
Sybscit - Os-U-Iii
28 pages
Os Unit 3
No ratings yet
Os Unit 3
38 pages
Os Unit-2 P-2
No ratings yet
Os Unit-2 P-2
13 pages
OS Unit-II - Process Synchronization
No ratings yet
OS Unit-II - Process Synchronization
27 pages
6834
No ratings yet
6834
19 pages
Operating System
No ratings yet
Operating System
18 pages
Deadlock 2023
No ratings yet
Deadlock 2023
85 pages
Deadlock Prevention, Avoidance, and Detection
No ratings yet
Deadlock Prevention, Avoidance, and Detection
29 pages
Deadlock Prevention, Avoidance, and Detection
No ratings yet
Deadlock Prevention, Avoidance, and Detection
29 pages
Unit Iii-Os
No ratings yet
Unit Iii-Os
17 pages
Deadlocks
No ratings yet
Deadlocks
29 pages
GM-3 2BCS303
No ratings yet
GM-3 2BCS303
48 pages
Distributed Deadlocks
No ratings yet
Distributed Deadlocks
11 pages
Opereating System
No ratings yet
Opereating System
20 pages
OS-UNIT-IV-notes
No ratings yet
OS-UNIT-IV-notes
26 pages
OS Lecture 4
No ratings yet
OS Lecture 4
54 pages
R19 OS UNIT III
No ratings yet
R19 OS UNIT III
40 pages
4
No ratings yet
4
25 pages
Chapter Four Deadlocks
No ratings yet
Chapter Four Deadlocks
20 pages
Deadlock
No ratings yet
Deadlock
43 pages
Unit 3 Chapter2 Deadlock Notes
No ratings yet
Unit 3 Chapter2 Deadlock Notes
29 pages
Dead Lock in Operating System
No ratings yet
Dead Lock in Operating System
18 pages
Kubernetes Made Easy
From Everand
Kubernetes Made Easy
Pankaj Joshi
No ratings yet
Oracle Data Guard 11gR2 Administration Beginner's Guide
From Everand
Oracle Data Guard 11gR2 Administration Beginner's Guide
Emre Baransel
No ratings yet
Oracle: Protect Your Data
From Everand
Oracle: Protect Your Data
Floribert TCHOKO
No ratings yet
Oracle 11g Streams Implementer's Guide
From Everand
Oracle 11g Streams Implementer's Guide
Ann L. R. McKinnell
No ratings yet
Breaking the Availability Barrier Ii: Achieving Century Uptimes with Active/Active Systems
From Everand
Breaking the Availability Barrier Ii: Achieving Century Uptimes with Active/Active Systems
Dr. Bruce Holenstein
No ratings yet
Generic Network Slice Template 27 January 2023: This Is A Non-Binding Permanent Reference Document of The GSMA
No ratings yet
Generic Network Slice Template 27 January 2023: This Is A Non-Binding Permanent Reference Document of The GSMA
72 pages
Distributed Systems
No ratings yet
Distributed Systems
41 pages
Distributed Mutual Exclusion
No ratings yet
Distributed Mutual Exclusion
28 pages
Distributed Scheduling
No ratings yet
Distributed Scheduling
27 pages
Easy Sudoku - 50 Printable Puzzles With Answers
100% (2)
Easy Sudoku - 50 Printable Puzzles With Answers
55 pages
Optics II - 0
No ratings yet
Optics II - 0
4 pages
Logcat Log
No ratings yet
Logcat Log
230 pages
Chap V (Summary and Suggestion For Further Research)
No ratings yet
Chap V (Summary and Suggestion For Further Research)
7 pages
Finals Mathm Reviewer
No ratings yet
Finals Mathm Reviewer
7 pages
Define Production Possibility Curve and state its properties
No ratings yet
Define Production Possibility Curve and state its properties
3 pages
Coffee Mill Mods
No ratings yet
Coffee Mill Mods
20 pages
Grade 10 Life Sciences Notes
No ratings yet
Grade 10 Life Sciences Notes
31 pages
Hypothesis Testing
No ratings yet
Hypothesis Testing
78 pages
Co-Synthesis of Hardware and Software For Digital Embedded Systems
100% (1)
Co-Synthesis of Hardware and Software For Digital Embedded Systems
274 pages
Ee - Sec-C-Lesson Plan
No ratings yet
Ee - Sec-C-Lesson Plan
14 pages
BFC FCT Man 0422 en-US
No ratings yet
BFC FCT Man 0422 en-US
532 pages
EEP223 Chapter 5.1 Power Series
No ratings yet
EEP223 Chapter 5.1 Power Series
27 pages
How To Achieve Selective Coordination of Circuit Breakers - EEP
No ratings yet
How To Achieve Selective Coordination of Circuit Breakers - EEP
5 pages
Journal of Loss Prevention in The Process Industries: Laihao Ma, Xiaoxue Ma, Yang Liu, Wanyi Deng, He Lan
No ratings yet
Journal of Loss Prevention in The Process Industries: Laihao Ma, Xiaoxue Ma, Yang Liu, Wanyi Deng, He Lan
10 pages
Optix Osn 8800 System Hardware-20090601-A
No ratings yet
Optix Osn 8800 System Hardware-20090601-A
41 pages
Hsslive Xii Maths QB Deter 2024
No ratings yet
Hsslive Xii Maths QB Deter 2024
5 pages
Eetop - CN - GM - Over - Id Example
No ratings yet
Eetop - CN - GM - Over - Id Example
2 pages
EN Report
No ratings yet
EN Report
24 pages
Quizizz - Number Patterns and Sequence
No ratings yet
Quizizz - Number Patterns and Sequence
6 pages
S 00 TAA C 48 001 001 Hydrology Study Report
No ratings yet
S 00 TAA C 48 001 001 Hydrology Study Report
88 pages
Chapter 7 - Electricity (Students Copy)
No ratings yet
Chapter 7 - Electricity (Students Copy)
61 pages
Analysis of Connections and Fasteners To Determine Disassembly and Strength Characteristics
No ratings yet
Analysis of Connections and Fasteners To Determine Disassembly and Strength Characteristics
15 pages
1042 Legacy Humidifier Installation & Owners Manual R
No ratings yet
1042 Legacy Humidifier Installation & Owners Manual R
8 pages
DLL - Math4 Week 8
No ratings yet
DLL - Math4 Week 8
9 pages
Sv9100 Quick User Guide Digital
No ratings yet
Sv9100 Quick User Guide Digital
4 pages
Azuelo Suarez Proposal
No ratings yet
Azuelo Suarez Proposal
40 pages
Zienkiewicz Et Al-1971-International Journal For Numerical Methods in Engineering
No ratings yet
Zienkiewicz Et Al-1971-International Journal For Numerical Methods in Engineering
16 pages
EOT Crane Load Calculations
No ratings yet
EOT Crane Load Calculations
8 pages

Distributed Deadlock Detection

Uploaded by

Distributed Deadlock Detection

Uploaded by

Synchronization:

Distributed Deadlock Detection

Probe (1,9,1) Site S1 Probe (1,3,4)

Control Site Control Site

You might also like