Possible Types of Failure

Possible types of failure in a distributed system include: 1. Site failures which can cause loss of volatile storage or non-volatile storage. 2. Communication failures such as lost messages or network partitions that divide the network into disconnected subnetworks. 3. Different communication structures like centralized, hierarchical, linear, or distributed can be used between sites.

Uploaded by

Amandeep Singh

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views

Possible Types of Failure

Uploaded by

Amandeep Singh

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 16

Possible types of failure

1. Site failure
2. Communication failures
1. Lost messages
2. Network partitions
Possible types of failure
1. Site failure
2. 5.1.2
3. 6.1
4. 7.1.2.4
5. 9.1.2
1.Site faliures are those failures which can occur at each
site ..the are calssified as .
a. Failures without loss of information-all the information
stored in memory is available for recovery eg.division
by zero error.
b. Failure with loss of volatile storage—the content of
main memory is lost,but information recoded on disk is
not affected.eg.system crash.
c. Failure with non volatile storage—the content of disk
storage is not lost .eg. Head crash
d. Failure with loss of stable storage—some info stored in
stable storage is lost because of several ,simultaneous
of third type
• Failure can also occur in the communication between the sites
• When a message is sent from site x to site why we require from
the comunication network th follwing behaviour
1. X receives a positive acknowledgement after a delay which is less
then same maximum dealay dmax
2. Message is delivered at y in proper sequence with respect to
other x y messages
3. The message is correct.

If after a delay of dmax ,site x has not received an acknoledgement

then may the message is not lost or acknoledgement is lost
• There can also be a network partition—
In it the network is partition intoo two or more
completely disconnected sub networks, one
including x and one including y. all the oprational
sites which belong to the same network can
communicate with each other, how ever they
can not communicate with sites which belong to
a differnet subnetwork untill the partition is
repaired.
Differnet communication structure
for two pc
• 1. centerlized communiication structure—
• The communication is always performed
between the coordinator dtm-agent and
the participants .but not betweenn
participants directly
2.Hierarchical communication structure—
The coordinator is the dtm agent at the root of the
tree.the communication between the coordinator
and participants is performed not by directly
broadcast , but by propagating he messages up
and down the tree. Each dtm agents which is
internal node of the communication tree gets the
message from its son and broadcast messages
to them
Col
3.Linear communication structure—
In leanear protocol an ordering of the sites is
defined, so that each site excpt the first
abd the last one has a predessor and
successor .instead of broadcasting a
message from the cordonator to all other
participants ,the message is passed from
each participants to its successsor.
• Distributed communication structure –
• It requires tht each dtm-agent communicate with
each other participants .the no of messages
which are needed by a distributed protocol is
much more greater thn the no. of messages
which is required by centerlised or heiraichical
structure..these protocols are suitable for those
network which are cheAP like local network.
Check pointing reduces the
overhead of log based recovery
• When a failure with loss of voltile stoahe occurs ,
a recovery proocedure reads the log file and
perform the following operation
1-- determine all non committed transaction tht hav
to be undone ie which hav a begin_transaction
record in the log file, without having a commit or
abort record
2 Determine all the which need to be redone .
This is all transaction which hav a commit record
in log files.to distiguish transactions which need
to be redone frm those which do not,checkpoints
are used
• Undo the transaction determine at step 1
and redo the transaction determined at
step 2.
• Checkpoints are operation which are
predically perform in order to simplify the
first two steps of the recovery
procedure.performing the check points
require the following operation.
• Writing to stable storage all log records
and all database updates which still in
volitile storage
• Writing to stable storge a ccheck point
record .it is an indication of transaction
which are active at the time whn
checkpoint is done.\
• Step1 and step 2 of recovery procedure
are now substitute by the following
• Find and read the last check point record
• Put all transaction written in the checkpt.
Record into the undo set.which contains
the transaction to be undone.
• Read the log file strting frm the checkpoint
record untill its end.
• If a begin_transaction is found it put into
undo set,if a commit record is found it put
into redoset.
Diff between availability and
reliability
• One of the advantage of distributed
datbase is increase the reliability and
availability
• Reliability-is defined as the probability tht
the system is running (not down) at a
certain time point.
• Availability-is the probabilty tht the system
is continusly available during a time
interval.
• Increased reliability and availibility ensures
gracefull degradation property..when the
data and dbms s/w are distributed over
serval sites,one site may failure while
other site countinue to operate.only the
data and s/w of failed site can not be
accessed .further improvement is
achieved by judiciously replicating data
and s/w at more then one site.

100+ ChatGPT Prompts For Software Developers - by Aruva - Empowering Ideas - Medium
No ratings yet
100+ ChatGPT Prompts For Software Developers - by Aruva - Empowering Ideas - Medium
20 pages
Octavian Jackpot Controller - User Manual
100% (3)
Octavian Jackpot Controller - User Manual
45 pages
Unit 4 - DSRM
No ratings yet
Unit 4 - DSRM
5 pages
6CS5_DS_Unit-5
No ratings yet
6CS5_DS_Unit-5
34 pages
1904050001
No ratings yet
1904050001
119 pages
Unit 4_Deadlock Handling & Recovery Techniques & Failuere Classification
No ratings yet
Unit 4_Deadlock Handling & Recovery Techniques & Failuere Classification
55 pages
Lecture 7 PDC
No ratings yet
Lecture 7 PDC
8 pages
CS 194: Distributed Systems
No ratings yet
CS 194: Distributed Systems
15 pages
DS CH7 - Fault Tolerance
No ratings yet
DS CH7 - Fault Tolerance
17 pages
System Recovery
No ratings yet
System Recovery
38 pages
DS UNIT-3 Saqs Laqs (Complete)
No ratings yet
DS UNIT-3 Saqs Laqs (Complete)
16 pages
Unit-3 Part2
No ratings yet
Unit-3 Part2
74 pages
Research Paper
No ratings yet
Research Paper
63 pages
DC Unit 4 Important
No ratings yet
DC Unit 4 Important
6 pages
Distributed Recovery Management: UNIT-4
No ratings yet
Distributed Recovery Management: UNIT-4
31 pages
Assignment 4 - 044
No ratings yet
Assignment 4 - 044
4 pages
4th Unit Topics Recovery
No ratings yet
4th Unit Topics Recovery
73 pages
Ds chapter 7 (2)
No ratings yet
Ds chapter 7 (2)
21 pages
Chapter 7
No ratings yet
Chapter 7
26 pages
Distributed Systems - Fault Tolerance
No ratings yet
Distributed Systems - Fault Tolerance
21 pages
6CS5 DS Unit-5
No ratings yet
6CS5 DS Unit-5
34 pages
Trust Based Node Recovery and Checkpointing Techniques in Manets
No ratings yet
Trust Based Node Recovery and Checkpointing Techniques in Manets
6 pages
Chapter 3
No ratings yet
Chapter 3
40 pages
Fault Tolerance:-: Introduction, Process Resilience, Distributed Commit, Recovery
No ratings yet
Fault Tolerance:-: Introduction, Process Resilience, Distributed Commit, Recovery
52 pages
Distributed Dbms Advanced Concepts
No ratings yet
Distributed Dbms Advanced Concepts
70 pages
15-440 Distributed Systems: Fault Tolerance, Logging and Recovery Thursday Oct 8, 2015
No ratings yet
15-440 Distributed Systems: Fault Tolerance, Logging and Recovery Thursday Oct 8, 2015
30 pages
Distributed Computing QB Answers
No ratings yet
Distributed Computing QB Answers
15 pages
DisSys Lec7
No ratings yet
DisSys Lec7
48 pages
Unit 5
No ratings yet
Unit 5
12 pages
Intro To DS Chapter 6
No ratings yet
Intro To DS Chapter 6
51 pages
Chapter 8-Fault Tolerance
No ratings yet
Chapter 8-Fault Tolerance
30 pages
Failure Recovery in Distributed Systems
No ratings yet
Failure Recovery in Distributed Systems
24 pages
Ddbs Checkpointing ... Ddbs Checkpointing ... : Phase 1 at Css Phase 2 at CC
No ratings yet
Ddbs Checkpointing ... Ddbs Checkpointing ... : Phase 1 at Css Phase 2 at CC
9 pages
Fault Tolerance: Click To Add Text Dealing Successfully With Partial System. Key Technique: Redundancy
No ratings yet
Fault Tolerance: Click To Add Text Dealing Successfully With Partial System. Key Technique: Redundancy
48 pages
unit 4
No ratings yet
unit 4
94 pages
Distributed Computing: Farhad Muhammad Riaz
No ratings yet
Distributed Computing: Farhad Muhammad Riaz
18 pages
Presentation On Consistent Checkpoints & Recovery in Distributed System
100% (1)
Presentation On Consistent Checkpoints & Recovery in Distributed System
26 pages
5 Chapter Five
No ratings yet
5 Chapter Five
29 pages
A Review On Fault Tolerance in Distributed Database
No ratings yet
A Review On Fault Tolerance in Distributed Database
4 pages
Consensus
No ratings yet
Consensus
77 pages
Chapter_8-Fault_Tolerance (1)
No ratings yet
Chapter_8-Fault_Tolerance (1)
37 pages
2nd Hrlydistributed Database New
No ratings yet
2nd Hrlydistributed Database New
27 pages
CST402-SCHEME
No ratings yet
CST402-SCHEME
9 pages
Unit 3-1
No ratings yet
Unit 3-1
26 pages
CN-2 Conv
No ratings yet
CN-2 Conv
16 pages
CSE446 Lecture 4
No ratings yet
CSE446 Lecture 4
32 pages
dis sys
No ratings yet
dis sys
16 pages
unit 4
No ratings yet
unit 4
24 pages
Distributed 5
No ratings yet
Distributed 5
5 pages
Distributed DBMS Reliability Unit IV
100% (1)
Distributed DBMS Reliability Unit IV
27 pages
DS unit_4
No ratings yet
DS unit_4
20 pages
Unit 4 Part 2
No ratings yet
Unit 4 Part 2
21 pages
Distributed Failure Recovery
No ratings yet
Distributed Failure Recovery
30 pages
TOC and CN UT
No ratings yet
TOC and CN UT
1 page
DS IAT 3 Answer Key
No ratings yet
DS IAT 3 Answer Key
9 pages
Unit IV 2 Marks With Answer
No ratings yet
Unit IV 2 Marks With Answer
2 pages
Distributed Deadlocks Recovery Techniques
No ratings yet
Distributed Deadlocks Recovery Techniques
18 pages
Aos Assignment 2
No ratings yet
Aos Assignment 2
14 pages
Chapter 8-Fault Tolerance
100% (1)
Chapter 8-Fault Tolerance
71 pages
DS Chapter V8.0fault Tolerance
No ratings yet
DS Chapter V8.0fault Tolerance
23 pages
Dc-3551 Unit IV Notes
No ratings yet
Dc-3551 Unit IV Notes
32 pages
Breaking the Availability Barrier Ii: Achieving Century Uptimes with Active/Active Systems
From Everand
Breaking the Availability Barrier Ii: Achieving Century Uptimes with Active/Active Systems
Dr. Bruce Holenstein
No ratings yet
Characteristics of MIS
100% (13)
Characteristics of MIS
6 pages
MAXIMO Instructions 1-08v8
No ratings yet
MAXIMO Instructions 1-08v8
287 pages
2 Chapter02 PDF
No ratings yet
2 Chapter02 PDF
54 pages
Open ImS Core
No ratings yet
Open ImS Core
69 pages
JDE927 Server Manager Guide
No ratings yet
JDE927 Server Manager Guide
540 pages
VSP 4x DB Calculator
No ratings yet
VSP 4x DB Calculator
5 pages
Genesys PureEngage Solution Overview
No ratings yet
Genesys PureEngage Solution Overview
69 pages
Cloud Digital Leader Learning Path Quiz Solutions
No ratings yet
Cloud Digital Leader Learning Path Quiz Solutions
9 pages
Python-Django Report
No ratings yet
Python-Django Report
38 pages
FastReport Studio Programmers Manual EN
No ratings yet
FastReport Studio Programmers Manual EN
69 pages
20-07-28 - Doc99 Cv3074 Joint Case Management Statement
No ratings yet
20-07-28 - Doc99 Cv3074 Joint Case Management Statement
12 pages
Green Building Tools
No ratings yet
Green Building Tools
92 pages
FSBH
No ratings yet
FSBH
78 pages
Sap Basis
No ratings yet
Sap Basis
112 pages
General BAPI Interview Questions
No ratings yet
General BAPI Interview Questions
7 pages
Chapter 01: Types of Digital Data
No ratings yet
Chapter 01: Types of Digital Data
79 pages
Colegio de Sta. Teresa de Avila
No ratings yet
Colegio de Sta. Teresa de Avila
4 pages
Active Data Guard Reporting For Oracle E-Business Suite Release 12.1 Using Oracle 11g
No ratings yet
Active Data Guard Reporting For Oracle E-Business Suite Release 12.1 Using Oracle 11g
17 pages
Water-Utility Network Whitepaper
No ratings yet
Water-Utility Network Whitepaper
13 pages
Tushar Phadke: 2200 Waterview Parkway Apt#1938, Richardson, TX 75080 214-500-3474
No ratings yet
Tushar Phadke: 2200 Waterview Parkway Apt#1938, Richardson, TX 75080 214-500-3474
2 pages
Network 1sm
No ratings yet
Network 1sm
29 pages
Sample Questions - Tech Cons 4-6 17-06-2000
No ratings yet
Sample Questions - Tech Cons 4-6 17-06-2000
40 pages
Handling Third Party Vendor
No ratings yet
Handling Third Party Vendor
31 pages
Fresh Corner MADpdf
No ratings yet
Fresh Corner MADpdf
32 pages
PVC Specs
No ratings yet
PVC Specs
2 pages
Useful Netezza Queries and Tips
No ratings yet
Useful Netezza Queries and Tips
6 pages
Compare and Contrast File System With Database System.: Application Programmer
No ratings yet
Compare and Contrast File System With Database System.: Application Programmer
10 pages
Ghouse Moinuddin Mohammad Email: - Phone: (872) 228-9552 Professional Summary
No ratings yet
Ghouse Moinuddin Mohammad Email: - Phone: (872) 228-9552 Professional Summary
9 pages