0% found this document useful (0 votes)

3 views

Distributed System

The document discusses eventual consistency as a weaker consistency model than linearizability. It allows replicas to process operations locally without communicating with other replicas, enabling availability even if a node cannot reach a quorum. This comes at the cost of potential staleness or conflicts during updates. The document provides examples of how calendar apps use eventual consistency and discusses strategies for resolving conflicts during updates.

Uploaded by

shubhamsemilo

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

Distributed System

Uploaded by

shubhamsemilo

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

node A node B node C node D

get(x) ! 1

set(x, 0)
set(x, 1)

get(x) ! 1
cas(x, 1, 2) ! true

get(x) ! 2

cas(x, 0, 3) ! false
get(x) ! 4

cas(x, 2, 4) ! true
get(x) ! 2

7.3 Eventual consistency

Linearizability is a very convenient consistency model for distributed systems, because it guarantees that
a system behaves as if there was only one copy of the data, even if it is in fact replicated. This allows
applications to ignore some of the complexities of working with distributed systems. However, this strong
guarantee also comes at cost, and therefore linearizability is not suitable for all applications.
Part of the cost is performance: both the ABD algorithm and the linearizable CAS algorithm based on
total order broadcast need to send a lot of messages over the network, and require significant amounts of
waiting due to network latency. Part is scalability: in algorithms where all updates need to be sequenced
through a leader, such as Raft, the leader can become a bottleneck that limits the number of operations
that can be processed per second.
Perhaps the biggest problem with linearizability is that every operation requires communication with
a quorum of replicas. If a node is temporarily unable to communicate with sufficiently many replicas, it
cannot perform any operations. Even though the node may be running, such a communication failure
makes it e↵ectively unavailable.

Start of video section 7.3

Eventual consistency (mp4 download)

Linearizability advantages:
I Makes a distributed system behave as if it were
non-distributed
I Simple for applications to use

Downsides:
I Performance cost: lots of messages and waiting for
responses
I Scalability limits: leader can be a bottleneck
I Availability problems: if you can’t contact a quorum of
nodes, you can’t process any operations

Eventual consistency: a weaker model than linearizability.

Di↵erent trade-o↵ choices.

Slide 137

75
As an example, consider the calendar app that you can find on most phones, tablets, and computers.
We would like the appointments and entries in this app to sync across all of our devices; in other words,
we want it to be replicated such that each device is a replica. Moreover, we would like to be able to
view, modify, and add calendar events even while a device is o✏ine (e.g. due to poor mobile network
coverage). If the calendar app’s replication protocol was linearizable, this would not be possible, since
an o✏ine device cannot communicate with a quorum of replicas.

Slide 138
Instead, calendar apps allow the user to read and write events in their calendar even while a device is
o✏ine, and they sync any updates between devices sometime later, in the background, when an internet
connection is available. The video of this lecture includes a demonstration of o✏ine updates to a calendar.
This trade-o↵ is known as the CAP theorem (named after consistency, availability, and partition
tolerance), which states that if there is a network partition in a system, we must choose between one of
the following options [Gilbert and Lynch, 2002]:

1. We can have linearizable consistency, but in this case, some replicas will not be able to respond to
requests because they cannot communicate with a quorum. Not being able to respond to requests
makes those nodes e↵ectively unavailable.
2. We can allow replicas to respond to requests even if they cannot communicate with other replicas.
In this case, they continue to be available, but we cannot guarantee linearizability.

Sometimes the CAP theorem is formulated as a choice of “pick 2 out of 3”, but that framing is misleading.
A system can be both linearizable and available as long as there is no network partition, and the choice
is forced only in the presence of a partition [Kleppmann, 2015].
This trade-o↵ is illustrated on Slide 139, where node C is unable to communicate with nodes A and
B. On A and B’s side of the partition, linearizable operations can continue as normal, because A and
B constitute a quorum. However, if C wants to read the value of x, it must either wait (potentially
indefinitely) until the network partition is repaired, or it must return its local value of x, which does not
reflect the value previously written by A on the other side of the partition.

76
The CAP theorem
A system can be either strongly Consistent (linearizable) or
Available in the presence of a network Partition

node A
set(x, v1 ) node B node C

network partition
get(x) ! v1

get(x) ! v1

get(x) ! v0
C must either wait indefinitely for the network to recover, or
return a potentially stale value
Slide 139
The calendar app chooses option 2: it forgoes linearizability in favour of allowing the user to continue
performing operations while a device is o✏ine. Many other systems similarly make this choice for various
reasons.
The approach of allowing each replica to process both reads and writes based only on its local state,
and without waiting for communication with other replicas, is called optimistic replication. A variety of
consistency models have been proposed for optimistically replicated systems, with the best-known being
eventual consistency.
Eventual consistency is defined as: “if no new updates are made to an object, eventually all reads will
return the last updated value” [Vogels, 2009]. This is a very weak definition: what if the updates to an
object never stop, so the premise of the statement is never true? A slightly stronger consistency model
called strong eventual consistency, defined on Slide 140, is often more appropriate [Shapiro et al., 2011].
It is based on the idea that as two replicas communicate, they converge towards the same state.

Eventual consistency
Replicas process operations based only on their local state.
If there are no more updates, eventually all replicas will be in
the same state. (No guarantees how long it might take.)
Strong eventual consistency:
I Eventual delivery: every update made to one non-faulty
replica is eventually processed by every non-faulty replica.
I Convergence: any two replicas that have processed the
same set of updates are in the same state
(even if updates were processed in a di↵erent order).
Properties:
I Does not require waiting for network communication
I Causal broadcast (or weaker) can disseminate updates
I Concurrent updates =) conflicts need to be resolved
Slide 140
In both eventual consistency and strong eventual consistency, there is the possibility of di↵erent nodes
concurrently updating the same object, leading to conflicts (as previously discussed on Slide 95). Various
algorithms have been developed to resolve those conflicts automatically [Shapiro et al., 2011].
The lecture video shows an example of a conflict in the eventually consistent calendar app: on one
device, I update the time of an event, while concurrently on another device, I update the title of the
same event. After the two devices synchronise, the update of the time is applied to both devices, while
the update of the title is discarded. The state of the two devices therefore converges – at the cost of
a small amount of data loss. This is the last writer wins approach to conflict resolution that we have
seen on Slide 95 (assuming the update to the time is the “last” update in this example). A more refined
approach might merge the updates to the time and the title, as shown on Slide 143.
This brings us to the end of our discussion of consistency models. Slide 141 summarises some of the

77
key properties of the models we have seen, in descending order of the minimum strength of assumptions
that they must make about the system model.

Summary of minimum system model requirements

Problem Must wait for Requires

communication synchrony
atomic commit all participating partially
nodes synchronous

strength of assumptions
consensus, quorum partially
total order broadcast, synchronous
linearizable CAS
linearizable get/set quorum asynchronous
eventual consistency, local replica only asynchronous
causal broadcast,
FIFO broadcast

Slide 141
Atomic commit makes the strongest assumptions, since it must wait for communication with all nodes
participating in a transaction (potentially all of the nodes in the system) in order to complete successfully.
Consensus, total order broadcast, and linearizable algorithms make weaker assumptions since they only
require waiting for communication with a quorum, so they can tolerate some unavailable nodes. The FLP
result (Slide 107) showed us that consensus and total order broadcast require partial synchrony. It can be
shown that a linearizable CAS operation is equivalent to consensus [Herlihy, 1991], and thus also requires
partial synchrony. On the other hand, the ABD algorithm for linearizable get/set is asynchronous, since
it does not require any clocks or timeouts. Finally, eventual consistency and strong eventual consistency
make the weakest assumptions: operations can be processed without waiting for any communication
with other nodes, and without any timing assumptions. Similarly, in causal broadcast and weaker forms
of broadcast (FIFO, reliable, etc.), a node broadcasting a message can immediately deliver it to itself
without waiting for communication with other nodes, as discussed in Section 4.2; this corresponds to a
replica immediately processing its own operations without waiting for communication with other replicas.
This hierarchy has some similarities to the concept of complexity classes of algorithms – for example,
sorting generally is O(n log n) – in the sense that it captures the unavoidable minimum communication
and synchrony requirements for a range of common problems in distributed systems.

8 Case studies
In this last lecture we will look at a couple of examples of distributed systems that need to manage
concurrent access to data. In particular, we will include some case studies of practical, real-world systems
that need to deal with concurrency, and which build upon the concepts from the rest of this course.

8.1 Collaboration and conflict resolution

Collaboration software is a broad category of software that facilitates several people working together
on some task. This includes applications such as Google Docs/Office 365 (multi-user text documents,
spreadsheets, presentations, etc.), Overleaf (collaborative LATEX documents), multi-user graphics software
(e.g. Figma), project planning tools (e.g. Trello), note-taking apps (e.g. OneNote, Evernote, Notion), and
shared calendars between colleagues or family members (like the calendar sync we saw on Slide 138).
Modern collaboration software allows several people to update a document concurrently, without
having to email files back and forth. This makes collaboration another example of replication: each
device on which a user has opened a document is a replica, and any updates made to one replica need to
be sent over the network to the replicas on other devices.
In principle, it would be possible to use a linearizable replication scheme for collaboration software.
However, such software would be slow to use, since every read or write operation would have to contact a
quorum of replicas; moreover, it would not work on a device that is o✏ine. Instead, for the sake of better

78
performance and better robustness to network interruptions, most collaboration software uses optimistic
replication that provides strong eventual consistency (Slide 140).

Start of video section 8.1

Collaboration and conflict resolution (mp4 download)

Nowadays we use a lot of collaboration software:

I Examples: calendar sync (last lecture), Google Docs, . . .
I Several users/devices working on a shared file/document
I Each user device has local replica of the data
I Update local replica anytime (even while o✏ine),
sync with others when network available
I Challenge: how to reconcile concurrent updates?

Families of algorithms:
I Conflict-free Replicated Data Types (CRDTs)
I Operation-based
I State-based
I Operational Transformation (OT)

Slide 142
In this section we will look at some algorithms that are used for this kind of collaboration. As example,
consider the calendar sync demo in the lecture recording of Section 7.3. Two nodes initially start with the
same calendar entry. On node A, the title is changed from “Lecture” to “Lecture 1”, and concurrently
on node B the time is changed from 12:00 to 10:00. These two updates happen while the two nodes are
temporarily unable to communicate, but eventually connectivity is restored and the two nodes sync their
changes. In the outcome shown on Slide 143, the final calendar entry reflects both the change to the title
and the change to the time.

Conflicts due to concurrent updates

node A node B
{ {
network partition

"title": "Lecture", "title": "Lecture",

"date": "2020-11-05", "date": "2020-11-05",
"time": "12:00" "time": "12:00"
} }

title = "Lecture 1" time = "10:00"

{ {
"title": "Lecture 1", "title": "Lecture",
"date": "2020-11-05", "date": "2020-11-05",
"time": "12:00" "time": "10:00"
} }
sync

{ {
"title": "Lecture 1", "title": "Lecture 1",
"date": "2020-11-05", "date": "2020-11-05",
"time": "10:00" "time": "10:00"
} }

Slide 143
This scenario is an example of conflict resolution, which occurs whenever several concurrent writes to
the same object need to be integrated into a single final state (see also Slide 95). Conflict-free replicated
data types, or CRDTs for short, are a family of algorithms that perform such conflict resolution [Shapiro
et al., 2011]. A CRDT is a replicated object that an application accesses though the object-oriented
interface of an abstract datatype, such as a set, list, map, tree, graph, counter, etc.
Slide 144 shows an example of a CRDT that provides a map from keys to values. The application
can invoke two types of operation: reading the value for a given key, and setting the value for a given
key (which adds the key if it is not already present).
The local state at each node consists of the set values containing (timestamp, key, value) triples.
Reading the value for a given key is a purely local operation that only inspects values on the current node,
and performs no network communication. The algorithm preserves the invariant that values contains at
most one element for any given key. Therefore, when reading the value for a key, the value is unique if
it exists.

Distributed Sys 7
No ratings yet
Distributed Sys 7
54 pages
p2
No ratings yet
p2
8 pages
12 Distributed2
No ratings yet
12 Distributed2
92 pages
Brewer Conjecture (CAP)
No ratings yet
Brewer Conjecture (CAP)
17 pages
Linearizability - A Correctness Condition For Concurrent Objects
No ratings yet
Linearizability - A Correctness Condition For Concurrent Objects
10 pages
Slides
No ratings yet
Slides
31 pages
Intro to DS Chapter 5
No ratings yet
Intro to DS Chapter 5
76 pages
Chapter 7-Consistency and Replication
No ratings yet
Chapter 7-Consistency and Replication
63 pages
The Distributed Computing Column: C Onvergent and Commutative Replicated D Ata Types
No ratings yet
The Distributed Computing Column: C Onvergent and Commutative Replicated D Ata Types
22 pages
Deepak and Deepa - Consistency - and - Replication
No ratings yet
Deepak and Deepa - Consistency - and - Replication
38 pages
A Critique of The Cap Theorem
No ratings yet
A Critique of The Cap Theorem
14 pages
Chapter 7-Consistency and Replication
No ratings yet
Chapter 7-Consistency and Replication
30 pages
consistency and replication
No ratings yet
consistency and replication
100 pages
Chapter-6 Consistency and Replication
No ratings yet
Chapter-6 Consistency and Replication
67 pages
DisSys Lec7
No ratings yet
DisSys Lec7
48 pages
DS CH6 - Consistency and Replication
No ratings yet
DS CH6 - Consistency and Replication
18 pages
Chapter 7
No ratings yet
Chapter 7
73 pages
Chapter 7-Consistency and Replication
No ratings yet
Chapter 7-Consistency and Replication
53 pages
Consistency and Replication: CS403/534 Distributed Systems Erkay Savas Sabanci University
No ratings yet
Consistency and Replication: CS403/534 Distributed Systems Erkay Savas Sabanci University
44 pages
CH 7 Part 2 Distributed System
No ratings yet
CH 7 Part 2 Distributed System
67 pages
Chapter 6-Consistency and Replication
No ratings yet
Chapter 6-Consistency and Replication
59 pages
slides.07
No ratings yet
slides.07
73 pages
Chimdesa Gedefa Assignment #2 Causal and Entry Consistency
No ratings yet
Chimdesa Gedefa Assignment #2 Causal and Entry Consistency
15 pages
Weakly Persistent Causal Objects in Dyna
No ratings yet
Weakly Persistent Causal Objects in Dyna
10 pages
Ds Lecture 10 11 11
No ratings yet
Ds Lecture 10 11 11
56 pages
Midterm Solutions
No ratings yet
Midterm Solutions
8 pages
Chapter 7-Consistency and Replication (1)
No ratings yet
Chapter 7-Consistency and Replication (1)
73 pages
Lecture 10 - Replication
No ratings yet
Lecture 10 - Replication
37 pages
Conflict Free Data Types: Principal of Distributed Software
No ratings yet
Conflict Free Data Types: Principal of Distributed Software
11 pages
Data-Centric Consistency Models: Presented by Saadia Jehangir
100% (2)
Data-Centric Consistency Models: Presented by Saadia Jehangir
31 pages
DS u-3
No ratings yet
DS u-3
15 pages
Concepts of Distributed Systems 2006/2007: Consistency & Replication
No ratings yet
Concepts of Distributed Systems 2006/2007: Consistency & Replication
53 pages
ds7 Con
No ratings yet
ds7 Con
71 pages
Chapter Five
No ratings yet
Chapter Five
46 pages
CAP Theorem Lect 2
No ratings yet
CAP Theorem Lect 2
77 pages
Consistency and Replication
No ratings yet
Consistency and Replication
73 pages
Chapter 6-Consistency and Replication
No ratings yet
Chapter 6-Consistency and Replication
39 pages
Consistency in Distributed Systems
No ratings yet
Consistency in Distributed Systems
21 pages
Week 11
No ratings yet
Week 11
57 pages
D.S Consistency and Replication
No ratings yet
D.S Consistency and Replication
44 pages
Cap Critique
No ratings yet
Cap Critique
14 pages
Chapter 7kec
No ratings yet
Chapter 7kec
8 pages
DS Consistancy and Replication (Mod 7)
No ratings yet
DS Consistancy and Replication (Mod 7)
13 pages
Distributed System Notes
No ratings yet
Distributed System Notes
24 pages
Distributed Systems: Chapter 07: Consistency & Replication
No ratings yet
Distributed Systems: Chapter 07: Consistency & Replication
48 pages
A Critique of The CAP Theorem-Martin Kleppmann
No ratings yet
A Critique of The CAP Theorem-Martin Kleppmann
14 pages
CAP Theorem
0% (1)
CAP Theorem
28 pages
Lec 26
No ratings yet
Lec 26
28 pages
Lec 15
No ratings yet
Lec 15
8 pages
Introduction To Distributed Computing
No ratings yet
Introduction To Distributed Computing
57 pages
Consistency Models in Distributed Systems
No ratings yet
Consistency Models in Distributed Systems
1 page
DS Lecture Chapter 7
No ratings yet
DS Lecture Chapter 7
38 pages
Presentation 1
No ratings yet
Presentation 1
64 pages
Chapter 7-Consistency and Replication
No ratings yet
Chapter 7-Consistency and Replication
78 pages
Big Data Analytics Lecture 2
No ratings yet
Big Data Analytics Lecture 2
42 pages
Oup Accepted Manuscript 2018
No ratings yet
Oup Accepted Manuscript 2018
18 pages
a161126
No ratings yet
a161126
26 pages
a (p ,θ) ⋅ p ∂ p, a (p ,θ) =log p ⋅θ p+θ, θ=2 ⋅ π ⋅f ⋅T
No ratings yet
a (p ,θ) ⋅ p ∂ p, a (p ,θ) =log p ⋅θ p+θ, θ=2 ⋅ π ⋅f ⋅T
12 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
MASS2100 FC300 FCT030 OI en en-US PDF
No ratings yet
MASS2100 FC300 FCT030 OI en en-US PDF
518 pages
Getting Started With RPi Pico W Using C
No ratings yet
Getting Started With RPi Pico W Using C
5 pages
VPE Control Reference PDF
No ratings yet
VPE Control Reference PDF
911 pages
HP Guide
No ratings yet
HP Guide
263 pages
G33M PDF
No ratings yet
G33M PDF
37 pages
Sobota B. Computer Science For Game Development... 2023
100% (1)
Sobota B. Computer Science For Game Development... 2023
139 pages
Error Down Auto Recovery On Huawei Switches
No ratings yet
Error Down Auto Recovery On Huawei Switches
2 pages
Mobile-basedNetworkMonitoringSystemUsingzabbix Compressed
No ratings yet
Mobile-basedNetworkMonitoringSystemUsingzabbix Compressed
6 pages
WP 5G 5G Traffic Model For Industrial Use Cases 22.10.19
No ratings yet
WP 5G 5G Traffic Model For Industrial Use Cases 22.10.19
28 pages
Typhoon H Software Release 3.0 Software Release 3.0: W Hat'S New?
No ratings yet
Typhoon H Software Release 3.0 Software Release 3.0: W Hat'S New?
5 pages
Server System Management
No ratings yet
Server System Management
386 pages
cst438 Midterm
No ratings yet
cst438 Midterm
7 pages
Where Can I Get Abaqus Tutorials For FEA
No ratings yet
Where Can I Get Abaqus Tutorials For FEA
2 pages
To Detect & Display Thread Breakage in TFO (Two For One) Machine To Make Process Efficient
No ratings yet
To Detect & Display Thread Breakage in TFO (Two For One) Machine To Make Process Efficient
4 pages
数据结构（C语言版） (严尉敏吴伟民) (Z-Library)
No ratings yet
数据结构（C语言版） (严尉敏吴伟民) (Z-Library)
345 pages
Loops: Genome 559: Introduction To Statistical and Computational Genomics Prof. James H. Thomas
No ratings yet
Loops: Genome 559: Introduction To Statistical and Computational Genomics Prof. James H. Thomas
27 pages
Foresight NV Training Intro v.1
No ratings yet
Foresight NV Training Intro v.1
93 pages
Sap Program Charter
No ratings yet
Sap Program Charter
20 pages
Half Yearly Instructions 2021-22
No ratings yet
Half Yearly Instructions 2021-22
3 pages
Aft c002 Googleinst 0520
No ratings yet
Aft c002 Googleinst 0520
2 pages
MYSQL NOTES 2024 XII
No ratings yet
MYSQL NOTES 2024 XII
19 pages
DLMS Client SCL Iot
No ratings yet
DLMS Client SCL Iot
4 pages
Apresentacao PCC1301 Display
No ratings yet
Apresentacao PCC1301 Display
32 pages
Adv C186
No ratings yet
Adv C186
40 pages
Linear Interpolation Function
No ratings yet
Linear Interpolation Function
5 pages
7UT512
0% (1)
7UT512
2 pages
Year/Sem:Iii / V Subject: Ec 1304 - Microprocessors and Microcontroller Lab
No ratings yet
Year/Sem:Iii / V Subject: Ec 1304 - Microprocessors and Microcontroller Lab
26 pages
CHAPTER 2: Understanding Service Strategy: Next Chapter Prev Chapter
No ratings yet
CHAPTER 2: Understanding Service Strategy: Next Chapter Prev Chapter
7 pages
Language Server Protocol and Implementation: Supporting Language-Smart Editing and Programming Tools 1st Edition Nadeeshaan Gunasinghe
100% (2)
Language Server Protocol and Implementation: Supporting Language-Smart Editing and Programming Tools 1st Edition Nadeeshaan Gunasinghe
79 pages
Service
No ratings yet
Service
51 pages