0% found this document useful (0 votes)

31 views28 pages

Lecture 04 - Cloud Storage

storage

Uploaded by

idc.cupons

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

31 views28 pages

Lecture 04 - Cloud Storage

storage

Uploaded by

idc.cupons

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 28

Cloud

Computing
Lecture 4

Storage – CAP
RDBMS

Dan Amiga
[email protected]
Stateless Instances

http://yourapp.cloudapp.net
Putting It All Together

Web role Worker role

Web role Worker role
Web role Worker role
LB

Storage
Stateless compute
+ Durable storage
-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐-‐
= Scalable application
Scale-up And Scale-out

Volume
Volume

WWW
$10,000
machine DNS

$1000
machine

$500 $500 $500 $500 $500

machine machine machine machine machine

# Machines
Scale Up Scale Out
Scale Up vs Scale Out
• Scale Up
– Easier (?)
– Bounded
– Expensive and proprietary
– Sometimes a must (?)
• Scale Out
– Harder (?)
– Slower when you start…
– Maintain Session (sticky vs regular)
– Unbounded, Cheaper, Always a must
• Storage is key for scaling out
On Premise / Traditional Storage Choices

• SAN, NAS, DAS

• Databases
• Offline Archival
• RAID Architecture on top of the above
Application Design Patterns

• Scale out for capacity

• Scale out for redundancy
• Asynchronous communication
• Short time outs with retries
• Idempotent operations
• Stateless with durable external storage
RAID (redundant array of independent disks)

• Storage technology that combines multiple

disk drive components into a logical unit.
• Data is distributed across the drives in one of
several ways called "RAID levels", depending
on the level of redundancy and performance
required.
Storage

• Simple, essential storage abstractions:

– Large items of data: Blobs, file streams, …
– Service state: Simple tables, caches, …
– Service communication: Queues, locks, …
• With an emphasis on:
– Massive scale, availability and durability
– Geo-location and geo-replication
• This is not a relational database in the cloud
Durable Storage

Blobs Tables Queues

…

• Three replicas of everything

• REST API
Storage
Blobs
Queues
AWS Storage Options

• Ephemeral Storage
• Elastic Block Storage (EBS)
• S3
• SQS
• NoSQL – Simple / Dynamo
• Relational Database Storage
• Storage Gateway
http://www.slideshare.net/AmazonWebServic
es/aws-storage-options
Amazon S3

• https://www.dropbox.com/help/7
• http://aws.amazon.com/s3/
• 1kb to 5TB of unlimited number
• You can choose a Region to optimize for
latency, minimize costs, or address
regulatory requirements.
• http://aws.amazon.com/s3-sla/
CAP Theorem
• Consistency (Atomic data objects)
– any read operation that begins after a write operation
completes must return that value, or the result of a
later write operation.
– E.g. if A writes 1 then 2 to location X, client B cannot
read 2 followed by 1.
• Available Data Objects
– even when severe (network? storage?) failures occur,
every request must terminate + minimal latency.
– Easier – all operations return successfully
• Partition Tolerance
– No set of failures less than total network failure is
allowed to cause the system to respond incorrectly.
– Easier – if the network stop delivering messages
between two sets of servers, the system will still
continue to work.
Simplified Proof
CAP Transactional Analysis

• You want consistency?

– Give up availability
– Or give up partition tolerance
Tradeoff

• Consistency give up
– DNS; Inconsistency;
• Availability give up
– Bad idea… Use retries
• Partition Tolerance
– VLDB/Clusters; Synchronous 2-phase commit
CAP In the real world

• AP: You are guaranteed get back responses

promptly (even with network partitions), but
you aren’t guaranteed anything about the
value/contents of the response.
• CP: You are guaranteed that any response you
get (even with network partitions) has a
consistent result. But you might not get any
responses whatsoever.
• CA: If the network never fails (and nodes never
crash, as they postulated earlier), then,
unsurprisingly, life is good. But if messages
could be dropped, all guarantees are off.
Consistency Models

• In an ideal world there would only be one consistency model;

when an update is made all observers will see that update

• Tradeoff to get a consistency update:

– Time
– Partition Tolerance or Availability

• An important observation is that in larger distributed scale

systems, network partitions are a given and as such consistency
and availability cannot be achieved at the same time. This
means that one has two choices on what to drop; relaxing
consistency will allow the system to remain highly available
under the partition conditions and prioritizing consistency
means that under certain conditions the system will not be
available.

• http://www.allthingsdistributed.com/2007/12/eventually_consis
tent.html
Eventual Consistency

• Different nodes keep replicas and each update is

“eventually” propagated to each replica
– And eventually, there is agreement on which
update is the latest
• As the consistency achieved is eventual, the
system has to resolve conflicts.
– Read repair: The correction is done when a read
finds an inconsistency. This slows down the read
operation.
– Write repair: The correction takes place during a
write operation, if an inconsistency has been
found, slowing down the write operation.
– Asynchronous repair: The correction is not part of
a read or write operation.
AWS S3 and Azure Storage Consistency
• Amazon S3 buckets in the US West (Oregon), US West
(Northern California), EU (Ireland), Asia Pacific (Singapore), Asia
Pacific (Tokyo), Asia Pacific (Sydney) and South America (Sao
Paulo) Regions provide read-after-write consistency for PUTS of
new objects and eventual consistency for overwrite PUTS and
DELETES. Amazon S3 buckets in the US Standard Region
provide eventual consistency.
• Azure storage is Consistent, Available, and Partition Tolerance.
How?
Storage Usage Comparison in Azure

CAP Theorem in Blockchain
No ratings yet
CAP Theorem in Blockchain
4 pages
Methods of Organizing Data
No ratings yet
Methods of Organizing Data
8 pages
Blazing Zebra Source Material
No ratings yet
Blazing Zebra Source Material
7 pages
Spatial Data Mining: Theory and Application
No ratings yet
Spatial Data Mining: Theory and Application
337 pages
Case Study Projects 1 Solution
0% (1)
Case Study Projects 1 Solution
1 page
7. Consistency Replication
No ratings yet
7. Consistency Replication
49 pages
Lecture 5 Distributed Storage Systems
No ratings yet
Lecture 5 Distributed Storage Systems
26 pages
REPEAT_1_Designing_for_failure_Architecting_resilient_systems_on_AWS_ARC335-R1
No ratings yet
REPEAT_1_Designing_for_failure_Architecting_resilient_systems_on_AWS_ARC335-R1
91 pages
AWS 3
No ratings yet
AWS 3
35 pages
Chapter One (Crime)
No ratings yet
Chapter One (Crime)
5 pages
CAP Theorem in Blockchain
No ratings yet
CAP Theorem in Blockchain
6 pages
Serverless, FAAS and Event-Driven Architecture
100% (1)
Serverless, FAAS and Event-Driven Architecture
63 pages
Amazon_Aurora_storage_demystified_DAT401
No ratings yet
Amazon_Aurora_storage_demystified_DAT401
30 pages
Chapter_4_3d6b7fe08203468c915d52f43c8757c0_1712934164766
No ratings yet
Chapter_4_3d6b7fe08203468c915d52f43c8757c0_1712934164766
28 pages
SAP BO Tutorial - SAP Business Objects Training Tutorials
No ratings yet
SAP BO Tutorial - SAP Business Objects Training Tutorials
2 pages
(Rahman) Assignment#1
No ratings yet
(Rahman) Assignment#1
9 pages
Unit 5
No ratings yet
Unit 5
21 pages
QMST 2300.251 ANGELOW D
No ratings yet
QMST 2300.251 ANGELOW D
7 pages
Ccaws Unit 5
No ratings yet
Ccaws Unit 5
17 pages
Oup Accepted Manuscript 2018
No ratings yet
Oup Accepted Manuscript 2018
18 pages
Module 2.3
No ratings yet
Module 2.3
25 pages
Lec 14
No ratings yet
Lec 14
13 pages
Random Af
No ratings yet
Random Af
15 pages
Intrepforalteryx
No ratings yet
Intrepforalteryx
15 pages
Cutting A Cube
No ratings yet
Cutting A Cube
15 pages
Homework 5 Solutions
No ratings yet
Homework 5 Solutions
6 pages
Lecture 7 Chapter 5 Part 3 Big Data Storage Concepts (3)
No ratings yet
Lecture 7 Chapter 5 Part 3 Big Data Storage Concepts (3)
11 pages
Crafting Sustainability A Study of Traditional Craft Practices in Central China
No ratings yet
Crafting Sustainability A Study of Traditional Craft Practices in Central China
242 pages
Amazon Dynamo DB - Presentation
100% (1)
Amazon Dynamo DB - Presentation
30 pages
DSM - CAP Theorem
No ratings yet
DSM - CAP Theorem
7 pages
Swilliams Lesson4
100% (1)
Swilliams Lesson4
3 pages
CAP Theorem Lect 2
No ratings yet
CAP Theorem Lect 2
77 pages
CISAS_Edited 13122024
No ratings yet
CISAS_Edited 13122024
12 pages
A Critique of The CAP Theorem-Martin Kleppmann
No ratings yet
A Critique of The CAP Theorem-Martin Kleppmann
14 pages
DBMS CHAP-4
No ratings yet
DBMS CHAP-4
20 pages
Ayushi Verma Market Research Project Finall
No ratings yet
Ayushi Verma Market Research Project Finall
69 pages
Writing A Research
No ratings yet
Writing A Research
52 pages
Overcoming CAP With Consistent Soft-State Replication: 1. Abstract
No ratings yet
Overcoming CAP With Consistent Soft-State Replication: 1. Abstract
7 pages
Data Engineering Unit 3
No ratings yet
Data Engineering Unit 3
4 pages
Aminotes - NTCC Project BIG DATA
No ratings yet
Aminotes - NTCC Project BIG DATA
22 pages
Unit-4_Cloud Storage and Database Services
No ratings yet
Unit-4_Cloud Storage and Database Services
88 pages
NoSQL - Unit 2
No ratings yet
NoSQL - Unit 2
11 pages
ch07-consistency-replication (1)
No ratings yet
ch07-consistency-replication (1)
30 pages
cloud unit-4-2
No ratings yet
cloud unit-4-2
32 pages
Big Data Analytics Lecture 3A
No ratings yet
Big Data Analytics Lecture 3A
27 pages
Unit 2.1.1 - AWS
No ratings yet
Unit 2.1.1 - AWS
20 pages
Phillips - Automated Aggregation of Data For Asset Health Analysis
No ratings yet
Phillips - Automated Aggregation of Data For Asset Health Analysis
11 pages
AWS Certified Solutions Architect Associate Exam Prep Regions, Availability Zones, and Edge Locations
No ratings yet
AWS Certified Solutions Architect Associate Exam Prep Regions, Availability Zones, and Edge Locations
90 pages
Sample Exam ITC 423
No ratings yet
Sample Exam ITC 423
8 pages
AWS Certified Cloud Practitioner Cheat Sheet Guide
100% (7)
AWS Certified Cloud Practitioner Cheat Sheet Guide
12 pages
Introduction To Market Research
No ratings yet
Introduction To Market Research
3 pages
R Code Snippets
No ratings yet
R Code Snippets
10 pages
Slides
No ratings yet
Slides
31 pages
FIDIC 1999 - Unforseeable Physical Conditions
No ratings yet
FIDIC 1999 - Unforseeable Physical Conditions
9 pages
Big Data Management and Nosql Databases: Doc. Rndr. Irena Holubova, PH.D
No ratings yet
Big Data Management and Nosql Databases: Doc. Rndr. Irena Holubova, PH.D
27 pages
Nosql Systems: Sharding, Replication and Consistency: Riccardo Torlone Università Roma Tre
No ratings yet
Nosql Systems: Sharding, Replication and Consistency: Riccardo Torlone Università Roma Tre
28 pages
Introduction to digital forensics
No ratings yet
Introduction to digital forensics
24 pages
Designing
No ratings yet
Designing
161 pages
Cap Critique
No ratings yet
Cap Critique
14 pages
Ebook - Cracking The System Design Interview Course
100% (1)
Ebook - Cracking The System Design Interview Course
91 pages
Tailoring Unit - Final
No ratings yet
Tailoring Unit - Final
26 pages
The New Age of Data-Intensive Applications
No ratings yet
The New Age of Data-Intensive Applications
7 pages
Cloud Computing
No ratings yet
Cloud Computing
68 pages
KT AWS
No ratings yet
KT AWS
16 pages
The CAP Theorem and The Design of Large Scale Distributed Systems: Part I
No ratings yet
The CAP Theorem and The Design of Large Scale Distributed Systems: Part I
44 pages
SQL Practicals (1)
No ratings yet
SQL Practicals (1)
7 pages
Distributed Shared Memory: Introduction & Thisis
No ratings yet
Distributed Shared Memory: Introduction & Thisis
22 pages
Business Analytics Notes
No ratings yet
Business Analytics Notes
6 pages
Durability & Availability: Durability Can Be Described As The Probability That You Will Eventually Be
No ratings yet
Durability & Availability: Durability Can Be Described As The Probability That You Will Eventually Be
12 pages
Week 4 Media and Information Litercay Lesson
No ratings yet
Week 4 Media and Information Litercay Lesson
6 pages
Module No.2.0: Office Application Unit No.2.3: Database Application Element 2.3.1: Selecting Database
No ratings yet
Module No.2.0: Office Application Unit No.2.3: Database Application Element 2.3.1: Selecting Database
16 pages
The Complete Future Trait Guide
From Everand
The Complete Future Trait Guide
Hamze Ghalebi
No ratings yet
The Architecture of Storage Networks
From Everand
The Architecture of Storage Networks
Pasquale De Marco
No ratings yet
Distributed Caching & Data Management: Mastering Redis, Memcached, And Apache Ignite Caching
From Everand
Distributed Caching & Data Management: Mastering Redis, Memcached, And Apache Ignite Caching
Rob Botwright
No ratings yet
AWS in Action Part -2: Real-world Solutions for Cloud Professionals
From Everand
AWS in Action Part -2: Real-world Solutions for Cloud Professionals
Poonam Devi
No ratings yet
Fast Data Processing Systems with SMACK Stack
From Everand
Fast Data Processing Systems with SMACK Stack
Raúl Estrada
No ratings yet
Learn Cassandra in 24 Hours
From Everand
Learn Cassandra in 24 Hours
Alex Nordeen
No ratings yet
Storage Area Networks For Dummies
From Everand
Storage Area Networks For Dummies
Christopher Poelker
3.5/5 (2)
AWS Certified Solutions Architect - Associate Exam Prep kit
From Everand
AWS Certified Solutions Architect - Associate Exam Prep kit
SUJAN
No ratings yet
Oracle 11g Streams Implementer's Guide
From Everand
Oracle 11g Streams Implementer's Guide
Ann L. R. McKinnell
No ratings yet
Introduction to Microsoft SQL Server
From Everand
Introduction to Microsoft SQL Server
Eric Frick
No ratings yet
All My IT Tech Posts
From Everand
All My IT Tech Posts
Stephen Edwards
No ratings yet
Oracle Recovery Appliance Handbook: An Insider’S Insight
From Everand
Oracle Recovery Appliance Handbook: An Insider’S Insight
Ramesh Raghav
No ratings yet
Build Your Own Distributed Compilation Cluster - A Practical Walkthrough
From Everand
Build Your Own Distributed Compilation Cluster - A Practical Walkthrough
Hunter Davis
No ratings yet
Elements of Android Room
From Everand
Elements of Android Room
Mark Murphy
No ratings yet
AWS Certified Solutions Architect - Professional
From Everand
AWS Certified Solutions Architect - Professional
VB Dev
No ratings yet
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
From Everand
Exploring Hadoop Ecosystem (Volume 2): Stream Processing
Wei Liu
No ratings yet
Information Technology HandBook
From Everand
Information Technology HandBook
Duong Tran
3/5 (1)
SAS Interview Questions You'll Most Likely Be Asked
From Everand
SAS Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
From Everand
SAS Programming Guidelines Interview Questions You'll Most Likely Be Asked
Vibrant Publishers
No ratings yet

Lecture 04 - Cloud Storage

Uploaded by

Lecture 04 - Cloud Storage

Uploaded by

Cloud

Web role Worker role

$500 $500 $500 $500 $500

• SAN, NAS, DAS

• Scale out for capacity

• Storage technology that combines multiple

• Simple, essential storage abstractions:

Blobs Tables Queues

• Three replicas of everything

• You want consistency?

• AP: You are guaranteed get back responses

• In an ideal world there would only be one consistency model;

• Tradeoff to get a consistency update:

• An important observation is that in larger distributed scale

• Different nodes keep replicas and each update is

You might also like