0% found this document useful (0 votes)

4 views69 pages

Module_1

The document provides an introduction to NoSQL databases, discussing their background, key features, advantages, and disadvantages compared to traditional relational databases. It covers the CAP theorem, various NoSQL categories such as key-value, document-based, column-family, and graph databases, and highlights the challenges of scaling relational databases. The document emphasizes the growing need for NoSQL solutions due to the limitations of relational databases in handling large datasets and complex data structures.

Uploaded by

belal Abdullah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

4 views69 pages

Module_1

Uploaded by

belal Abdullah

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPT, PDF, TXT or read online on Scribd

You are on page 1/ 69

1

INTRODUCTION TO
NOSQL DATABASES
Prepared By:
Madhuri J
Assistant Professor
Department of Computer Science and Engineering
Bangalore Institute of Technology
2

Outline
• Background
• What is NOSQL?
• Who is using it?
• 3 major papers for NOSQL
• CAP theorem
• NOSQL categories
• Conclusion
• References
3

Background
• Relational databases  mainstay of business
• Web-based applications caused spikes
• explosion of social media sites (Facebook, Twitter) with large data
needs
• rise of cloud-based solutions such as Amazon S3 (simple storage
solution)
• Hooking RDBMS to web-based application becomes
troublesome
4

Issues with scaling up

• Best way to provide ACID and rich query model is to have
the dataset on a single machine
• Limits to scaling up (or vertical scaling: make a “single”
machine more powerful)  dataset is just too big!
• Scaling out (or horizontal scaling: adding more
smaller/cheaper servers) is a better choice
• Different approaches for horizontal scaling (multi-node
database):
• Master/Slave
• Sharding (partitioning)
5

Scaling out RDBMS: Master/Slave

• Master/Slave
• All writes are written to the master
• All reads performed against the replicated slave databases
• Critical reads may be incorrect as writes may not have been
propagated down
• Large datasets can pose problems as master needs to
duplicate data to slaves
6

Scaling out RDBMS: Sharding

• Sharding (Partitioning)
• Scales well for both reads and writes
• Not transparent, application needs to be partition-aware
• Can no longer have relationships/joins across partitions
• Loss of referential integrity across shards
7

Other ways to scale out RDBMS

• Multi-Master replication
• INSERT only, not UPDATES/DELETES
• No JOINs, thereby reducing query time
• This involves de-normalizing data
• In-memory databases
8

What is NOSQL?
• The Name:
• Stands for Not Only SQL
• The term NOSQL was introduced by Carl Strozzi in 1998 to name
his file-based database
• It was again re-introduced by Eric Evans when an event was
organized to discuss open source distributed databases
• Eric states that “… but the whole point of seeking alternatives is
that you need to solve a problem that relational databases are a
bad fit for. …”
9

What is NOSQL?
• Key features (advantages):
• non-relational
• don’t require schema
• data are replicated to multiple
nodes (so, identical & fault-tolerant)
and can be partitioned:
• down nodes easily replaced
• no single point of failure
• horizontal scalable
• cheap, easy to implement
(open-source)
• massive write performance
• fast key-value access
10

What is NOSQL?
• Disadvantages:
• Don’t fully support relational features
• no join, group by, order by operations (except within partitions)
• no referential integrity constraints across partitions
• No declarative query language (e.g., SQL)  more
programming
• Relaxed ACID (see CAP theorem)  fewer guarantees
• No easy integration with other applications that support
SQL
11

Who is using them?

3 major papers for NOSQL

• Three major papers were the “seeds” of the NOSQL
movement:
• BigTable (Google)
• DynamoDB (Amazon)
• Ring partition and replication
• Gossip protocol (discovery and error detection)
• Distributed key-value data stores
• Eventual consistency
• CAP Theorem
13

The Perfect Storm

• Large datasets, acceptance of alternatives, and
dynamically-typed data has come together in a “perfect
storm”
• Not a backlash against RDBMS
• SQL is a rich query language that cannot be rivaled by the
current list of NOSQL offerings
14

CAP Theorem
• ACID
• A DBMS is expected to support “ACID transactions,” processes
that are:
• Atomicity: either the whole process is done or none is
• Consistency: only valid data are written
• Isolation: one operation at a time
• Durability: once committed, it stays that way

• CAP
• Consistency: all data on cluster has the same copies
• Availability: cluster always accepts reads and writes
• Partition tolerance: guaranteed properties are maintained even
when network failures prevent some machines from
communicating with others
15

CAP Theorem
• Brewer’s CAP Theorem:
• For any system sharing data, it is “impossible” to guarantee
simultaneously all of these three properties
• You can have at most two of these three properties for any shared-
data system
• Very large systems will “partition” at some point:
• That leaves either C or A to choose from (traditional DBMS prefers
C over A and P )
• In almost all cases, you would choose A over C (except in specific
applications such as order processing)
16

CAP Theorem
• Consistency
• 2 types of consistency:
1. Strong consistency – ACID (Atomicity, Consistency,
Isolation, Durability)
2. Weak consistency – BASE (Basically Available
Soft-state Eventual consistency)
17

CAP Theorem
• A consistency model determines rules for visibility and
apparent order of updates
• Example:
• Row X is replicated on nodes M and N
• Client A writes row X to node N
• Some period of time t elapses
• Client B reads row X from node M
• Does client B see the write from client A?
• Consistency is a continuum with tradeoffs
• For NOSQL, the answer would be: “maybe”
• CAP theorem states: “strong consistency can't be achieved at the
same time as availability and partition-tolerance”
18

CAP Theorem
• Eventual consistency
• When no updates occur for a long period of time, eventually all
updates will propagate through the system and all the nodes will
be consistent
• Cloud computing
• ACID is hard to achieve, moreover, it is not always required, e.g.
for blogs, status updates, product listings, etc.
19

Impedence mismatch
• Difference between relational model and in-memory data
structures.
• Relational data model organizes data into structure of tables,
rows, relations and tuples
• Tuple: Set of name-value pairs. (single record)
• Relation: Set of tuples.
• Values of relational tuple have to be simple and cannot contain
structures, such as nested record.
• In-memory data structures can take rich structures
• Data structure has to be translated into relational
representation to store it on disk.
• Representations requiring translation is IMPEDENCE
MISMATCH
20

Impedance mismatch Examplae

• Integration Database

• Multiple applications developed by separate teams,

storing their data in common database.
• Improves communication as all applications are
operating on persistent data.

• Disadvantages
• One application makes changes in data storage, it has
to co-ordinate with other
• Structure integrating many applications becomes
complex.
• Update on application may become problematic to
another application.
22

• Application Database
• Accessed by single application codebase, that’s looked
after by a single team.
• Only the team using the application needs to know
about the database structure.
23

Aggregates
• —Data as atomic units that have a complex structure —
• more structure than just a set of tuples —
• example:
• — complex record with: simple fields, arrays, records nested
inside —
• Aggregate in Domain-Driven Design —
•a collection of related objects that we treat as a unit —
•a unit for data manipulation and management of consistency —
•Advantages of aggregates: —
•easier for application programmers to work with —
•easier for database systems to handle operating on a cluster
24
25

Relational implementation
26

A possible aggregation
27

Aggregate representation
28

Aggregate implementation
29

Another possible aggregation

Aggregate representation (2)

Aggregate implementation (2)

Why NOSQL databases

• Application development productivity
• Less effort in mapping data between in memory data
structures and a relational database.

• Large scale data

• Explicitly run on clusters.
33

NOSQL categories
1. Key-value
• Example: DynamoDB, Voldermort, Scalaris
2. Document-based
• Example: MongoDB, CouchDB
3. Column-based
• Example: BigTable, Cassandra, Hbased
4. Graph-based
• Example: Neo4J, InfoGrid
• “No-schema” is a common characteristics of most
NOSQL storage systems
• Provide “flexible” data types
34

Key-Value Database
• Strongly aggregate-oriented
• Lots of aggregates
• Each aggregate has a key
• Data model
• A set of <key, value> pairs
• Value: an aggregate instance
• The aggregate is opaque to the database
• — just a big blob of mostly meaningless bit

• Access to an aggregate: lookup based on its key

Key-value
• Focus on scaling to huge amounts of data
• Designed to handle massive load
• Based on Amazon’s dynamo paper
• Data model: (global) collection of Key-value pairs
• Dynamo ring partitioning and replication
• Example: (DynamoDB)
• items having one or more attributes (name, value)
• An attribute can be single-valued or multi-valued like set.
• items are combined into a table
36

Key-Values Databases: Example

Key-value
• Basic API access:
• get(key): extract the value given a key
• put(key, value): create or update the value given its key
• delete(key): remove the key and its associated value
• execute(key, operation, parameters): invoke an operation to the
value (given its key) which is a special data structure (e.g. List, Set,
Map .... etc)
38

Key-value
Pros:
• very fast
• very scalable (horizontally distributed to nodes based on key)
• simple data model
• eventual consistency
• fault-tolerance

Cons:
- Can’t model more complex data structure such as objects
39

Key-value
Name Producer Data model Querying

SimpleDB Amazon set of couples (key, {attribute}), restricted SQL; select, delete,
where attribute is a couple GetAttributes, and
(name, value) PutAttributes operations
Redis Salvatore set of couples (key, value), primitive operations for each
Sanfilippo where value is simple typed value type
value, list, ordered (according to
ranking) or unordered set, hash
value
Dynamo Amazon like SimpleDB simple get operation and put
in a context
Voldemort LinkeId like SimpleDB similar to Dynamo
40

Popular key-value databases

Document databases
• Strongly aggregate-oriented
• Lots of aggregates
• Each aggregate has a key
• Data model
• A set of <key, document > pairs
• Document: an aggregate instance
• Structure of the aggregate visible
• limits on what we can place in it
• Access to an aggregate:
• queries based on the fields in the aggregate
42

Document Data model- Example

Document-based
• Can model more complex objects
• Data model: collection of documents
• Document: JSON (JavaScript Object Notation is a
data model, key-value pairs, which supports objects,
records, structs, lists, array, maps, dates, Boolean
with nesting), XML, other semi-structured formats.
44

Document-based
• Example: (MongoDB) document
• {Name:"Jaroslav",
Address:"Malostranske nám. 25, 118 00 Praha 1”,
Grandchildren: {Claire: "7", Barbara: "6", "Magda: "3", "Kirsten: "1",
"Otis: "3", Richard: "1“}
Phones: [ “123-456-7890”, “234-567-8963” ]
}
45

MongoDB 10gen object-structured manipulations with objects in

documents stored in collections (find object or
collections; objects via simple selections
each object has a primary and logical expressions,
key called ObjectId delete, update,)
Couchbase Couchbase1 document as a list of by key and key range, views
named (structured) items via Javascript and
(JSON document) MapReduce
47

Column(-Family) Store
48

Properties of Column-Family Stores

Cassandra
50

Column-based
• Based on Google’s BigTable paper
• Like column oriented relational databases (store data in column order) but
with a twist
• Tables similarly to RDBMS, but handle semi-structured
• Data model:
• Collection of Column Families
• Column family = (key, value) where value = set of related columns (standard, super)
• indexed by row key, column key and timestamp

allow key-value pairs to be stored (and retrieved on key) in a massively parallel

system
storing principle: big hashed distributed tables
properties: partitioning (horizontally and/or vertically), high availability etc.
completely transparent to application

* Better: extendible records

Column-based
• One column family can have variable
numbers of columns
• Cells within a column family are sorted “physically”
• Very sparse, most cells have null values
• Comparison: RDBMS vs column-based NOSQL
• Query on multiple tables
• RDBMS: must fetch data from several places on disk and glue together
• Column-based NOSQL: only fetch column families of those columns
that are required by a query (all columns in a column family are stored
together on the disk, so multiple rows can be retrieved in one read
operation  data locality)
52

Column-based
• Example: (Cassandra column family--timestamps
removed for simplicity)
UserProfile = {
Cassandra = { emailAddress:”[email protected]” , age:”20”}
TerryCho = { emailAddress:”[email protected]” , gender:”male”}
Cath = { emailAddress:”[email protected]” ,
age:”20”,gender:”female”,address:”Seoul”}
}
53

Column-based
Name Producer Data model Querying

BigTable Google set of couples (key, {value}) selection (by combination of

row, column, and time stamp
ranges)
HBase Apache groups of columns (a BigTable JRUBY IRB-based shell
clone) (similar to SQL)
Hypertable Hypertable like BigTable HQL (Hypertext Query
Language)
CASSANDRA Apache columns, groups of columns simple selections on key,
(originally corresponding to a key range queries, column or
Facebook) (supercolumns) columns ranges
PNUTS Yahoo (hashed or ordered) tables, selection and projection from a
typed arrays, flexible schema single table (retrieve an
arbitrary single record by
primary key, range queries,
complex predicates, ordering,
top-k)
54

Popular Column-Family Stores

Graph-based
• Focus on modeling the structure of data (interconnectivity)
• Scales to the complexity of data
• Graph databases are motivated by—small records with
complex interconnections
• we have a web of information whose nodes are very small
(nothing more than a name) but there is a rich structure of
interconnections between them.
• Example:
• Neo4j, FlockDB, Pregel, InfoGrid …
56
57

Graph Database
• A graph database is a database that uses graph structures with
nodes, edges, and properties to represent and store data
• A management systems for graph databases offers Create,
Read, Update, and Delete (CRUD) methods to access and
manipulate data
• Graph databases can be used for both OLAP (since are
naturally multidimensional structures ) and OLTP
• Systems tailored to OLTP (e.g., Neo4j) are generally optimized
for transactional performance, and tend to guarantee ACID
properties
58

Graph Database: Relationships

• Graph databases are particulary suited to model situations in
which the information is somehow “natively” in the form of a
graph.
• Most of the time you find data by navigating through the
network of edges, with queries such as “tell me all the things
that both Anna and Barbara like.”
• The emphasis on relationships makes graph databases very
different from aggregate-oriented databases.
• The real world provide us with a lot of application domains:
social networks, recommendation systems, geospatial
applications, computer network and data center management,
authorization and access control, etc.
59

Schemaless Databases
• A schemaless store also makes it easier to deal with non uniform
data: data where each record has a different set of fields.
• NoSQL databases are schemaless:
• A key-value store allows you to store any data you like under a
key
• A document database effectively does the same thing, since it
makes no restrictions on the structure of the documents you
store
• Column-family databases allow you to store any data under any
column you like
• Graph databases allow you to freely add new edges and freely
add properties to nodes and edges as you wish
60

Schemaless Databases
• This has various advantages:
• Without a schema binding you, you can easily store whatever
you need, and change your data storage as you learn more
about your project
• You can easily add new things as you discover them
• A schemaless store also makes it easier to deal with nonuniform
data: data where each record has a different set of fields
(limiting sparse data storage)
61

Schemaless Databases
• And also some problems
• Indeed, whenever we write a program that accesses data, that program
almost always relies on some form of implicit schema: it will assume
that certain field names are present and carry data with a certain
meaning, and assume something about the type of data stored within
that field
• Having the implicit schema in the application means that in order to
understand what data is present you have to dig into the application
code
• Furthermore, the database remains ignorant of the schema: it cannot
use the schema to support the decision on how to store and retrieve
data efficiently.
62

Materialized views

• Views provide a mechanism to hide from the client whether

data is derived data or base data—but can’t avoid the fact that
some views are expensive to compute.
• Materialized views are views that are computed in advance
and cached on disk.
• •Materialized views are effective for data that is read heavily
but can stand being somewhat stale.
63

• There are two rough strategies to building a materialized view.

1.Eager approach where you update the materialized view at the
same time you update the base data for it.
• In this case, adding an order would also update the purchase history
aggregates for each product.
• This approach is good when you have more frequent reads of the
materialized view than you have writes and you want the materialized
views to be as fresh as possible.
2. Materialized views can be used within the same aggregate.
• An order document might include an order summary element that
provides summary information about the order.
• An advantage of doing this is that it allows you to update the
materialized view within the same atomic operation.
64

MODELING FOR DATA ACCESS

• As mentioned earlier, when modeling data aggregates we need to
consider how the data is going to be read as well as what are the side
effects on data related to those aggregates.

• This is the model where all the data for the customer is embedded
using a key-value store.

• In this scenario, the application can read the customer’s information

and all the related data by using the key.

• If the requirements are to read the orders or the products sold in each
order, the whole object has tobe read and then parsed on the client
side to build the results.
65
66

• When references are needed, we could switch to document stores

and then query inside the documents, or even change the data for the
key-value store to split the value object into Customer and Order
objects and then maintain these objects’ references to each other.
67

• Aggregates can also be used to obtain analytics; for example, an

aggregate update may fill in information on which Orders have a
given Product in them.
• •This denormalization of the data allows for fast access to the data
we are interested in and is the basis for Real Time BI or Real Time
Analytics
68

Conceptual view into column data store

Graph model of e-commerce data

NOSQL
No ratings yet
NOSQL
23 pages
Introduction To NoSQL
No ratings yet
Introduction To NoSQL
43 pages
IntroNoSQL Revised
No ratings yet
IntroNoSQL Revised
28 pages
Introduction To NoSQL
No ratings yet
Introduction To NoSQL
29 pages
NoSQL Databases
No ratings yet
NoSQL Databases
52 pages
CIS - 468 - 04 - NOSQL Databases and Big Data Storage Systems
No ratings yet
CIS - 468 - 04 - NOSQL Databases and Big Data Storage Systems
102 pages
Lecture 6 - NoSQL
No ratings yet
Lecture 6 - NoSQL
28 pages
NoSQL Databases
No ratings yet
NoSQL Databases
20 pages
No SQL
No ratings yet
No SQL
109 pages
IntroNoSQL (3)
No ratings yet
IntroNoSQL (3)
44 pages
NoSql 2024 Assign2
No ratings yet
NoSql 2024 Assign2
189 pages
NoSQL D
No ratings yet
NoSQL D
26 pages
NoSQL
No ratings yet
NoSQL
18 pages
Big Data Analytics Unit-2
No ratings yet
Big Data Analytics Unit-2
30 pages
BigData_NoSQL
No ratings yet
BigData_NoSQL
30 pages
NoSQL Database
No ratings yet
NoSQL Database
64 pages
Introduction to NoSQL
No ratings yet
Introduction to NoSQL
13 pages
nosql-kk
No ratings yet
nosql-kk
23 pages
Introduction To Nosql: Gabriele Pozzani
No ratings yet
Introduction To Nosql: Gabriele Pozzani
49 pages
NoSQL (1)
No ratings yet
NoSQL (1)
12 pages
NOsql Presentation
No ratings yet
NOsql Presentation
20 pages
4.NoSQL 1
No ratings yet
4.NoSQL 1
69 pages
nosql
No ratings yet
nosql
64 pages
BIG - DATA - Unit 4
No ratings yet
BIG - DATA - Unit 4
99 pages
Module 5_NoSQL databases
No ratings yet
Module 5_NoSQL databases
33 pages
Unit 2
No ratings yet
Unit 2
26 pages
ngd unit 1-4
No ratings yet
ngd unit 1-4
43 pages
11 NoSQL-slides
No ratings yet
11 NoSQL-slides
26 pages
Lecture 1 - NoSQL
No ratings yet
Lecture 1 - NoSQL
31 pages
unit 4 BDA
No ratings yet
unit 4 BDA
22 pages
Chapter 5-NoSQL PDF
No ratings yet
Chapter 5-NoSQL PDF
47 pages
Cs 620 / Dasc 600 Introduction To Data Science & Analytics: Lecture 6-Nosql
No ratings yet
Cs 620 / Dasc 600 Introduction To Data Science & Analytics: Lecture 6-Nosql
31 pages
NoSQL MongoDB HBase Cassandra
100% (1)
NoSQL MongoDB HBase Cassandra
142 pages
Chapter_4 - NoSQL_1676181987
No ratings yet
Chapter_4 - NoSQL_1676181987
85 pages
Slide 6 NoSQL Database and HBase Tutorial
No ratings yet
Slide 6 NoSQL Database and HBase Tutorial
110 pages
Unit 4: Big Data Tehnology Landscape Two Inportant Technologies
No ratings yet
Unit 4: Big Data Tehnology Landscape Two Inportant Technologies
42 pages
Lecture 1
No ratings yet
Lecture 1
31 pages
Full Stack UNIT3
No ratings yet
Full Stack UNIT3
57 pages
4unit NoSQL
No ratings yet
4unit NoSQL
27 pages
Unit VI_1
No ratings yet
Unit VI_1
31 pages
Unit Ii - Nosql Databases
No ratings yet
Unit Ii - Nosql Databases
112 pages
NOSQL Lecture 1 Notes
No ratings yet
NOSQL Lecture 1 Notes
31 pages
Introduction To: Nosql
No ratings yet
Introduction To: Nosql
27 pages
NoSQL
No ratings yet
NoSQL
29 pages
Big Data Analysis
No ratings yet
Big Data Analysis
9 pages
NoSQL_Notes
No ratings yet
NoSQL_Notes
11 pages
No SQL & RDBMS
No ratings yet
No SQL & RDBMS
39 pages
BDA.Unit-2
No ratings yet
BDA.Unit-2
30 pages
Big Data Topic4 (Nosql Database) (Thanh Binh Nguyen) .TextMark
No ratings yet
Big Data Topic4 (Nosql Database) (Thanh Binh Nguyen) .TextMark
52 pages
Nosql Tricks
No ratings yet
Nosql Tricks
34 pages
11-NoSQL_Nhom8
No ratings yet
11-NoSQL_Nhom8
72 pages
NoSQL DBs
No ratings yet
NoSQL DBs
46 pages
Bda - Unit 2
No ratings yet
Bda - Unit 2
30 pages
PPT 2.2.1
No ratings yet
PPT 2.2.1
26 pages
UNIT II
No ratings yet
UNIT II
70 pages
2- NoSQL
No ratings yet
2- NoSQL
32 pages
Unit 2(Big Data Analytics)
No ratings yet
Unit 2(Big Data Analytics)
11 pages
Lecture NoSqlIntro
No ratings yet
Lecture NoSqlIntro
30 pages
Nosql What Does It Mean
No ratings yet
Nosql What Does It Mean
15 pages
DBMS MASTER: Become Pro in Database Management System
From Everand
DBMS MASTER: Become Pro in Database Management System
Ummed Singh
No ratings yet
DBMS, Big Data Anlaytics Module 1 Notes
No ratings yet
DBMS, Big Data Anlaytics Module 1 Notes
15 pages
ADempiere Manual
100% (1)
ADempiere Manual
30 pages
Asm2 Levinhhung
No ratings yet
Asm2 Levinhhung
26 pages
Exam 70 462 Administering Microsoft SQL Server 2012 2014 Databases Skills Measured
No ratings yet
Exam 70 462 Administering Microsoft SQL Server 2012 2014 Databases Skills Measured
4 pages
ARM5 Update 1 Release Note Active Risk
No ratings yet
ARM5 Update 1 Release Note Active Risk
35 pages
Jav
No ratings yet
Jav
10 pages
HR Services Profile
No ratings yet
HR Services Profile
6 pages
Oracle Application Solution Document 27.06.22
No ratings yet
Oracle Application Solution Document 27.06.22
12 pages
On PDA
100% (1)
On PDA
13 pages
Clarion Language Programming
No ratings yet
Clarion Language Programming
74 pages
Ch02 Data Models Ed7
67% (3)
Ch02 Data Models Ed7
35 pages
GC33-0155-2 CICS VS Version 1.6 and 1.7 General Information Jul85 PDF
No ratings yet
GC33-0155-2 CICS VS Version 1.6 and 1.7 General Information Jul85 PDF
101 pages
i Bcom CA Dbms Model Final
No ratings yet
i Bcom CA Dbms Model Final
6 pages
Clinical Management System For Orthopedic Division
No ratings yet
Clinical Management System For Orthopedic Division
70 pages
Advanced Databse Module
No ratings yet
Advanced Databse Module
131 pages
Hyperion Planning What You Should Know
100% (2)
Hyperion Planning What You Should Know
7 pages
OpenAI Agents SDK 45 MCQs Nida Rizwan
No ratings yet
OpenAI Agents SDK 45 MCQs Nida Rizwan
12 pages
8-DC16 - Ch11 - Building Solutions Database, System, and Application Development Tools
No ratings yet
8-DC16 - Ch11 - Building Solutions Database, System, and Application Development Tools
65 pages
C PROGRAM of Machine Design
100% (1)
C PROGRAM of Machine Design
12 pages
Volza
No ratings yet
Volza
24 pages
Case Mobile
No ratings yet
Case Mobile
55 pages
Chase SQL
No ratings yet
Chase SQL
30 pages
Introduction To Database and SQL
No ratings yet
Introduction To Database and SQL
20 pages
L5 - Designing The Solution
No ratings yet
L5 - Designing The Solution
20 pages
MQLMySQL Technical Reference MQL4
0% (1)
MQLMySQL Technical Reference MQL4
7 pages
Impact of Technology in Quality of Service
No ratings yet
Impact of Technology in Quality of Service
233 pages
SAP Business Objects 4.1 Course Content:: SAP BO 4.1 Videos-2014 + 800 MB Material 32 Hours
No ratings yet
SAP Business Objects 4.1 Course Content:: SAP BO 4.1 Videos-2014 + 800 MB Material 32 Hours
4 pages
Installation and Configuration Guide For The ILM Store
No ratings yet
Installation and Configuration Guide For The ILM Store
52 pages
Abhishek DBMS Ch1
No ratings yet
Abhishek DBMS Ch1
39 pages
Networker Foundations - SRG
No ratings yet
Networker Foundations - SRG
58 pages

Module_1

Uploaded by

Module_1

Uploaded by

1

Issues with scaling up

Scaling out RDBMS: Master/Slave

Scaling out RDBMS: Sharding

Other ways to scale out RDBMS

Who is using them?

3 major papers for NOSQL

The Perfect Storm

Impedance mismatch Examplae

• Multiple applications developed by separate teams,

Another possible aggregation

Aggregate representation (2)

Aggregate implementation (2)

Why NOSQL databases

• Large scale data

• Access to an aggregate: lookup based on its key

Key-Values Databases: Example

Popular key-value databases

Document Data model- Example

Popular document databases

MongoDB 10gen object-structured manipulations with objects in

Properties of Column-Family Stores

allow key-value pairs to be stored (and retrieved on key) in a massively parallel

* Better: extendible records

BigTable Google set of couples (key, {value}) selection (by combination of

Popular Column-Family Stores

Graph Database: Relationships

• Views provide a mechanism to hide from the client whether

• There are two rough strategies to building a materialized view.

MODELING FOR DATA ACCESS

• In this scenario, the application can read the customer’s information

• When references are needed, we could switch to document stores

• Aggregates can also be used to obtain analytics; for example, an

Conceptual view into column data store

Graph model of e-commerce data

You might also like