0% found this document useful (0 votes)

85 views35 pages

Big Nosql Data: Mike Carey

The document discusses the evolution of database management systems from pre-relational hierarchical and network models to modern NoSQL databases. It covers the relational era beginning in 1970 with Codd's paper introducing the relational model. Object-oriented databases emerged in the 1980s seeking to model rich object schemas but had tight programming language bindings. Object-relational databases of the 1990s added object features to relational systems for emerging applications while loosely coupling with programming languages. XML databases of the 2000s stored data in XML format. The document uses the AsterixDB system as an example of modern NoSQL databases for big data.

Uploaded by

Suraj Patil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

85 views35 pages

Big Nosql Data: Mike Carey

Uploaded by

Suraj Patil

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 35

Big NoSQL Data

Mike Carey
[email protected]

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 0

Announcements
• Keep watching the course wiki page (especially its
attachments):
• https://grape.ics.uci.edu/wiki/asterix/wiki/stats170ab-2018
• Ditto for the Piazza page (for Q&A):
• http://piazza.com/uci/winter2018/stats170a/home
• HW #4 is underway:
• Expect getting 10K Tweets to take ~1 hour
• Leave time for wrangling and rate-limiting “fun”
• Id data type options: bigint or varchar (varchar may be easier)
• Suggestion: get list of Tweets, then go straight to PostgreSQL
• Any initial questions? (Don’t start this one late!)

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 1

Plan for Today’s Lecture
• The pre-relational and relational eras
• Moving beyond rows and columns (?)
1. The object-oriented DB era
2. The object-relational DB era
3. The XML DB era
4. The NoSQL DB era
• AsterixDB as a Big NoSQL Data exemplar
• User model (DDL, DML, etc.)
• System architecture and internal highlights
• Conclusions and class participation (J)

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 2

The Birth of Today’s DBMS Field

• In the beginning was the Word, and the Word was

with Codd, and the Word was Codd...
• 1970 CACM paper: “A relational model of data for large
shared data banks”
• Many refer to this as the first generation of (real?)
database management systems
Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 3
The First Decade B.C.

• The need for a data management library, or a database

management system, had actually been well recognized
• Hierarchical DB systems (e.g., IMS from IBM)
• Network DB systems (most notably CODASYL)
• These systems provided navigational APIs
• Systems provided files, records, pointers, indexes
• Programmers had to (carefully!) scan or search for records,
follow parent/child structures or pointers, and maintain code
when anything physical changed

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 4

The First Decade B.C. (cont.)

Order (id, custName, custCity, total) Product (sku, name, listPrice, size, power)

Item-Order Item-Product

Item (ino, qty, price)

(Parent child record sets)

Order
123 Fred LA 25.97 Product
401 Garfield T-Shirt 9.99 XL -
Item-Order
Item Item Product
1 2 9.99 2 1 3.99 544 USB Charger 5.99 - 115V
Item-Product
Item-Product 5
Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018
Enter the Relational DB Era
Order (id, custName, custCity, total)
Product (sku, name, listPrice, size, power)
123 Fred LA 25.97
401 Garfield T-Shirt 9.99 XL null
Item (order-id, ino, product-sku, qty, price) 544 USB Charger 5.99 null 115V
123 1 401 2 9.99

123 2 544 1 3.99

• Be sure to notice that

• Everything’s now (logical) rows and columns
• The world is flat; columns are atomic (1NF)
• Data is now connected via keys (foreign/primary)
Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 6
As the Relational Era Unfolded
• The Spartan simplicity of the relational data model
made it possible to start tackling the opportunities
and challenges of a logical data model
• Declarative queries (Rel Alg/Calc, Quel, QBE, SQL, ...)
• Transparent indexing (physical data independence)
• Query optimization and execution
• Views, constraints, referential integrity, security, ...
• Scalable (shared-nothing) parallel processing
• Today’s multi-$B industry was slowly born
• Commercial adoption took ~10-15 years
• Parallel DB systems took ~5 more years
Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 7
Enter the Object-Oriented DB Era
Order
123 Fred LA 25.97 { } Product
401 Garfield T-Shirt 9.99 XL -

Item Product
1 2 9.99 Item 544 USB Charger 5.99 - 115V
2 1 3.99

• Notice that:
• Data model contains objects and pointers (OIDs)
• The world is no longer flat – the Order and Product
schemas now have set(Item) and Product in them,
respectively
Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 8
What OODBs Sought to Offer

• Motivated largely by late 1980’s CAx applications (e.g.,

mechanical CAD, VLSI CAD, software CAD, ...)
• Rich schemas with inheritance, complex objects, object
identity, references, ...
• Methods (“behavior”) as well as data in the DBMS
• Tight bindings with (OO) programming languages
• Fast navigation, some declarative querying
• Ex: Gemstone, Ontos, Objectivity, Versant, Object
Design, O2, also DASDBS (sort of)

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 9

Why OODBs “Fell Flat”
• Too soon for another (radical) DB technology
• Also technically immature relative to RDBMSs
• Tight PL bindings were a double-edged sword
• Data shared, outlives programming languages
• Bindings led to significant system heterogeneity
• Also made schema evolution a major challenge
• Systems “overfitted” in some dimensions
• Inheritance, version management, ...
• Focused on thick clients (e.g., CAD workstations)

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 10

Enter the Object-Relational DB Era
Product (sku, name, listPrice)
Order (id, customer, total) ClothingProduct (size) under Product
123 Fred LA 25.97 ElectricProduct (power) under Product

Item (order-id, ino, product-sku, qty, price) 401 Garfield T-Shirt 9.99 XL

(123) 1 (401) 2 9.99

544 USB Charger 5.99 115V
(123) 2 (544) 1 3.99

• Be sure to notice:
• “One size fits all!”
• UDTs/UDFs, table hierarchies, references, ...
• But the world got flatter again...
(Timing lagged OODBs by just a few years)

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 11

What O-R DBs Sought to Offer
• Motivated by newly emerging application
opportunities (multimedia, spatial, text, ...)
• User-defined functions (UDTs/UDFs) & aggregates
• Data blades (UDTs/UDFs + indexing support)
• OO goodies for tables: row types, references, ...
• Nested tables (well, at least Oracle added these)
• Back to a model where applications were loosely
bound to the DBMS (e.g., ODBC/JDBC)
• Ex: ADT-Ingres, Postgres, Starburst, UniSQL, Illustra,
DB2, Oracle

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 12

Why O-R DBs “Fell Flat”
• Significant differences across DB vendors
• SQL standardization lagged somewhat
• Didn’t include details of UDT/UDF extensions
• Tough to extend the innards (for indexing)
• Application issues (and multiple platforms)
• Least common denominator vs. coolest features
• Tools (e.g., DB design tools, ORM layers, ...)
• Also still probably a bit too much too soon
• IT departments still rolling in RDBMSs and creating
relational data warehouses

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 13

Then Came the XML DB Era
<Order id=”123”> <Product sku=”401”>
<Customer> <name>Garfield T-Shirt</name>
<custName>Fred</custName> <listPrice>9.99</listPrice>
<custCity>LA</custCity> <size>XL</size>
</Customer> </Product>
<total>25.97</total>
<Items> <Product sku=”544”>
<Item ino=”1”> <name>USB Charger</name>
<product-sku>401</product-sku> <listPrice>5.99</listPrice>
<qty>2</qty> <power>115V</power>
<price>9.99</price> </Product>
</Item>
<Item ino=“2”>
< product-sku>544</product-sku>
<qty>1</qty> Note that
<price>3.99</price> - The world’s less flat again
</Item ino=”2”> - We’re now in the 2000’s
</Items>
</Order> Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 14
What XML DBs Sought to Offer
• One <flexible/> data model fits all (XML)
• Origins in document markup (SGML)
• Nested data
• Schema variety/optionality
• New declarative query language (XQuery)
• Designed both for querying and transformation
• Early standardization effort (W3C)
• Two different DB-related use cases, in reality
• Data storage: Lore (pre-XML), Natix, Timber, Ipedo,
MarkLogic, BaseX; also DB2, Oracle, SQL Server
• Data integration: Nimble Technology, BEA Liquid Data (from
Enosys), BEA AquaLogic Data Services Platform
Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 15
Why XML DBs “Fell Flat” Too
• Document-centric origins (vs. data use cases) of XML
Schema and XQuery made a mess of things
• W3C XPATH legacy (K)
• Document identity, document order, ...
• Attributes vs. elements, nulls, ...
• Mixed content (overkill for non-document data)
• Two other external trends also played a role
• SOA and Web services came but then went
• JSON (and RESTful services) appeared on the scene
• Note: Likely still an important niche market...
Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 16
Now the “NoSQL” DB Era

• Not from the DB world!

• Distributed systems folks
• Also various startups
• From caches à K/V use cases
• Needed massive scale-out
• OLTP (vs. parallel DB) apps
• Simple, low-latency API
• Need a key K, but want no schema for V
• Record-level atomicity, replica consistency varies

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 17

NoSQL Data (JSON-based)
Collection(Order) Collection(Product)

{“id”: “123”, {“sku”: 401,

“Customer”: “name”: “Garfield T-Shirt”,
{ “custName”: “Fred”, “listPrice”: 9.99,
“custCity”: “LA” } “size”: “XL” }
“total”: 25.97,
“Items”: [ {“sku”: 544,
{“product-sku”: 401, “name”: “USB Charger”,
“qty”: 2, “listPrice”: 5.99,
“price”: 9.99 }, “power”: “115V” }
{“product-sku”: 544,
“qty”: 1,
“price”: 3.99 } Note that
] - The world’s not flat, but it’s less <messy/>
} - We’re now in the 2010’s, timing-wise

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 18

Current NoSQL Trends
• Popular examples: MongoDB, Couchbase
• Coveting the benefits of many DB goodies
• Secondary indexing and non-key access
• Declarative queries
• Aggregates and now (initially small) joins
• Seem to be heading towards...
• BDMS (think scalable, OLTP-aimed, parallel DBMS)
• Declarative queries and query optimization, but applied to
schema-less data
• Return of (some, optional!) schema information

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 19

Our Answer: Apache AsterixDB

http://asterixdb.apache.org/
Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 20
Just How Big is “Big Data”?

Cores

Main
Memory This
is big
Disks data!
Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 21
AsterixDB: “One Size Fits a Bunch”
Semistructured BDMS Desiderata:
Data Management
• Able to manage data
• Flexible data model
• Full query capability
• Continuous data
ingestion
• Efficient and robust
parallel runtime
• Cost proportional to task
at hand
• Support “Big Data data
types”
Parallel 1st Generation •
•
Database Systems “Big Data” Systems •

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 22

ASTERIX Data Model (ADM)
CREATE DATAVERSE TinySocial; CREATE TYPE EmploymentType AS {
USE TinySocial; organizationName: string,
startDate: date,
CREATE TYPE GleambookUserType AS { endDate: date?
id: int, };
alias: string, CREATE DATASET GleambookUsers
name: string, (GleambookUserType)
userSince: datetime, PRIMARY KEY id;
friendIds: {{ int }},
employment: [EmploymentType]
};

Highlights include:
• JSON++ based data model
• Rich type support (spatial, temporal, …)
• Records, lists, bags
• Open vs. closed types
Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 24
ASTERIX Data Model (ADM)
CREATE DATAVERSE TinySocial; CREATE TYPE EmploymentType AS {
USE TinySocial; organizationName: string,
startDate: date,
CREATE TYPE GleambookUserType AS { endDate: date?
id: int };
}; CREATE DATASET GleambookUsers
(GleambookUserType)
CREATE TYPE GleambookMessageType AS { PRIMARY KEY id;
messageId: int,
authorId: int, CREATE DATASET GleambookMessages
inResponseTo: int?, (GleambookMessageType)
senderLocation: point?, PRIMARY KEY messageId;
message: string
}; Highlights include:
• JSON++ based data model
• Rich type support (spatial, temporal, …)
• Records, lists, bags
• Open vs. closed types
Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 25
Ex: GleambookUsers Data
{"id”:1, "alias":"Margarita", "name":"MargaritaStoddard", "nickname":"Mags”,
"userSince":datetime("2012-08-20T10:10:00"), "friendIds":{{2,3,6,10}},
"employment": [ {"organizationName":"Codetechno”, "startDate":date("2006-08-06")},
{"organizationName":"geomedia" , "startDate":date("2010-06-17"),
"endDate":date("2010-01-26")} ],
"gender":"F”
},

{"id":2, "alias":"Isbel”, "name":"IsbelDull", "nickname":"Izzy",

"userSince":datetime("2011-01-22T10:10:00"), "friendIds":{{1,4}},
"employment": [ {"organizationName":"Hexviafind", "startDate":date("2010-04-27")} ]
},

{"id":3, "alias":"Emory", "name":"EmoryUnk”,

"userSince":datetime("2012-07-10T10:10:00"), "friendIds":{{1,5,8,9}},
"employment": [ {"organizationName":"geomedia”, "startDate":date("2010-06-17"),
"endDate":date("2010-01-26")} ]
},
.....

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 26

Other DDL Features
CREATE INDEX gbUserSinceIdx ON GleambookUsers(userSince);
CREATE INDEX gbAuthorIdx ON GleambookMessages(authorId) TYPE BTREE;
CREATE INDEX gbSenderLocIndex ON GleambookMessages(senderLocation) TYPE RTREE;
CREATE INDEX gbMessageIdx ONGleambookMessages(message) TYPE KEYWORD;
//--------------------- and also ------------------------------------------------------------------------------------
CREATE TYPE AccessLogType AS CLOSED
{ ip: string, time: string, user: string, verb: string, `path`: string, stat: int32, size: int32 };
CREATE EXTERNAL DATASET AccessLog(AccessLogType) USING localfs
(("path"="localhost:///Users/mikejcarey/extdemo/accesses.txt"),
("format"="delimited-text"), ("delimiter"="|"));
CREATE FEED myMsgFeed USING socket_adapter
(("sockets"="127.0.0.1:10001"), ("address-type"="IP"),
("type-name"="GleambookMessageType"), ("format"="adm"));
CONNECT FEED myMsgFeed TO DATASET GleambookMessages;
START FEED myMsgFeed;

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 27

ASTERIX Queries (SQL++ or AQL)
• Q1: List the user names and messages sent by
Gleambook social network users with less than 3
friends:
SELECT user.name AS uname,
(SELECT VALUE msg.message
FROM GleambookMessages msg
WHERE msg.authorId = user.id) AS messages
FROM GleambookUsers user
WHERE COLL_COUNT(user.friendIds) < 3;
{ "uname": "NilaMilliron", "messages": [ ] }
{ "uname": "WoodrowNehling", "messages": [ " love acast its 3G is good:)" ] }
{ "uname": "IsbelDull", "messages": [ " like product-y the plan is amazing", " like
product-z its platform is mind-blowing" ] }
...
Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 28
SQL++ (cont.)
• Q2: Identify active users (last 30 days) and group and count them
by their numbers of friends:
WITH endTime AS current_datetime(),
startTime AS endTime - duration("P30D")
SELECT nf AS numFriends, COUNT(user) AS activeUsers
FROM GleambookUsers user
LET nf = COLL_COUNT(user.friendIds)
WHERE SOME logrec IN AccessLog SATISFIES
{ "numFriends": 2, "activeUsers": 1 }
user.alias = logrec.user { "numFriends": 4, "activeUsers": 2 }
...
AND datetime(logrec.time) >= startTime
AND datetime(logrec.time) <= endTime SQL++ highlights:
GROUP BY nf; • Many features (see docs)
• Spatial & text predicates
• Set-similarity matching
Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 29
Updates and Transactions
• Q3: Add a new user to Gleambook.com:
UPSERT INTO GleambookUsers ( • Key-value store-like
{"id":667,"alias":”dfrump", transactions
(w/record-level
"name":"DonaldFrump", atomicity)
"nickname":"Frumpkin", • Insert, delete, and
"userSince":datetime("2017-01-01T00:00:00"), upsert ops; index-
consistent
"friendIds":{{ }},
• 2PL concurrency
"employment":[{"organizationName":"USA",
• WAL no-steal, no-force
"startDate":date("2017-01-20")}], with LSM shadowing
"gender":"M"}
);

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 30

Example AsterixDB Use Cases
• Potential use case areas include
• Behavioral science
• Cell phone event analytics
• Social data analytics
• Public health
• Cluster management log analytics
• Power usage monitoring
• IoT data storage and querying
• ....

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 31

Project Status
• 4 year initial NSF project (250+ KLOC), started 2009
• Now officially Apache AsterixDB…
• Semistructured “NoSQL” style data model
• Declarative queries, inserts, deletes, upserts (SQL++)
• Scalable parallel query execution
• Data storage/indexing (primary & secondary, LSM-based)
• Internal and external datasets both supported
• Rich set of data types (including text, time, location)
• Fuzzy and spatial query processing
• NoSQL-like transactions (for inserts/deletes)
• Data feeds and indexes for external datasets
• ....
Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 32
Commercial Use: Big Data Analytics
Couchbase Data Platform
ü Service-Centric Clustered Data System
ü Multi-process Architecture
ü Dynamic Distribution of Facilities
ü Cluster Map Distribution
ü Automatic Failover
ü Enterprise Monitoring/Management
ü Security
ü Offline Mobile Data Integration
ü Streaming REST API
ü SQL-like Query Engine for JSON
ü Clustered* Global Indexes
ü Lowest Latency Key-Value API
ü Active-Active Inter-DC Replication
ü Local Aggregate Indexes
ü Full-Text Search*

ü Operational Analytics (currently DP)

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 33
For More Information

• Asterix project UCI/UCR research home

• http://asterix.ics.uci.edu/
• Apache AsterixDB home
• http://asterixdb.apache.org/
• SQL++ Primer
• http://asterixdb.apache.org/docs/0.9.2/index.html

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 34

THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE: "THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE"
From Everand
THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE: "THE STEP BY STEP GUIDE FOR SUCCESSFUL IMPLEMENTATION OF DATA LAKE-LAKEHOUSE-DATA WAREHOUSE"
AJIT DASH
2/5 (2)
An Introduction To Databases With Web Applications - Martyn Prigmore
No ratings yet
An Introduction To Databases With Web Applications - Martyn Prigmore
713 pages
Database Evolution
No ratings yet
Database Evolution
8 pages
PPT_203105251-6
No ratings yet
PPT_203105251-6
54 pages
BDII
No ratings yet
BDII
10 pages
History of Databases
No ratings yet
History of Databases
5 pages
Block Introduction: Unit 1 Introduction To Object Oriented Database Management System Relational Model
No ratings yet
Block Introduction: Unit 1 Introduction To Object Oriented Database Management System Relational Model
50 pages
Database Assignment
No ratings yet
Database Assignment
5 pages
Crossword Puzzle - Question Words
No ratings yet
Crossword Puzzle - Question Words
10 pages
Advanced Database
No ratings yet
Advanced Database
29 pages
Chapter1-Overview of Database Concepts
No ratings yet
Chapter1-Overview of Database Concepts
19 pages
A New Technology Has in Which Relational and Object-Oriented Concepts Have Been Combined or Merged. These Systems Are Called
No ratings yet
A New Technology Has in Which Relational and Object-Oriented Concepts Have Been Combined or Merged. These Systems Are Called
18 pages
AI&ML-Unit1_DBMS
No ratings yet
AI&ML-Unit1_DBMS
18 pages
4_2 Oodbms and Ddbms
No ratings yet
4_2 Oodbms and Ddbms
48 pages
DBMS - Intro 2023 - v2
No ratings yet
DBMS - Intro 2023 - v2
66 pages
ADMS 2018 - Chapter Three
No ratings yet
ADMS 2018 - Chapter Three
34 pages
KSK ADS Unit 2
No ratings yet
KSK ADS Unit 2
121 pages
Unit I: Introduction
No ratings yet
Unit I: Introduction
41 pages
Chapter 1 - Concept of Object Oriented Database
No ratings yet
Chapter 1 - Concept of Object Oriented Database
23 pages
Woodger Computing Inc. - Architecture: Object-Oriented Databases
No ratings yet
Woodger Computing Inc. - Architecture: Object-Oriented Databases
5 pages
SQL Material
No ratings yet
SQL Material
47 pages
Unit 1
No ratings yet
Unit 1
45 pages
Advanced Database - Allchapters
No ratings yet
Advanced Database - Allchapters
306 pages
7th DBMS
No ratings yet
7th DBMS
65 pages
1 - The Databases Revolutions
No ratings yet
1 - The Databases Revolutions
46 pages
Ordb
No ratings yet
Ordb
46 pages
Ordb
No ratings yet
Ordb
46 pages
ADVDB or DD Parallel Aau
No ratings yet
ADVDB or DD Parallel Aau
261 pages
CSE202 Database Management Systems: Lecture #6
No ratings yet
CSE202 Database Management Systems: Lecture #6
82 pages
Lecture-01-Introduction
No ratings yet
Lecture-01-Introduction
51 pages
Slide 3
No ratings yet
Slide 3
35 pages
The Context of Database Management
No ratings yet
The Context of Database Management
23 pages
CS4416 Lectures 1-7
No ratings yet
CS4416 Lectures 1-7
148 pages
Database
No ratings yet
Database
20 pages
Databases I - 01 Intro and Conceptual Modeling
No ratings yet
Databases I - 01 Intro and Conceptual Modeling
45 pages
Object Relational
No ratings yet
Object Relational
27 pages
RDBMS Concepts
100% (1)
RDBMS Concepts
73 pages
ADVDB-OR-DD-Parallel-aau
No ratings yet
ADVDB-OR-DD-Parallel-aau
258 pages
SC4x W3L1 TopicsInDatabases v2
No ratings yet
SC4x W3L1 TopicsInDatabases v2
37 pages
Object Oriented Databases
100% (1)
Object Oriented Databases
26 pages
Session 33_Data Independence
No ratings yet
Session 33_Data Independence
19 pages
A Brief History of Database Systems
100% (1)
A Brief History of Database Systems
4 pages
Dbms
No ratings yet
Dbms
34 pages
Course01 - Introduction in Databases
No ratings yet
Course01 - Introduction in Databases
31 pages
Adv Dbms Lecture Note - Chapter 3
No ratings yet
Adv Dbms Lecture Note - Chapter 3
3 pages
Class 6
No ratings yet
Class 6
29 pages
DBMS Unit 1
No ratings yet
DBMS Unit 1
31 pages
U1 - History of Database Systems
No ratings yet
U1 - History of Database Systems
14 pages
Iii. Current Trends: Object-Relational Dbmss
No ratings yet
Iii. Current Trends: Object-Relational Dbmss
24 pages
Database Systems The Complete Book PDF
No ratings yet
Database Systems The Complete Book PDF
20 pages
Database Design & Administration: ISOM3260
No ratings yet
Database Design & Administration: ISOM3260
35 pages
Data Models: Preface XV
No ratings yet
Data Models: Preface XV
8 pages
The Revolution in Database Architecture: Jim Gray Microsoft Research
No ratings yet
The Revolution in Database Architecture: Jim Gray Microsoft Research
5 pages
Database Trends - Past, Present, Future
No ratings yet
Database Trends - Past, Present, Future
59 pages
ADVDB-OR-DD-Parallel-MM-GIS - HiLCoE
No ratings yet
ADVDB-OR-DD-Parallel-MM-GIS - HiLCoE
337 pages
CH - 1 Relational Database Design Updated
No ratings yet
CH - 1 Relational Database Design Updated
80 pages
IT3306 - Data Management Systems - AllInOne
No ratings yet
IT3306 - Data Management Systems - AllInOne
670 pages
DBMS Case Study
No ratings yet
DBMS Case Study
14 pages
102-COPIES-ADV-LESSON-1
No ratings yet
102-COPIES-ADV-LESSON-1
5 pages
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
From Everand
Data Engineering with Scala and Spark: Build streaming and batch pipelines that process massive amounts of data using Scala
Eric Tome
No ratings yet
Dblog: A Watermark Based Change-Data-Capture Framework: Andreas Andreakis Ioannis Papapanagiotou
No ratings yet
Dblog: A Watermark Based Change-Data-Capture Framework: Andreas Andreakis Ioannis Papapanagiotou
6 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
51 pages
Fairness Frameworks Packages: An Overview of Some Useful &
No ratings yet
Fairness Frameworks Packages: An Overview of Some Useful &
9 pages
AI Landscape
No ratings yet
AI Landscape
111 pages
Container Workloads
No ratings yet
Container Workloads
11 pages
Feature Engineering and Selection: CS 294: Practical Machine Learning October 1, 2009 Alexandre Bouchard-Côté
No ratings yet
Feature Engineering and Selection: CS 294: Practical Machine Learning October 1, 2009 Alexandre Bouchard-Côté
94 pages
2019 State of Devops Report Puppet Circleci Splunk - SML 1 1 PDF
No ratings yet
2019 State of Devops Report Puppet Circleci Splunk - SML 1 1 PDF
86 pages
Spark 2.12 Issues
No ratings yet
Spark 2.12 Issues
10 pages
The American Healthcare System Explained - Suraj Patil - Medium
No ratings yet
The American Healthcare System Explained - Suraj Patil - Medium
12 pages
Feature Engineering
100% (2)
Feature Engineering
76 pages
Coe PDF
No ratings yet
Coe PDF
16 pages
Waves
No ratings yet
Waves
52 pages
C - Manuals - Access - Access 2013 Part 2 PDF
No ratings yet
C - Manuals - Access - Access 2013 Part 2 PDF
142 pages
Lecture 1 Addition What Is Data?
No ratings yet
Lecture 1 Addition What Is Data?
22 pages
Domain - Relational Calculus
No ratings yet
Domain - Relational Calculus
4 pages
Database Systems Handbook
No ratings yet
Database Systems Handbook
293 pages
Assignment 3
No ratings yet
Assignment 3
2 pages
Unit 3 Social Computing
No ratings yet
Unit 3 Social Computing
19 pages
1904504-GIS - Notes Unit 3
No ratings yet
1904504-GIS - Notes Unit 3
45 pages
Acknowledgment: Car Rental Dbms
No ratings yet
Acknowledgment: Car Rental Dbms
19 pages
CSCE 4523 Introduction To Database Management Systems
No ratings yet
CSCE 4523 Introduction To Database Management Systems
9 pages
DBMS
No ratings yet
DBMS
1 page
Database Concepts Final
No ratings yet
Database Concepts Final
18 pages
2024 Model Answer Paper
No ratings yet
2024 Model Answer Paper
20 pages
PPT Lecture 2.1 and 2.2 DataModels(1)
No ratings yet
PPT Lecture 2.1 and 2.2 DataModels(1)
105 pages
Chapter06 Relational Algebra
No ratings yet
Chapter06 Relational Algebra
61 pages
COMP5320 2025 Wk2 L1 RelationalAlgebra - Tagged
No ratings yet
COMP5320 2025 Wk2 L1 RelationalAlgebra - Tagged
71 pages
Part 01 Introduction
No ratings yet
Part 01 Introduction
19 pages
Subject Code & Name: 18UITC51-RDBMS: Class: III B.SC (IT)
No ratings yet
Subject Code & Name: 18UITC51-RDBMS: Class: III B.SC (IT)
95 pages
Peter Norton's Introduction To Computers: Productivity Software
No ratings yet
Peter Norton's Introduction To Computers: Productivity Software
24 pages
Ch10-Functional Dependencies and Normalization For Relational Databases
No ratings yet
Ch10-Functional Dependencies and Normalization For Relational Databases
31 pages
Keywords: INSC 20263 (BIS) Terry Gray 1 October 4, 2020
No ratings yet
Keywords: INSC 20263 (BIS) Terry Gray 1 October 4, 2020
19 pages
DBMS (EE604A) Syllabus R18
No ratings yet
DBMS (EE604A) Syllabus R18
2 pages
06 Additional Relational Operation - Relational Algebra and Calculus
No ratings yet
06 Additional Relational Operation - Relational Algebra and Calculus
34 pages
Assignment Distributed Database System
No ratings yet
Assignment Distributed Database System
6 pages
Concepts of Database Management, 4 Edition, Pratt & Adamski
No ratings yet
Concepts of Database Management, 4 Edition, Pratt & Adamski
34 pages
Ch-5 (DBMS) Relational Data Model & RDB Constraints
No ratings yet
Ch-5 (DBMS) Relational Data Model & RDB Constraints
13 pages
Database Management Concepts
No ratings yet
Database Management Concepts
16 pages
DB Lecture Note All in ONE
No ratings yet
DB Lecture Note All in ONE
229 pages
Syllabus-MCA-2022
No ratings yet
Syllabus-MCA-2022
101 pages
Topic 1 Msbte Questions and Answers
No ratings yet
Topic 1 Msbte Questions and Answers
14 pages
Dbms Unit
No ratings yet
Dbms Unit
42 pages

Big Nosql Data: Mike Carey

Uploaded by

Big Nosql Data: Mike Carey

Uploaded by

Big NoSQL Data

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 0

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 1

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 2

• In the beginning was the Word, and the Word was

• The need for a data management library, or a database

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 4

Item (ino, qty, price)

123 2 544 1 3.99

• Be sure to notice that

• Motivated largely by late 1980’s CAx applications (e.g.,

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 9

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 10

(123) 1 (401) 2 9.99

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 11

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 12

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 13

• Not from the DB world!

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 17

{“id”: “123”, {“sku”: 401,

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 18

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 19

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 22

{"id":2, "alias":"Isbel”, "name":"IsbelDull", "nickname":"Izzy",

{"id":3, "alias":"Emory", "name":"EmoryUnk”,

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 26

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 27

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 30

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 31

ü Operational Analytics (currently DP)

• Asterix project UCI/UCR research home

Michael Carey/Padhraic Smyth, UC Irvine: Stats 170A/B, Winter 2018 34

You might also like