Lecture 1 Introduction Chapter 1
Lecture 1 Introduction Chapter 1
www.uh.edu/webct
Learn
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke
Your UnderGrad TAs, Shannon Furr & Elizabeth Pham & Duy Ta.
Office Hours:
Shannon M&W 11:30 to 12:30
Elizabeth M&W 1:15 to 2:15
Duy T&Th 10 to 11
563 PGH (Computer LAB)
10
11
12
14
15
16
17
18
19
COSC 3380
Lecture 1
DataBase Management
Systems
(DBMS)
Chapter 1
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke
20
Key Terms
Data
Information
Data Model
Schema and Conceptual Schema and Physical
Schema
Transaction
Integrity constraints
Database Management System
Query
21
What Is a
Database?
University Systems: A university
22
What is a Relational
Database?
23
What is a Relational
Database?
24
What Is a DBMS?
A DataBase Management System (DBMS)
is a software package designed to store and manage
Databases.
25
What is a
DBMS?
26
27
28
Several Areas
29
30
Crash Recovery
31
Why use a
DBMS?
32
Why study
Databases??
33
Data
models
A Data model
Data.
34
Levels of
Abstraction
Many Views,
Schema
single Conceptual (Logical) Schema - Conceptual
WHAT
single Physical Schema - HOW.
Physical Schema
35
Example: University
Database
36
Data
Independence *
WHAT
HOW
Database Management
Systems 3ed, R. Ramakrishnan and J. Gehrke
Concurrency
Controlexecution of user programs is essential for
Concurrent
Transactio
n
Scheduling Concurrent
DBMS ensures that execution of {T1, ... , Tn} is
Transactions
equivalent to some serial execution T1, ... Tn.
40
Ensuring
Atomicity
DBMS ensures Atomicity (all-or-nothing
41
The
Log
Structure of a
DBMS
These layers
must consider
Concurrency Control and
Recovery
Query Optimization
and Execution
Relational Operators
Files and Access Methods
Buffer Management
Disk Space Management
DB
43
DB application programmers
44
45
Entity
Generalized category representing person, place, thing on which we store
and maintain information
e.g. SUPPLIER, PART
Attributes:
Specific characteristics of each Entity, e.g.:
SUPPLIER name, address
PART description, unit price, supplier
46
Cannot be duplicated
Provides unique identifier for all information in any Row
47
48
Data for the Entity PART have their own separate Table. Part_Number is the
primary key and Supplier_Number is the foreign key, enabling users to find
related information from the SUPPLIER Table about the Supplier for each Part.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke
49
50
A Simple Entity-Relationship
Diagram (ERD) WHAT
Entity-
This Diagram shows the Relationship between the Entities SUPPLIER and PART.
51
52
The shaded areas show which Data came from the SUPPLIER, LINE_ITEM, and
ORDER Tables. The Database does not maintain Data on Extended Price or
Order Total because they can be derived from other Data in the Tables.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke
53
The final design of the Database for Suppliers, Parts, and Orders has four Tables.
The LINE_ITEM Table is a join Table that eliminates the many-to-many Relationship
between ORDER and PART.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke
54
55
DBMS
Examples of DBMS:
56
57
Join:
Project:
58
The select, project, and join operations enable Data from two different Tables to
be combined and only selected attributes to be displayed.
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke
59
Automated or manual file storing definitions of Data elements and their characteristics
60
Illustrated here are the SQL statements for a query to select suppliers for parts 137 or 150.
62
Illustrated here is how the previous query would be constructed using Microsoft
Access query-building tools. It shows the Tables, fields, and selection criteria
used for the query.
63
64
65
Data Warehouses
Data Warehouse
Database that stores current and historical data that may be of interest to
decision makers
Consolidates and standardizes data from many systems, operational and
transactional Databases
Data can be accessed but not altered
66
The Data Warehouse extracts current and historical data from multiple
operational systems inside the organization. These Data are combined with Data
from external sources and reorganized into a central Database designed for
management Reporting and Analysis. The information directory provides Users
Database
Management
Systems
Ramakrishnan
and
J. Gehrke
67
with information
about
the 3ed,
DataR.available
in the
Warehouse.
Data Mining
e.g. Harrahs Entertainment gathers and analyzes customer data to
create gambling profiles and identify most profitable customers
68
Business Intelligence
A series of Analytical Tools works with Data stored in Databases to find patterns
and insights for helping managers and employees make better Decisions to
DatabaseOrganizational
Management Systems
3ed, R. Ramakrishnan and J. Gehrke
69
improve
Performance.
70
The view that is showing is product versus region. If you rotate the cube 90 degrees,
the face that will show is product versus actual and projected sales. If you rotate the
cube 90 degrees again, you will see region versus actual and projected sales.
Other
views
are possible.
Database
Management
Systems 3ed,
71
Data Mining
72
Data Mining
One popular use of Data Mining: Analyzing patterns in customer data for
one-to-one marketing campaigns or for identifying profitable customers
Predictive analysis:
73
74
Class Participation!
75
handle the sites massive data volume, and is running 9,000 instances of
memcached in order to keep up with the number of transactions the
database must serve.
The oft-quoted statistic from 2008 is that the site had 1,800 servers
dedicated to MySQL and 805 servers dedicated to memcached, although
multiple MySQL shards and memcached instances can run on a single
server.
Database Sharding can be simply defined as a "shared-nothing" partitioning scheme for large databases
across a number of servers, enabling new levels of database performance and scalability achievable. If you
think of broken glass, you can get the concept of sharding - breaking your database down into smaller
chunks called shards and spreading those across a number of distributed servers.
The term "sharding" was coined by Google engineers, and popularized through their publication of the Big
Table architecture. However, the concept of "shared-nothing" database partitioning has been around for a
decade or more and there have been many implementations over this period, especially high profile in-house
built solutions by Internet leaders such as eBay, Amazon, Digg, Flickr, Skype, YouTube, Facebook,
Friendster, and Wikipedia.
Class Participation!
http://gigaom.com/cloud/facebook-trapped-in-mysql-fate-worse-than-death/
Database Management Systems 3ed, R. Ramakrishnan and J. Gehrke
http://www.codefutures.com/database-sharding/
76
77
http://highscalability.com/myspace-architecture
http://www.communitymx.com/content/article.cfm?cid=A1A63
78
79
Summary
THE END
81