Chapter 1- Databases for an Information-Rich and Connected World 2
Chapter 1- Databases for an Information-Rich and Connected World 2
We only think about the database unless there is an outage or security breach.
The Ubiquitous Information Provider
§ Searching to buy a specific item (e.g. an appliance, a service, clothing, a book...)
§ Researching more in-depth before purchasing a major item such as a car
§ Planning a vacation or trip
§ Learning about a particular hobby or sport that you enjoy such as skiing, boating, or coin collecting
§ Researching a particular company before investing in it
§ Researching a particular medical condition or medical problem
§ Solving a technical problem or learning a new technical skill such as administering a Linux server
§ Writing a research paper or researching for a school project
§ Wikipedia
Interesting Public Databases
Ø Annotated Human Genome Data
Ø Federal Reserve Economic Data
Ø US Labor, Economic, and Census Data
Ø Freebase Dataset
Ø Historical Weather Datasets
Ø Twitter Archive at the Library of Congress
In the universe…
The properties of a thing have values
Color: Red
Style: Windsor
Data Material: Maple wood
Paint type: Oil-based
Weight: 2.6 KG
Seat height: 17.5 inches
Overall height: 34.3 inches
Maximum front-leg diameter: 1 inch
Maximum splat spindle diameter: 0.675 inches
…
Data
Values of the properties of a thing are data
Color: Red
Style: Windsor
Material: Maple wood Data
Paint type: Oil-based
Weight: 2.6 KG
Seat height: 17.5 inches
Overall height: 34.3 inches
Maximum front-leg diameter: 1 inch
Maximum splat spindle diameter: 0.675 inches
…
In the universe, things have many properties
PERSON
IQ Height
(Intelligence Quotient)
Phone Strength
Number
Food
Address Preference
Education Girth
Skills
…
IQ
Name
Phone
Attributes
Number Properties we track in oursystem
The properties we don’t
Address
track are still just properties
Education
Skills
Sex
In a database…
Attributes of Objects (Entities) have values
Those values are DATA
Na me Robin Noble DATA
IQ 180 DATA
Height 5’ 7” DATA
SQL (pronounced "ess-que-el") stands for Structured Query Language. SQL is used to communicate with a database.
Database Architectures: Mainframe
Mainly used by big organizations to quickly,
continually, and securely process and
store huge amounts of data
Advantages Disadvantages
• Large number of concurrent users • Does not take advantage of local computer processing power
• Able to handle very large volume of transactions • Hardware is relatively expensive
• Suitable for slow network environments • Character-based interface
• Central location of both application and DBMS makes
maintenance relatively simple
Advantages Disadvantages
• Inexpensive to acquire • Only available to a single user at a
• Easy to configure time
• Requires few resources to operate • Only available at a single computer
• May impose limits on how much data
can be stored
Advantages Disadvantages
• Inexpensive to acquire • Only viable for a limited number of users, perhaps 10 or fewer
• Easy to configure • Query performance declines quickly as the amount of data increases
• Low-cost option for multi-user concurrency • Multiple installation of both application software and DBMS can lead to version
control issues for both
These servers are especially popular as a central storage place for internal
company files that are relevant for individual users or that can be shared by multiple users.
Database Architectures: Client Server
Advantages Disadvantages
• Makes good use of processing power of multiple computers • Application software is distributed, possibly leading to version
• Handles large amounts of both data and transactions control issues
• Can support thousands of concurrent users • DBMS software is relatively expensive compared to the File Server
approach
It is especially effective when clients and the server each have distinct tasks that they routinely perform
Database Architectures: Web
Advantages Disadvantages
• All advantages of the client/server architecture • Adds another layer of complexity (web server)
• Increased ease of application version control • Various software apps must keep synchronized with database
• Very good at handling intermittent use from large numbers of software
users (millions) • Upgrades in browsers may require changes to application
It is the most current architecture and the most widely used in today’s connected, online world.
Databases and Business Systems
Information System Purpose and Examples Database Requirements
Transaction Processing Support the daily activities • Capture transactions
System (TPS) • Present detailed information
to users.
Management Information Manage and verify the efficiency of • Data for MIS comes from
System (MIS) daily activities TPS
• Summarized into a set of
management reports
• Record historical data to
forecast future trends.
Decision Support System Make strategic decisions. • Contains both internal and
(DSS) external data
• Modeling and analysis tools
Databases and Business Systems
continued
OLTP Database OLAP Database
Purpose of DB Day-to-day business processes Strategic decision making
Source of data Daily transactions Internal and external data
Organization of DB Structured and normalized Not normalized; star or snowflake
Queries (retrieve data) Standard SQL Complex queries
Inserts/updates Frequent, limited data, must be Very few, extensive and often
rapid and real-time transformed data, batches
Space requirements Intentionally limited for rapid Can be very large
response
Time Frame Usually weeks or months Long time series, often years
Sales
Technology Careers and Database Skills
Technology Careers Requiring Database Skills Knowledge-based Careers Requiring DB Skills
§ Software engineer and developer § Financial planner
§ Business/computer systems analyst § Market research analyst
§ Web developer § Advertising manager
§ Computer programmer § Human resource manager
§ Quality assurance engineer § Accountant
§ Server administrator § Economist
§ Database administrator § Sales manager
§ Data warehouse engineer § Sociologist
§ Project manager § Management consultant
§ Public Administrator