0% found this document useful (0 votes)
9 views

Lecture 04 - Chapter 6

The document discusses database management systems and how businesses can use databases and business intelligence to improve performance and decision making. It covers topics like DBMS, relational databases, using data to enhance decisions, and tools for managing large datasets like Hadoop and in-memory computing.

Uploaded by

974374542
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
9 views

Lecture 04 - Chapter 6

The document discusses database management systems and how businesses can use databases and business intelligence to improve performance and decision making. It covers topics like DBMS, relational databases, using data to enhance decisions, and tools for managing large datasets like Hadoop and in-memory computing.

Uploaded by

974374542
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 29

Chapter 6 – Database management and

Business Intelligence

Laurel Qi

1
Content agenda
 Database management systems (DBMS)
 Using database to improve business

performance and decision making


 Business Intelligence

MM3422@Laurel 2
Database management systems (DBMS)
 Computer system organizes data in a
hierarchy (components of database)
◦ Database: Group of related files
◦ File: Group of records of same type
◦ Record: Group of related fields
◦ Field: Group of characters as word(s) or number

◦ Usually a file describes an entity, e.g., person, place, thing on


which we store information. And attributes were used to
describe entity, e.g., Student_ID, Course, Date and Grade
belong to entity COURSE

MM3422@Laurel 3
Database management systems (DBMS)

Entity

Attributes

A computer system organizes data


in a hierarchy that starts with the
bit, which represents either a 0 or a
1. Bits can be grouped to form a
byte to represent one character,
number, or symbol. Bytes can be
grouped to form a field, and related
fields can be grouped to form a
record. Related records can be
collected to form a file, and related
files can be organized into a
database.

MM3422@Laurel 4
Database management systems (DBMS)
 Database
◦ a collection of data organized to serve many applications
efficiently by centralizing data and controlling redundant data
 Database management system (DBMS)
◦ software that permits an organization to centralize data,
manage them efficiently, and provide access to the stored data
by application programs

 Capabilities of DBMS:
◦ Query - Data manipulation language: Used to add, change, delete, retrieve data from
database
 Structured Query Language (SQL)
e.g.,… “select”… “from”… “where”
 Microsoft Access user can use tools to generate SQL
◦ Report - Many DBMS have report generation capabilities for creating polished reports

MM3422@Laurel 5
Database management systems (DBMS)

Database Interface Application


e.g. HRMS
programs

MM3422@Laurel 6
Database management systems (DBMS)
 Relational DBMS
◦ Represent data as two-dimensional tables called relations or files
◦ Each table contains data on entity and attributes

◦ Traditional DBMS: IBM DB2; Oracle, Microsoft Access and SQL


server; My SQL (open-source)

◦ Cloud-based relational database service providers:


 Amazon Relational Database service (Amazon RDS)
 Oracle’s own database cloud service
 Microsoft SQL Azure

Suitable
◦ users: Web-focused start-ups; SME seeking for database
capabilities at a lower price.

MM3422@Laurel 7
Database management systems (DBMS)
Table: grid of columns and rows
• Rows: Records
• Columns: Fields/attributes for entity
• Primary key: Field used to uniquely identify each record. Primary key cannot be duplicated.
• Foreign key: Primary key used in second table as look-up field to identify records from
original table. Foreign key defines the relationship between tables.

The foreign key in the second


table must be the primary key in
the first table.

MM3422@Laurel 8
Database management systems (DBMS)
An example of relational database

MM3422@Laurel 9
After understanding the DBMS, how should organization use
database to improve business performance and decision making?

MM3422@Laurel 10
Using Databases to Improve Business
Performance and Decision Making
 Database is used to……
◦ Keep track of basic transactions, e.g., paying suppliers,
processing orders, keep track of customers and pay
employees.
 Database is also used to…
◦ Help managers and employees to make better decisions.
(Pricing strategy of Starbucks)
 Big data era
◦ 6M searches per second
◦ 1M transactions per hour
◦ 4B page views per day
◦ 250B photos stored

MM3422@Laurel 11
Using Databases to Improve Business
Performance and Decision Making
 Big data era
◦ History: years ago, most of the transaction data could be stored in the
DBMS. But now…massive sets of unstructured/semi-structured data from
Web traffic, social media, sensors, E-trading systems.
• Amount: Data are more than doubling every two years, in petabytes,
exabytes. Beyond the capability of typical DBMS to capture, store, and
analyze.
• Usage: reveal more patterns and interesting anomalies than smaller data
sets, with the potential to provide new insights into customer behavior,
weather patterns, financial market activity, or other phenomena.
• Examples?
◦ Car-rental giant Hertz gathers data from web surveys, emails, text messages, web site traffic
patterns and different store branches that there would be a late return of the cars during
specific times of the day. The company arranges more managerial staff during those peak
times, and thus enhances Hertz’s performance and increased customer satisfaction.
◦ Restaurant collects credit card information and company’s social media websites -> quality is
more important than price->introducing vegetarian dishes, more seafood selections, and
more expensive wines, raising sales by more than 10%.

MM3422@Laurel 12
Using Databases to Improve Business
Performance and Decision Making

MM3422@Laurel 13
Using Databases to Improve Business
Performance and Decision Making
 The four “Vs” of big data
IBM data scientists break big data into four dimensions: volume, variety, velocity
and veracity.

Refer to: http://www.ibmbigdatahub.com/infographic/four-vs-big-data

MM3422@Laurel 14
Using Databases to Improve Business
Performance and Decision Making
 Companies need technologies or tools to
manage and analyze the big data…
 Contemporary tools:
◦ Data warehouses
Before the Big data era
◦ Data marts
◦ Hadoop
◦ In-memory computing Big data era

◦ Analytical platforms

MM3422@Laurel 15
Using Databases to Improve Business
Performance and Decision Making
 Data warehouse:
◦ Stores current and historical data from many core operational
transaction systems
◦ Consolidates( 整合 ) and standardizes( 标准化 ) information for
use across enterprise, but data cannot be altered( 更改,变动 )
◦ Data warehouse system will provide ad hoc query, analysis, and
reporting tools
 Data marts:
◦ Subset of data warehouse
◦ Summarized or highly focused portion of firm’s data for use by
specific population of users
◦ Typically focuses on single subject or line of business

MM3422@Laurel 16
Using Databases to Improve Business
Performance and Decision Making

OLTP: On-Line Transaction Processing is the process in which systems


facilitate and manage data entry and retrieval on a frequent basis. (ATM)
ETL: extraction, transformation, and loading, in which the data is cleaned and
organized into a meaningful manner.

MM3422@Laurel 17
Using Databases to Improve Business
Performance and Decision Making
In big data era… Used by Facebook and Yahoo
 Hadoop ( 分布式计算 )
◦ An open source software framework that enables distributed parallel
processing of huge amounts of data across inexpensive computers.
 Functionality:
◦ It breaks a big data problem down into sub-problems, distributes them
among up to thousands of inexpensive computer processing nodes, and
then combines the result into a smaller data set that is easier to analyze.

 Input of Hadoop:
◦ Structured transactional data (in DBMS)
◦ Semi-structured data
◦ Unstructured data (>90% of org. data).
https://www.webopedia.com/TERM/U/unstructured_data.html

MM3422@Laurel 18
Using Databases to Improve Business
Performance and Decision Making
In big data era…
 In-memory computing
◦ A method to rely on
main memory (RAM) for
data storage.

◦ Eliminate bottlenecks from retrieving and reading data in a traditional,


disk-based database and dramatically shortening query response times.
 Can reduce hours/days of processing to seconds
 Can process very large data set, e.g., data mart or small data warehouse.
 Analytic platforms (- a software suite)
◦ Commercial database vendors have also developed specialized high-speed analytic
platforms using both relational and non-relational technology that are optimized for
analyzing large datasets.

MM3422@Laurel 19
Using Databases to Improve Business
Performance and Decision Making
Traditional
data
processing

Big data
processing

A contemporary business intelligence infrastructure features capabilities and tools to


manage and analyze large quantities and different types of data from multiple sources.
Data Lake: repository for raw structured or unstructured data that has not yet been
analyzed.

MM3422@Laurel 20
Business Intelligence
 Based on Data warehouse…

 Business Intelligence:
◦ Tools for consolidating, analyzing, and providing
access to vast amounts of data to help users make
better business decisions

 Principle tools include:


◦ Software for database query and reporting
◦ Online analytical processing (OLAP)
◦ Data mining

MM3422@Laurel 21
Business Intelligence

A series of analytical tools works with data stored in databases to find


patterns and insights for helping managers and employees make better
decisions to improve organizational performance.
MM3422@Laurel 22
Business Intelligence
Query and report (advanced level)

OLTP OLAP
MM3422@Laurel 23
Business Intelligence
 Online analytical processing (OLAP)
◦ OLAP supports multidimensional data analysis
 Viewing data using multiple dimensions (data cube)
 Q: how many washers sold in the East in 2003 compared with
other regions?

OLAP enables rapid,


online answers to ad
hoc queries

MM3422@Laurel 24
Business Intelligence
 Data mining:
◦ Finds hidden patterns, relationships in large databases and infers
rules to predict future behavior. More discovery driven than OLAP
 E.g., Find buying patterns in customer data for one-to-one marketing
campaigns (body shop recommendations) or to identify profitable
customers. (credit card holder…)
 Text mining
◦ Extracts key elements from large unstructured data sets
◦ Sentiment( 看法,情绪,观点 ) analysis software
 Web mining
◦ Discovery and analysis of useful patterns and
information from web (Google trends service)

MM3422@Laurel 25
Business Intelligence
 Business Intelligence analytics
◦ Predictive analytics
 Use statistical analysis, data mining techniques, historical data,
and assumptions about future conditions to predict future
trends and behavior patterns.
◦ Big data analytics
◦ Operational intelligence and analytics
◦ Location analytics and Geographic Information
Systems (GIS) (e.g., Starbucks new shop location)

Your own examples?

MM3422@Laurel 26
Business Intelligence
 Business Intelligence (BI) envionment
 BI vendors: Oracle, SAP, IBM, Microsoft, SAS.
 Six elements in the business intelligence environment
1. Data from the business environment
2. Business intelligence infrastructure
3. Business analytics toolset
4. Managerial users and methods
5. Delivery platform – MIS, DSS, ESS
6. User interface

MM3422@Laurel 27
Business Intelligence

BUSINESS INTELLIGENCE AND ANALYTICS FOR DECISION SUPPORT


From customer transaction and
social media streams to create
real-time, personalized
shopping experiences for major
online retailers

The
display
area of
BI.

ESS
ESS

Business intelligence and analytics requires a strong database foundation,


a
set of analytic tools, and an involved management team that can ask
intelligent questions and analyze MM3422@Laurel
data. 28
Closing words
 Key Points Today
 Big data issues.
 Business intelligence and analytics.

 Home reading case:


• Can Big Data Analytics Help People Find Love?
• The Dark Side of Big Data.
• Data Drive Starbucks Location Decisions.

Thank you! Have a good weekend!


MM3422@Laurel 29

You might also like