Lecture 04 - Chapter 6
Lecture 04 - Chapter 6
Business Intelligence
Laurel Qi
1
Content agenda
Database management systems (DBMS)
Using database to improve business
MM3422@Laurel 2
Database management systems (DBMS)
Computer system organizes data in a
hierarchy (components of database)
◦ Database: Group of related files
◦ File: Group of records of same type
◦ Record: Group of related fields
◦ Field: Group of characters as word(s) or number
MM3422@Laurel 3
Database management systems (DBMS)
Entity
Attributes
MM3422@Laurel 4
Database management systems (DBMS)
Database
◦ a collection of data organized to serve many applications
efficiently by centralizing data and controlling redundant data
Database management system (DBMS)
◦ software that permits an organization to centralize data,
manage them efficiently, and provide access to the stored data
by application programs
Capabilities of DBMS:
◦ Query - Data manipulation language: Used to add, change, delete, retrieve data from
database
Structured Query Language (SQL)
e.g.,… “select”… “from”… “where”
Microsoft Access user can use tools to generate SQL
◦ Report - Many DBMS have report generation capabilities for creating polished reports
MM3422@Laurel 5
Database management systems (DBMS)
MM3422@Laurel 6
Database management systems (DBMS)
Relational DBMS
◦ Represent data as two-dimensional tables called relations or files
◦ Each table contains data on entity and attributes
Suitable
◦ users: Web-focused start-ups; SME seeking for database
capabilities at a lower price.
MM3422@Laurel 7
Database management systems (DBMS)
Table: grid of columns and rows
• Rows: Records
• Columns: Fields/attributes for entity
• Primary key: Field used to uniquely identify each record. Primary key cannot be duplicated.
• Foreign key: Primary key used in second table as look-up field to identify records from
original table. Foreign key defines the relationship between tables.
MM3422@Laurel 8
Database management systems (DBMS)
An example of relational database
MM3422@Laurel 9
After understanding the DBMS, how should organization use
database to improve business performance and decision making?
MM3422@Laurel 10
Using Databases to Improve Business
Performance and Decision Making
Database is used to……
◦ Keep track of basic transactions, e.g., paying suppliers,
processing orders, keep track of customers and pay
employees.
Database is also used to…
◦ Help managers and employees to make better decisions.
(Pricing strategy of Starbucks)
Big data era
◦ 6M searches per second
◦ 1M transactions per hour
◦ 4B page views per day
◦ 250B photos stored
MM3422@Laurel 11
Using Databases to Improve Business
Performance and Decision Making
Big data era
◦ History: years ago, most of the transaction data could be stored in the
DBMS. But now…massive sets of unstructured/semi-structured data from
Web traffic, social media, sensors, E-trading systems.
• Amount: Data are more than doubling every two years, in petabytes,
exabytes. Beyond the capability of typical DBMS to capture, store, and
analyze.
• Usage: reveal more patterns and interesting anomalies than smaller data
sets, with the potential to provide new insights into customer behavior,
weather patterns, financial market activity, or other phenomena.
• Examples?
◦ Car-rental giant Hertz gathers data from web surveys, emails, text messages, web site traffic
patterns and different store branches that there would be a late return of the cars during
specific times of the day. The company arranges more managerial staff during those peak
times, and thus enhances Hertz’s performance and increased customer satisfaction.
◦ Restaurant collects credit card information and company’s social media websites -> quality is
more important than price->introducing vegetarian dishes, more seafood selections, and
more expensive wines, raising sales by more than 10%.
MM3422@Laurel 12
Using Databases to Improve Business
Performance and Decision Making
MM3422@Laurel 13
Using Databases to Improve Business
Performance and Decision Making
The four “Vs” of big data
IBM data scientists break big data into four dimensions: volume, variety, velocity
and veracity.
MM3422@Laurel 14
Using Databases to Improve Business
Performance and Decision Making
Companies need technologies or tools to
manage and analyze the big data…
Contemporary tools:
◦ Data warehouses
Before the Big data era
◦ Data marts
◦ Hadoop
◦ In-memory computing Big data era
◦ Analytical platforms
MM3422@Laurel 15
Using Databases to Improve Business
Performance and Decision Making
Data warehouse:
◦ Stores current and historical data from many core operational
transaction systems
◦ Consolidates( 整合 ) and standardizes( 标准化 ) information for
use across enterprise, but data cannot be altered( 更改,变动 )
◦ Data warehouse system will provide ad hoc query, analysis, and
reporting tools
Data marts:
◦ Subset of data warehouse
◦ Summarized or highly focused portion of firm’s data for use by
specific population of users
◦ Typically focuses on single subject or line of business
MM3422@Laurel 16
Using Databases to Improve Business
Performance and Decision Making
MM3422@Laurel 17
Using Databases to Improve Business
Performance and Decision Making
In big data era… Used by Facebook and Yahoo
Hadoop ( 分布式计算 )
◦ An open source software framework that enables distributed parallel
processing of huge amounts of data across inexpensive computers.
Functionality:
◦ It breaks a big data problem down into sub-problems, distributes them
among up to thousands of inexpensive computer processing nodes, and
then combines the result into a smaller data set that is easier to analyze.
Input of Hadoop:
◦ Structured transactional data (in DBMS)
◦ Semi-structured data
◦ Unstructured data (>90% of org. data).
https://www.webopedia.com/TERM/U/unstructured_data.html
MM3422@Laurel 18
Using Databases to Improve Business
Performance and Decision Making
In big data era…
In-memory computing
◦ A method to rely on
main memory (RAM) for
data storage.
MM3422@Laurel 19
Using Databases to Improve Business
Performance and Decision Making
Traditional
data
processing
Big data
processing
MM3422@Laurel 20
Business Intelligence
Based on Data warehouse…
Business Intelligence:
◦ Tools for consolidating, analyzing, and providing
access to vast amounts of data to help users make
better business decisions
MM3422@Laurel 21
Business Intelligence
OLTP OLAP
MM3422@Laurel 23
Business Intelligence
Online analytical processing (OLAP)
◦ OLAP supports multidimensional data analysis
Viewing data using multiple dimensions (data cube)
Q: how many washers sold in the East in 2003 compared with
other regions?
MM3422@Laurel 24
Business Intelligence
Data mining:
◦ Finds hidden patterns, relationships in large databases and infers
rules to predict future behavior. More discovery driven than OLAP
E.g., Find buying patterns in customer data for one-to-one marketing
campaigns (body shop recommendations) or to identify profitable
customers. (credit card holder…)
Text mining
◦ Extracts key elements from large unstructured data sets
◦ Sentiment( 看法,情绪,观点 ) analysis software
Web mining
◦ Discovery and analysis of useful patterns and
information from web (Google trends service)
MM3422@Laurel 25
Business Intelligence
Business Intelligence analytics
◦ Predictive analytics
Use statistical analysis, data mining techniques, historical data,
and assumptions about future conditions to predict future
trends and behavior patterns.
◦ Big data analytics
◦ Operational intelligence and analytics
◦ Location analytics and Geographic Information
Systems (GIS) (e.g., Starbucks new shop location)
MM3422@Laurel 26
Business Intelligence
Business Intelligence (BI) envionment
BI vendors: Oracle, SAP, IBM, Microsoft, SAS.
Six elements in the business intelligence environment
1. Data from the business environment
2. Business intelligence infrastructure
3. Business analytics toolset
4. Managerial users and methods
5. Delivery platform – MIS, DSS, ESS
6. User interface
MM3422@Laurel 27
Business Intelligence
The
display
area of
BI.
ESS
ESS