0% found this document useful (0 votes)
35 views

$RD56ADG

The document discusses different schemas for multidimensional data models in data warehouses, including star schemas, snowflake schemas, and fact constellation schemas. Star schemas contain a central fact table linked to dimension tables, snowflake schemas further normalize dimension tables, and fact constellation schemas allow multiple fact tables to share dimension tables.

Uploaded by

muzammil sardar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
35 views

$RD56ADG

The document discusses different schemas for multidimensional data models in data warehouses, including star schemas, snowflake schemas, and fact constellation schemas. Star schemas contain a central fact table linked to dimension tables, snowflake schemas further normalize dimension tables, and fact constellation schemas allow multiple fact tables to share dimension tables.

Uploaded by

muzammil sardar
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 21

Data Warehouse &

Week 5 -- Lecture 13
Data Mining
Course Teacher: Syed Saood Zia
Data Warehouse & Data Mining

Data Warehousing and Online Analytical Processing


Table of Content 3

• Data Warehouse Modeling: Data Cube and OLAP

• Schemas for Multidimensional Data Models


• Star Schema

• Snow Flake Schema

• Fact Constellation
Data Warehouse Modeling: Data Cube and
OLAP 4

• Data warehouses and OLAP tools are based on a multidimensional


data model.
• This model views data in the form of a data cube.
• A data cube allows data to be modeled and viewed in multiple
dimensions. It is defined by dimensions and facts.
• In general terms, dimensions are the perspectives or entities with
respect to which an organization wants to keep records.
Data Cube: 2-D view of Data 5
Data Cube: 3-D View 6
3-D Data Cube Representation 7
Schemas for Multidimensional Data Models 8

• The entity-relationship data model is commonly used in the design


of relational databases, where a database schema consists of a set
of entities and the relationships between them.

• Such a data model is appropriate for online transaction processing.


Schemas for Multidimensional Data Models 9

• A data warehouse, however, requires a concise, subject-oriented


schema that facilitates online data analysis.

• The most popular data model for a data warehouse is a


multidimensional model, which can exist in the form of a star
schema, a snowflake schema, or a fact constellation schema.
Star Schema 10

• The most common modeling paradigm is the star schema, in which the
data warehouse contains:

1) A large central table (fact table) containing the bulk of the data,
with no redundancy, and

2) A set of smaller attendant tables (dimension tables), one for each


dimension.

• The schema graph resembles a starburst, with the dimension tables


displayed in a radial pattern around the central fact table.
Star Schema 11

• A star schema for AllElectronics sales is shown in Figure 4.6. Sales are
considered along four dimensions: time, item, branch, and location.

• The schema contains a central fact table for sales that contains keys to
each of the four dimensions, along with two measures: dollars sold and
units sold.

• To minimize the size of the fact table, dimension identifiers (e.g., time
key and item key) are system-generated identifiers.
Star Schema 12
Snow Flake Schema 13

• The snowflake schema is a variant of the star schema model, where


some dimension tables are normalized, thereby further splitting the
data into additional tables.
• The resulting schema graph forms a shape similar to a snowflake.
• The major difference between the snowflake and star schema models
is that the dimension tables of the snowflake model may be kept in
normalized form to reduce redundancies.
• Such a table is easy to maintain and saves storage space.
Snow Flake Schema 14
Snow Flake Schema 15

• Snowflake schema. A snowflake schema for AllElectronics sales is


given in Figure 4.7.
• The sales fact table is identical to that of the star schema in
Figure 4.6.
• The main difference between the two schemas is in the definition
of dimension tables.
• The single dimension table for item in the star schema is
normalized in the snowflake schema, resulting in new item and
supplier tables.
Fact Constellation 16

• Sophisticated applications may require multiple fact tables to


share dimension tables. This kind of schema can be viewed as a
collection of stars, and hence is called a galaxy schema or a fact
constellation.
• Fact constellation. A fact constellation schema is shown in Figure
4.8.
• This schema specifies two fact tables, sales and shipping.
Fact Constellation 17
Fact Constellation 18

• A fact constellation schema allows dimension tables to be shared


between fact tables. For example, the dimensions tables for time, item,
and location are shared between the sales and shipping fact tables.
• In data warehousing, there is a distinction between a data warehouse
and a data mart.
• A data warehouse collects information about subjects that span the
entire organization, such as customers, items, sales, assets, and
personnel, and thus its scope is enterprise-wide.
• For data warehouses, the fact constellation schema is commonly used,
since it can model multiple, interrelated subjects.
Fact Constellation 19

• A fact constellation schema allows dimension tables to be shared


between fact tables. For example, the dimensions tables for time,
item, and location are shared between the sales and shipping fact
tables.

• In data warehousing, there is a distinction between a data


warehouse and a data mart.
Fact Constellation 20

• A data mart, on the other hand, is a department subset of the


data warehouse that focuses on selected subjects, and thus its
scope is department wide.

• For data marts, the star or snowflake schema is commonly used,


since both are geared toward modeling single subjects, although
the star schema is more popular and efficient.
Summary 21

• Data Warehouse Modeling: Data Cube and OLAP

• Schemas for Multidimensional Data Models


• Star Schema

• Snow Flake Schema

• Fact Constellation

You might also like