Module 2-2
Module 2-2
INTELLIGENCE
Pradeep Kumar S
IBM Software Technical Trainer
IBM India Private Limited
UNIT-4
MODULE – 2
1. Star schema
2. Populating the data
warehouse
3. Anomalies in data fields
Star schema
◦A star schema is a multi-dimensional data model used to
organize data in a database so that it is easy to understand and
analyze.
◦Star schemas can be applied to data warehouses, databases,
data marts, and other tools.
◦The star schema design is optimized for querying large data
sets.
Fact vs Dimension Table
◦A fact table represents the measures on which analysis is
performed.
◦It also contains foreign keys for the dimension keys.
◦For example − Every sale is a fact.
Fact vs Dimension Table
◦The Dimension table represents the characteristics of a
dimension.
◦A Customer dimension can have Customer_Name, Phone_No
etc.
Populating the data warehouse
◦The process of populating a data warehouse starts with
extracting data from source systems.
◦This data then undergoes further cleansing and transformation
before it is loaded into a data warehouse.
◦Strategic execution of the BI and reporting initiatives is only
possible if the right data extraction solution is in place.
Anomalies in data fields