0% found this document useful (0 votes)
7 views

Module 2-2

Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
7 views

Module 2-2

Copyright
© © All Rights Reserved
Available Formats
Download as PPTX, PDF, TXT or read online on Scribd
You are on page 1/ 17

BUSINESS

INTELLIGENCE
Pradeep Kumar S
IBM Software Technical Trainer
IBM India Private Limited
UNIT-4
MODULE – 2

1. Star schema
2. Populating the data
warehouse
3. Anomalies in data fields
Star schema
◦A star schema is a multi-dimensional data model used to
organize data in a database so that it is easy to understand and
analyze.
◦Star schemas can be applied to data warehouses, databases,
data marts, and other tools.
◦The star schema design is optimized for querying large data
sets.
Fact vs Dimension Table
◦A fact table represents the measures on which analysis is
performed.
◦It also contains foreign keys for the dimension keys.
◦For example − Every sale is a fact.
Fact vs Dimension Table
◦The Dimension table represents the characteristics of a
dimension.
◦A Customer dimension can have Customer_Name, Phone_No
etc.
Populating the data warehouse
◦The process of populating a data warehouse starts with
extracting data from source systems.
◦This data then undergoes further cleansing and transformation
before it is loaded into a data warehouse.
◦Strategic execution of the BI and reporting initiatives is only
possible if the right data extraction solution is in place.
Anomalies in data fields

◦An unexpected change within these data patterns, or an event


that does not conform to the expected data pattern, is
considered an anomaly.
◦In other words, an anomaly is a deviation from business as
usual.
Anomalies in data fields
Anomaly and its examples
◦If you are a breeder of black dogs and one puppy comes out
pink, that puppy is an anomaly.
◦The noun anomaly comes from the Greek word anomolia,
meaning "uneven” or “irregular.”
◦When something is unusual compared to similar things around
it, it's the anomaly.
Anomalies in data fields
There are three types of Data Anomalies:
◦Update anomalies occur when the same data is repeated in
multiple rows, and changes are made in some but not all
instances.
◦For example, if an employee's address changes and the update
is made in one row but not in others, the database will contain
inconsistent data..
◦An insertion anomaly is the inability to add data to the
database due to the absence of other data.
◦Deletion Anomalies happen when the deletion of unwanted
information causes desired information to be deleted as well.
◦For example, if a single database record contains information
about a particular product along with information about a
salesperson for the company and the salesperson quits, then
information about the product is deleted along with salesperson
information.
Thank You !!!

You might also like