0% found this document useful (0 votes)
239 views6 pages

BI Assignment 1

The document compares OLTP and OLAP systems. OLTP systems handle transactional data and operational tasks, providing source data to data warehouses. OLAP systems analyze consolidated data from OLTP databases to help with planning, problem solving, and decision making. OLAP systems allow for complex queries and multi-dimensional views of business data and activities, but queries may take longer to run than in optimized OLTP systems due to larger data volumes and aggregation.

Uploaded by

priyangka deka
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
239 views6 pages

BI Assignment 1

The document compares OLTP and OLAP systems. OLTP systems handle transactional data and operational tasks, providing source data to data warehouses. OLAP systems analyze consolidated data from OLTP databases to help with planning, problem solving, and decision making. OLAP systems allow for complex queries and multi-dimensional views of business data and activities, but queries may take longer to run than in optimized OLTP systems due to larger data volumes and aggregation.

Uploaded by

priyangka deka
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 6

1.Compare OLPT versus OLAP. Give example where ever possible.

(5)

OLTP vs OLAP

We can divide IT systems into transactional (OLTP) and analytical (OLAP). In general, we can assume that
OLTP systems provide source data to data warehouses, whereas OLAP systems help to analyze it. 

OLTP SYSTEM OLAP SYSTEM


Online Transaction Processing Online Analytical Processing
(Operation System) (Operation System)

Source of data Operational data; OLTPs are the Consolidation data; OLAP data
original source of the data. comes from the various OLTP
Databases

Purpose of data To control and run fundamental To help with planning, problem


business tasks solving, and decision support
What the data Reveals a snapshot of ongoing Multi-dimensional views of
business processes various kinds of business
activities

Short and fast inserts and Short and fast inserts and
updates initiated by end users updates initiated by end users
Inserts and Updates
Queries Relatively standardized and Often complex queries involving
simple queries Returning aggregations
relatively few records

Processing Speed Typically very fast Depends on the amount of data


involved; batch data refreshes
and complex queries may take
many hours; query speed can be
improved by creating indexes

Space Requirements Can be relatively small if Larger due to the existence of


historical data is archived aggregation structures and
history data; requires more
indexes than OLTP

Database Design Highly normalized with many Typically de-normalized with


tables fewer tables; use of star and/or
snowflake schemas

Backup and Recovery Backup religiously; operational Instead of regular backups,


data is critical to run the some environments may
business, data loss is likely to consider simply reloading the
entail significant monetary loss OLTP data as a recovery method
and legal liability

2.Write a case study of your choice. for the following dimensioanl model.
a) Draw Star schema model. Give explanation for each.(6)
b) Draw Fact constellation model. Explain the concept of confirmed dimensions and fact table.
Give example for each(9)

a)

Dimension tables are used to describe the data we want to store. For example: a retailer might want to
store the date, store, and employee involved in a specific purchase. Each dimension table is its own
category (date, employee, store) and can have one or more attributes. For each store, we can save its
location at the city, region, state and country level. For each date, we can store the year, month, day of
the month, day of the week, etc. This is related to the hierarchy of attributes in the dimension table.

In the star schema, we’ll usually find that some attributes are a subset of other attributes in the same
record. This redundancy is deliberate and done in the name of better performance. We could use date,
location, and sales agent dimensions to aggregate (the transform part of the ETL process) and store data
inside DWH. In dimensional modeling, it’s very important to define the right dimensions and choose
proper granulation.
The Star Schema

The star schema is the simplest model used in DWH. Because the fact table is in the center of the schema
with dimension tables around it, it looks roughly like a star. This is especially apparent when the fact table
is surrounded by five dimension tables. A variant of the star schema the centipede schema, where the
fact table is surrounded by a large number of small dimension tables.

Star schemas are very commonly used in data marts. We can relate them to the top-down data model
approach. We’ll analyze two star schemas (data marts) and then combine them to make a single model.

The sales report is one today’s most common reports. As we mentioned before, in most cases we could
generate sales reports from the live system. But when data or business size makes this too cumbersome,
we’ll have to build a data warehouse or a data mart to streamline the process. After designing our star
schema, an ETL process will get the data from operational database(s), transform the data into the
proper format for the DWH, and load the data into the warehouse.

The model presented above contains of one fact table (colored light red) and five dimension tables
(colored light blue). The tables in the model are:

 fact_sales  – This table contains references to the dimension tables plus two facts (price and
quantity sold). Note that all five foreign keys together form the primary key of the table.
 dim_sales_type  – This is a sales-type dimension table with only one attribute, “ type_name ”.
 dim_employee  – This is an employee dimension table that stores basic employee attributes: full
name and birth year.
 dim_product  – This is a product dimension table with only two attributes (other than the
primary key): product name and product type.
 dim_time  – This table handles the time dimension. It contains five attributes besides the
primary key. The lowest-level data is sales by date ( action_date ). The  action_week  attribute is
the number of the week in that year (i.e. the first week in January would be given the number 1;
the last week in December would get the number 52, etc.)
The  actual_month  and  actual_year  attributes store the calendar month and year when the
sale occurred. These can be extracted from the  action_date  attribute.
The  action_weekday  attribute stores the name of the day when the sale took place.
 dim_store  – This is a store dimension. For each store we’ll save the city, region, state and
country where it is located. Here we can clearly notice that the star schema is denormalized.

b)

A Fact constellation means two or more fact tables sharing one or more dimensions. It is also
called Galaxy schema.

Fact Constellation Schema describes a logical structure of data warehouse or data mart. Fact
Constellation Schema can design with a collection of de-normalized FACT, Shared, and Conformed
Dimension tables.

We can look at the two previous models as two data marts, one for the sales department and the other
for the supply department. Each of them consists of only one fact table and a few dimensional tables. If
we wanted, we could combine these two data marts into one model. This type of schema, containing
several fact tables and sharing some dimension tables, is called a galaxy schema. Sharing dimension
tables can reduce database size, especially where shared dimensions have many possible values. Ideally,
in both data marts the dimensions are defined in the same manner. If that’s not the case, we’ll have to
adjust the dimensions to fit both needs.

A galaxy schema, built out of our two example data marts, is shown below:
3.Explain any two Business Intelligence applications with an example(5).

Business intelligence (BI) leverages software and services to transform data into actionable insights that
inform an organization’s strategic and tactical business decisions. BI tools access and analyze data sets
and present analytical findings in reports, summaries, dashboards, graphs, charts and maps to provide
users with detailed intelligence about the state of the business.

The term business intelligence often also refers to a range of tools that provide quick, easy-to-digest
access to insights about an organization's current state, based on available data. 

Reporting:

A crucial business application of BI is reporting. As we’ve covered, business intelligence tools collect and
study unstructured sets of data in addition to organizing and using them to generate a range of different
types of reports. These can include staffing, expenses, sales, customer services, and other processes.

Reporting and data analysis are similar, but they vary significantly in purpose, delivery, tasks and value.
Reporting is the process of organizing data in summaries with the intention of monitoring business
performance. Analysis is the process of exploring data to extract insights that can be applied to improve
business practices.
Basically, reporting turns data into plain information. Analysis takes data and turns it into actionable
insights. Both help businesses improve their performance and monitor operations, but use different
methods to do so. Reporting shows users what’s happening and analysis explains why it’s happening.
Both processes can be carried out using visualizations, but don’t have to.

Business intelligence tools are ideal for handling dynamic data. Historically, data visualizations were
static, and a new one would have to be created for every variable change. Modern BI software provides
interactive dashboards that can update in real time, offering a new level of usability and agility in data
analysis.

Performance management

With BI applications, organizations can monitor goal progress based on pre-defined or customizable
timeframes. The data-driven goals may include project completion deadlines, target delivery time, or
sales goals. For example, if you’d like to reach a certain sales goal, your BI system can analyze previous
months of data and suggest a reasonable goal to aim for based on past performance.

These goals can be tracked closely to deliver gets -time updates on goal progress. This helps you
understand what gaps might remain. Users can set the system to alert them when they are getting close
to a target or if the time limit is ending and they have yet to reach their goal. This helps managers and
employees alike stay on top of their progress and helps keep teams goal-oriented.

Users can also monitor goal fulfillment and use progress data to gauge the overall productivity of an
organization. Unlike instances when a substantial amount of time is lost tracking down or organizing
urgently needed data, information is always readily accessible. This saves businesses time and money —
not to mention makes your lives easier!

You might also like