Architecture
Architecture
htm
Since a data warehouse can gather information quickly and efficiently, it can
enhance business productivity.
A data warehouse also helps in bringing down the costs by tracking trends,
patterns over a long period in a consistent and reliable manner.
The top-down view − This view allows the selection of relevant information
needed for a data warehouse.
The data source view − This view presents the information being captured,
stored, and managed by the operational system.
The data warehouse view − This view includes the fact tables and
dimension tables. It represents the information stored inside the data
warehouse.
The business query view − It is the view of the data from the viewpoint of
the end-user.
1 of 8 2/12/2025, 8:57 PM
Data Warehousing - Architecture https://www.tutorialspoint.com/dwh/dwh_architecture.htm
Bottom Tier − The bottom tier of the architecture is the data warehouse
database server. It is the relational database system. We use the back end
tools and utilities to feed data into the bottom tier. These back end tools and
utilities perform the Extract, Clean, Load, and refresh functions.
Middle Tier − In the middle tier, we have the OLAP Server that can be
implemented in either of the following ways.
Top-Tier − This tier is the front-end client layer. This layer holds the query
tools and reporting tools, analysis tools and data mining tools.
Explore our latest online courses and learn new skills at your own pace. Enroll and
become a certified expert to boost your career.
2 of 8 2/12/2025, 8:57 PM
Data Warehousing - Architecture https://www.tutorialspoint.com/dwh/dwh_architecture.htm
warehouse models −
Virtual Warehouse
Data mart
Enterprise Warehouse
Data mart contains a subset of organization-wide data. This subset of data is valuable
to specific groups of an organization.
In other words, we can claim that data marts contain data specific to a particular
group. For example, the marketing data mart may contain data related to items,
customers, and sales. Data marts are confined to subjects.
The life cycle of a data mart may be complex in long run, if its planning and
design are not organization-wide.
3 of 8 2/12/2025, 8:57 PM
Data Warehousing - Architecture https://www.tutorialspoint.com/dwh/dwh_architecture.htm
Load Manager
This component performs the operations required to extract and load process.
The size and complexity of the load manager varies between specific solutions from
one data warehouse to other.
Perform simple transformations into structure similar to the one in the data
warehouse.
The data is extracted from the operational databases or the external information
providers. Gateways is the application programs that are used to extract data. It is
supported by underlying DBMS and allows client program to generate SQL to be
executed at a server. Open Database Connection(ODBC), Java Database Connection
4 of 8 2/12/2025, 8:57 PM
Data Warehousing - Architecture https://www.tutorialspoint.com/dwh/dwh_architecture.htm
In order to minimize the total load window the data need to be loaded into the
warehouse in the fastest possible time.
It is more effective to load the data into relational database prior to applying
transformations and checks.
While loading it may be required to perform simple transformations. After this has
been completed we are in position to do the complex checks. Suppose we are loading
the EPOS sales transaction we need to perform the following checks:
Strip out all the columns that are not required within the warehouse.
Warehouse Manager
A warehouse manager is responsible for the warehouse management process. It
consists of third-party system software, C programs, and shell scripts.
The size and complexity of warehouse managers varies between specific solutions.
Backup/Recovery tool
SQL Scripts
5 of 8 2/12/2025, 8:57 PM
Data Warehousing - Architecture https://www.tutorialspoint.com/dwh/dwh_architecture.htm
Creates indexes, business views, partition views against the base data.
Transforms and merges the source data into the published data warehouse.
Archives the data that has reached the end of its captured life.
Note − A warehouse Manager also analyzes query profiles to determine index and
aggregations are appropriate.
Query Manager
Query manager is responsible for directing the queries to the suitable tables.
The following screenshot shows the architecture of a query manager. It includes the
following:
6 of 8 2/12/2025, 8:57 PM
Data Warehousing - Architecture https://www.tutorialspoint.com/dwh/dwh_architecture.htm
Stored procedures
Detailed Information
Detailed information is not kept online, rather it is aggregated to the next level of
detail and then archived to tape. The detailed information part of data warehouse
keeps the detailed information in the starflake schema. Detailed information is loaded
into the data warehouse to supplement the aggregated data.
7 of 8 2/12/2025, 8:57 PM
Data Warehousing - Architecture https://www.tutorialspoint.com/dwh/dwh_architecture.htm
Summary Information
Summary Information is a part of data warehouse that stores predefined
aggregations. These aggregations are generated by the warehouse manager.
Summary Information must be treated as transient. It changes on-the-go in order to
respond to the changing query profiles.
It needs to be updated whenever new data is loaded into the data warehouse.
It may not have been backed up, since it can be generated fresh from the
detailed information.
8 of 8 2/12/2025, 8:57 PM