0% found this document useful (0 votes)
38 views18 pages

Data Warehouse Concepts

A data warehouse is a centralized repository of integrated data from one or more disparate sources. It stores current and historical data and is optimized for querying and analysis rather than transaction processing. A data warehouse includes data from different sources, stores time-series data to allow analysis over time, and is structured for investigative analysis rather than real-time transactions.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
38 views18 pages

Data Warehouse Concepts

A data warehouse is a centralized repository of integrated data from one or more disparate sources. It stores current and historical data and is optimized for querying and analysis rather than transaction processing. A data warehouse includes data from different sources, stores time-series data to allow analysis over time, and is structured for investigative analysis rather than real-time transactions.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 18

DATA WAREHOUSE

CONCEPTS
>OVERVIEW >DATA WAREHOUSE
>CHARACTERISTICS OF DATA WAREHOUSE
>HISTORY OF DATA WAREHOUSE >GOALS OF
DATA WAREHOUSE >TYPES OF DATA WAREHOUSE
>OVERVIEW:
• Data warehouse is a relational database management system (RDBMS)
construct.
• described as any centralized data repository.
• stores information oriented to satisfy decision-making requests.
• support architectures and tool for business executives.
• environment contains an extraction, transportation, and loading (ETL) solution,
an online analytical processing (OLAP) engine, customer analysis tools, and
other applications.

Source:https://www.javatpoint.com/data-warehouse
>WHAT IS DATA WAREHOUSE?

•is a subject-oriented, integrated, time-


variant, and non-volatile collection of data
in support of management’s decision making
process

Source: https://www.javatpoint.com/data-warehouse
A DATA WAREHOUSE CAN BE VIEWED AS A DATA
SYSTEM WITH THE FOLLOWING ATTRIBUTES:
• It is a database designed for investigative tasks.
• It supports a relatively small number of clients with
relatively long interactions.
• It includes current and historical data.
• Its usage is read-intensive.
• It contains a few large tables.
Source: https://www.javatpoint.com/data-warehouse
>CHARACTERISTICS OF DATA
WAREHOUSE/FEATURES

1.Subject-oriented
2.Integrated
3.Time Variant
4.Non-Volatile
Source: https://www.javatpoint.com/data-warehouse
1. SUBJECT-ORIENTED
• because it provides information about
a subject rather than organization's
ongoing operations.
• Subject can be customer, product, or
sales, etc.
• This is done by excluding data that
are not useful concerning the subject
and including all data needed by the
users to understand the subject.

Source: https://www.javatpoint.com/data-warehouse
2. INTEGRATED
•A data warehouse integrates various
heterogeneous data sources like
RDBMS, flat files, and online
transaction records.
• Itrequires performing data cleaning
and integration during data
warehousing to ensure consistency in
naming conventions, attributes types,
etc., among different data sources.

Source: https://www.javatpoint.com/data-warehouse
3. TIME-VARIANT
• Historical information is kept in a data warehouse.
• For example, one can retrieve files from 3 months,
6 months, 12 months, or even previous data from
a data warehouse.
• A data warehouse is a time-variant database,
which supports the business management in
analyzing the business and comparing the
business with different time periods like year,
quarter, month, week and date. Source: https://slideplayer.com/slide/8019533/

• These variations with a transactions system, where


often only the most current file is kept.

Source: https://www.javatpoint.com/data-warehouse
4. NON-VOLATILE
• The data warehouse is a physically separate data
storage.
• The operational updates of data do not occur in the
data warehouse.
• It usually requires only two procedures in data
accessing: Initial loading of data and access to data.
• does not require transaction processing, recovery,
and concurrency capabilities, which allows for
substantial speedup of data retrieval.
• defines that once entered into the warehouse, and
data should not change.

Source: https://www.javatpoint.com/data-warehouse
HISTORY OF DATA WAREHOUSE
Here are some key events in evolution of Data Warehouse:
• 1960- Dartmouth and General Mills in a joint research project, develop the terms
dimensions and facts.
• 1970- A Nielsen and IRI introduces dimensional data marts for retail sales.
• 1983- Tera Data Corporation introduces a database management system which is
specifically designed for decision support
• Data warehousing started in the late 1980s when IBM worker Paul Murphy and
Barry Devlin developed the Business Data Warehouse.
• However, the real concept was given by Inmon Bill. He was considered as a father of
data warehouse. He had written about a variety of topics for building, usage, and
maintenance of the warehouse & the Corporate Information Factory.
Source: https://www.guru99.com/data-warehousing.html
GOALS OF DATA WAREHOUSING

• To help reporting as well as analysis


• Maintain the organization's historical information
• Be the foundation for decision making.

Source: https://www.javatpoint.com/data-warehouse
NEED FOR DATA WAREHOUSE
• Data Warehouse is needed for the following reasons:
1. Business User: require a data warehouse to view summarized data
from the past.
2. Store historical data: Data Warehouse is required to store the time
variable data from the past.
3. Make strategic decisions: contributes to making strategic decisions.

4. For data consistency and quality: Bringing the data from different
sources at a commonplace, the user can effectively undertake to
bring the uniformity and consistency in data.

5. High response time: Data warehouse has to be ready for somewhat


unexpected loads and types of queries.

Source: https://www.javatpoint.com/data-warehouse
BENEFITS OF DATA WAREHOUSE
• Understand business trends and make better forecasting decisions.
• are designed to perform well enormous amounts of data.
• The structure of data warehouses is more accessible for end-users to
navigate, understand, and query.
• Queries that would be complex in many normalized databases could be
easier to build and maintain in data warehouses.
• is an efficient method to manage demand for lots of information from lots
of users.
• provide the capabilities to analyze a large amount of historical data.
Source: https://www.javatpoint.com/data-warehouse
TYPES OF DATA WAREHOUSE
• Three main types of Data Warehouses are:
1. Enterprise Data Warehouse:
• is a centralized warehouse.
2. Operational Data Store:
• arenothing but data store required when neither Data warehouse nor
OLTP systems support organizations reporting needs.

3. Data Mart:
• is a subset of the data warehouse.

Source: https://www.guru99.com/data-warehousing.html
WHO NEEDS DATA WAREHOUSE?
Data warehouse is needed for all types of users like:
• Decision makers
• Users who use customized, complex processes
• people who want simple technology to access the data
• people who want a systematic approach for making decisions.
• Useful for the user who wants fast performance on a huge amount of
data
• People who want to discover 'hidden patterns' of data-flows and
groupings.
Source: https://www.guru99.com/data-warehousing.html
WHAT IS A DATA WAREHOUSE USED FOR?
1. Airline:
• used for operation purpose
2. Banking:
• used in the banking sector
• used for the market research, performance analysis of the product and
operations.
3. Healthcare:
• used Data warehouse to strategize and predict outcomes
4. Public sector:
• used for intelligence gathering.

Source: https://www.guru99.com/data-warehousing.html
5. Investment and Insurance sector:
• used to analyze data patterns, customer trends, and to track market
movements.

6. Retail chain:
• used for distribution and marketing.
7. Telecommunication:
• used in this sector for product promotions, sales decisions and to make
distribution decisions.

8. Hospitality Industry:
• utilizes warehouse services to design as well as estimate their advertising
and promotion campaigns
Source: https://www.guru99.com/data-warehousing.html
DATA WAREHOUSE TOOLS
1. MarkLogic:
• MarkLogic is useful data warehousing solution that makes data integration easier
and faster using an array of enterprise features.
https://developer.marklogic.com/products/
2. Oracle:
• It offers a wide range of choice of data warehouse solutions for both on-
premises and in the cloud. It helps to optimize customer experiences by increasing
operational efficiency. https://www.oracle.com/index.html
3. Amazon RedShift:
• It is a simple and cost-effective tool to analyze all types of data using standard
SQL and existing BI tools.
• https://aws.amazon.com/redshift/?nc2=h_m1
Source: https://www.guru99.com/data-warehousing.html

You might also like