Data Warehouse Concepts
Data Warehouse Concepts
CONCEPTS
>OVERVIEW >DATA WAREHOUSE
>CHARACTERISTICS OF DATA WAREHOUSE
>HISTORY OF DATA WAREHOUSE >GOALS OF
DATA WAREHOUSE >TYPES OF DATA WAREHOUSE
>OVERVIEW:
• Data warehouse is a relational database management system (RDBMS)
construct.
• described as any centralized data repository.
• stores information oriented to satisfy decision-making requests.
• support architectures and tool for business executives.
• environment contains an extraction, transportation, and loading (ETL) solution,
an online analytical processing (OLAP) engine, customer analysis tools, and
other applications.
Source:https://www.javatpoint.com/data-warehouse
>WHAT IS DATA WAREHOUSE?
Source: https://www.javatpoint.com/data-warehouse
A DATA WAREHOUSE CAN BE VIEWED AS A DATA
SYSTEM WITH THE FOLLOWING ATTRIBUTES:
• It is a database designed for investigative tasks.
• It supports a relatively small number of clients with
relatively long interactions.
• It includes current and historical data.
• Its usage is read-intensive.
• It contains a few large tables.
Source: https://www.javatpoint.com/data-warehouse
>CHARACTERISTICS OF DATA
WAREHOUSE/FEATURES
1.Subject-oriented
2.Integrated
3.Time Variant
4.Non-Volatile
Source: https://www.javatpoint.com/data-warehouse
1. SUBJECT-ORIENTED
• because it provides information about
a subject rather than organization's
ongoing operations.
• Subject can be customer, product, or
sales, etc.
• This is done by excluding data that
are not useful concerning the subject
and including all data needed by the
users to understand the subject.
Source: https://www.javatpoint.com/data-warehouse
2. INTEGRATED
•A data warehouse integrates various
heterogeneous data sources like
RDBMS, flat files, and online
transaction records.
• Itrequires performing data cleaning
and integration during data
warehousing to ensure consistency in
naming conventions, attributes types,
etc., among different data sources.
Source: https://www.javatpoint.com/data-warehouse
3. TIME-VARIANT
• Historical information is kept in a data warehouse.
• For example, one can retrieve files from 3 months,
6 months, 12 months, or even previous data from
a data warehouse.
• A data warehouse is a time-variant database,
which supports the business management in
analyzing the business and comparing the
business with different time periods like year,
quarter, month, week and date. Source: https://slideplayer.com/slide/8019533/
Source: https://www.javatpoint.com/data-warehouse
4. NON-VOLATILE
• The data warehouse is a physically separate data
storage.
• The operational updates of data do not occur in the
data warehouse.
• It usually requires only two procedures in data
accessing: Initial loading of data and access to data.
• does not require transaction processing, recovery,
and concurrency capabilities, which allows for
substantial speedup of data retrieval.
• defines that once entered into the warehouse, and
data should not change.
Source: https://www.javatpoint.com/data-warehouse
HISTORY OF DATA WAREHOUSE
Here are some key events in evolution of Data Warehouse:
• 1960- Dartmouth and General Mills in a joint research project, develop the terms
dimensions and facts.
• 1970- A Nielsen and IRI introduces dimensional data marts for retail sales.
• 1983- Tera Data Corporation introduces a database management system which is
specifically designed for decision support
• Data warehousing started in the late 1980s when IBM worker Paul Murphy and
Barry Devlin developed the Business Data Warehouse.
• However, the real concept was given by Inmon Bill. He was considered as a father of
data warehouse. He had written about a variety of topics for building, usage, and
maintenance of the warehouse & the Corporate Information Factory.
Source: https://www.guru99.com/data-warehousing.html
GOALS OF DATA WAREHOUSING
Source: https://www.javatpoint.com/data-warehouse
NEED FOR DATA WAREHOUSE
• Data Warehouse is needed for the following reasons:
1. Business User: require a data warehouse to view summarized data
from the past.
2. Store historical data: Data Warehouse is required to store the time
variable data from the past.
3. Make strategic decisions: contributes to making strategic decisions.
4. For data consistency and quality: Bringing the data from different
sources at a commonplace, the user can effectively undertake to
bring the uniformity and consistency in data.
Source: https://www.javatpoint.com/data-warehouse
BENEFITS OF DATA WAREHOUSE
• Understand business trends and make better forecasting decisions.
• are designed to perform well enormous amounts of data.
• The structure of data warehouses is more accessible for end-users to
navigate, understand, and query.
• Queries that would be complex in many normalized databases could be
easier to build and maintain in data warehouses.
• is an efficient method to manage demand for lots of information from lots
of users.
• provide the capabilities to analyze a large amount of historical data.
Source: https://www.javatpoint.com/data-warehouse
TYPES OF DATA WAREHOUSE
• Three main types of Data Warehouses are:
1. Enterprise Data Warehouse:
• is a centralized warehouse.
2. Operational Data Store:
• arenothing but data store required when neither Data warehouse nor
OLTP systems support organizations reporting needs.
3. Data Mart:
• is a subset of the data warehouse.
Source: https://www.guru99.com/data-warehousing.html
WHO NEEDS DATA WAREHOUSE?
Data warehouse is needed for all types of users like:
• Decision makers
• Users who use customized, complex processes
• people who want simple technology to access the data
• people who want a systematic approach for making decisions.
• Useful for the user who wants fast performance on a huge amount of
data
• People who want to discover 'hidden patterns' of data-flows and
groupings.
Source: https://www.guru99.com/data-warehousing.html
WHAT IS A DATA WAREHOUSE USED FOR?
1. Airline:
• used for operation purpose
2. Banking:
• used in the banking sector
• used for the market research, performance analysis of the product and
operations.
3. Healthcare:
• used Data warehouse to strategize and predict outcomes
4. Public sector:
• used for intelligence gathering.
Source: https://www.guru99.com/data-warehousing.html
5. Investment and Insurance sector:
• used to analyze data patterns, customer trends, and to track market
movements.
6. Retail chain:
• used for distribution and marketing.
7. Telecommunication:
• used in this sector for product promotions, sales decisions and to make
distribution decisions.
8. Hospitality Industry:
• utilizes warehouse services to design as well as estimate their advertising
and promotion campaigns
Source: https://www.guru99.com/data-warehousing.html
DATA WAREHOUSE TOOLS
1. MarkLogic:
• MarkLogic is useful data warehousing solution that makes data integration easier
and faster using an array of enterprise features.
https://developer.marklogic.com/products/
2. Oracle:
• It offers a wide range of choice of data warehouse solutions for both on-
premises and in the cloud. It helps to optimize customer experiences by increasing
operational efficiency. https://www.oracle.com/index.html
3. Amazon RedShift:
• It is a simple and cost-effective tool to analyze all types of data using standard
SQL and existing BI tools.
• https://aws.amazon.com/redshift/?nc2=h_m1
Source: https://www.guru99.com/data-warehousing.html