0% found this document useful (0 votes)
2 views

Components of Data Warehouse

The document outlines the components of a data warehouse, including source data, data staging, data storage, metadata, data marts, and management/control components. It details the types of source data such as production, internal, archived, and external data, as well as the processes of data extraction, transformation, and loading. Additionally, it explains the role of metadata and the purpose of data marts in catering to specific business needs.

Uploaded by

yuvan.yuvan2004
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
2 views

Components of Data Warehouse

The document outlines the components of a data warehouse, including source data, data staging, data storage, metadata, data marts, and management/control components. It details the types of source data such as production, internal, archived, and external data, as well as the processes of data extraction, transformation, and loading. Additionally, it explains the role of metadata and the purpose of data marts in catering to specific business needs.

Uploaded by

yuvan.yuvan2004
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 2

Components of Data Warehouse:

Source Data Component in Data Warehousing


1. Production Data:
o Originates from the organization's operational systems.
o Specific data segments are selected based on the data requirements of the data
warehouse.
2. Internal Data:
o Includes private spreadsheets, reports, customer profiles, and departmental
databases maintained within the organization.
o Some of this data can be valuable for analysis and integration into the data
warehouse.
3. Archived Data:
o Historical data extracted from operational systems is periodically archived.
o These archives store older data no longer actively used in day-to-day operations.
4. External Data:
o Comes from sources outside the organization, such as industry reports, market
statistics, and third-party data providers.
o Executives often rely on this data for strategic decision-making.
Data Staging Component in Data Warehousing
1. Data Extraction:
o Data is collected from multiple sources using appropriate techniques for each
source.
2. Data Transformation:
o Extracted data is cleaned, standardized, and integrated.
o Tasks include correcting errors, filling missing values, removing duplicates, and
standardizing formats.
o Data from different sources is combined, sorted, and merged.
o Irrelevant data is purged, and meaningful data is reorganized for analysis.
3. Data Loading:
o During the initial setup, large volumes of data are loaded into the warehouse,
which is time-intensive.
o Subsequent loads involve updating and maintaining the data efficiently.
Data Warehouse Components
1. Data Storage Component:
o Data storage in a data warehouse is separate from operational data repositories.
o Operational systems store only current, highly normalized data optimized for fast
transaction processing.
o Data warehouses, in contrast, store historical and aggregated data optimized for
analysis and reporting.
2. Metadata Component:
o Acts as a data dictionary or catalog for the data warehouse.
o Stores information about data structures, records, addresses, indexes, and
relationships.
o Helps users and systems understand and manage the stored data effectively.
3. Data Marts:
o Subsets of the data warehouse designed for specific business units or user
groups.
o Focused on particular subjects, offering quicker access and simplified querying.
o While data warehouses handle broad organizational data, data marts address
specialized analytical needs.
o Modern approaches often involve building data warehouses with multiple
interconnected data marts.
4. Management and Control Component:
o The management and control component oversees data warehouse operations,
ensuring smooth data transformation, transfer, storage, and delivery.
o It works with database management systems to securely store data, monitors
data movement through staging, and regulates data delivery to users,
maintaining consistency and efficiency across all processes.

You might also like