221

Data warehouses contain large amounts of data from multiple sources to support business intelligence and analytics. There are two main types of metadata in data warehouses - technical metadata for development and administration, and business metadata to understand the stored data. Data marts contain subsets of data from warehouses focused on specific business units or departments to allow quick access to insights. ETL processes extract, transform, and load data into warehouses from source systems to integrate and prepare the data for analysis.

Uploaded by

Abhishek Ranjan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

41 views

221

Uploaded by

Abhishek Ranjan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

In the data warehouse architecture, metadata describes the data warehouse database and offers a framework for data.

It
helps in constructing, preserving, handling, and making use of the data warehouse.
There are two types of metadata in data mining:

i) Technical Metadata comprises information that can be used by developers and managers when executing
warehouse development and administration tasks.
ii) Business Metadata comprises information that offers an easily understandable standpoint of the data
stored in the warehouse. Metadata plays an important role for businesses and the technical teams to
understand the data present in the warehouse and convert it into information.

A data mart is a subset of a data warehouse focused on a particular line of business, department, or subject
area. Data marts make specific data available to a defined group of users, which allows those users to quickly
access critical insights without wasting time searching through an entire data warehouse. For example, many
companies may have a data mart that aligns with a specific department in the business, such as finance, sales,
or marketing.

Data Mart Vs Data Warehouse

Data marts and data warehouses are both highly structured repositories where data is stored and managed until
it is needed. However, they differ in the scope of data stored: data warehouses are built to serve as the central
store of data for the entire business, whereas a data mart fulfills the request of a specific division or business
function. Because a data warehouse contains data for the entire company, it is best practice to have strictly
control who can access it. Additionally, querying the data you need in a data warehouse is an incredibly
difficult task for the business. Thus, the primary purpose of a data mart is to isolate—or partition—a smaller set of
data from a whole to provide easier data access for the end consumers. A data mart can be created
from an existing data warehouse—the top-down approach—or from other sources, such as internal operational
systems or external data. Similar to a data warehouse, it is a relational database that stores transactional data
(time value, numerical order, reference to one or more object) in columns and rows making it easy to organize
and access

Extract, Transform, Load (ETL), is a process of data integration that encompasses three steps - extraction,
transformation, and loading. In a nutshell, ETL systems take large volumes of raw data from multiple sources,
convert it for analysis, and load that data into your warehouse.
ETL saves you significant time on data extraction and preparation - time that you can better spend on
evaluating your business. Practicing ETL is also part of a healthy data management workflow, ensuring high data
quality, availability, and reliability. Each of the three major components in the ETL saves time and development
effort by running just once in a dedicated data flow:

Extract: In ETL, the first link determines the strength of the chain. The extract stage determines which data
sources to use, the refresh rate (velocity) of each source, and the priorities (extract order) between them — all of
which heavily impact your time to insight. Transform: After extraction, the transformation process brings
clarity and order to the initial data swamp. Dates and times combine into a single format and strings parse down
into their true underlying meanings. Location data convert to coordinates, zip codes, or cities/countries. The
transform step also sums up, rounds, and averages measures, and it deletes useless data and errors or discards
them for later inspection. It can also mask personally identifiable information (PII) to comply with GDPR, CCPA,
and other privacy requirements. Load: In the last phase, much as in the first, ETL determines
targets and refresh rates. The load phase also determines whether loading will happen incrementally, or if it will
require “upsert” (updating existing data and inserting new data) for the new batches of data.

ROLAP implies Relational OLAP, an application based on relational DBMSs. It performs dynamic
multidimensional analysis of data stored in a relational database. The architecture is like three-tier. It has three
components viz. front end (User Interface), ROLAP server (Metadata request processing engine) and the back
end (Database Server).In this three-tier architecture the user submits the request and ROLAP engine converts
the request into SQL and submits to the backend database. Popular ROLAP products include Metacube by
Stanford Technology Group, Red Brick Warehouse by Red Brick Systems.

MOLAP stands for Multidimensional Online Analytical Processing. It processes the data using the
multidimensional cube using various combinations. Since, the data is stored in multidimensional structure the
MOLAP engine uses the pre-computed or pre-stored information. MOLAP engine processes pre-compiled
information. It has dynamic abilities to perform aggregation of concept hierarchy. MOLAP is very useful in time-
series data analysis and economic evaluation. Tools that incorporate MOLAP include Oracle Essbase, IBM
Cognos, and Apache Kylin.
HOLAP It defines Hybrid Online Analytical Processing. It is the hybrid of ROLAP and MOLAP technologies. It
connects both the dimensions together in one architecture. It stores the intermediate or part of the data in ROLAP
and MOLAP. Depending on the query request it accesses the databases. It stores the relational tables in ROLAP
structure, and the data requires multidimensional views, stored and processed using MOLAP architecturePopular
HOLAP products are Microsoft SQL Server 2000 presents a hybrid OLAP server.
Desktop Online Analytical Processing (DOLAP) architecture is most suitable for local multidimensional
analysis. It is like a miniature of multidimensional database or it’s like a sub cube or any business data cube.

Features of Star Schema: (i) The data is in denormalized database. (ii) It provides quick query response (iii)
Star schema is flexible can be changed or added easily. (iv) It reduces the complexity of metadata for developers
and end users. Advantages of Star Schema:- Query performance, Load performance and administration, Built-
in referential integrity

Features of Snowflake Schema : - (i) It has normalized tables, (ii) Occupy less disk space. ,(iii) It requires more
lookup time as many tables are interconnected and extending dimensions. Advantages of Snowflake Schema:-
i) A Snowflake schema occupies a much smaller amount of disk space compared to the Star schema. Lesser
disk space means more convenience and less hassle. ii) Snowflake schema of small protection from
various Data integrity issues. Most people tend to prefer the Snowflake schema because of how safe if it is.
iii) Data is easy to maintain and more structured. iv) Data quality is better than star schema.

FACT CONSTELLATION SCHEMA :- There is another schema for representing a multidimensional model. This
term fact constellation is like the galaxy of universe containing several stars. It is a collection of fact schemas
having one or more-dimension tables in common as shown in the figure below. This logical representation is
mainly used in designing complex database systems
Advantages of Fact Constellation Schema:- i) Different fact tables are explicitly assigned to the dimensions. ii)
Provides a flexible schema for implementation
Limitations of Fact Constellation Schema:- i) Complexity of the schema involved because of several
aggregations. ii)Fact constellation solution is hard to maintain and support

Abinitio Session 1
100% (1)
Abinitio Session 1
237 pages
Railway Management System Database Mini Project
No ratings yet
Railway Management System Database Mini Project
24 pages
Learn Data Warehousing in 24 Hours
From Everand
Learn Data Warehousing in 24 Hours
Alex Nordeen
No ratings yet
Unit - 1 Introduction To Data Warehousing
No ratings yet
Unit - 1 Introduction To Data Warehousing
57 pages
Ba Unit 2
No ratings yet
Ba Unit 2
20 pages
Data Warehousing, Business Analytics and Online Analytical -1 (1)
No ratings yet
Data Warehousing, Business Analytics and Online Analytical -1 (1)
35 pages
Data Modeling Concept Latest
No ratings yet
Data Modeling Concept Latest
25 pages
DW Concepts
100% (1)
DW Concepts
40 pages
Data Warehouse Concepts & Terminology: - Vamshi Myana
No ratings yet
Data Warehouse Concepts & Terminology: - Vamshi Myana
39 pages
Data Repositories in Data Analytics
No ratings yet
Data Repositories in Data Analytics
8 pages
Data Warehousing unit 1,2
No ratings yet
Data Warehousing unit 1,2
9 pages
CA2 Notes - Copy
No ratings yet
CA2 Notes - Copy
8 pages
Data Mining
No ratings yet
Data Mining
25 pages
DWM Mod 1
No ratings yet
DWM Mod 1
17 pages
Business Intelligence Overview
No ratings yet
Business Intelligence Overview
20 pages
DWM QB Soln
No ratings yet
DWM QB Soln
18 pages
DWDM IT-32 DATAWAREHOUSING & DATAMINING
No ratings yet
DWDM IT-32 DATAWAREHOUSING & DATAMINING
9 pages
Lecture 13
No ratings yet
Lecture 13
17 pages
VV_Data Warehousing and Data Mining
No ratings yet
VV_Data Warehousing and Data Mining
25 pages
Module 1 (2)
No ratings yet
Module 1 (2)
71 pages
Dataware House
100% (8)
Dataware House
42 pages
Data Warehouse - Concept and Fundamentals: Sridevi
No ratings yet
Data Warehouse - Concept and Fundamentals: Sridevi
25 pages
Data Warehouse
No ratings yet
Data Warehouse
4 pages
What Is A Data Mart - IBM
No ratings yet
What Is A Data Mart - IBM
9 pages
DATA WAREHOUSE Basic Concepts
No ratings yet
DATA WAREHOUSE Basic Concepts
26 pages
Module 1 Data Warehousing Fundamentals
No ratings yet
Module 1 Data Warehousing Fundamentals
17 pages
All Unit
No ratings yet
All Unit
17 pages
MIS - Session 11-14 - BI Data Warehouse
No ratings yet
MIS - Session 11-14 - BI Data Warehouse
65 pages
Lect 14 DM
No ratings yet
Lect 14 DM
33 pages
Data Warehouse - BSA 1st Year For BCA
No ratings yet
Data Warehouse - BSA 1st Year For BCA
20 pages
Data Mining Unit-2 notes
No ratings yet
Data Mining Unit-2 notes
8 pages
Data Warehouse
No ratings yet
Data Warehouse
71 pages
Datascience Unit 02 1
No ratings yet
Datascience Unit 02 1
53 pages
DW Concepts Shiva
No ratings yet
DW Concepts Shiva
32 pages
Data Warehouse
No ratings yet
Data Warehouse
56 pages
Advance Database Concepts
No ratings yet
Advance Database Concepts
23 pages
Unit IV Data Mining
No ratings yet
Unit IV Data Mining
65 pages
Data Mining and Data Warehousing
No ratings yet
Data Mining and Data Warehousing
92 pages
Important Concepts in Big Data
No ratings yet
Important Concepts in Big Data
6 pages
DW Unit I Notes
No ratings yet
DW Unit I Notes
28 pages
DWDM Mid 1
No ratings yet
DWDM Mid 1
10 pages
Data Warehouse 2
No ratings yet
Data Warehouse 2
33 pages
Data Warehousing 1
No ratings yet
Data Warehousing 1
29 pages
7 - Data warehousing & Data Modelling_DE_Feb25
No ratings yet
7 - Data warehousing & Data Modelling_DE_Feb25
18 pages
UNIT-1 (RIT-062) : Data Warehousing
No ratings yet
UNIT-1 (RIT-062) : Data Warehousing
34 pages
Business Intelligence: Data Warehouse
No ratings yet
Business Intelligence: Data Warehouse
60 pages
Part A: Question 1 What Is Data Warehouse Schema? Explain Different Types of Schema
No ratings yet
Part A: Question 1 What Is Data Warehouse Schema? Explain Different Types of Schema
6 pages
DWDM202
No ratings yet
DWDM202
6 pages
What Is A Data Warehouse?
No ratings yet
What Is A Data Warehouse?
47 pages
List Data Warehouse Models With Example
No ratings yet
List Data Warehouse Models With Example
19 pages
Unit 1
No ratings yet
Unit 1
99 pages
Unit 2 DATA WAREHOUSE AND DATA MART
No ratings yet
Unit 2 DATA WAREHOUSE AND DATA MART
17 pages
What Is A Data Warehouse - IBM
No ratings yet
What Is A Data Warehouse - IBM
9 pages
Ch4 - Data Warehousing
No ratings yet
Ch4 - Data Warehousing
33 pages
dwh
No ratings yet
dwh
34 pages
Unit 2 Updated
No ratings yet
Unit 2 Updated
50 pages
Project Report For ME
No ratings yet
Project Report For ME
49 pages
Data Warehousing Basics
No ratings yet
Data Warehousing Basics
20 pages
UNIT 3
No ratings yet
UNIT 3
7 pages
Malineni Lakshmaiah Engineering College S.KONDA-523101 Andhra Pradesh
No ratings yet
Malineni Lakshmaiah Engineering College S.KONDA-523101 Andhra Pradesh
15 pages
DW MICRO
No ratings yet
DW MICRO
2 pages
Backup Strategies With MySQL Enterprise Backup
No ratings yet
Backup Strategies With MySQL Enterprise Backup
33 pages
DS Netbackup V1512
No ratings yet
DS Netbackup V1512
6 pages
Pivot Tables & Dashboards
No ratings yet
Pivot Tables & Dashboards
2 pages
Assignment 3
No ratings yet
Assignment 3
2 pages
Exadata Pricelist 070598 PDF
No ratings yet
Exadata Pricelist 070598 PDF
14 pages
Va PT Report
No ratings yet
Va PT Report
16 pages
Ibase PDF
No ratings yet
Ibase PDF
360 pages
Tutorials Complicatedjointpatterns 170921 0618 5692
No ratings yet
Tutorials Complicatedjointpatterns 170921 0618 5692
4 pages
Oda: Managing and Backing Up Vms
No ratings yet
Oda: Managing and Backing Up Vms
33 pages
Muhammad Anas Bin Mohd Yusof (Am2304013250) - Assignmnet 2
No ratings yet
Muhammad Anas Bin Mohd Yusof (Am2304013250) - Assignmnet 2
31 pages
Wsly B.I Record 5-24 PDF
No ratings yet
Wsly B.I Record 5-24 PDF
33 pages
Object Database Vs - Object-Relational Databases
No ratings yet
Object Database Vs - Object-Relational Databases
21 pages
Auxiliary Memory
0% (1)
Auxiliary Memory
18 pages
Writing Stored Procedures For Microsoft SQL Server
100% (4)
Writing Stored Procedures For Microsoft SQL Server
334 pages
0202 Microsoft Team System Roles and Security
No ratings yet
0202 Microsoft Team System Roles and Security
1 page
HCIP-openGauss V1.0 Training Material
No ratings yet
HCIP-openGauss V1.0 Training Material
529 pages
CHAPTER 1 2a
No ratings yet
CHAPTER 1 2a
10 pages
Contents:: Conclusion
No ratings yet
Contents:: Conclusion
6 pages
Sleeping Barbar
No ratings yet
Sleeping Barbar
1 page
ASR Manager Install
No ratings yet
ASR Manager Install
7 pages
3-Characteristics of Database Approach-19-07-2024
No ratings yet
3-Characteristics of Database Approach-19-07-2024
5 pages
Data Modeling
No ratings yet
Data Modeling
3 pages
Smart Data Access (SDA) - SAP Blogs
No ratings yet
Smart Data Access (SDA) - SAP Blogs
8 pages
Week1-Linked List
No ratings yet
Week1-Linked List
12 pages
Quick Reference Guide Public Cloud
No ratings yet
Quick Reference Guide Public Cloud
3 pages
OBIEE Logical Table Mappings
No ratings yet
OBIEE Logical Table Mappings
18 pages
Program-7: //WAP To Implement JDBC Connectivity
No ratings yet
Program-7: //WAP To Implement JDBC Connectivity
15 pages
A Day in The Life of Aloha : Back of House Computer (BOH)
No ratings yet
A Day in The Life of Aloha : Back of House Computer (BOH)
3 pages
Cognizant Data Obscure Brochure
No ratings yet
Cognizant Data Obscure Brochure
3 pages

221

Uploaded by

221

Uploaded by

In the data warehouse architecture, metadata describes the data warehouse database and offers a framework for data.

Data Mart Vs Data Warehouse

You might also like