SCD Stage

Type 1 and Type 2 are two methods for handling Slowly Changing Dimensions (SCDs) in a data warehouse. Type 1 overwrites old data, while Type 2 preserves history by creating multiple records with keys/version numbers. The document provides examples of how supplier data may be stored using each type, and how DataStage can be used to implement SCD processing through its SCD stage.

Uploaded by

abreddy2003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

80 views

SCD Stage

Uploaded by

abreddy2003

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 6

Slowly Changing Dimensions (SCDs) are dimensions that have data that changes

slowly, rather than changing on a time-based, regular schedule.

Type 1
The Type 1 methodology overwrites old data with new data, and therefore does not track
historical data at all.
Here is an example of a database table that keeps supplier information:
-------------------------------------------------------------------

Supplier_Key Supplier_Code Supplier_Name Supplier_State

123 ABC Acme Supply Co CA

--------------------------------------------------------------------
In this example, Supplier_Code is the natural key and Supplier_Key is a surrogate key.
Technically, the surrogate key is not necessary, since the table will be unique by the
natural key (Supplier_Code). However, the joins will perform better on an integer than
on a character string.
Now imagine that this supplier moves their headquarters to Illinois. The updated table
would simply overwrite this record:
----------------------------------------------------------------

Supplier_Key Supplier_Code Supplier_Name Supplier_State

123 ABC Acme Supply Co IL

---------------------------------------------------------------

Type 2
The Type 2 method tracks historical data by creating multiple records for a given natural
key in the dimensional tables with separate surrogate keys and/or different version
numbers. With Type 2, we have unlimited history preservation as a new record is
inserted each time a change is made.
In the same example, if the supplier moves to Illinois, the table could look like this, with
incremented version numbers to indicate the sequence of changes:
-----------------------------------------------------------------

Supplier_Key Supplier_Code Supplier_Name Supplier_State Version

123 ABC Acme Supply Co CA 0
124 ABC Acme Supply Co IL 1
-----------------------------------------------------------------

Another popular method for tuple versioning is to add effective date columns.
-----------------------------------------------------------------------------------

Supplier_Key Supplier_Code Supplier_Name Supplier_State Start_Date End_Date

123 ABC Acme Supply Co CA 01-Jan-2000 21-Dec-2004
124 ABC Acme Supply Co IL 22-Dec-2004
------------------------------------------------------------------------------------
The null End_Date in row two indicates the current tuple version. In some cases, a
standardized surrogate high date (e.g. 9999-12-31) may be used as an end date, so that
the field can be included in an index, and so that null-value substitution is not required
when querying.
How to Implement SCD using DataStage 8.1 –SCD stage?
Step 1: Create a datastage job with the below structure-
1. Source file that comes from the OLTP sources
2. Old dimesion refernce table link
3. The SCD stage
4. Target Fact Table
5. Dimesion Update/Insert link

Figure 1

Step 2: To set up the SCD properties in the SCD stage ,open the stage and access the
Fast Path
Figure 2
Step 3: The tab 2 of SCD stage is used specify the purpose of each of the pulled keys
from the referenced dimension tables.

Figure 3

Step 4: Tab 3 is used to provide the seqence generator file/table name which is used to
generate the new surrogate keys for the new or latest dimesion records.These are keys
which also get passed to the fact tables for direct load.
Figure 4

Step 5: The Tab 4 is used to set the properties for configuring the data population logic
for the new and old dimension rows. The type of activies that we can configure as a part
of this tab are:

1. Generation the new Surrogate key values to be passed to the dimension and fact table
2. Mapping the source columns with the source column
3. Setting up of the expired values for the old rows
4. Defining the values to mark the current active rows out of multiple type rows
Figure 5

Step 6: Set the derivation logic for the fact as a part of the last tab.
Figure 6
Step 7: Complete the remaining set up, run the job

Figure 7

Anil Kumar - ETL Testing - 3.2 Yrs - Resume
100% (1)
Anil Kumar - ETL Testing - 3.2 Yrs - Resume
4 pages
Slowly Changing Dimension (SCD)
No ratings yet
Slowly Changing Dimension (SCD)
4 pages
Excelvan CL720D LED Projector English User Manual
No ratings yet
Excelvan CL720D LED Projector English User Manual
23 pages
Datastage Slowly Changing Dimensions
No ratings yet
Datastage Slowly Changing Dimensions
10 pages
SCD Type 1 Implementation Using Informatica PowerCenter
No ratings yet
SCD Type 1 Implementation Using Informatica PowerCenter
7 pages
Implementing Rapidly Changing Dimension: What Are Fast Changing Dimensions?
No ratings yet
Implementing Rapidly Changing Dimension: What Are Fast Changing Dimensions?
5 pages
Trouble Shooting Manual (Part 2 of 2), Event Log Messages: Robot Controller
No ratings yet
Trouble Shooting Manual (Part 2 of 2), Event Log Messages: Robot Controller
209 pages
SCD
No ratings yet
SCD
7 pages
Slowly Changing Dimension
No ratings yet
Slowly Changing Dimension
5 pages
Slowly Changing Dimensions (SCD) - Types - Data Warehouse
No ratings yet
Slowly Changing Dimensions (SCD) - Types - Data Warehouse
3 pages
Slowly Changing Dimensions
No ratings yet
Slowly Changing Dimensions
22 pages
ODI 12c SCD Type 2 Step by Step Implementation
No ratings yet
ODI 12c SCD Type 2 Step by Step Implementation
22 pages
SCD Types Session
No ratings yet
SCD Types Session
14 pages
Understanding Slowly Changing Dimensions SCD in Data Warehousing by Mainak Das Python in Plain English
No ratings yet
Understanding Slowly Changing Dimensions SCD in Data Warehousing by Mainak Das Python in Plain English
13 pages
Datastage - Slowly Changing Dimensions - Talentain
No ratings yet
Datastage - Slowly Changing Dimensions - Talentain
7 pages
Slowly Changing Dimension: Rahma Hassan
No ratings yet
Slowly Changing Dimension: Rahma Hassan
11 pages
About Slowly Changing Dimensions - SAS
No ratings yet
About Slowly Changing Dimensions - SAS
5 pages
SCD Docs
No ratings yet
SCD Docs
13 pages
Slowly Changing Dimensions (SCDs) Guide
No ratings yet
Slowly Changing Dimensions (SCDs) Guide
14 pages
DataTeam-LMS (Slowly Changing Dimesions)
No ratings yet
DataTeam-LMS (Slowly Changing Dimesions)
19 pages
2.5+-+Slowly+Changing+Dimensions
No ratings yet
2.5+-+Slowly+Changing+Dimensions
10 pages
History Management of Data - Slowly Changing Dimensions: Marek Wancerz, Paweł Wancerz
No ratings yet
History Management of Data - Slowly Changing Dimensions: Marek Wancerz, Paweł Wancerz
3 pages
Wancerz
No ratings yet
Wancerz
2 pages
Implementation of Slowly Changing Dimension To Data Warehouse To Manage Marketing Campaigns in Banks
No ratings yet
Implementation of Slowly Changing Dimension To Data Warehouse To Manage Marketing Campaigns in Banks
8 pages
Specification Comments : Download Now
No ratings yet
Specification Comments : Download Now
1 page
SCD in Databricks
No ratings yet
SCD in Databricks
16 pages
Chapter 4 Slow Changing Dimensions
No ratings yet
Chapter 4 Slow Changing Dimensions
5 pages
SCD 20II 20implementation 20in 20datastage 207.X
No ratings yet
SCD 20II 20implementation 20in 20datastage 207.X
7 pages
SCD Data Type _Notes_250302_112719
No ratings yet
SCD Data Type _Notes_250302_112719
1 page
Data Warehousing Using Slowly Changing DimensionsSCD in Informatica
No ratings yet
Data Warehousing Using Slowly Changing DimensionsSCD in Informatica
9 pages
Slowly Changing Dimentions (SCD) - Type 1, Type 2, Type 3
No ratings yet
Slowly Changing Dimentions (SCD) - Type 1, Type 2, Type 3
3 pages
DM 0903 Data Stage Slowly Changing PDF
No ratings yet
DM 0903 Data Stage Slowly Changing PDF
32 pages
SCD Type 3 Implementation Using Informatica PowerCenter
0% (1)
SCD Type 3 Implementation Using Informatica PowerCenter
6 pages
Cost Based Optimization
No ratings yet
Cost Based Optimization
14 pages
DWH Slowly Changing Dimension
No ratings yet
DWH Slowly Changing Dimension
4 pages
Slowly Changing Dimensions Specification A Relational Algebra Approach
No ratings yet
Slowly Changing Dimensions Specification A Relational Algebra Approach
6 pages
Data Modeling, Star Schema, Snowflake Schema
No ratings yet
Data Modeling, Star Schema, Snowflake Schema
7 pages
scd type 2
No ratings yet
scd type 2
7 pages
Slowly Changing Dimension DW
No ratings yet
Slowly Changing Dimension DW
3 pages
SCD
No ratings yet
SCD
2 pages
Slowly Changing Dimension (SCD'S) : Submitted By: BALAJI K
No ratings yet
Slowly Changing Dimension (SCD'S) : Submitted By: BALAJI K
14 pages
Slowly Changing Dimensions
No ratings yet
Slowly Changing Dimensions
3 pages
My Datastage Notes - SCD
No ratings yet
My Datastage Notes - SCD
4 pages
Datawarehouse and Datamart
No ratings yet
Datawarehouse and Datamart
9 pages
Types of SCD With Example
No ratings yet
Types of SCD With Example
30 pages
Class 3
No ratings yet
Class 3
28 pages
SCD types
No ratings yet
SCD types
5 pages
DWH
No ratings yet
DWH
5 pages
Slowly Dimension On Visio
No ratings yet
Slowly Dimension On Visio
10 pages
Informatica Senarios
No ratings yet
Informatica Senarios
6 pages
What Are Slowly Changing Dimensions
No ratings yet
What Are Slowly Changing Dimensions
2 pages
In The Star Schema Design
No ratings yet
In The Star Schema Design
11 pages
DWT Chapter 2 Part 2
No ratings yet
DWT Chapter 2 Part 2
14 pages
DatawareHousing Concepts
No ratings yet
DatawareHousing Concepts
20 pages
Data Warehouse and Data Modelling
No ratings yet
Data Warehouse and Data Modelling
11 pages
Facts & Dims
No ratings yet
Facts & Dims
14 pages
Normalization: Problems of Data Redundancy
No ratings yet
Normalization: Problems of Data Redundancy
15 pages
11 Chapter11+ +Building+the+Data+Warehouse+ +part2
No ratings yet
11 Chapter11+ +Building+the+Data+Warehouse+ +part2
22 pages
How To Define/Implement Type 2 SCD in SSIS Using Slowly Changing Dimension Transformation
No ratings yet
How To Define/Implement Type 2 SCD in SSIS Using Slowly Changing Dimension Transformation
11 pages
ISA Certified Automation Professional (CAP) Associate Study Notes: 500 Study Notes for Accelerated Certification Success
From Everand
ISA Certified Automation Professional (CAP) Associate Study Notes: 500 Study Notes for Accelerated Certification Success
Steve Brown
No ratings yet
Basic DBA Query v.1: Oracle Database
From Everand
Basic DBA Query v.1: Oracle Database
Oraclesql-plsql
5/5 (1)
LPIC-3 Exam 306-300 Mastery: 500 Practice Questions on High Availability & Storage Clusters
From Everand
LPIC-3 Exam 306-300 Mastery: 500 Practice Questions on High Availability & Storage Clusters
Steve Brown
No ratings yet
MySQL Crash Course: A Hands-on Introduction to Database Development
From Everand
MySQL Crash Course: A Hands-on Introduction to Database Development
Rick Silva
No ratings yet
Intrusion Detection in Homogeneous and
No ratings yet
Intrusion Detection in Homogeneous and
2 pages
Socket 1
No ratings yet
Socket 1
2 pages
Problem Oriented Software Engineering1
No ratings yet
Problem Oriented Software Engineering1
1 page
Modeling and Automated
No ratings yet
Modeling and Automated
1 page
Bhaskar Reddy
No ratings yet
Bhaskar Reddy
7 pages
Avistage Testreports Document2
No ratings yet
Avistage Testreports Document2
4 pages
QS Avi Stage Referencedocumnet
No ratings yet
QS Avi Stage Referencedocumnet
16 pages
Bandwidth Estimation For IEEE 802
No ratings yet
Bandwidth Estimation For IEEE 802
2 pages
Snowflake Roles
No ratings yet
Snowflake Roles
1 page
Instructions To Prepare For Interview
No ratings yet
Instructions To Prepare For Interview
10 pages
New Uploaded Resume
No ratings yet
New Uploaded Resume
3 pages
S.No Emp - No Emp - Name Designation Department DOJ Current - Location Mobile No
No ratings yet
S.No Emp - No Emp - Name Designation Department DOJ Current - Location Mobile No
16 pages
Ds Filter
No ratings yet
Ds Filter
2 pages
Data Stage Course Content: Unit-1 Data Warehousing Concepts
No ratings yet
Data Stage Course Content: Unit-1 Data Warehousing Concepts
3 pages
Reference Residual
No ratings yet
Reference Residual
150 pages
Bhaskar Datastage Profile2
No ratings yet
Bhaskar Datastage Profile2
7 pages
IA Rules
No ratings yet
IA Rules
120 pages
Data Stage Scenarios: Scenario1. Cummilative Sum
No ratings yet
Data Stage Scenarios: Scenario1. Cummilative Sum
13 pages
Computer Class 9 WS 1
No ratings yet
Computer Class 9 WS 1
7 pages
(Archives) Microsoft Publisher 2007: Working With Rulers & Guides
No ratings yet
(Archives) Microsoft Publisher 2007: Working With Rulers & Guides
4 pages
4020918-1300SRM1435-(09-2014)-UK-EN-APC200 código de falha
No ratings yet
4020918-1300SRM1435-(09-2014)-UK-EN-APC200 código de falha
42 pages
Interactive Whiteboard Thesis
100% (2)
Interactive Whiteboard Thesis
4 pages
Lecture 1 - Computer System Overview
No ratings yet
Lecture 1 - Computer System Overview
42 pages
Stick Sport Infographic 07
No ratings yet
Stick Sport Infographic 07
1 page
Plaintext and Ciphertext
No ratings yet
Plaintext and Ciphertext
7 pages
Array Interview Question
No ratings yet
Array Interview Question
17 pages
Indala Reader Comparison2
No ratings yet
Indala Reader Comparison2
2 pages
Oracle Programming Using PL/SQL: Level 2 WWW - Micros.umsl - Edu
No ratings yet
Oracle Programming Using PL/SQL: Level 2 WWW - Micros.umsl - Edu
2 pages
Binder 1
No ratings yet
Binder 1
36 pages
Resetear RRU - Moshell
No ratings yet
Resetear RRU - Moshell
6 pages
Viewsat 9000hd Manual
No ratings yet
Viewsat 9000hd Manual
25 pages
Arduino Water Pressure Sensor Project, Water Level Pressure Sensor
100% (1)
Arduino Water Pressure Sensor Project, Water Level Pressure Sensor
21 pages
Cardiac Surgery Textbook
No ratings yet
Cardiac Surgery Textbook
2 pages
Manual PLC Click Koyo
No ratings yet
Manual PLC Click Koyo
186 pages
Logcat
No ratings yet
Logcat
4 pages
NETBEANS
No ratings yet
NETBEANS
22 pages
Resume
No ratings yet
Resume
3 pages
Chapter 16
No ratings yet
Chapter 16
20 pages
Digital Communication Systems by Simon Haykin-115
No ratings yet
Digital Communication Systems by Simon Haykin-115
6 pages
OECD - Customisation Opportunities of IUCLID
No ratings yet
OECD - Customisation Opportunities of IUCLID
81 pages
OSP With India Localization
No ratings yet
OSP With India Localization
18 pages
GUI Continued: SEED Infotech Pvt. LTD
No ratings yet
GUI Continued: SEED Infotech Pvt. LTD
11 pages
Variabilidad Desconocida - Método Desviación Estándar Dos Especificaciones (M) Ansi-Asq-z1.9-2008
No ratings yet
Variabilidad Desconocida - Método Desviación Estándar Dos Especificaciones (M) Ansi-Asq-z1.9-2008
12 pages
A8298 Datasheet
No ratings yet
A8298 Datasheet
28 pages
CISSP For Professionals
No ratings yet
CISSP For Professionals
2 pages

SCD Stage

Uploaded by

SCD Stage

Uploaded by

Slowly Changing Dimensions (SCDs) are dimensions that have data that changes

slowly, rather than changing on a time-based, regular schedule.

Supplier_Key Supplier_Code Supplier_Name Supplier_State

Supplier_Key Supplier_Code Supplier_Name Supplier_State

Supplier_Key Supplier_Code Supplier_Name Supplier_State Version

Supplier_Key Supplier_Code Supplier_Name Supplier_State Start_Date End_Date

You might also like