0% found this document useful (0 votes)
226 views3 pages

Datastage 8.5 PDF

The document provides an overview of the content covered in a Data Stage course. It includes topics such as data warehousing concepts, ETL processes, Data Stage architecture and components, designing and running Data Stage jobs, working with file and database stages, processing stages like joining and sorting, data transformation, and performance tuning. It also covers more advanced topics like parameter sets, job sequencing, and using the Data Stage director, web console, and Information Analyzer tools.

Uploaded by

Bhaskar Reddy
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
226 views3 pages

Datastage 8.5 PDF

The document provides an overview of the content covered in a Data Stage course. It includes topics such as data warehousing concepts, ETL processes, Data Stage architecture and components, designing and running Data Stage jobs, working with file and database stages, processing stages like joining and sorting, data transformation, and performance tuning. It also covers more advanced topics like parameter sets, job sequencing, and using the Data Stage director, web console, and Information Analyzer tools.

Uploaded by

Bhaskar Reddy
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 3

Data Stage Course content

Data warehousing concepts


About IBM web sphere DS & QS
 Data warehouse & Data
warehousing
 Data warehousing Architectures
 ETL project phases
 ETL Process
 ETL Architecture
 Over view of project Architecture
 ER, Dimensional Model & I E
Notation
 Data warehousing Teams & Roles
 Understand Mapping Documents
Introduction to Data Stage












History & Features


Difference between 7.5X2 & 8.x
IBS DS & QS 8.0.1
DS info Sphere 8.5 Enhancements
View on Web console
IBM Data Stage 7.5x2 & 8.x
Architecture
Traditional Batch Processing
Partition & Pipeline parallelism
Partitioning & re- Partitioning
techniques
Combinality, Combining & collecting
Techniques
Configuration File & Node
Components

 7.5X2 & 8X Enhancements-New


Features
 Parallel Processing environments
(SMP, MPP etc.)
 DS Client Components Overview
 DS architecture Components
Overview

Data Stage Jobs & Stages


 Difference between server & parallel
jobs
 Overview of Parallel jobs, server
jobs, mainframe jobs, migration jobs
and job sequencing jobs
 Repository, DS Designer Toolbar &
Palette
 Active & Passive stages
 Palette Customization
 About Link Markers
 Frame work Operators

Design, Compile & Run DS jobs


 DS Job Design Process
 Designer Canvas customization
 Compile, Force Compile & multiple
job compile
 DTD & OSH code

Working with file stages







Sequential file & stage rules


Data set & types
File set
Differences-Data et, file set & seq
file stages
 Lookup file set stage
Development/Debug & STUB Stages





Row generator & column generator


Head, tail & sample stages
Peek stage
Copy stage

Working with relational databases


 Enterprise and plug-in stages
overview
 Oracle, DB/2, SQL server stages
 TERADATA
 ODBC (oracle, SQL server, Excel &
Access)
 Dynamic RDBMS stage
 Orchestrate schema import

Processing stages
Horizontal combining
 Join stage
 Lookup stage (normal, sparse, range,
case less)s
 Merge state
 Funnel stage & funneling
Sorting & vertical combining
 In-stage sorts (traditional sort)

 Sort stage (complex & simple sorts)


 Aggregate stage
 Remove duplicate stage
Data transformation with transformer
 Basic transformer vs. parallel
transformer
 External functions & macros
 Stage variables & system variables
 Transformer constraints
 Transformer execution order
Filtering methods





Constraints & source level


Filter stage
Switch stage
External filter stage

Other stages





Modify stage
Compress & Expand stage
Decode & Encode stage
Pivot stage

Slowly changing dimensions


 Type -0, 1, 2 Dimensions
 Implementing SCD -I & II in 8.0.1 &
7.5x2
 SCD stage
 Change capture & change apply
stage
 Surrogate key stage (state file &
sequence object)

Performance tuning tips


 Performance tuning with best
practices
 Performance tuning with
partitioning techniques
 Performance Analysis & Estimate
resource
Advance topics
 Parameter set & Parameter file
creation
 Data connection
Job control
 Job sequencing (run stages, error
handling stages, flow control stages
& etc)
 Batch jobs
Data stage designer additional features
 Import & export of repository
components
 Node configuration
 Generate exports

Data stage director


 Schedule/run/validate a compiled
job
 Message handling (job & project
level)
 Monitor jobs, filter/purge logs
 Unlocking job & customize

Web console & DS administrator


 security services, session
management
 Domain management, reporting,
scheduling..
 ADD, Delete and Organize projects
Information analyzer
 Column, primary key, foreign key,
cross domain & base line analysis

You might also like