Dynamic Partitioning To Increase Parallelism in PowerCenter
Dynamic Partitioning To Increase Parallelism in PowerCenter
Availability
Informatica PowerCenter session partition can be used to process data in parallel and achieve faster
data delivery. Using Dynamic Session Partitioning capability, PowerCenter can dynamically decide
the degree of parallelism. The Integration Service scales the number of session partitions at run time
based on factors such as source database partitions or the number of CPUs on the node resulting
significant performance improvement.
The Integration Service can decide the number of session partitions at run time based different
factors.
Note : Do not configure dynamic partitioning for a session that contains manual partitions. If you set
dynamic partitioning and you manually partition the session, the session will be invalid.
Session Configuration With Dynamic Partitioning
Session can be setup for dynamic partition from Config Object Tab in the session properties. You
can choose the available partitioning options from the drop down list.
When using different partition types with dynamic partition, following rules must be considered.
Pass-through partitioning : If you change the number of partitions at a partition point, the number
of partitions in each pipeline stage changes. If you use pass-through partitioning with a relational
source, the session runs in one partition in the stage.
Key range partitioning : You must define a closed range of numbers or date keys to use dynamic
partitioning. Dynamic partitioning does not scale partitions with key range partitioning on relational
targets.
Database partitioning : When you use database partitioning, the Integration Service creates
session partitions based on the source database partitions. This can be used only with Oracle and
IBM DB2 sources.
Hash auto-keys, Hash user keys, Round robin : Use hash user keys, hash auto-keys, and round-
robin partition types to distribute rows with dynamic partitioning. Use hash user keys and hash auto-
keys partitioning when you want the Integration Service to distribute rows to the partitions by group.
Use round-robin partitioning when you want the Integration Service to distribute rows evenly to
partitions.
Pass-through partitioning :- If you change the number of partitions at a partition point, the number
of partitions in each pipeline stage changes. If you use pass-through partitioning with a relational
source, the session runs in one partition in the stage.
Key range partitioning :- You must define a closed range of numbers or date keys to use dynamic
partitioning. The keys must be numeric or date datatypes. Dynamic partitioning does not scale
partitions with key range partitioning on relational targets.
Database partitioning :- When you use database partitioning, the Integration Service creates
session partitions based on the source database partitions. Use database partitioning with Oracle
and IBM DB2 sources.
Hash auto-keys, hash user keys, or round-robin :- Use hash user keys, hash auto-keys, and
round-robin partition types to distribute rows with dynamic partitioning. Use hash user keys and hash
auto-keys partitioning when you want the Integration Service to distribute rows to the partitions by
group. Use round-robin partitioning when you want the Integration Service to distribute rows evenly
to partitions.