Ab Initio Interview Question v1.0
Ab Initio Interview Question v1.0
1.
Objective
Lateral recruitment is one of the key activities of Ab Initio Center of Excellence. This document
has been intended to support the recruitment activities across TCS in Ab Initio ETL tool.
Answers to these questions have been mentioned to carry out the interview by any BI resource.
An experienced Ab Initio candidate is expected to answer all the questions correctly to become
technically qualified for a berth in TCS.
2. Organization
The interview questions have been classified according to Ab Initio features along with blend of
basic UNIX and Data warehousing conceptual questions.
3. Question Bank
Ab Initio Components
Phases - are used to break the graph into pieces. Temporary files created during a phase will be
deleted after its completion. Phases are used to effectively separately manage resource-consuming
(memory, CPU, disk) parts of the application.
Checkpoints - created for recovery purposes. These are points where everything is written to disk.
You can recover to the latest saved point - and rerun from it. You can have phase breaks with or
without checkpoints.
A.
Q. Which one is faster for processing fixed length DMLs or delimited DMLs.
A. Fixed length DML is better because it will directly read the data of that length without
any comparison but in delimited one, every character is to be compared.
Q. What’s the difference between partition by Key and partition by round robin component?
A. Partition by key places the right data at the right partition, partition by round robin in a
round robin manner.
3.2 Parallelism
3.3 Performance
Q. Create a multifile
A. m_mkfs.
Q. Suppose in a graph a look up name is changed, how u implement the change in graph.
A. Changes in the xfr can be done by grep and sed commands.
Q. Write a command in the start script to check the size of input file is zero or not,
if zero, graph will fail.
A. size = `du <filename> | awk { print $1}`
if [$size -eq 0]; then
exit
Q. Characteristic of DW
A. Subject oriented, integrated, non-volatile and time variant.
Q. Concept of Sarogate key, by which component in AbInitio u can generate sarogate key.
A. Assign Key Component.
Q. What is Fact? Explain the different type of Facts (additive, semi additive, non Additive).
R. A fact table contains composite key where each candidate key is a foreign key to the
dimension table.
S.
A. Additive: Additive facts are facts that can be summed up through all of the dimensions in
Semi-Additive: Semi-additive facts are facts that can be summed up for some of the
dimensions in the fact table, but not the others. Ex – Current Balance, because it can be added
for all accounts, but can’t be added for each day.
Non-Additive: Non-additive facts are facts that cannot be summed up for any of the
dimensions present in the fact table. Ex- Profit margin, because it can’t be added in day level
or account level.
Conformed Dimension (Reusable): A dimension which can be shared by multiple fact tables is
known as conformed dimension
Q. Explain Factless Fact.
A. Contains a series of key values does not contain any fact or measure.
Q. What is ODS?
A. It contain near real time data. ODS is used for analytical reporting as well as source for
data warehouse.