0% found this document useful (0 votes)
382 views

Abinitio Questions

Uploaded by

Pavan Kumar
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
382 views

Abinitio Questions

Uploaded by

Pavan Kumar
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 62

Compact Interview Questions on 15/12/2014

1. Tell me about yourself.


2. Tell me complex graph you developed.
3. I have two files FileA and FileB, FileA is sorted and I join these two files in join component

I select sorted-input : in-memory option and I give FileA in driving port what will happen and
in output data is sorted or not?
4. Tell me about joinwithdb component and their parameters
5. What is generic graph and without psets generic graphs can work or not ? if yes how?
6. In reformat I set count as 2 and I write some business logic in transform0, so I want to use
that transform in transform1 of the reformat.
7. Basic Questions on scan, rollup functionalities.
8. My Input like
U1,O1,I1
U1,O1,I2
U1,O1,l3
U1,O2,I1
U2, O5,I1
U2, O5,I2
U2, O6, 12
U2,O7,I4
So I would like distinct count of orders (O)
9. Performance tunings in Abinitio
10. Performance tunings in Database.
11. My input contain 50 columns I would like display all columns except 10,20,30,40,50
numbered columns.
12. How to find two patterns in a file?
13. What is the outputs of ---- select count (*) from emp, select count (1) from emp.
14. Which components partially breaks the pipeline parallelism.
15. Input file contain 5 records -- FBE select expr(next_in_sequence>1 and
next_in_sequence<4)in my output how many files are available.
16. How to convert 8_way partition to 3_way partition.
17. What are differnces b/w join and lookup .

CTS Company Question


1.what are the versions you have used in present and previous projects ?
2.what is the difference between de-partion and partion components?
3.what is the difference between partion by key and partion by round robin, in which situations
u used these components in u r project ?
4. cat with multifile and serial file ?
5. how the tables are extracted to input table ?
6. how u will get oracle tables in Abinitio ?
7. if we connect serial and multifile is it executed or not ?
8. Serial reformat joinmultifile
Is it work ?in what way it is working ?
9. Parameters of scan component ?
10. What u r done in u r previous project?
11. how can u generate cost and revenue of u r project ?
12.how the multifiles and serial files are executed in unix environment ?
13.what r the components u used in your project ?
14. what is the main aim of u r project ?
15. how the multifile can be changed to serial file,serial file to multifile ?
16. How the data will be seen in multifiles ?
17. 1
2
3
.
.
10
divide these records in 2 ports . 2 in each port alternately.
eg 1,2 port 0
3,4 port 1
5,6expla port 0

18. file1 file2


1 1
1 1
1 1
explain what will be the output for each type of join.
19. maxcore for join is 100MB
if we receive below size of records at driving port what will happen for each case?
a. 90MB
b.100 MB
c. 200 MB

20. 12 records are in input file and in the output we want 4 serial files with equal amount of
data.
input file--> ??--> 4 serail files
which compoent will you use in the ??
21. scd and cdc explain?
22. how to improve performance of graph?
23. what is Abinitio.rc file?
24. explain behaviour of scan & rollup?
25. Explain (Bitwise)organisational data flow?
26. I want to read complete file of 100 records and find at the end which record was corrupt
and log it accordingly? How?
Accenture on 7th jan-2015
1.tell me briefly about your professional details
2.what are the input sources to your project
3.have you ever use the write multiple component
4.how you view particular partition data
5.what is partition
6.what is multi file
7.what is m_dump
8.how you create multi file system
9.what are the componnts you used in your project
10.what are your responsibilites in your projects
11.how many partitions you have in your database
12.how we find the number of partitions of a database
13.what are the partition components tell me their parameters
14.how to see the duplicate records in a table
15.if your source is a flat file in a some other server which component you use to access

Acenture questions:
1. Tell me about u r previous project ?
2. Parallelism types ?
3. Lookup static lookup and dynamic lookup?
4. Maxcore ?
5. Driving port ?
6. How many no of inputs we can take for join?
7. If I have one input file having 50 mb maxcore and and other 9 remaning input files
having different maxcores so which one u select as driving port ?
8. Difference between rollup and scan ?
9. What are the mandatory functions in both rollup and scan and is any we can ignore in
that ?
10. Lookup local?
11. Phases and checkpoints ?
12. What is $#, $?, and $ mpjret.
13. How will test the dbc file?
14. What is pdl?
15. What is metaprogramming?
16. What is conduct it ?
17. How u delete 30 days older files?
18. Oracle: how the where clause is executed in oracle
19. Second maxsal in oracle?
20. Partition by key and partion by round roubin ?
Hcl Question
1. Tell me about ur self?
2. What is fact and dimension?
3. How to use granualarity in fact table?
4. What are the schemas did u use?and explain it?which is give better performance?
5. Types of dimensions?
6. Firstly which table u have to load it/and that data where to read it?
7. What is the use of staging area in dwh?
8. Explain about scds?did u work on dt?
9. By seeing only how to find dt table belongs to scd type2 or not?is it possible?
10. Explain about FBE?and tell me one scenario?
11. Wt is PBK and PBRR?which gives better performance?
12. Diff b/w replicate and broadcast?
13. What is multifie?did u work ondt?
14. How to change depth of multifiles?
15. Diff b/w checkpoint and phase?
16. What is join and explain it and give the scenarios of each join type?
17. Explain what is parallism?
18. I have one file its having 10 records how to displays the count into target table?
19. I m having 3 tables, in dt 10th dept no records are get into target?how to get it?
20. One table is der in dt table header having 3 fields,detail having 3 fielda and trailer having
no.of record count ? in dt table how to read dml?
21. How to get depth of multifile ?
22. By using awk cmd how to count the no.of columns?
23. How to find 0 byte file/
24. How to delete first 3 characters in line?
25. Explain about rollup and scan? And what is the diff b/w them?
26. By using reformate what are the operating u did so far in ur graph or project?

HCL Interview Questions (18-10-14)


1. Difference between eme& sandbox
2. different approaches of Data Warehousing
3. how to move graph from development to QA and production
4. I checked out one graph from eme and after 10 days I check in same graph is it possible and
why
5. graphs to achieve SCD-1,SCD-2,SCD-3
6. difference between partition by expression and filter by expression
7.functionalities of scan and rollup and package mode stages like initialization and rollup or
scan how many times called
8.how to remove files which are created before 10 days
9.how to display particular fields from a file
10. input file have 100 records how to display 40 to 50 records
11.how to delete duplicate rows from table
12.difference between delete and truncate

IBM 10-02-2015
1. Tell me about yourself
2. Explained about your project and give some measures
3. What is rollup?
4. I have Input file which contain one column with Y or N records only.. Actually ‘N’ value
records are rejected but I want to send all the records without rejecting. How would you
achieve it?
5. Explain Join in ABINITIO
6. What is m_dump?
7. I want see the in completed data records how can you do that with m_dump
8. What is dedup sort?
9. Here is my error – Unsorted record & overwritten
10. Which version are you using? What are the latest features in 11G?
11. How to referential check constraint in your table
12. Write a sql statement using left outer join?
13. Have you used the MFS in your project?
14. I have records in a file like a b c d e f g h j.I want to display all the characters except
with b?
15. What is regular expression?
16. I want to display all the records with having starting line c.

IBM 08-jan-2015
1.Briefly tell me about your project
2.What is complex graph you face in your roject
3.what are the roles and responsibilities of you in your project
4.what are your source and destinations in your project
5.what is the component is used to unload data from database and what are the parameters
you configure in that component
6.what is the process to get the dml for data which is unloading from database
7.if i use select statement to unload the data from database but we select only 2 fields out of 7
fields,so howmany fields in output file
8.what happen when we select -select statement and -table options in input table is it same or
different

9.without GDE, In unix how i get the dml of a table in a separate file
10.how many types of joins in join component
11.what are the functionalities of scan,rollup,reformat componets
12.what is m_dump,m_mkfs commonds
13.questions on wrapper scripting
14.which commond is used to replace a patteren with new patteren
15.i run a graph if it success its output moves to one record else error information moves to
some other file how
16.what is the difference b/w m_cat,cat commonds

IBM 21/02.2015
1. Tell me about yourself
2. How to delete 30 days old files in unix
3. What is your scheduling tool ?
4. In Unix script, you will find the ‘!’ statement can you tell me what it is and how it works
internally ( !/bin/ksh)
5. How to delete the last row in a file
6. How to delete duplicates in oracle
7. In Abinitio , my input value is 12345 and my dml type is decimal(5,2) so what is the output
and also if I have input like this 1234567 what will happen ?
8. What is $# , $@ , $?
9. In join I have in in0
1
1
1
1
In another file contains same field and same records
1
1
1
1
Now what will be my output records if I keep inner,fullouter, left outer & right outer join
10. In join I have like below and what will be the output if I keep in-memory
x
y
z
a
b
c
11. What are the functions in EME?
12. What you do in your project
13. Tell me more about your project and process
14. How to generate a rank for emp table and I want to get a rank whose rank is 2
15. What is the difference between dimensional model and ER model
16. What are NR forms and explain different types of NR forms each
17. How to delete duplicate records in emp
Cap Gemini 18/02.2015

1. How do you checkout a graph in Unix


2. How to check the versions of a particular object
3. How to see the lock objects for a particular user
4. Types of SCDs and Explain about SCD Type 2
5. Word count for a multifile
6. My graph will be input  RFMT  Output file, I got 100 error records I want to get 50th
error record how can I get it?
7. I want to run a graph twice in same session how can you do that
8. How to kill the running graph
9. How to delete recovery files
10. How do you use M_cleanup
11. What is m_kill
12. What is driving port..
13. When do you go for lookup or Join
14. What is lookup_match ?
15. What does string_lpad do?
16. What is ramp limit?
17. Different stages of your project
18. I want to get 20 to 30 records in a file
19. How do I check whether graph is failed
20. What is $#
21. What is NVL Function in Oracle Give me an example
22. Like NVL Function in Oracle, Is there any other function which we can use in ABINITIO ?
23. What is decode?
24. What is output index and output indexes
25. How do you create tag for multiple objects

SAI SYSTEMS INTERIVEW QUESTIONS 17/2/2015


1. Tell me about yourself and projects
2. What is your roles
3. If I have requirement What kind of approach for estimation of your task you can give me
4. Can you brief me what denormalization is?
5. Do you know in SCD Type -2 , How the records are represented in type-2
6. How many types of joins in oracle
7. How do you get unique values in the table without using distinct
8. Architecture of AB Initio
9. Main Components in AB Initio
10. What is GDE , When you run a graph , what will happen in background
11. What is purpose EME
12. Difference between private sandbox and user sandbox
13. Difference .db config and .cfg
14. What is Environmental Variables
15. How you will normally do error handling
16. If I got some error like truncate error I want to handle without stopping
In my src table column length is 20(char) and destination is 10 type varchar , now I want
to capture first ten char without any error and without any transformation
17. What is force error , is there any other similar function
18. Is it possible to create surrogate key in GDE , is there any component to get auto
incremental value
19. What is the purpose of parallelism?
20. Different kind of partition component
21. What is important factor that you consider to improve your graph?
22. How to move the graph from develop to testing (which is in different environment ) how
do you maintain all the information
23. Is it configured Environment variable

INCADESCENT
1. Tell me about your self
2. do you know Continuous graphs, metadata hub , BRE , ACE ?
3. Which company do you work for and what is your total experience ?
4. what is the scheduler do you use in your project ?
5. what is compare records component and check sum component will do ?
6. Explain about multifile system ?
7. explain benfits of abinitio tool
8. What is choice parameter
9. what is conditional parameter what will happen if we disable a component
10. How you will generate dml dynamically
11. what is conditional dml how you saperate header , tailor and body
12. generate dml using abinitio parametes
13. how to display the data using abinitio command
14. in flat file how you will saperate header , tailor , body and how you validate the records with
record count in tailor ? i.e. no of records in body should match with tailor which contains
record count.
15. how do you eliminate duplicates in abintio ? apart from sort and dsort how you will
eliminate ?
16. what are the partition and departition components explain with in one sentence for all
departition components.
17. display last record using sed command
18. how you will replace string in unix
19. how you will do the same using vi editor

21. how do you select maximum marks from each subject in student table.
22. how do you select top 2 maximum marks from each subject.
23. what is difference between star schema and snow flake schema
24. what is factless fact table.
25.Find sum of trans_amt department wise from the table ?

Scenario-1:
cust_id trans_amt
1001 10
1001 20
1001 30
1002 5
1002 40
1002 50
.... ...
..... ...

1003 20
1003 30

Expected ootput1:

cust_ID trans_amt
1001 60
-----------------------
1002 95
-----------------------
1003 50
expected output-2:

Cust_ID trans_amt
------------------
1001 10
20
30
-------------------
1002 5
40
50
------------------
1003 20
30
--------------------
26. There are two input files .. file1--2000 record, file2 300 record. i want get 2 colums from the
second file?
27. What is MAX core ?
28.What is MAX memory?
29. table1 have 10000 records, table 2 have 3 records matching with table1.. how to write sql
query and how to desig abi grap?

29.how to fetch duplicate recods from the table?


Ans :select * from emp where empno in (select empno from emp group by empno having
count(empno)>1)
31. how to see the records in the unix file between range 35 to 90?
Ans :sed -n 35,90p filename
32. how to search pattern in a file example some where in the unix directory file called xyz is
there and in that file you pattern pandu.A/B/D/Y/xyz file
Ans :grep -r pandu xyzfile
33.How to find the files and direcotries?
find . -type f
find . -type d
34.what is diff between union and union all?
35. is union all and full outer join are same?-- NO
36. is union and full outer join are equal?-- No
37. what will happens if you gave key as NULL in the join component?
38. In rollup component if you pass key as NULL what will happen? and when you will get last
record and count of all input record?
39.What is Pset and how you will create via commandline?
40. how you will decided Maxcore values within graph for tow -three componet?
41. design the job for below scenario to et the total number of product for each custumer(For
example A is costumer he has tow product: SONY and SAMSUNG)

CUSTOMER PRODUCT
----------------------
A SONY
B MARTK1
C HCL SHARE
D LAVA
A SAMSUNG
B NOKIA
C HCL TECH

IBM
Interview Questions
1.Tell me about your professional background?
2.tell me two complicated graphs in your experience and those which components you are used
and execution flow?
3.What are SCD types and which component is useful to particular type of SCD?
4.What is difference when we perform join using lookup file&input file,join with db & input
file?
5.when we press run button in GDE what is going on back ground?
6.what is unit tesing , how y ou do this?
7.how we change 2-way partion to 4 way partition and which component is used to do this?
8.difference between partition by round robin and partition by key and which one is better?
9. i have one multifile, I use rm multifile_name what happen?
10.how to find last two days modified records?
11.differences between grep,fgrep,egrep?
12.nth highest salary?
13.how we remove duplictae rows ?

Oracle Interview Questions


1.grep,fgrep,egrep differences
2.To print last record using sed
3.one file is given,print records in reverse order
4.I have 50 records in my input table without using any component do the parallel unload
i.e
1-10----->1st partion
11-20---->2nd partion
20-40 ----->3rd partion
remaining ------>4th partiion
5.I have a file in that only one field is there in that fields are like
abc
123
pre
245
654
Ytr
-
-
--
how to print these fields in reverse order
6.how print a last record for each group in rollrp
7.differences between broadcast and replicate.
8.difference between merge and gather
9.what are the psets and their functionalities.
10.how to generate sequence number for records in file.
11.nth highest salary
12.how to print unique rows from table

Shutterfly Screening Questions


SET I
1) Oracle - There are 3 tables having channel id , channel name in one table , date id ,date in
other table and fact table having dateid,channel id and name - need to find for year how many
channels - was able to and partially explained the logic not the exact sql query
2) Abinitio - same question was able to ans explained the approach
3) Reformat component -- what are its parameters and in which order they are executed
internally
4) Scan component
5) Rollup with scan
6) How many types of rollup
7) Scenario - How you will find the trailer count in file
8) Logging mechanism
9) Production scenarios - how you will check in case of production issues and log verifying –
able to ans
10) What log file will contain: in reformat how it will be - replied will be able to see but need to
check
11) Comma separated file how you will separate each record as vector
Partially with one approach using string index and allocate for new variable
12) Phase and checkpoint
13) If in Production file get corrupted in b/w so can we start from failure recovery after
cleansing
- explained the approach that we should correct the file and load as fresh as records which
are already passed at intail stage may have the junk data although we can load from the
checkpoint but there is probability that records may have bad data.

SET II
1. Complex job in Ab Initio
2. Explain Performance tuning in Ab Initio
3. Input file with today's insert and update and target fact table with billion records. CDC on 5
keys. Approach to update and insert the fact using ABI and SQL.
4. Rollup component,what types of aggregation functions are provided.
5. Checkpoint and Phase
6. PDL functions- scenarios where used and what type of functions can't be used in PDL
7. use of generic load graph to load multiple tables in parallel if PSET not available- Answer.
using AB_JOB parameter
8. what is a .rec file
9. difference between Lookup and Join and where you used lookup against join.
10. Oracle sql question, find channel generated most revenue and best performaning channel

SET III

1) Overview of the candidate experience?


2) Overview of candidate experience with various tools?
3) What is skew and what’s its importance?
4) How to find Co>Op version? (m_env -v)
5) Find the depth of mfs? (m_expand)
6) What is phasing & check pointing? What is the use?
7) How to calculate memory requirements for a complex graph? (see how many join, sort,
rollup, lookups components are being used, how many ways it's running, etc.)
8) How to performance tune a complex graph?
9) What are PSETs?
10) What is PDL?
11) What is max-core and how it works with components like Join, Sort, Rollup?
12) How to copy a 4 Way MFS file to 8 way MFS File?
13) How to reference external xfr files in your xfr?
14) How to run a graph in a loop without a scheduling s/w or unix scripting?
15) What are different types of parallelism in AbInitio?
16) What’s the difference in Direct and API loads?
17) When would you unload tables and then join Vs. Join in DB and then unload?
18) What is m_db command and where would you use it?
19) What is dependency analysis?
20) To create a single vector out of multiple vectors of the same type which function would
you use ?(vector_append, vector_concat, vector_merge)
21) How to calculate length of a vector?
22) How to read data from Excel files?
23) What are linked sub graphs?
24) How to find out minimum and maximum values for a key out of multiple records (Claim-
Nb, Code)? (Rollup)
25) Give us 3 examples of some unique/complex AbInitio development that you may have
done.
26) How to calculate memory requirements for a complex graph including lookups, sorts,
and partitioned components?
27) How to generate Meta Keys for Dimensions in parallel?
28) What are some of the things to address for graph performance?
29) What usability functions do you like more in GDE 3.1?
30) TYPE-II dimensions are useful for what kind of analysis?
31) How to read and publish data in/out of queues using AbInitio (when no continous flows
are available)?
32) When would you use 'Join with DB' component?
33) How do read multiple excel files from a directory?
34) What are dynamic lookup files and when are they used?
35) What is component folding?
36) What's the difference between a stored PROC and FUNCTION?
37) How do replace NULL values with default values in SQL? (DB2)
38) How are self sized vectors defined and how is the size calculated?
39) How you check-in and check-out from the EME using commandline?
40) How to find the difference between the graph in your sandbox VS.EME?
41) Find exit status of the last executed command? ($?)
SET IV

1. Start up with Last project overview.


2. Describe about last project including sources handle, Type of data, technology used
in project.
3. Most complex SQL you have written.
4. How to validate load is correct.
5. How to validate NULL value.
6. One table has user id and join date,other table has user id and email: pulling
common records(asked about explaining Inner Join),Pull out the most recent join
date for user.
7. Among two tables each table has 1 billion records: Best way to process these
records in Abinitio i.e. dealing with huge volume if any certain condition is being
used to filtering records.
8. Most Interesting Pset code you have written and proud of: CF flows was the answer
from Amit.
9. Have you worked on look up files, how many record look up call returns.
10. How to get all match look up records (Asked about lookup_match,
lookup_count,lookup nth).
11. Different kind of parallelism. - Data,Pipeline, Component.(Will join break pipeline or
not) if yes in which case will break.
12. Driving port of in-memory join.
13. Roll up : in memory and sorted roll up.
14. Ded up sorted : Asked about all three options available in that and little more about
Unique only.
15. Partitioned data into Serialize : Difference between concatenate and merge.
16. What is Skew?
17. m_command : m_db only he was able to provide that time.
18. Exporting of co-opsys.: which parameter tells the value of this.
19. How many records get loaded if output table running in parallel layout. - suppose 4
way running so how many entries will be there for that in log details.
20. Utility load /API: which way u r loading.
21. how to make sure after particular point to start process in case of any
failure(Commit Number) or how to commit records till particular number in table.
22. Describe about SOX environment?
23. Based on some expression (condition) want to put output data in multiple out ports:
were looking for output indexes/index.
2014-03-06
1. Summary of recent work.
2. How to convert a multi-file to a serial file? (Departitioners)
3. Identify and describe the standard departitioners (Concatenate, Gather, Interleave,
Merge).
4. Given a slowly operating join component, how can one optimize it? (In which
situations to use in-memory, and which to use sorted input?)
5. What are the possible “keep” settings for the Dedup Sorted component? (first, last,
unique-only)
6. Can one de-dup data without sorting? (Yes, use in-memory Rollup instead.)
7. DATE/CHANNEL/REVENUE query question... something like
select c.name
, sum(r.revenue)
, count(distinct r.order_number) – in case order number is repeated in the fact
table
from REVENUE r
inner join DATE d – assuming r.date_id is not nullable
on (r.date_id = d.id)
inner join CHANNEL c – assuming r.channel_id is not nullable
on (r.channel_id = c.id)
where to_char('yyyy', d.date_string) = '2013' – or whatever (DB's date format
function)
group by c.channel_name
;
1. How to optimize database queries? (Did not know for sure, beyond indices. Partitioning
and database server tuning options might be possible, but such knowledge is beyond
candidate's experience.)

RS SOFTWARE
1) I have 5 records . I want to send same records from input to output how?
2) I have 10 records then i want to send 1st record to output port how?
3) I want 3rd max salary?
4) I want Top 3 salaries?
5) By using distinct delete the duplicates?
6) How to we know the one file is multifile or serial file ?
7) Pipeline parallism?
8) Grep versions?
9) Access time in unix?
10) Modify time in unix?
11) How to search Tendulkar in file(in unix)?
12) I want to display only directories?
13) Air lock break?
14) Air lock release?
15) Air object versions?
16) How to devide Header, Trailer?
17) I have 3 files how to add one after another?
18) How many max versions we can store?
19) How to load the data in Teradata?
20) what is the utility for loading? if i have million of records?

HCL
1) Star Schema, Snowflake schema
2) Left outer, rightouter, Inner join?
3) No of arguments in unix?
4) How to find previous day file?
5) I have 3 records 1st one have 5 columns, 2nd one have 3 columns, , 3rd one have 4
columns..I want to display last columns from all 3 records
6) what is . and what is /.
7) How to know the how much time taken for run the job?
8) Devide the Header, Trailer and Detail records
9) What are the tables used in your project?
10) How to Deploy script?
11) Multifile system?
12) How many Parallesium are there?
13) Max core, Max memory?
14) Difference between eme& sandbox
15) different approaches of Data Warehousing
16) how to move graph from development to QA and production
17) I checked out one graph from eme and after 10 days I check in same graph is it possible
and why
18) graphs to achieve SCD-1,SCD-2,SCD-3
19) difference between partition by expression and filter by expression
20)functionalities of scan and rollup and package mode stages like initialization and rollup or
scan how many times called
21)how to remove files which are created before 10 days
22)how to display particular fields from a file
23) input file have 100 records how to display 40 to 50 records
24).how to delete duplicate rows from table
25)difference between delete and truncate

WIPRO
1) Reformate, Redifine?
2) Rollup, Scan, Aggregate?
3) Normalize, Denormalize?
4) Output index, Output indexes?
5) Replicate, Broadcast?
6) Gather, Concatenate?
7) Drive parameter?
8) What is multifiles?
9) Phases & Check points?
10) Where the .rec files will be store?
11) How to recover the data?
12) How to see the data partitions from multifile?
13) what is the list command for multifile?
14) How to find 10 days back modify file?
15) How to checkout in GDE and Unix?
16) How to run the graph?
17) what is pset?
18) what is the sed?
19) How to replace the Hari to ramu with sed command?
20) what is grep one example?
21) I want to display Unq , duplicate records?
22) Meta pivot?
23) Met aprogramming?
24) How to do output file to lookup?
25) second max salary?
26) Delete the duplicates query?

Polaris
1) How to checkout particular object?
2) What is commit number?
3) What are the dataset components?
4) What is the output index, output indexes?
5) what is the update table? what is the multi update?
6) What is the difference between truncate and Delete?
7) I have 100 records i want to load from 2 to 99 records ? How
8) How to send the files from one server to another server?
9) I want to see the zero bytes file in directories?
10) what is the use of SQL?
11) I want to display from 50 70 th records
12) How many graps you developed?
13) Did you work on Continuous graphs, XML components
14) Utility load and Api load

Wipro
1) Why we use the Dataware house?
2) How to convert cobol tO Dml
3) what we need to set up for cobol to Dml?
4) How to convert from cobol Redifine to Dml?
5) What is the command for count the multifile in unix?
6) How we can devide the Header and Trailer?
7) what are the CDS are there give the example?
8) What are the load utilities? When and Where we can use it?
9) Project and developed graph example?

Satyam
1. In Rollup Component Key is Null how many records come to outport?
2. What is promotion?
3. What is Dependency Analysis?
4. How to find the running process which is in background?
5. What is the difference between Broadcast & Replicate component?
6. Advanced Abinitio components which we are used?
7. Satish Konda how to find out O’s using in Abinitio?
8. In Scan Component Key is Null how many records come to outport?
9. In Lookup file all matching records will come in outport?
10. .DBC file contains?
11. How to test the .DBC File?
12. XIS component in abinitio?

SYNECHRON INTERVIEW QUESTIONS:


1) Difference Between Adhoc Multifile & Multifile?
2) In Join (Outer Join ) You are using SCD Type -2 ?
3) Explain Final Function?
4) How much memory CPU use ? How to find out?
5) Explain CheckPointed Sort in Abinitio?
6) checkponts are use or not After Sort Component?
7) Run Time Behavior of Sort component?
8) Runtime Behavior of DEDUP Component?
9) No Identifier & We don’t know location of Headder & Trailer. How we can Separate Data?
10) In memory : Inputs must be Sorted which one should be used? In Rollup Component what
should be used?
11) Pipeline parallelism breaks?
12) Explain AB_JOB_PREFIX?
13) How to find out the Second Highest Salary in SQL?
14) Why Dimensional Design is used in DWH?
15) Is Dimensional model Is Normalized?
16) Is Fact table is Normalized?

SYNTEL INTERVIEW QUESTIONS?


1. Rollup Scenario?
2. In front of Rollup Partition components are used or not?
3. I have 4-way multifile in DEV, in SIT 6-way while we are working with this we faced any
problem?
4. What is Data Poisioning?
5. Have you heard end script
A. If a graph Failed end script will run/not?
B. Even through if Graph is failed end script need to be run how?
6. In TWS how you will check job status?
7. How we can see the error in TWS?
8. Explain Complex/Critical graph In your project?
9. Join in0, In1, In2 Inner Join?
10. Shell scripting?
11. Conditional components?

L & T Interview Questions

1. Explin Lookup?
2. What is the difference between lookup_count, Lookup_local?
3. What is the difference between Phase and Checkpoint?
4. Explian Rollup component?
5. What is the difference between temporary mode & Expanded mode?
6. What is the difference between merge & gather?
7. Explain departition Components?
8. What is the difference betweemn PBRR & PBK?
9. I have 1,2,3,4,5 records in one input file . It is 2-way Multifile. I want output 1,3,5 in one
file 2,4 records in second output file?
10. I have 1,2,3,4,5 records in one input file . It is 2-way Multifile. I want output 1,2,5 in one
file 3,4 records in second output file?
11. How to Remove Header and Trailer records?
12. I have 1,00,000 records in 1file & 30 records have in another file how to join those two
file In both the file have Deptno ?
13. Which command is used for pattern search?
14. I have Dept No, Dept Name, Salary I want second Highest salary in Different
departments write the query In SQL?
15. Explain the surrogate Key?
16. I Have 2-way partition file & also 4-way partition file how to join these two files?
17. What are the Unix commands used in your Projects?
18. How mant Graphs can be created in Your project?
19. Explain the Graph names?
20. How many PSETS are create in your project?
21. You have Interaction with EME or not?
22. Explain a complex graph in your Pojects?
23. Scan & Rollup both have same functions but Scan gives Cumulative summary records &
roll up gives Summary records How?
24. Scan & Rollup both have same functions but Scan gives Cumulative summary records &
roll up gives Summary records How?
L&T interview questions
1) Tell me about your project?
2) What ate the daimentions used in your project?
3) How many daimentions tables are there?
4) What is fact? How many fact tables are there?
5) what is Rollup? Give one example how it will work?
6) What is Scanup? Give one example how it will work?
7) What is multifile system? How will you create multi files?
8) Did you used multifiles in your project?
9) what is the command for status of last executed in Unix?
10) What is parallelism? How many types are there?
11) Which parallelism will break the sort?
12) what is the command for remove the perticular file?
13) I have 1,2,3,4 coloums but i want only 2&3 how will you get in unix?
14) How to copy 3 lines in mainframe?
15) How to access mainframe data?
16) How to connect the database?
17) What is the command for stop the running job?
Syntel interview on 12th-July-2012
1) How to handle comma which in data for a comma delimeted file? Answer: use repair input
component.
2) How to run a mp in Unix?
3) How to find common projects in Unix?
4) There is a fixed lenth file of record lenth 1400 and it has 1000 records, In which only one
record lenth is < 1400 how to find out that record in Unix?
5) I have one file it contains Header, Body, Triler but I donot know identifer how you will
seperate?
a) whether we can multiple into inputs to reformat? why?
6) I have 1000 bytes in a file I want to cut positio 11-20 th & 50-60 th & 700-720 How you will
do?
7) You can include a subgraph in common project into Private project if u do any changes in
subgraph will it reflect in Private project?
8) How will you stop the graph after Phase1?
9) What is the difference between Phase & Checkpoint?
10)What is the Primary Key?
11) what is the Difference b/w Primary key & Unique key?
12) what is Index?
13) What type of validations you will do once you got file?
14) how to find 3rd highest salary in each department in Lookup file?
15) How to find Lenth of record in a file using Abinitio?
16) studid ----how u will get one record for each group?
10
10
20
20
17) what is the diff b/w dbc and cfg?
18) db2/unix/abinitio ---ratings(out of 5)?
19) do you u write multiple files?
20) roll up parameter
21) to save memory what performance tunnning u will do
22) driving port in join component
23.whether we can multiple into inputs to reformat? why?

Wipro interview
1. How many components can set in one phase?
2. If the first phase has zero byte file, then the 2nd phase will automatically stop? How?
3. In the dev the file will be DVX, in the testing file will be QAX, in the production the same will
be changed PDX dynamically, how?
4. The parameter value is AAA, how can u write this in select stat, how can u pass this par to
all DB components?
5. I want to load the records from 21st out of 100 records, how?
6. In the scripting file, the very first line is usually # bin/ksh, if we will give the blank line
before this, will the script run?
7. How can u convert 4-way to 8-way?
8. Diff b/w interleave & concatenate?
9. How will work Broadcast component apart from other partition components?
10. If u set { } key as join parameter, what will happen?
11. We are passing 20 records from in0 & 40 records from in1 with 5 matching records. What is
the record count with left outer join & full outer join?
12. The graph will automatically stop after processing the 20 records from 100, how?
13. What is the alternative command for FTP?
14. What is AB_JOB_PREFIX?
15. I want to run all the files as parallel with out using MFS, the files of DML’s are different,
how? (using PSETS)
16. How can u display the 20 record from 100 in abinitio & in unix?
17. How can u change the timestamp of the file in unix?
18. How can u count the records are in the o/p port & in the reject port in the reformat
component?
19. Can we use the look up with full outer join?
20. How can u set the global variable without using the export command in unix?
21. What is the lay out of the component?
Wipro interview
1) Waht are Fact, Dimention tables
2) What is Aggretate table?
3) What is ODS?
4) OLAP, OLTP
5) What are Sed, AWK
6) Recursively, how to find the pattern in file and also in sub directories?
7) How to know, status of the graph? ==> mpjret
8) what is the first line in unix script?
9) How to run mp in unix?
10) what is lookup_local?
11) Difference between Force error & Force abort?
12a) How to find the file count?
12b) What will happen if if give all fields in KEY of Rollup with count() function.
13) Output index, Output indexes.
14) Reformat: In select statement Age!= 45, in Transform if name=D then force_error(), explain
the behaviour.
15a) Different types of modes?
15b) Difference between API & Utility mode.
16) Case 1: Huge data, select statements how to load the data?
Case 2: Small data , select statements how to load the data?
Case 3: If normal data how to load?
17) Header, Trailer,Detailed records are there then how to prepare the test data?
18) Departition components?
19) Difference between Merge, Interleave, concatenate, gather
20) How to create sequence number in multifile? in uinx and abinitio?

Accenture Interview Questions


1) We have one string, but we don't know the lenght of the string. I want the output in the
follwing pattern.
1,3,5,7
2,4,6,8
3,5,7,9,11
2)I have one table studid, name, score month wise in vector format. I want normal data which
component you used?
3) What is surgated key? what is the use?
4) what is sorted within group component?
5) How many departition component ate there? Which is the best?
6) what is the use of snowflake schema?
7) what is the difference between Rollup & Scan and give example?
8) what is the force error?
9) what is the mpjret? How to use?
10) we have null value records? How to identify? and at the same time graph won't be fail how?
11) Did you prepare Wrapper script? then how?
12) what is .rec
13) What is Skew
14) If employee will work on multiple tasks, first find out the employee involved in how many
tasks after that find out top3 employees involved in multiple tasks?
15) Override parameter in join component?
16) Driving parameter in Join?
17) what is lookup &join?
18) what is the use of lookup?
19) If we have larze amount of data assign to lookup file? what will happen?
20) Load the data equally in partitions in this switchwation which component you will use ?

TCS – A1
1. What is your Role in current Project
2. Explain the Flow of your current project
3. What the Database using for your project
4. What the .dbc file contains.
5. What is the difference b/w Truncate and delete.
6. How you Take data from flat files.
7. What the requirement document contains exactly.
8. How to prepare analysis document and What it contains.
9. How assign keys components can work.
10. What are primary keys and foreign key
11. Explain Reformat with parameters and how it acts in Real-time.
12. Functionality of Roll-up and how to filter the records in it.
13. What are the components you worked most.
14. What is view.
15. What is m_dump command.
16. How you execute the script from GDE.
17. What is wrapper script.
18. How many graph's you developed up-to here.
UNKNOWN COMPANY
1.What will happen if we pass null key in Rollup,Scan,Sort and Join component?
2.How will you calculate second highest salary using Abinitio?
3.How will filter out H D T records if DML is not conditioner?
4.How will you assign foreign key for following scenario?
I/P File
Columns
Sarika
‘Ingle
23
Dinesh
Wayal
28
O/P File
Columns
Sarika 1
‘Ingle 1
23 1
Dinesh 2
Wayal 2
28 2
5.How to find out 50th row in text file using in Unix and Abinitio?(Don’t want to use head and
tail)
6.How will you retrieve multiple rows using lookup single function in Abinitio?
7.How will you find out Insert/Update or New/Old records using Join component.
8.How to check multiple words available in a file using single command in Unix?
9.How will you achieve following scenario in Abinitio?
a.
Output
Input File File
A-B-C
A 1 6
B 2
C 3
b.
Input File
Empno Deptno Sal
1 10 2000
2 20 3000
6 10 7000
3 30 3000
5 20 4000
4 10 6000

OutPut File
Empno Deptno Sal Cal_sal
1 10 2000 2000
4 10 6000 8000
6 10 7000 15000
2 20 3000 3000
5 20 4000 7000
3 30 3000 3000
Use FUSE component and tell me the output for below two tables
Table1
Countr
CustID State y
1001 CA US
1002 TX US
1003 NY US
1004 NJ US
Table2
AcctN AcctNa
o me City
9999 John Hyd
Chenna
8888 Aavi i
7777 Nani Dubai
Banglo
6666 Navin re
5555 Suman Hyd

I/FIle
EMPn Empnam Deptn
o e o

1 Anu 10
2 Bapu 20

O/File
EMPn Empnam Deptn
o e o
1 Anu 10
1 Anu 10
2 Bapu 20
2 Bapu 20
SYNTEL
PARVATHI TELLPHONIC SYNTL INTERVIEW QUESTIONS
1.Tell Me About Ur Self
2.Which Versions Ur Using
3.How To Get The Source Data ?File Or Table?
4.I Have File 20 Records Nd I Want Each Four Records In Five O/P Files
5.What Is M_Expand
6.U Aware Of M_Mkfs?What Is It?
7.In M_Mkfs Control Partion Is Delete Means Ur Getting The Data Or Not?
8.Wt Chekin Command ?U Will Give Emepath Or Sandbox Path?
9.Ware Of Lookup Ur Graph ?Nd I Have Lookup 1
1
1
2
2
2
3
3
3
In Reformate Data I Want Like In O/P 1
2
3?
10.What Is Merge Key Nd Gather Diff B/W Nd In Gather Sorted Will Come Or Not?
11.In Grep I Have Abe Nd Pqr Are Their How To Find In Unix?
12.I Have 10 Records How To Get Row Wise In Sql?
13.U Aware Of Scan In Ur Project?
14.Which Phase Ur Working?
15.What Is Diff B/W Component Parallalism Nd Data Parallalism?

UNKNOWN COMPANY
1) What is the difference between dbc and cfg? When do you use these two?
2) What are the compilation errors you came across while executing your graphs?
3) What is depth_error?
4) During the execution of graph, let us say you lost the network connection, would you have to
start the process all over again or does it start from where it stopped?
5) Types of partitions and scenarios.
6) What does unused port in join component do?
7) Define Multi file system. Can you create multifile system on the same server? Also, if you
have a table that has Name, Address, Status, Position attributes, can Name and Address be
on one partition and Status and Position in the other partition?
8) What is a sandbox? Did the co-operating system version 2.8 have sandbox, if not how would
you store the respective files?
9) How did you do version control? Which tool did you use?
10) How do you troubleshoot performance issues in graph?
11) What are the usual errors that you encounter during ETL process apart from compilation
process?
12) Were you involved in production support? What were the different kinds of problems that
you encountered?
13) Please give us insight on Enterprise Meta Environment, and some possible questions on
that.
14) What are delta table and master table?
15) What error would you get when you use Partition by Round Robin and Join?
16) In which scenarios would you use Partition by Key and also, Partition by Round Robin and
differences between the both?
17) What are the different dimension tables that you used and some columns in the fact table?
18) How do you count the number of records in a flat file?
19) How do you count the number of records in a multifile system without using GDE?
20) What does Scan and Rollup component do and give a scenario where you used them?
21) Did you ever used user defined functions or packages? If yes, give a scenario.
22) What do you have to give the value for the Record Required parameter for a natural join?
23) When do you use Partition by Expression?
24) What is Adhoc File System? Give me a scenario where you used it.
25) What are the different commands that you used when writing wrappers?
26) What do the hidden files in a sandbox represent and what does start.ksh represent?
27) What are different things that you have to consider when loading data into a table?
28) What is difference between Redefine Format and Reformat components?
29) Sometimes you have to use dynamic length strings. Can you give me one circumstance
where you need it?
30) If you have a flat file as follows:
20 General Manager Chris
30 Divisional Manager Harry
20 General Manager Mary
30 Divisional Manager Dravid
How do you count the number of records that have 20 in the first column, and likewise for 30.
2nd time Interview 0n 29-09-2014:

1.) How to read multiple files and load into target table using utiliy mode?
Case -1: If table has duplicate record will graph fail?
2.) Explain with scenarios in real time where you have used API mode and Utility mode?
3.) i want get the count of file which is having of 1TB data without, design grap in formance
prospective and using Unix command
4.) What is Multiple and how it work? If i have 1000 records and Round robin is 4 way parttion
so how many record goes to each partition?
5.) I have a file with Credit card information’s, Custmorid, transction_amts and date. i want to
get first two highest transction amts for each coustomre?
6.) How do you remove particularobject from the given tag?
7.) How to see the primary objects and comments using air command?
8.) how comfortable you are with Unix and give me rating out of 5?
9.)how to delete 30 days old files?
10.) how to search pattern in a file in multiple directoy?
11.) have you written any unix script? if so explain me?
12.) How comfortable you are with oracle and give me rating out of 5?
13.) Write down the where clause order of execution in oracle?
14.) Write the syntax for all type of Joins?
15.) Write any SQL to delete duplicates from given table?

Pandu Interview
1) A file contains 100 records , in that records null & duplicate records also is there, when ever
we loaded into the table how the result came ? (duplicate and null records allso coming are
not)?
2) A file contains 100 records, we have to add 50 records to that same file? (we should not take
another file)
3) if the record count is 300 the graph will get execute, if not means the graph will get aborted?
4.) if in the directory contains 10 files. i want to write all 10 files into single file.. how you do it?
5.) what is the diff beween gde verions 2.16 and 3?
6.) while checkoutdid you got dependecy analysis?

Other Interview Questions


1) What are the steps to run the graph
2)explain scd types and how to implement in abinitio and what is the structure of the scd type2
table
3) what is the record required parameter in join component
4)In put file having 100 records, I want 50 to 60 records as output
5)In FBE select i have given 100, what is the output
6) How to generate the sequence number for multi files
7) I want to use the same file no .of times in a graph,how can you do this.
8)what is the difference between the phase and check point
9) tell me about your yesterday's work
10)how to find the disc free space in unix
11) how to find the file as serial or multifile using unix
12) what is the dependency analysis
13)how to find the 2nd highest salary from every department
14) how to generate rank & densce rank in oracle
15) in department file empnum,empname,empsal,deptno
question is emp name=ashok , I want to find the dept no of Ashok in unix
16)end date is today's date , i want start date as 3 years back date,write the date function of
start date
17) print the 70 to 80 lines of file using unix don't do the changes (just print those lines only)
18) how to read the multi files
19) how to find the 3 days back modified files using unix
18) I have AP, TN, KA, PUNE in one file, now I want each state into separate output file.
How can you do this?
19) how to find directories and that particular directory-sub-directories using unix
20) explain different types of parallelism techniques, where you used in your project.
21) how to find the no .of lines in a file
22) input column having the string value ashok____kum___ar like that , I ant the output as
ashok_kumarhow can you do this
23) how to run the pset using air commands
24) what happens if the generic graph is running same time in two different windows.
25)
In put file

subject marks
english 65
hindi 70
science 75
english 55
hindi 50
science 70
how to generate the same sequence number per each group. Treat frist English, hindi and
science are one group.
26) tell me the order of execution of order where, group by and having clauses in oracle
27)what is the first line in unix script?
28)Have you heard end script
A. If a graph Failed end script will run/not?
B. Even through if Graph is failed end script need to be run how?
29)No of arguments in unix?
30)Grep versions?
31)how you wil get the count of records in multifile using graph ?
32)Access time in unix?
33)Modify time in unix?
34)How to find 10 days back modify file?
35)How to find previous day file?
36)How to search Tendulkar in file(in unix)?
37)I want to display only directories?
38)How to devide Header, Trailer?
39)what is . and what is /.
40)I have 3 records 1st one have 5 columns, 2nd one have 3 columns, , 3rd one have 4
columns..I want to display last columns from all 3 records
41)what is the sed?
42)What are Sed, AWK ?
43)display last record using sed command ?
44)How to replace the Hari to ramu with sed command?
45)what is grep one example?
46)Recursively, how to find the pattern in file and also in sub directories?
47)how you will replace string in unix ?
48)how you will do the same using vi editor ?
49)i have file with two records
jithendra
kumar
generate 2 files which contains jithendra in one file and jithendra kumar in another file ?
50)I want to display Unique , duplicate records?
51)I want to see the zero bytes file in directories?
52)What is the command for count the multifile in unix?
53)How to find the running process which is in background?
54)how to print line nos in a file in unix ?
55)how you will improve the performance of graph
56)What is Data Poisioning?
57)How can u change the timestamp of the file in unix?
58)How can u display the 20 record from 100 in abinitio & in unix?
59)How can u set the global variable without using the export command in unix?
60)how to grep jithendra or kumar from a file ?
61)How to run mp in unix?
62)How to run a mp in Unix?
63)how to find the files created by a perticular user and remove them interactively
64)How to find common projects in Unix?
65)There is a fixed lenth file of record lenth 1400 and it has 1000 records,In which only one
record lenth is < 1400 how to find out that record in Unix?
66)How to create sequence number in multifile? in uinx and abinitio?
67)How can u display the 20 record from 100 in abinitio & in unix?
68)How can u change the timestamp of the file in unix?
69)How can u set the global variable without using the export command in unix?
70)I have 1000 bytes in a file I want to cut positio 11-20 th & 50-60 th & 700-720 How you will
do?
71)I have one file it contains Header, Body, Triler but I donot know identifer how you will
seperate?
a) whether we can multiple into inputs to reformat? why?
72)I want to see the zero bytes file in directories?
ERRORS
1. what is the force error?
2. Difference between Force error & Force abort?

3. How many graps you developed?


4. Project and developed graph example?
5. what type of cleansing you did ? what type of load or data errors you got and how you
fixed them
Unkown company
1. How to run the graph?
2. How to convert cobol to Dml
3. what we need to set up for cobol to Dml?
4. How to convert from cobol Redifine to Dml?
5. What is Data Poisioning?
6. I have 5 records . I want to send same records from input to output how?
7. I have 10 records then i want to send 1st record to output port how?
8. How we can devide the Header and Trailer?
9. Multifile system?
10. What is the command for count the multifile in unix?
11. what are the pdl you have used how you will create a simple vector using pdl
12. what is the list command for multifile?
13. How to we know the one file is multifile or serial file ?
14.
15. How to see the data partitions from multifile?
16. what are the SCD are there give the example?
17. Pipeline parallism?
18. replace 4th column with present date where my file is | delimeter
19. create a generic graph and it should modify the file based on input parameter given
20. if i give a.dat it should modify a.dat
21. if i give b.dat it should modify b.dat
22. How many Parallesium are there?
23. Pipeline parallelism breaks?
24. Air lock break?
25. Air lock release?
26. Air object versions?
27. I have 3 files how to add one after another?
28. How many max versions we can store?
29. How to load the data in Teradata?
30. What are the load utilities? When and Where we can use it?
31. what is the utility for loading? if i have million of records?
32. How to know the how much time taken for run the job?
33. How to Deploy script?
34. Max core, Max memory?
35. Reformate, Redifine?
36. what is dynamic lookup ?
37. what is the difference between lookup and join when you will go for lookup or join ?
38. how you will return entire record from you a lookup ?
39. Rollup Scenario?
40. Scan & Rollup both have same functions but Scan gives Cumulative summary records &
roll up gives Summary records How?
41. Rollup, Scan, Aggregate?
42. In front of Rollup Partition components are used or not?
43. In Rollup Component Key is Null how many records come to outport?
44. What is promotion?
45. What is Dependency Analysis?
46. Output index, Output indexes?
47. What is the output index, output indexes?
48. Replicate, Broadcast?
49. In Scan Component Key is Null how many records come to outport?
50. What is the difference between Broadcast & Replicate component?
51. Gather, Concatenate?
52. Advanced Abinitio components which we are used?
53. Satish Konda how to find out O’s using in Abinitio?
54. Drive parameter?
55. I have 4-way multifile in DEV, in SIT 6-way while we are working with this we faced any
problem?
56. What is multifiles?
57. I have 1,2,3,4,5 records in one input file . It is 2-way Multifile. I want output 1,3,5 in one
file 2,4 records in second output file?
58. I have 1,2,3,4,5 records in one input file . It is 2-way Multifile. I want output 1,2,5 in one
file 3,4 records in second output file
59. Difference Between Adhoc Multifile & Multifile?
60. Phases & Check points?
61. if a graph fails after loading 1000 rec how you wil recover the graph ? graph contains 3
phases
62. what is the difference between phase and check point when yu will go for that
63. will you able to avoid check points when you created phases ?
64. if i have total of 10 units defined in resource pool and plan contains 3 taks which runs
parllel and every one required 10 units.
65. in this scenario how you wil handle the sequnce of execution of graphs ?
66. Where the .rec files will be store?
67. How to recover the data?
68. How to checkout in GDE and Unix?
69. How to checkout perticular object?
70. what is pset?
71. Meta pivot?
72. Meta programming?
73. I want to display Unq , duplicate records?
74. How to do output file to lookup?
75. In Lookup file all matching records will come in outport?
76. What is commit number?
77. What are the dataset components?
78. what is the update table? what is the multi update?
79. I have 100 records i want to load from 2 to 99 records ? How
80. How to send the files from one server to another server?
81. I want to display from 50 70 th records
82. Utility load and Api load
83. Did you work on Continuous graphs, XML components
84. What is Dependency Analysis?
85. .DBC file contains?
86. How to test the .DBC File?
87. XIS component in abinitio?
88. In Join (Outer Join ) You are using SCD Type -2 ?
89. how you wil implement scd1 and scd2 ?
90. Explain Final Function?
91. How much memory CPU use ? How to find out?
92. Explain CheckPointed Sort in Abinitio?
93. checkponts are use or not After Sort Component?
94. Run Time Behavior of Sort component?
95. Runtime Behavior of DEDUP Component?
96. No Identifier & We don’t know location of Headder & Trailer. How we can Separate
Data?
97. In memory : Inputs must be Sorted which one should be used? In Rollup Component
what should be used?
98. Explain AB_JOB_PREFIX?
99. In TWS how you will check job status?
100. How we can see the error in TWS?
101. Explain Complex/Critical graph In your project?
102. Explain a complex graph in your Pojects?
103. How many Graphs can be created in Your project?
104. Explain the Graph names?
105. How many PSETS are create in your project?
106. You have Interaction with EME or not?
107. Join in0, In1, In2 Inner Join?
108. I have 1,00,000 records in 1file & 30 records have in another file how to join
those two file In both the file have Deptno ?
109. Have 2-way partition file & also 4-way partition file how to join these two files?
110. Conditional components?
111. Explin Lookup?
112. What is the difference between lookup_count, Lookup_local?
113. What is the difference between Phase and Checkpoint?
114. Explian Rollup component?
115. What is the difference between temporary mode & Expanded mode?
116. What is the difference between merge & gather?
117. Explain departition Components?
118. What is the difference betweemn PBRR & PBK?
119. How to Remove Header and Trailer records?
120. Which command is used for pattern search?
121. how you will generate sorragate key ?
122. how you will implement scd1 and scd2 for a mfs file ?
123. what type of validations you normally do ?
124. what is mapping docment what includes in there ?
125. what are the methods we do have in plans s ?
126. what is inherited method ? why we need them how you implemented ?
127. what is opconsole ?
128. what is bridge in abinitio ? have you worked in bridges ?
129. if you are not having bridges how you are working ? how you are handling a
scenario like this ?
130. if two teams need to work on same project then if a graph was locked by one
user and you want to modify the same graph then how you will do this ?
131. i thought to tell him i will aks that guy to remove the lock or i will break the lock
:P
132. how you will identify how many records proccessed by a graph how you will
improve performance of a graph .
133. how you will check in or check out of a graph from unix .
134. i have a table_a , table_b
135. i have accountid in both tables common 1000 record present in table_a
136. now i want to get all 1000 records by joining table_b and if accountid matches in
both the tables then i have to to populate a new column called matched with
values yes and no (yes if matches and no if not matches)
137. what is abinitiorc file what it contains ?
138. what is the execution order of parameters ?
139. difference between .project.pset and .sandbox.pset ?
140. how you will see the objects locked by user ?
141. how to list the objects in project ?
142. how to see the versions of object ?
143. difference between two versionso of an object ?
144. how to checkin last version of the object ?
145. how to export and import an object from gde ?
146. how to create data directories from unix ?
147. how to create a project in the EME for the first time ?
148. how to recover the plan from last checkpoint ?
149. how to clean the temporary files created by abinitio?
Unknown questions
1. Why we use the Dataware house?

2. I want 3rd max salary?


3. I want Top 3 salaries?
4. How to find out the Second Highest Salary in SQL?
5. I have Dept No, Dept Name, Salary I want second Highest salary in Different
departments write the query In SQL?
6. By using distinct delete the duplicates?

7. Star Schema, Snowflake schema


8. Explain the surrogate Key?
9. Left outer, rightouter, Inner join?
10. i am a able to insert a record into fact if corresponding record not present in dimension?
11. I want to display Unq , duplicate records?
12. second max salary?
13. Delete the duplicates query?
14. What is the difference between truncate and Delete?
15. what is dimension table and fact table what is the difference ?
16. what is the use of SQL?
17. I want to display from 50 70 th records
18. In Join (Outer Join ) You are using SCD Type -2 ?
19. Why Dimensional Design is used in DWH?
20. Is Dimensional model Is Normalized?
21. Is Fact table is Normalized?
22. Normalize, Denormalize?
23. what is star scheman and snowflake schema which one is better why it is better ?
24. what is fact less fact table?
Unknown company

1. what is the difference between phase and check point


when yu will go for that will you able to avoid check points when you created phases
2. how you will improve the performance of graph
3. if a graph fails after loading 1000 rec how you wil recover the graph ? graph contains 3
phases
4.what is resource pool
5. if i have total of 10 units defined in resource pool and plan contains 3 takes which runs parllel
and every one required 10 units.
in this scenario how you wil handle the sequnce of execution of graphs ?
6.how you wil implement scd1 and scd2
7.how you wil get the count of records in multifile using graph
8.what are the pdl you have used how you will create a simple vector using pdl
9. what type of cleansing you did ? what type of load or data errors you got and how you fixed
them

10.how to find the files created by a perticular user and remove them interactively
11. replace 4th column with present date where my file is | delimeter
12. create a generic graph and it should modify the file based on input parameter given
if i give a.dat it should modify a.dat
if i give b.dat it should modify b.dat
....
i will give mask ( question not understand properly)
13. what is dynamic lookup
14. what is the difference between lookup and join when you will go for lookup or join
15. how you will return entire record from a lookup ?
16. what is dimension table and fact table what is the difference
17 . how you will generate sorragate key ?
18. how you will implement scd1 and scd2 for a mfs file ?
19. what is star scheman and snowflake schema which one is better why it is better
20. what is coincile schema (exactly not sure what he aksed)
21. what is fact less fact table
22. what type of validations you normally do
23. what is mapping docment what includes in there ?
24. i am a able to insert a record into fact if corresponding record not present in dimension
25.what are the methods we do have in plans s
26. what is inherited method ? why we need them how you implemented ?
27. what is opconsole ?
28. in which purpose we need operational console ? if i want to create a variable in opconsole
and this variable i need to use in graphs or parameters . how you can achive this
29. what is bridge in abinitio ? have you worked in bridges ?
30 . if you are not having bridges how you are working ? how you are handling a scenario like
this ?
if two teams need to work on same project then if a graph was locked by one user and you
want to modify the same graph then how you will do this ?
i thought to tell him i will aks that guy to remove the lock or i will break the lock :P
31. how you will identify how many records proccessed by a graph how you will improve
performance of a graph .
32. how you will check in or check out of a graph from unix .
33. i have a table_a , table_b
i have accountid in both tables common 1000 record present in table_a
now i want to get all 1000 records by joining table_b and if accountid matches in both the
tables then i have to to populate a new column called matched with values yes and no (yes
if matches and no if not matches)
34 what is abinitiorc file what it contains ?
35 what is the execution order of parameters ?
36. difference between .project.pset and .sandbox.pset
37. how you will see the objects locked by user.
38. how to list the objects in project ?

39. how to see the versions of object ?


40. difference between two versionso of an object ?
41. how to checkin last version of the object ?
42. how to export and import an object from gde ?
43. how to create data directories from unix ?
44. how to create a project in the EME for the first time ?
45. how to recover the plan from last checkpoint ?
46. how to clean the temporary files created by abinitio?
47.how to print line nos in a file in unix ?
48.how to grep jithendra or kumar from a file ?
49. i have file with two records
jithendra
kumar
generate 2 files which contains jithendra in one file and jithendra kumar in another file ?

unknown company

1. What are the prioritized Rules in as Transform function


2. What is Is_defined(), Is_null(), Is_Blank()?
3. What are the different String functions used in Abinitio?
4. I installed AbInito in my PC.In components there is no "select" component.How can i get
that component?and How we can create user defined components?
5. What is conditional DML and Structured DML?
6. What is EME?What is GDE?What is Co-operating system?What is the link between all
these?Please explain elaborately.
7. Can you read multiple input files using one input file component ?(same DML)

8. Can we specify checkpoints without Phases?


9. What is the difference between Generate Records Component and Create Data
Component?
10. How does PBK and Sort component and Join(in memory sort) work?I mean in the join
component i have used In Memory Sort,so how will it work in terms of performence as
compared with PBKand Sort Component?
11. What is the default layout of a component ? Is it mandatory to set the layout of a
component ?
12. hi, am put into Ab initio now. I need some material for to learn it!! i have to give a test
also.. can anyone help me provide material please.... how to prepare scd2 in abinitio

13. how does deadlock occur?


14. difference betwween check point and phase?
15. how to create a new mfs file? where will we specify the number of partition eg 4 way ,8
way? IBM 2 280
16. If m loading a file of 1 million records.and the graph fails after loading 10,000 records. In
this case if we use rollback command then what will happen?
17. How can you increase the number of ports of your output flow? What is the limit? Can i
have 20 out ports?
18. what is difference between chk point n phase.Say im loading a file (containig 1 lakh
records) and my graph fails after loading 50,000th record.What is the advantage of using
check point in this case. Will i be able to start from 51000th record.
19. what is difference between chk point n phase.Say im loading a file (containig 1 lakh
records) and my graph fails after loading 50,000th record.What is the advantage of using
check point in this case. Will i be able to start from 51000th record.
20. What is the difference between reformat and redefine format?
21. What do you mean by continuous graph? what is it significance?
22. what is difference between fuse and join? Explain in detail? In which scenario we will
use them?
23. How to load the data from an excel sheet to input file in a graph?
24. What is the language used in Ab Initio ?
25. How can you import XML repositories exported from different tools like ODI

26. what is the difference between the .dbc and .cfg extensions and give any clear
information ? Cognizent
27. how can you convert 3 way to 7way partitioning in abinitio?
28. what is extension list in abinitio?
29. how will i can implemate Insert,Update,delete in abinitio? how will u view MFS in unix?
what is diff/btween conditional dml& conditional component? what is decoding & what
is NVL?
30. what is hash file in abinitio? how will it read in abinitio? in hash file the data files r
combine (file1.dat,file2.dat,file3.dat......etc) how will it divide? What is meant header
and tailer, suppose header and tailer had some junk data how will delete junk data ?
which components r used?
31. I had 10,000 records r there i loded today 4000 records, i need load to 4001 - 10,000
next day how is in Type 1 and how is it on type 2?
32. Graph was failed how it is achived ? send answer with phone number? i want speak with
directly who is intrest on abinitio?
33. What is the difference between In-Memory Sort and Inputs must be sorted?

34. what are the steps in actual ab initio graph processing including general,pre and post
process settings?
35. What is air_project_parameters and air_sandbox_overrides? what is the relation
between them?
36. In Join component which record will go to unused port and which will go to reject port

37. What is max core value ? what is the use of it? Hi friends if u don't mind i want one
question What abt READ MULTIFILE COMPONENT? PLZ GIVE BRIEFLY?

38. How will u remove header record? How will u remove tailer record?
39. the inputs are 10,20,30,40,50 but the output was comes 0 records,the graph was failed
what is bad situation?
40. What is the difference between a DB config and a CFG file?

41. Describe how you would monitor and control database physical file size and growth?
42. Describe which system or process elements you would review when trouble-shooting
general server slows ?
43. Have you ever encountered an error called "depth not equal"?
i. What are Cartesian joins?
44. How many parallelisms are in Abinitio?
45. What is the difference between partitioning with key and round robin? What is a
ramp limit?
46. what is the relation between EME , GDE and Co-operating system ?
47. what are kinds of layouts does ab initio supports ?
48. what is meant by fancing in abinitio ?
49. How Does MAXCORE works?
50. What is meant by Co > Operating system and why it is special for Ab-initio ?

51. What is the latest version that is available in Ab-initio?


52. What is AB_LOCAL expression where do you use it in ab- initio?
53. What is $mpjret? Where it is used in ab-initio?
54. What are the different versions and releases of ABinitio ?

55. How to run the graph without GDE?


56. what is the syntax of m_dump command?
57. what is skew and skew measurement?
58. How to get DML using Utilities in UNIX?
59. what is the difference between .dbc and .cfg file?
60. How to Improve Performance of graphs in Ab initio?

61. How will you use EME for view or publish metadata reports using EME?

62. How can we test the abintio manually and automation?


63. How do we handle if DML changing dynamically?
64. what does layout means in terms of Ab Initio?
65. What is the difference between sandbox and EME, can we perform checkin and
checkout through sandbox?
66. How might you quantitatively measure an improvement made to a query?

67. Why might the optimizer use a table scan when an index is available?
68. Describe what the SQL Server tempdb and tlog files are used for? Describe the effect of
the "checkpoint" t-sql statement?
69. What are the components new in 2.14 compared to 1.8 and State the usage of the
components?
70. What is .abinitiorc and What it contain?
71. How to Create Surrogate Key using Ab Initio?
72. What does dependency analysis mean in Ab Initio?
HCL Interview questions 07/03/2015
1. Explain about scd typ-2
2. In join if I connect 4 inputs what is required type for left outer join and how it performs?
3. I have graph like , In one flow I used PBKS with key A,B and in another flow I used
PBKS with key A only, now I connected Join and keep key as A,B then what will be
output ?
4. In lookup I have fields like
1| Jyothi
1|ram
1|siva
1|lokesh
If I called lookup to these values which record will comes out
5. For the above same scenario , I want to get output like
1|Jyothi,ram,siva,lokesh ( Here my file is lookup)
6. Here is my file like this
1| sam
2|ravi
3| prabhas
Here is my code in output index with count value 2
If (cust_name == sam) 0
else
(cust_name==ravi)1
Now I have a record prabhas so where the record is going
Does the graph fail?
7. How do you get rank in unix file system
8. I have a file like and I want to remove empty row’s and empty spaces
Test tester | ram |1
1|2|3
3|4|5
6 |7|8
9. How to delete duplicate records from a table
10. How to get a last column from a file

Sai Systems
1. Tell me about yourself and projects
2. What is your role
3. De normalization process
4. denormalization is the process of attempting to optimize the read performance of a
database by adding redundant data
5. If I have requirement What kind of approach for estimation of your task you can give me
6. Do you know in SCD Type -2 , How the records are represented in type-2
7. How many types of joins in oracle
8. How do you get unique values in the table without using distinct
9. Architecture of AB Initio
10. Main Components in AB Initio
11. What is GDE , When you run a graph , what will happen in background
12. What is purpose EME
13. Difference between private sandbox and user sandbox
14. Difference .db config and .cfg
15. What is Environmental Variables
a. Variable which can access through entire einvironment $AB_HOME Dev to test
16. How you will normally do error handling
17. If I got some error like truncate error I want to handle without stopping
a. In my src table column length is 20(char) and destination is 10 type varchar , now
I want to capture first ten char without any error and without any
transformation
18. What is force error , is there any other similar function
19. Is it possible to create surrogate key in GDE , is there any component to get auto
incremental value
20. What is the purpose of parallelism?
21. Different kind of partition component
22. What is important factor that you consider to improve your graph?
23. How to move the graph from develop to testing (which is in different environment ) how
do you maintain all the information
24. Is it configured Environment variable
25. About SDLC & previous project
26. what are the top 5 errors in your project?
27. Data error mismatch
28. Performance issues
29. Setting driving properly
30. Types of parallelism
31. Different types of layouts
32. Questions on GDE, Co –op, EME
33. What are the graph level parameters and what is the difference between local and
formal level parameters
34. Local parameter can be given only at Design Time, while in the case of Formal
parameter, when the Graph will run it will first prompt for the values of Formal
parameters. In other words, u can say that the value of the Formal Parameter can be
given at Run Time.
35. Kind of documents that you may receive in your project
36. Field level documents
37. Mapping documents

CTS Interview Questions

1.what are the versions you used in present and previous projects ?
2.what is the difference between de-partion and partion components?
3.what is the difference between partion by key and partion by round robin ,in which situations
u used these components in u r project ?
4. cat with multifile and serial file ?
5. how the tables are extracted to input table ?
6. how u will get oracle tables in Abinitio ?
7. if we connect serial and multifile is it executed or not ?
8. Serial reformat joinmultifile
Is it work ?in what way it is working ?
9. Parameters of scan component ?
10. What u r done in u r previous project?
11. how can u generate cost and revenue of u r project ?
12.how the multifiles and serial files are executed in unix environment ?
13.what r the components u used in your project ?
14. what is the main aim of u r project ?
15. how the multifile can be changed to serial file,serial file to multifile ?
16. How the data will be seen in multifiles ?
1] 1
2
3
.
.
10
divide these records in 2 ports . 2 in each port alternately.
eg 1,2 port 0
3,4 port 1
5,6expla port 0
2] file1 file2
1 1
1 1
1 1
explain what will be the output for each type of join.

3] maxcore for join is 100MB


if we receive below size of records at driving port what will happen for each case?
a. 90MB
b.100 MB
c. 200 MB
4] 12 records are in input file and in the output we want 4 serial files with equal amount of
data.
input file--> ??--> 4 serail files
which compoent will you use in the ??
5]scd and cdc explain?
6] how to improve performance of graph?
7] what is abinitio.rc file?
8] explain behaviour of scan & rollup?
9] Explain (Bitwise)organisational data flow?
10]I want to read complete file of 100 records and find at the end which record was corrupt and
log it accordingly? How?

2014-03-06
8. Summary of recent work.
9. How to convert a multi-file to a serial file? (Departitioners)
10. Identify and describe the standard departitioners (Concatenate, Gather, Interleave,
Merge).
11. Given a slowly operating join component, how can one optimize it? (In which
situations to use in-memory, and which to use sorted input?)
12. What are the possible “keep” settings for the Dedup Sorted component? (first, last,
unique-only)
13. Can one de-dup data without sorting? (Yes, use in-memory Rollup instead.)
14. DATE/CHANNEL/REVENUE query question... something like
select c.name
, sum(r.revenue)
, count(distinct r.order_number) – in case order number is repeated in the fact
table
from REVENUE r
inner join DATE d – assuming r.date_id is not nullable
on (r.date_id = d.id)
inner join CHANNEL c – assuming r.channel_id is not nullable
on (r.channel_id = c.id)
where to_char('yyyy', d.date_string) = '2013' – or whatever (DB's date format
function)
group by c.channel_name
;
2. How to optimize database queries? (Did not know for sure, beyond indices. Partitioning
and database server tuning options might be possible, but such knowledge is beyond
candidate's experience.)

Unknown company
1. Tell me abt ur self?
2. What is fact and dimension?
3. Hw to use granualarity in fact table?
4. What are the schemas did u use?and explain it?which is give better performance?
5. Types of dimensins?
6. Firstly which table u have to load it/and that data where to read it?
7. Wt is the use of staging area in dwh?
8. Explain abt scds?did u wrk on dt?
9. By seeing only hw to find dt table belongs to scd type2 or not?is it possible?
10. Explain abt FBE?and tell me one scenario?
11. Wt is PBK nd PBRR?which gives better performance?
12. Diff b/w replicate nd broadcast?
13. Wt is multifie?did u wrk ondt?
14. Hw to change depth of multifiles?
15. Diff b/w checkpoint and phase?
16. Wt is join nd explain it nd gve da scenarios of each join type?
17. Explain wt is parallism?
18. I hve one file its having 10 recors hw to displays the count into target table?
19. I m having 3 tables, in dt 10th dept no records are get into target?hw to get it?
20. One table is der in dt table header having 3 fields,detail having 3 fielda and trailer having
no.of record count ? in dt table hw to read dml?
21. Hw to get depth of multifile ?
22. By using awk cmd hw to count the no.of columns?
23. Hw to find 0 byte file/
24. Hw to delete first 3 chars in line?
25. Explain abt rollup nd scan? And wt is the diff b/w them?
26. By using reformate wt are the operating u did so far in ur graph or project?
Unknown company
1. Difference between eme& sandbox
2. different approaches of Data Warehousing
3. how to move graph from development to QA and production
4. I checked out one graph from eme and after 10 days I check in same graph is it possible and
why
5. graphs to achieve SCD-1,SCD-2,SCD-3
6. difference between partition by expression and filter by expression
7.functionalities of scan and rollup and package mode stages like initialization and rollup or
scan how many times called
8.how to remove files which are created before 10 days
9.how to display particular fields from a file
10. input file have 100 records how to display 40 to 50 records
11.how to delete duplicate rows from table
12.difference between delete and truncate

Ramarao questions
1. tell me about yourself?
2. how you will write the dml for the following

20141121 # header date?


A)"yyyymmdd"
4 # tailor record count?
A) decimal (5)

1,jithu
2,chandu
3,srinu
4,sandeep
3. how you will validate header and trailor?
4. abort the graph if header and trailor validation failed . if validation sucess create 4 way
multifile

5. compare the account no in new graph and insert the new record to the target table. and get
the total no of records inserted into the summary table.

6. how many projects you worked on you know informatica also ? what is your total dev and
production support experience
7.how you will validate the header and trailor i.e. above scenario by using unix

8.how you will get the second record from a file with out using sed , head and tail commands?
9.i have account no and transaction date get the last 3 transactions for each account no (write
the query and tell me how you can implement using graph )
10. get the total no of transaction dates for each account there may be multiple records for one
day .
11. what are the dimensions tables and fact tables?
12. what are the different types of scds?
13. explain about all scd2? .
14. i have account no as key field . how many records will be there for scd1, scd2 and scd3?

NTT DATA INTERVIEW QUESTIONS


1. Tell me about yourself
2. Explain about your current project
3. What is critical error which you faced
4. What kind validation you have done
5. Scenario
6. Input file

Input Output
1. C1 1. C1
2. A 2. A
3. B 3. a
4. C 4. B
5. D 5. C
6. a 6. c
7. d 7. D
8. c 8. d
9. How do you run graph from the background
10. How to see the detailed version for an object
11. How to move your project from dev -> quality
12. How you create tag for multiple objects
13. Scenario , I have two mfs files with having depth -2 with fields like
Acct_id, Acct_type , Amt
10,credit,100
20,debit,400
30,debit,600
In these scenario , We already sorted with acct_id now I want to join the two mfs files
with again sorting with another field acct_type so how you gonna ?
14. For join component how the multifile system works?
15. How to create multifile system
16. Scenario, If graph success then its ok but if my graph fails then I want to send mail
notification how you do that in abinitio ?
17. How to check the status of a graph
18. How do you execute the ksh file
19. When do you go for pset? Types of interpretation
20. How do you checkout the objects/graph air project export cmd and air sandbox
command then she asked
21. What is the difference between the air project export & air sandbox export
- Pls share more on this if you any solutions
22. Difference between broadcast & replicate
23. Given basic scenario about the rollup which needs to be group & taking sum
24. Main difference between scan & rollup
25. Scenario

26. cid cname Out put will be


27. 10 a cid cname
28. 20 b 10 a,abc,123
29. 10 abc 20 b,xyz
30. 10 123 30 def
31. 30 def
32. 20 xyz
33.

25. In the above output I want to send all records except the char contains ‘a ‘
26. In sql how do you left the two tables give me query
27. How to send 25 to 30 records to outport
28. I want to send values to the output file w.r.to deptno , Eg given in emp table we
have 10, 20,30,40 , Now i want only deptno 10 value in one output file and all the other
want to send into another output file
29. What will happen if we keep null in sort which given one sample file
30. How do you eliminate duplicate records
31. What kind of testing methods you have used in the project
32. What kind of validation in the unit testing
33. What is the difference between dbc file & config file
How do you create table in pset level ?

Skytech :
1. Tell me about u r self?
2. Tell me about u r project?
3. How do u do the graph performance check ?
4. How do u develop the generic graph? How do u know that the graph is generic are custom
graph?
5. How do u validate the header in generic graph ?
6. What type of header validations we can do ?
7. Different file are comeing from diff source so how can u find out how to delete the the header?
8. Most complex components u worked ?
9. Have u worked on any countionnus flows ?
10. Main reason of using palns ?
11. What is the diff between run the graph by using plans and by using scripting?
12. Have u write any shell scripts in which case u wrote?

Unknown company
1)What is the function you would use to transfer a string into a decimal?
2)How many parallelisms are in Abinitio? Please give a definition of each.
3)What is the difference between a DB config and a CFG file?
4)Have you eveer encountered an error called "Pipeline Broken"? (This occurs when you
extensively create graphs it is a trick question)
5)How do you truncate a table? (Each candidate would say only 1 of the several ways to do
this.)
6)How do you improve the performance of a graph?
7)What is the difference between partitioning with key and round robin?
8)Have you worked with packages?
9)How do you add default rules in transformer?
10)What is a ramp limit and Maxcore values for Scan, Rollup, Sort, Replicate?
11)Have you used rollup component? Describe how.
12)How many components in your most complicated graph?
13)Do you know what a local lookup is?
14)What is Ad hoc multifile? How is it used?

Unknown company

1. tell me about your self ?


2. do you know Continuos graphs, metadata hub , BRE , ACE ?
3. which company do you work for and what is your total experience ?
4. what is the schedular do you use in your project ?
5. what is compare records componenet and check sum companonet will do ?
6. explain about multifile system ?
7. explain benfits of abinitio tool
8. what is choice parameter
9. what is condional parameter what will happen if we disable a component
10.how you will generate dml dynamically
11. what is conditional dml how you saperate header , tailor and body
12. generate dml using abinitio parametes
13. how to display the data using abinitio command ?
14. in flat file how you will saperate header , tailor , body and how you validate the records
with record count in tailor ? i.e. no of records in body should match with tailor which
contains record count.
15. how do you eliminate duplicates in abintio ? apart from sort and dsort how you will
eliminate ?
16. what are the partition and departition components explain with in one sentence for all
departition components ?
17.How to convert multi file to serial file by using command?

Unknown company
1. How to create sandbox by using command?
2. Is it possible to create sandbox without using GDE? If it is possible what is difference?
3. How to move the multi file path to sandbox?
4. What is difference between lookup and lookup local () functions?
5. Tell me right outer join query?
6. How to display a file and delete same file at a time by using UNIX?
7. Can you do read and write in single graph?
8. How to take the max-core value each partition?
9. What is mount?
10. What is AB_JOB?
11. What is component folding?
12. How can you find out job executed successfully or not?
13. How can you find out graph is executing in which phase?
14. What is the difference between primary key and primary index?
15. How can change the primary index?
16. What is the difference between soft-link and hard-link?
17. How can you find the file size?
18. What is wrapper script?
19. How can you find the process id?
20. What is top-down and bottom-up approach?
21. Can you draw your project architecture?
22. What is granularity?
23. How to delete the duplicates in file?
24. What is the query to delete the duplicate rows in the table?
25. How to find out the 10th maximum salary in table?
26. How to find out the second highest salary by using ab initio?
27. What is AB_WORK_DIR?
28. What type of information there in .ksh?
29. What is there in AB_HOME?
30. Where co-operating system path is stored?
31. When .ksh will create?
32. How can you deploy the script?
33. If I pass 10 value in select expression of filter by expression component what is the
result?
34. How can you store file data into table?
35. Difference between dimension and fact table?
36. How can you run the graph in UNIX?
37. How to move the graphs from one environment to another environment?
38. How can you schedule the jobs?
39. How to retrieve the duplicates in the table?
40. If I give the next_in_sequence () in multi file, will u get the duplicates or not?
41. How can you apply the next_in_sequence () in multi file?
42. If I set max-core less, which type of error will you get?

Previous Questions Asked in MAHINDRA


SATYAM
Tell about yourself and Current project Explanations are
the common in all interviews
 Tell me about yourself and your current project?
 What are the components you have used in your graph?
 Explain the multi file, multi file system and multi file directory?
 What are m_commands you have used?
 What is the difference between m_dump and m_expand?
 How do u connect to the database?
 How do u test whether your dbc file is correct or not?
 How do you change 2-way to 4-way multifile?
 Have you worked with EME if yes, tell me what all the air commands you
have used are?
 How do you find out the version of EME?
 Tell me the difference between api and utility?
 What is output_index and output_indexes?
 If I give key null in join component will the component work?
 What are the parameters in join component?
 What is override key parameter in join component?
 What is layout?
 What are the errors you have faced?
 When will you get the fatal error?
 What are the UNIX commands you have used?
 What is SED and tell me one example where you have used in your
project?
 What AWK tell me one example?
 How do you find out and delete the files which were created 30 days
before?
 How do you find out the nth highest salary in the table?
 What is skew?
 Which parameters first resolved whether graph level or sandbox level?
 How do you connect to oracle?
 If you select in-memory – input need not be sorted in join component how
it will work?

Previous Questions Asked in IBM


Tell about yourself and Current project Explanations are
the common in all interviews
 what are the Partition component tell me in detail
 what is the purpose of overide key parameter in the join component ?
 what is difference b/w join and merge..components ?
 If table A contains 10 and table B contains 0 record ,what is the o/p total
number of records if we do cross join
 Wat is genaric graph?
 how will write the code for reformat compotent as a generic graph?
 what is lookup template
 what is compressed lookup.
 Wat is AB unit ?
 what are the grap level parameters? how u define in u r project
 what are the measure of the fact table?
 I have 2 way partition flow file we need to generate surrogate key for out
file How to generate it
 What is wrapper script.
 Tell about the rollup and its parameters
 what is the diff b/e the rollup and scan
 What is the purpose of m_rollup
 what is chk-in and chek-out where does we do it and how many ways.
 What is pstes.
 Tell the most used air_commands in your project.
 write any two sample scripts previously used in project

Previous Questions Asked in TCS


Tell about yourself and Current project Explanations are the
common in all interviews
 What is your Role in current Project
 Explain the Flow of your current project
 What the Database using for your project
 How you connect to your Database
 What are the permissions you have to access Database
 What the .dbc file contains.
 What is the difference b/w Truncate and delete.
 How you Take data from flat files.
 What the requirement document contains exactly.
 How to prepare analysis document and What it contains.
 How assign keys components can work.
 How you get history file.What are primary keys and foreign key
 Explain Reformat with parameters and how it acts in Real-time.
 Functionality of Roll-up and how to filter the records in it.
 I have 100 records in which 10 records need to get
aggregated ,left the rest.How will you handle.
 What are the components you worked most.
 What is view.
 Create a view using table and i need only 100 rows in my view
instead of all records.
 What is m_dump command.
 How you execute the script from GDE.
 What is wrapper script.
 How many graph's you developed up-to here.

Previous Questions Asked in WIPRO


Tell about yourself and Current project Explanations are the
common in all interviews
 which version you are using.
 what are the components you are working
 what is the scheduling tool you have used.
 Tell me about join .
 What are the parameters of join explain all.
 what is semi-join.
 what are cartesian joins.
 what is lookup.
 what the partition components you worked
 What is the difference between merge
and concatenate.
 what is the diff b/w scan and role-up.
 How will you test a dbc file from command prompt.
 What is generic graph.
 what is wrapper script.
 What are m_commands.
 What is the purpose of m_rollup.
 How to write conditional dml if the file has huge
amount of data.
 how do you convert4-way mfs to 16-way mfs.
 how would you do performance tuning for already
builted graph.
 Tell the sed and awk commands with some suitable
examples.
 write two sample scripts previously you wrote.

Previous Questions Asked in POLARIES


Tell about yourself and Current project Explanations are the
common in all interviews
 which database you are using
 What is the difference between .dbc and .cfg file
 What is a Surrogate Key. How do you Create it.
 What is conditional dml.
 What is Lookup file
 Tell me briefly about the parallelism.
 Which components cannot support the
pipeline parallelism
 What are the continuous components
 what is the diff b/w scan and rollup.
 What is .pset
 tell me the importance of EME in abinitio
 How you run the graph in Unix environment
 Tell me about air commands
 What is AB_LOCAL where you will you it.
 How to schedule graph's in abinitio.
 How you open the multifile in unix.
 What is the diff b/w sandbox and EME
 What is chek-in and chek-out from gde and unix.
 I have some 100 records in that fetch out the
records related to word delhi from the line of 56
to 89.
 How you improve the performance of previous
build graphs.
 Tell me about version controlling in abinitio.

Most repeated questions with answers

 What is the difference between rollup and scan?

Ans: By using rollup we cant generate cumulative summary records for that we will be
using scan.

 What is the difference between partitioning with key and round robin?

Ans: PARTITION BY KEY:


In this, we have to specify the key based on which the partition will occur. Since it is
key based it results in very well balanced data. It is useful for key dependent
parallelism.
PARTITION BY ROUND ROBIN:In this, the records are partitioned in sequential way,
distributing data evenly in blocksize chunks across the output partition. It is not key
based and results in well balanced data especially with blocksize of 1. It is useful for
record independent parallelism.

 How do you truncate a table

ans: There are many ways to do it.


1. Probably the easiest way is to use Truncate Table
2. Run Sql or update table can be used to do the same thing
3. Run Program

 What is the difference between a DB config and a CFG file?

Ans; A .dbc file has the information required for Ab Initio to connect to the database
to extract or load tables or views. While .CFG file is the table configuration file created
by db_config while using components like Load DB Table

 Types of parallelism in detail.


ans:There are 3 types of parallelism in ab-initio.
1) Data Parallelism: Data is processed at the different servers at the same time.
2) Pipeline parallelism: In this the records are processed in pipeline, i.e. the
components do not have to wait for all the records to be processed. The records that
got processed are passed to next component in pipeline.
3) Component Parallelism: In this two or more components process the records in
parallel.
Component parallelism:- A graph with multiple processes running simultaneously on
separate data uses component parallelism.
Data parallelism :- A graph that deals with data divided into segments and operates on
each segment simultaneously uses data parallelism. Nearly all commercial data
processing tasks can use data parallelism. To support this form of parallelism, Ab Initio
provides Partition components to segment data, and Departition components to merge
segmented data back together .
Pipeline parallelism :- A graph with multiple components running simultaneously on
the same data uses pipeline parallelism. Each component in the pipeline continuously
reads from upstream components, processes data, and writes to downstream
components. Since a downstream component can process records previously written
by an upstream component, both components can operate in parallel. NOTE: To limit
the number of components running simultaneously, set phases in the graph.

 What is the function you would use to transfer a string into a decimal?

Ans: For converting a string to a decimal we need to typecast it using the following
syntax,
out.decimal_field :: ( decimal( size_of_decimal ) ) string_field;
The above statement converts the string to decimal and populates it to the decimal
field in output.

 . How to execute the graph from start to end stages? Tell me and how to run graph in non-
Abinitio system?

Ans: There are so many ways to do this, i am giving one example due to time
constraint you can run components according to phasea how you defined.
by creating ksh, sh scripts also you can run.

 . What is data mapping and data modelling?

Ans; Data mapping deals with the transformation of the extracted data at FIELD level
i.e. the transformation of the source field to target field is specified by the mapping
defined on the target field. The data mapping is specified during the cleansing of the
data to be loaded.
For Example:
source;
string(35) name = "Siva Krishna ";
target;
string("01") nm=NULL("");/*(maximum length is string(35))*/
Then we can have a mapping like:
Straight move.Trim the leading or trailing spaces.
The above mapping specifies the transformation of the field nm

 What is the difference between sandbox and EME, can we perform checkin

and checkout through sandbox/ Can anybody explain checkin and checkout?
Ans; Sandboxes are work areas used to develop, test or run code associated with a
given project. Only one version of the code can be held within the sandbox at any
time.
The EME Datastore contains all versions of the code that have been checked into it. A
particular sandbox is associated with only one Project where as a Project can be
checked out to a number of sandboxes

 explain the environment varaibles with example.?

ans; Environemental variables server as global variables in unix envrionment. They


are used for passing on values from a shell/ process to another. They are inherited by Abinitio as
sandbox variables/ graph parameters like
AI_SORT_MAX_CORE
AI_HOME
AI_SERIAL
AI_MFS etc.
To know what all variables exist, in your unix shell, find out the naming convention
and type a command like "env | grep AI". This will provide you a list of all the
variables set in the shell. You can refer to the graph parameters/ components to see
how these variables are used inside Abinitio.

 What r the Graph parameter?

ans: There are 2 types of graph parameters in AbInitio


1. local parameter
2. Formal parameters.(those parameters working at runtime)

 . How to Improve Performance of graphs in Ab initio?Give some examples or tips.?

Ans: There are somany ways to improve the performance of the graphs in Abinitio.
I have few points from my side.
1.Use MFS system using Partion by Round by robin.
2.If needed use lookup local than lookup when there is a large data.
3.Takeout unnecessary components like filter by exp instead provide them in
reformat/Join/Rollup.
4.Use gather instead of concatenate.
5.Tune Max_core for Optional performance.
6.Try to avoid more phases.
 What are the most commonly used components in a Abinition graph. example of a
trasformation of data, say customer data in a credit card company into meaningful output
based on business rules?

Ans: The most commonly used components in to any Ab Initio project are
input file/output file
input table/output table
lookup file
reformat,gather,join,runsql,join with db,compress components,sort,trash,partition by
expression,partition by key ,concatinate

 Difference between conventional loading and direct loading ? when it is used in real time .?

ans: Conventional Load:


Before loading the data, all the Table constraints will be checked against the data.
Direct load:(Faster Loading)
All the Constraints will be disabled. Data will be loaded directly.Later the data will be
checked against the table constraints and the bad data won't be indexed.
Api conventional loading
utility direct loading.

 How to find the number of arguments defined in graph?

Ans: $# - No of positional parameters


$? - the exit status of the last executed command.

 . What is the difference between .dbc and .cfg file?

Ans: .cfg file is for the remote connection and .dbc is for connecting the database.
.cfg contains :
1. The name of the remote machine
2. The username/pwd to be used while connecting to the db.
3. The location of the operating system on the remote machine.
4. The connection method.
and .dbc file contains the information:
1. The database name2. Database version
3. Userid/pwd
4. Database character set and some more...

 . How to do we run sequences of jobs ,,like output of A JOB is Input to B .How do we co-
ordinate the jobs?

Ans: By writing the wrapper scripts we can control the sequence of execution of more
than one job.

 How would you do performance tuning for already built graph ? Can you let me know some
examples?
Ans: example :- suppose sort is used in fornt of merge component its no use of using
sort ! bcz we hv sort component built in merge.
2) we use lookup instead of JOIN,Merge Componenet.
3) suppose we wnt to join the data comming from 2 files and we dnt wnt dupliates we
will use union funtion instead of adding addtional component for duplicate remover.

 . What is semi-join

ans: In abinitio,there are 3 types of join...


1.inner join. 2.outer join and 3.semi join.
for inner join 'record_requiredn' parameter is true for all in ports.
for outer join it is false for all the in ports.
if u want the semi join u put 'record_requiredn' as true for the required component and
false for other components..

 How to get DML using Utilities in UNIX?

Ans: If your source is a cobol copybook, then we have a command in unix which
generates the required in Ab Initio. here it is:
cobol-to-dml.

 what is local and formal parameter?

Ans: Two are graph level parameters but in local you need to initialize the value at the
time of declaration where as globle no need to initialize the data it will promt at the
time of running the graph for that parameter.

 . what is BRODCASTING and REPLICATE ?

ans: Broadcast - Takes data from multiple inputs, combines it and sends it to all the
output ports.
Eg - You have 2 incoming flows (This can be data parallelism or component
parallelism) on Broadcast component, one with 10 records & other with 20 records.
Then on all the outgoing flows (it can be any number of flows) will have 10 + 20 = 30
records
Replicate - It replicates the data for a particular partition and send it out to multiple
out ports of the component, but maintains the partition integrity.
Eg - Your incoming flow to replicate has a data parallelism level of 2. with one partition
having 10 recs & other one having 20 recs. Now suppose you have 3 output flos from
replicate. Then each flow will have 2 data partitions with 10 & 20 records respectively.

 What is m_dump

m_dump command prints the data in a formatted way.


m_dump <dml> <file.dat>

 an exaple of realtime start script in the graph?


Ans: Here is a simple example to use a start script in a graph:
In start script lets give as:
export $DT=`date '+%m%d%y'`
Now this variable DT will have today's date before the graph is run.
Now somewhere in the graph transform we can use this variable as;
out.process_dt::$DT;
which provides the value from the shell.

 How to run the graph without GDE?

Ans: In RUN ==> Deploy >> As script , it create a .bat file at ur host directory ,and
then run .bat file from Command prompt

 How Does MAXCORE works?

Ans: Maxcore is a value (it will be in Kb).Whne ever a component is executed it will
take that much memeory we specified for execution

 .What is $mpjret? Where it is used in ab-initio?

ans: You can use $mpjret in endscript like


if 0 -eq($mpjret)then
echo "success"
else
mailx -s "[graphname] failed" mailid

 How do you convert 4-way MFS to 8-way mfs?

Ans: To convert 4 way to 8 way partition we need to change the layout in the
partioning component. There will be seperate parameters for each and every type of
partioning eg. AI_MFS_HOME, AI_MFS_MEDIUM_HOME, AI_MFS_WIDE_HOME etc.
The appropriate parameter need to be selected in the component layout for the type of
partioning..

 What is AB_LOCAL expression where do you use it in ab-initio?

ans: ablocal_expr is a parameter of itable component of Ab Initio.ABLOCAL() is


replaced by the contents of ablocal_expr.Which we can make use in parallel
unloads.There are two forms of AB_LOCAL() construct, one with no arguments and one
with single argument as a table name(driving table).
The use of AB_LOCAL() construct is in Some complex SQL statements contain
grammar that is not recognized by the Ab Initio parser when unloading in parallel. You
can use the ABLOCAL() construct in this case to prevent the Input Table component
from parsing the SQL (it will get passed through to the database). It also specifies
which table to use for the parallel clause.

 What is mean by Co > Operating system and why it is special for Abinitio ?
ans: It converts the AbInitio specific code into the format, which the UNIX/Windows
can understand and feeds it to the native operating system, which carries out the task.

 How will you test a dbc file from command prompt ?

ans: try "m_db test myfile.dbc"

 . Which one is faster for processing fixed length dmls or delimited dmls and why ?

ans: Fixed length DML's are faster because it will directly read the data of that length
without any comparisons but in delimited one,s every character is to be compared and
hence delays

 .What are the continuous components in Abinitio?

ans: Contineous components used to create graphs,that produce useful output file
while running continously
Ex:- Contineous rollup,Contineous update,batch subscribe

 How to retrieve data from database to source in that case whice component is used for this?

ans; To unload (retrive) Data from the database DB2, Informix, or Oracle we have
components like Input Table and Unload DB Table by using these two components we
can unload data from the database.

 . What is the relation between EME , GDE and Co-operating system ?

ans: EME is said as enterprise metdata env, GDE as graphical devlopment env and Cooperating sytem
can be said as asbinitio server relation b/w this CO-OP, EME AND GDE
is as fallows
Co operating system is the Abinitio Server. this co-op is installed on perticular O.S
platform that is called NATIVE O.S .comming to the EME, , its hold the metadata,trnsformations,db
config files source and targets information's. comming to GDE its is end user envirinment where we can
devlop the
graphs(mapping just like in informatica)
designer uses the GDE and designs the graphs and save to the EME or Sand box it is
at user side.where EME is ast server side.

 . What are kinds of layouts does ab initio supports

ans: Basically there are serial and parallel layouts supported by AbInitio. A graph can
have both at the same time. The parallel one depends on the degree of data
parallelism. If the multi-file system is 4-way parallel then a component in a graph can
run 4 way parallel if the layout is defined such as it's same as the degree of
parallelism.

 Do you know what a local lookup is?


ans: Lookup File consists of data records which can be held in main memory. This
makes the transform function to retrieve the records much faster than retirving from
disk. It allows the transform component to process the data records of multiple files
fastly.

 . How many components in your most complicated graph?

ans: This is a tricky question, number of component in a graph has nothing to do withthe level of
knowledge a person has. On the contrary, a proper standardized and
modular parametric approach will reduce the number of components to a very few. In
a well thought modular and parametric design, mostly the graphs will have 3/4
components, which will be doing a particular task and will then call another sets of
graphs to do the next and so on. This way total numbers of distinct graphs will
drastically come down, support and maintenance will be much more simplified.
The bottomline is, there are lot more other things to plan rather than to add
components.

 . How to handle if DML changes dynamically in abinitio

ans: If the DML changes dynamically then both dml and xfr has to be passed as graph
level parameter during the runtime.

 . Have you worked with packages?

Ans: Packages are nothing but the reusable blocks of objects like transforms, user
defined functions, dmls etc. These packages are to be included in the transform where
you use them. For example, consider a user defined function like
/*string_trim.xfr*/
out::trim(input_string)=
begin
let string(35) trimmed_string = string_lrtrim(input_string);
out::trimmed_string;
end
Now, the above xfr can be included in the transform where you call the above function
as
include ''~/xfr/string_trim.xfr'';
But this should be included ABOVE your transform function.
For more details see the help file in "packages".

 . What are primary keys and foreign keys?

Ans: In RDBMS the relationship between the two tables is represented as Primary key
and foreign key relationship.Wheras the primary key table is the parent table and
foreignkey table is the child table.The criteria for both the tables is there should be a
matching column.

 What are Cartesian joins?


Ans: Cartesian join will get you a Cartesian product. A Cartesian join is when you join
every row of one table to every row of another table. You can also get one by joining
every row of a table to every row of itself.

 Explain the difference between the “truncate” and "delete" commands?

ans: Truncate :- It is a DDL command, used to delete tables or clusters. Since it is a


DDL command hence it is auto commit and Rollback can't be performed. It is faster
than delete.

 . How can i run the 2 GUI merge files?

Ans:Do you mean by merging Gui map files in WR.If so, by merging GUI map files in
GUI map editor it wont create corresponding test script.without testscript you cant run
a file.So it is impossible to run a file by merging 2 GUI map files.
In the context of databases and data management, the terms "truncate,"
"drop," and "delete" refer to different operations that can be performed on
database tables. Here's an explanation of each command:

1. TRUNCATE: The TRUNCATE command is used to quickly and efficiently


remove all data from a table, essentially resetting it to its original state.
When you truncate a table, you delete all the rows from the table, but the
table structure, as well as any associated indexes, constraints, or triggers,
remain intact. TRUNCATE is a DML operation,. This command is often faster
than using the DELETE command to remove all rows from a table because it
doesn't generate as much undo and redo logging.
2. DROP: The DROP command is used to completely remove an entire table
from the database. When you drop a table, all data, as well as the table
structure, indexes, constraints, and triggers associated with it, are
permanently deleted from the database. This operation is also a DDL
command that modifies the database schema. Dropping a table is an
irreversible action, and it cannot be undone. Therefore, it is crucial to be
cautious when using the DROP command, as it can lead to permanent data
loss.
3. DELETE: The DELETE command is used to remove one or more specific rows
from a table based on a specified condition. Unlike the TRUNCATE command,
DELETE is a DML (Data Manipulation Language) operation that affects only
the data within the table and not its structure. When you delete rows using
the DELETE command, the table structure remains unchanged, and any
associated indexes, constraints, or triggers are unaffected. The DELETE
command is more flexible than TRUNCATE because it allows you to specify a
condition to selectively remove certain rows from the table while leaving
others intact.

In summary, the main differences between these commands are as follows:


 TRUNCATE removes all rows from a table while keeping the table structure
intact. It is faster than DELETE and does not generate as much logging. It is a
DDL command.
 DROP completely removes a table, including its data, structure, and
associated objects. This operation is irreversible. It is also a DDL command.
 DELETE selectively removes rows from a table based on a condition while
leaving the table structure and associated objects unchanged. It is a DML
command

 the TRUNCATE command is typically considered a Data Manipulation


Language (DML) command rather than a Data Definition Language
(DDL) command. In most database management systems, including
popular ones like SQL, TRUNCATE is classified as a DDL command
because it directly affects the structure of a table by removing all rows.
 However, it's important to note that the classification of commands can
vary slightly between different database systems and versions. Some
databases may consider TRUNCATE as a DML command due to its
behavior resembling a DELETE statement without a WHERE clause,
which manipulates data rather than altering the table structure.
Therefore, the classification of TRUNCATE can depend on the specific
database system you are using.

You might also like