Abinitio Questions
Abinitio Questions
I select sorted-input : in-memory option and I give FileA in driving port what will happen and
in output data is sorted or not?
4. Tell me about joinwithdb component and their parameters
5. What is generic graph and without psets generic graphs can work or not ? if yes how?
6. In reformat I set count as 2 and I write some business logic in transform0, so I want to use
that transform in transform1 of the reformat.
7. Basic Questions on scan, rollup functionalities.
8. My Input like
U1,O1,I1
U1,O1,I2
U1,O1,l3
U1,O2,I1
U2, O5,I1
U2, O5,I2
U2, O6, 12
U2,O7,I4
So I would like distinct count of orders (O)
9. Performance tunings in Abinitio
10. Performance tunings in Database.
11. My input contain 50 columns I would like display all columns except 10,20,30,40,50
numbered columns.
12. How to find two patterns in a file?
13. What is the outputs of ---- select count (*) from emp, select count (1) from emp.
14. Which components partially breaks the pipeline parallelism.
15. Input file contain 5 records -- FBE select expr(next_in_sequence>1 and
next_in_sequence<4)in my output how many files are available.
16. How to convert 8_way partition to 3_way partition.
17. What are differnces b/w join and lookup .
20. 12 records are in input file and in the output we want 4 serial files with equal amount of
data.
input file--> ??--> 4 serail files
which compoent will you use in the ??
21. scd and cdc explain?
22. how to improve performance of graph?
23. what is Abinitio.rc file?
24. explain behaviour of scan & rollup?
25. Explain (Bitwise)organisational data flow?
26. I want to read complete file of 100 records and find at the end which record was corrupt
and log it accordingly? How?
Accenture on 7th jan-2015
1.tell me briefly about your professional details
2.what are the input sources to your project
3.have you ever use the write multiple component
4.how you view particular partition data
5.what is partition
6.what is multi file
7.what is m_dump
8.how you create multi file system
9.what are the componnts you used in your project
10.what are your responsibilites in your projects
11.how many partitions you have in your database
12.how we find the number of partitions of a database
13.what are the partition components tell me their parameters
14.how to see the duplicate records in a table
15.if your source is a flat file in a some other server which component you use to access
Acenture questions:
1. Tell me about u r previous project ?
2. Parallelism types ?
3. Lookup static lookup and dynamic lookup?
4. Maxcore ?
5. Driving port ?
6. How many no of inputs we can take for join?
7. If I have one input file having 50 mb maxcore and and other 9 remaning input files
having different maxcores so which one u select as driving port ?
8. Difference between rollup and scan ?
9. What are the mandatory functions in both rollup and scan and is any we can ignore in
that ?
10. Lookup local?
11. Phases and checkpoints ?
12. What is $#, $?, and $ mpjret.
13. How will test the dbc file?
14. What is pdl?
15. What is metaprogramming?
16. What is conduct it ?
17. How u delete 30 days older files?
18. Oracle: how the where clause is executed in oracle
19. Second maxsal in oracle?
20. Partition by key and partion by round roubin ?
Hcl Question
1. Tell me about ur self?
2. What is fact and dimension?
3. How to use granualarity in fact table?
4. What are the schemas did u use?and explain it?which is give better performance?
5. Types of dimensions?
6. Firstly which table u have to load it/and that data where to read it?
7. What is the use of staging area in dwh?
8. Explain about scds?did u work on dt?
9. By seeing only how to find dt table belongs to scd type2 or not?is it possible?
10. Explain about FBE?and tell me one scenario?
11. Wt is PBK and PBRR?which gives better performance?
12. Diff b/w replicate and broadcast?
13. What is multifie?did u work ondt?
14. How to change depth of multifiles?
15. Diff b/w checkpoint and phase?
16. What is join and explain it and give the scenarios of each join type?
17. Explain what is parallism?
18. I have one file its having 10 records how to displays the count into target table?
19. I m having 3 tables, in dt 10th dept no records are get into target?how to get it?
20. One table is der in dt table header having 3 fields,detail having 3 fielda and trailer having
no.of record count ? in dt table how to read dml?
21. How to get depth of multifile ?
22. By using awk cmd how to count the no.of columns?
23. How to find 0 byte file/
24. How to delete first 3 characters in line?
25. Explain about rollup and scan? And what is the diff b/w them?
26. By using reformate what are the operating u did so far in ur graph or project?
IBM 10-02-2015
1. Tell me about yourself
2. Explained about your project and give some measures
3. What is rollup?
4. I have Input file which contain one column with Y or N records only.. Actually ‘N’ value
records are rejected but I want to send all the records without rejecting. How would you
achieve it?
5. Explain Join in ABINITIO
6. What is m_dump?
7. I want see the in completed data records how can you do that with m_dump
8. What is dedup sort?
9. Here is my error – Unsorted record & overwritten
10. Which version are you using? What are the latest features in 11G?
11. How to referential check constraint in your table
12. Write a sql statement using left outer join?
13. Have you used the MFS in your project?
14. I have records in a file like a b c d e f g h j.I want to display all the characters except
with b?
15. What is regular expression?
16. I want to display all the records with having starting line c.
IBM 08-jan-2015
1.Briefly tell me about your project
2.What is complex graph you face in your roject
3.what are the roles and responsibilities of you in your project
4.what are your source and destinations in your project
5.what is the component is used to unload data from database and what are the parameters
you configure in that component
6.what is the process to get the dml for data which is unloading from database
7.if i use select statement to unload the data from database but we select only 2 fields out of 7
fields,so howmany fields in output file
8.what happen when we select -select statement and -table options in input table is it same or
different
9.without GDE, In unix how i get the dml of a table in a separate file
10.how many types of joins in join component
11.what are the functionalities of scan,rollup,reformat componets
12.what is m_dump,m_mkfs commonds
13.questions on wrapper scripting
14.which commond is used to replace a patteren with new patteren
15.i run a graph if it success its output moves to one record else error information moves to
some other file how
16.what is the difference b/w m_cat,cat commonds
IBM 21/02.2015
1. Tell me about yourself
2. How to delete 30 days old files in unix
3. What is your scheduling tool ?
4. In Unix script, you will find the ‘!’ statement can you tell me what it is and how it works
internally ( !/bin/ksh)
5. How to delete the last row in a file
6. How to delete duplicates in oracle
7. In Abinitio , my input value is 12345 and my dml type is decimal(5,2) so what is the output
and also if I have input like this 1234567 what will happen ?
8. What is $# , $@ , $?
9. In join I have in in0
1
1
1
1
In another file contains same field and same records
1
1
1
1
Now what will be my output records if I keep inner,fullouter, left outer & right outer join
10. In join I have like below and what will be the output if I keep in-memory
x
y
z
a
b
c
11. What are the functions in EME?
12. What you do in your project
13. Tell me more about your project and process
14. How to generate a rank for emp table and I want to get a rank whose rank is 2
15. What is the difference between dimensional model and ER model
16. What are NR forms and explain different types of NR forms each
17. How to delete duplicate records in emp
Cap Gemini 18/02.2015
INCADESCENT
1. Tell me about your self
2. do you know Continuous graphs, metadata hub , BRE , ACE ?
3. Which company do you work for and what is your total experience ?
4. what is the scheduler do you use in your project ?
5. what is compare records component and check sum component will do ?
6. Explain about multifile system ?
7. explain benfits of abinitio tool
8. What is choice parameter
9. what is conditional parameter what will happen if we disable a component
10. How you will generate dml dynamically
11. what is conditional dml how you saperate header , tailor and body
12. generate dml using abinitio parametes
13. how to display the data using abinitio command
14. in flat file how you will saperate header , tailor , body and how you validate the records with
record count in tailor ? i.e. no of records in body should match with tailor which contains
record count.
15. how do you eliminate duplicates in abintio ? apart from sort and dsort how you will
eliminate ?
16. what are the partition and departition components explain with in one sentence for all
departition components.
17. display last record using sed command
18. how you will replace string in unix
19. how you will do the same using vi editor
21. how do you select maximum marks from each subject in student table.
22. how do you select top 2 maximum marks from each subject.
23. what is difference between star schema and snow flake schema
24. what is factless fact table.
25.Find sum of trans_amt department wise from the table ?
Scenario-1:
cust_id trans_amt
1001 10
1001 20
1001 30
1002 5
1002 40
1002 50
.... ...
..... ...
1003 20
1003 30
Expected ootput1:
cust_ID trans_amt
1001 60
-----------------------
1002 95
-----------------------
1003 50
expected output-2:
Cust_ID trans_amt
------------------
1001 10
20
30
-------------------
1002 5
40
50
------------------
1003 20
30
--------------------
26. There are two input files .. file1--2000 record, file2 300 record. i want get 2 colums from the
second file?
27. What is MAX core ?
28.What is MAX memory?
29. table1 have 10000 records, table 2 have 3 records matching with table1.. how to write sql
query and how to desig abi grap?
CUSTOMER PRODUCT
----------------------
A SONY
B MARTK1
C HCL SHARE
D LAVA
A SAMSUNG
B NOKIA
C HCL TECH
IBM
Interview Questions
1.Tell me about your professional background?
2.tell me two complicated graphs in your experience and those which components you are used
and execution flow?
3.What are SCD types and which component is useful to particular type of SCD?
4.What is difference when we perform join using lookup file&input file,join with db & input
file?
5.when we press run button in GDE what is going on back ground?
6.what is unit tesing , how y ou do this?
7.how we change 2-way partion to 4 way partition and which component is used to do this?
8.difference between partition by round robin and partition by key and which one is better?
9. i have one multifile, I use rm multifile_name what happen?
10.how to find last two days modified records?
11.differences between grep,fgrep,egrep?
12.nth highest salary?
13.how we remove duplictae rows ?
SET II
1. Complex job in Ab Initio
2. Explain Performance tuning in Ab Initio
3. Input file with today's insert and update and target fact table with billion records. CDC on 5
keys. Approach to update and insert the fact using ABI and SQL.
4. Rollup component,what types of aggregation functions are provided.
5. Checkpoint and Phase
6. PDL functions- scenarios where used and what type of functions can't be used in PDL
7. use of generic load graph to load multiple tables in parallel if PSET not available- Answer.
using AB_JOB parameter
8. what is a .rec file
9. difference between Lookup and Join and where you used lookup against join.
10. Oracle sql question, find channel generated most revenue and best performaning channel
SET III
RS SOFTWARE
1) I have 5 records . I want to send same records from input to output how?
2) I have 10 records then i want to send 1st record to output port how?
3) I want 3rd max salary?
4) I want Top 3 salaries?
5) By using distinct delete the duplicates?
6) How to we know the one file is multifile or serial file ?
7) Pipeline parallism?
8) Grep versions?
9) Access time in unix?
10) Modify time in unix?
11) How to search Tendulkar in file(in unix)?
12) I want to display only directories?
13) Air lock break?
14) Air lock release?
15) Air object versions?
16) How to devide Header, Trailer?
17) I have 3 files how to add one after another?
18) How many max versions we can store?
19) How to load the data in Teradata?
20) what is the utility for loading? if i have million of records?
HCL
1) Star Schema, Snowflake schema
2) Left outer, rightouter, Inner join?
3) No of arguments in unix?
4) How to find previous day file?
5) I have 3 records 1st one have 5 columns, 2nd one have 3 columns, , 3rd one have 4
columns..I want to display last columns from all 3 records
6) what is . and what is /.
7) How to know the how much time taken for run the job?
8) Devide the Header, Trailer and Detail records
9) What are the tables used in your project?
10) How to Deploy script?
11) Multifile system?
12) How many Parallesium are there?
13) Max core, Max memory?
14) Difference between eme& sandbox
15) different approaches of Data Warehousing
16) how to move graph from development to QA and production
17) I checked out one graph from eme and after 10 days I check in same graph is it possible
and why
18) graphs to achieve SCD-1,SCD-2,SCD-3
19) difference between partition by expression and filter by expression
20)functionalities of scan and rollup and package mode stages like initialization and rollup or
scan how many times called
21)how to remove files which are created before 10 days
22)how to display particular fields from a file
23) input file have 100 records how to display 40 to 50 records
24).how to delete duplicate rows from table
25)difference between delete and truncate
WIPRO
1) Reformate, Redifine?
2) Rollup, Scan, Aggregate?
3) Normalize, Denormalize?
4) Output index, Output indexes?
5) Replicate, Broadcast?
6) Gather, Concatenate?
7) Drive parameter?
8) What is multifiles?
9) Phases & Check points?
10) Where the .rec files will be store?
11) How to recover the data?
12) How to see the data partitions from multifile?
13) what is the list command for multifile?
14) How to find 10 days back modify file?
15) How to checkout in GDE and Unix?
16) How to run the graph?
17) what is pset?
18) what is the sed?
19) How to replace the Hari to ramu with sed command?
20) what is grep one example?
21) I want to display Unq , duplicate records?
22) Meta pivot?
23) Met aprogramming?
24) How to do output file to lookup?
25) second max salary?
26) Delete the duplicates query?
Polaris
1) How to checkout particular object?
2) What is commit number?
3) What are the dataset components?
4) What is the output index, output indexes?
5) what is the update table? what is the multi update?
6) What is the difference between truncate and Delete?
7) I have 100 records i want to load from 2 to 99 records ? How
8) How to send the files from one server to another server?
9) I want to see the zero bytes file in directories?
10) what is the use of SQL?
11) I want to display from 50 70 th records
12) How many graps you developed?
13) Did you work on Continuous graphs, XML components
14) Utility load and Api load
Wipro
1) Why we use the Dataware house?
2) How to convert cobol tO Dml
3) what we need to set up for cobol to Dml?
4) How to convert from cobol Redifine to Dml?
5) What is the command for count the multifile in unix?
6) How we can devide the Header and Trailer?
7) what are the CDS are there give the example?
8) What are the load utilities? When and Where we can use it?
9) Project and developed graph example?
Satyam
1. In Rollup Component Key is Null how many records come to outport?
2. What is promotion?
3. What is Dependency Analysis?
4. How to find the running process which is in background?
5. What is the difference between Broadcast & Replicate component?
6. Advanced Abinitio components which we are used?
7. Satish Konda how to find out O’s using in Abinitio?
8. In Scan Component Key is Null how many records come to outport?
9. In Lookup file all matching records will come in outport?
10. .DBC file contains?
11. How to test the .DBC File?
12. XIS component in abinitio?
1. Explin Lookup?
2. What is the difference between lookup_count, Lookup_local?
3. What is the difference between Phase and Checkpoint?
4. Explian Rollup component?
5. What is the difference between temporary mode & Expanded mode?
6. What is the difference between merge & gather?
7. Explain departition Components?
8. What is the difference betweemn PBRR & PBK?
9. I have 1,2,3,4,5 records in one input file . It is 2-way Multifile. I want output 1,3,5 in one
file 2,4 records in second output file?
10. I have 1,2,3,4,5 records in one input file . It is 2-way Multifile. I want output 1,2,5 in one
file 3,4 records in second output file?
11. How to Remove Header and Trailer records?
12. I have 1,00,000 records in 1file & 30 records have in another file how to join those two
file In both the file have Deptno ?
13. Which command is used for pattern search?
14. I have Dept No, Dept Name, Salary I want second Highest salary in Different
departments write the query In SQL?
15. Explain the surrogate Key?
16. I Have 2-way partition file & also 4-way partition file how to join these two files?
17. What are the Unix commands used in your Projects?
18. How mant Graphs can be created in Your project?
19. Explain the Graph names?
20. How many PSETS are create in your project?
21. You have Interaction with EME or not?
22. Explain a complex graph in your Pojects?
23. Scan & Rollup both have same functions but Scan gives Cumulative summary records &
roll up gives Summary records How?
24. Scan & Rollup both have same functions but Scan gives Cumulative summary records &
roll up gives Summary records How?
L&T interview questions
1) Tell me about your project?
2) What ate the daimentions used in your project?
3) How many daimentions tables are there?
4) What is fact? How many fact tables are there?
5) what is Rollup? Give one example how it will work?
6) What is Scanup? Give one example how it will work?
7) What is multifile system? How will you create multi files?
8) Did you used multifiles in your project?
9) what is the command for status of last executed in Unix?
10) What is parallelism? How many types are there?
11) Which parallelism will break the sort?
12) what is the command for remove the perticular file?
13) I have 1,2,3,4 coloums but i want only 2&3 how will you get in unix?
14) How to copy 3 lines in mainframe?
15) How to access mainframe data?
16) How to connect the database?
17) What is the command for stop the running job?
Syntel interview on 12th-July-2012
1) How to handle comma which in data for a comma delimeted file? Answer: use repair input
component.
2) How to run a mp in Unix?
3) How to find common projects in Unix?
4) There is a fixed lenth file of record lenth 1400 and it has 1000 records, In which only one
record lenth is < 1400 how to find out that record in Unix?
5) I have one file it contains Header, Body, Triler but I donot know identifer how you will
seperate?
a) whether we can multiple into inputs to reformat? why?
6) I have 1000 bytes in a file I want to cut positio 11-20 th & 50-60 th & 700-720 How you will
do?
7) You can include a subgraph in common project into Private project if u do any changes in
subgraph will it reflect in Private project?
8) How will you stop the graph after Phase1?
9) What is the difference between Phase & Checkpoint?
10)What is the Primary Key?
11) what is the Difference b/w Primary key & Unique key?
12) what is Index?
13) What type of validations you will do once you got file?
14) how to find 3rd highest salary in each department in Lookup file?
15) How to find Lenth of record in a file using Abinitio?
16) studid ----how u will get one record for each group?
10
10
20
20
17) what is the diff b/w dbc and cfg?
18) db2/unix/abinitio ---ratings(out of 5)?
19) do you u write multiple files?
20) roll up parameter
21) to save memory what performance tunnning u will do
22) driving port in join component
23.whether we can multiple into inputs to reformat? why?
Wipro interview
1. How many components can set in one phase?
2. If the first phase has zero byte file, then the 2nd phase will automatically stop? How?
3. In the dev the file will be DVX, in the testing file will be QAX, in the production the same will
be changed PDX dynamically, how?
4. The parameter value is AAA, how can u write this in select stat, how can u pass this par to
all DB components?
5. I want to load the records from 21st out of 100 records, how?
6. In the scripting file, the very first line is usually # bin/ksh, if we will give the blank line
before this, will the script run?
7. How can u convert 4-way to 8-way?
8. Diff b/w interleave & concatenate?
9. How will work Broadcast component apart from other partition components?
10. If u set { } key as join parameter, what will happen?
11. We are passing 20 records from in0 & 40 records from in1 with 5 matching records. What is
the record count with left outer join & full outer join?
12. The graph will automatically stop after processing the 20 records from 100, how?
13. What is the alternative command for FTP?
14. What is AB_JOB_PREFIX?
15. I want to run all the files as parallel with out using MFS, the files of DML’s are different,
how? (using PSETS)
16. How can u display the 20 record from 100 in abinitio & in unix?
17. How can u change the timestamp of the file in unix?
18. How can u count the records are in the o/p port & in the reject port in the reformat
component?
19. Can we use the look up with full outer join?
20. How can u set the global variable without using the export command in unix?
21. What is the lay out of the component?
Wipro interview
1) Waht are Fact, Dimention tables
2) What is Aggretate table?
3) What is ODS?
4) OLAP, OLTP
5) What are Sed, AWK
6) Recursively, how to find the pattern in file and also in sub directories?
7) How to know, status of the graph? ==> mpjret
8) what is the first line in unix script?
9) How to run mp in unix?
10) what is lookup_local?
11) Difference between Force error & Force abort?
12a) How to find the file count?
12b) What will happen if if give all fields in KEY of Rollup with count() function.
13) Output index, Output indexes.
14) Reformat: In select statement Age!= 45, in Transform if name=D then force_error(), explain
the behaviour.
15a) Different types of modes?
15b) Difference between API & Utility mode.
16) Case 1: Huge data, select statements how to load the data?
Case 2: Small data , select statements how to load the data?
Case 3: If normal data how to load?
17) Header, Trailer,Detailed records are there then how to prepare the test data?
18) Departition components?
19) Difference between Merge, Interleave, concatenate, gather
20) How to create sequence number in multifile? in uinx and abinitio?
TCS – A1
1. What is your Role in current Project
2. Explain the Flow of your current project
3. What the Database using for your project
4. What the .dbc file contains.
5. What is the difference b/w Truncate and delete.
6. How you Take data from flat files.
7. What the requirement document contains exactly.
8. How to prepare analysis document and What it contains.
9. How assign keys components can work.
10. What are primary keys and foreign key
11. Explain Reformat with parameters and how it acts in Real-time.
12. Functionality of Roll-up and how to filter the records in it.
13. What are the components you worked most.
14. What is view.
15. What is m_dump command.
16. How you execute the script from GDE.
17. What is wrapper script.
18. How many graph's you developed up-to here.
UNKNOWN COMPANY
1.What will happen if we pass null key in Rollup,Scan,Sort and Join component?
2.How will you calculate second highest salary using Abinitio?
3.How will filter out H D T records if DML is not conditioner?
4.How will you assign foreign key for following scenario?
I/P File
Columns
Sarika
‘Ingle
23
Dinesh
Wayal
28
O/P File
Columns
Sarika 1
‘Ingle 1
23 1
Dinesh 2
Wayal 2
28 2
5.How to find out 50th row in text file using in Unix and Abinitio?(Don’t want to use head and
tail)
6.How will you retrieve multiple rows using lookup single function in Abinitio?
7.How will you find out Insert/Update or New/Old records using Join component.
8.How to check multiple words available in a file using single command in Unix?
9.How will you achieve following scenario in Abinitio?
a.
Output
Input File File
A-B-C
A 1 6
B 2
C 3
b.
Input File
Empno Deptno Sal
1 10 2000
2 20 3000
6 10 7000
3 30 3000
5 20 4000
4 10 6000
OutPut File
Empno Deptno Sal Cal_sal
1 10 2000 2000
4 10 6000 8000
6 10 7000 15000
2 20 3000 3000
5 20 4000 7000
3 30 3000 3000
Use FUSE component and tell me the output for below two tables
Table1
Countr
CustID State y
1001 CA US
1002 TX US
1003 NY US
1004 NJ US
Table2
AcctN AcctNa
o me City
9999 John Hyd
Chenna
8888 Aavi i
7777 Nani Dubai
Banglo
6666 Navin re
5555 Suman Hyd
I/FIle
EMPn Empnam Deptn
o e o
1 Anu 10
2 Bapu 20
O/File
EMPn Empnam Deptn
o e o
1 Anu 10
1 Anu 10
2 Bapu 20
2 Bapu 20
SYNTEL
PARVATHI TELLPHONIC SYNTL INTERVIEW QUESTIONS
1.Tell Me About Ur Self
2.Which Versions Ur Using
3.How To Get The Source Data ?File Or Table?
4.I Have File 20 Records Nd I Want Each Four Records In Five O/P Files
5.What Is M_Expand
6.U Aware Of M_Mkfs?What Is It?
7.In M_Mkfs Control Partion Is Delete Means Ur Getting The Data Or Not?
8.Wt Chekin Command ?U Will Give Emepath Or Sandbox Path?
9.Ware Of Lookup Ur Graph ?Nd I Have Lookup 1
1
1
2
2
2
3
3
3
In Reformate Data I Want Like In O/P 1
2
3?
10.What Is Merge Key Nd Gather Diff B/W Nd In Gather Sorted Will Come Or Not?
11.In Grep I Have Abe Nd Pqr Are Their How To Find In Unix?
12.I Have 10 Records How To Get Row Wise In Sql?
13.U Aware Of Scan In Ur Project?
14.Which Phase Ur Working?
15.What Is Diff B/W Component Parallalism Nd Data Parallalism?
UNKNOWN COMPANY
1) What is the difference between dbc and cfg? When do you use these two?
2) What are the compilation errors you came across while executing your graphs?
3) What is depth_error?
4) During the execution of graph, let us say you lost the network connection, would you have to
start the process all over again or does it start from where it stopped?
5) Types of partitions and scenarios.
6) What does unused port in join component do?
7) Define Multi file system. Can you create multifile system on the same server? Also, if you
have a table that has Name, Address, Status, Position attributes, can Name and Address be
on one partition and Status and Position in the other partition?
8) What is a sandbox? Did the co-operating system version 2.8 have sandbox, if not how would
you store the respective files?
9) How did you do version control? Which tool did you use?
10) How do you troubleshoot performance issues in graph?
11) What are the usual errors that you encounter during ETL process apart from compilation
process?
12) Were you involved in production support? What were the different kinds of problems that
you encountered?
13) Please give us insight on Enterprise Meta Environment, and some possible questions on
that.
14) What are delta table and master table?
15) What error would you get when you use Partition by Round Robin and Join?
16) In which scenarios would you use Partition by Key and also, Partition by Round Robin and
differences between the both?
17) What are the different dimension tables that you used and some columns in the fact table?
18) How do you count the number of records in a flat file?
19) How do you count the number of records in a multifile system without using GDE?
20) What does Scan and Rollup component do and give a scenario where you used them?
21) Did you ever used user defined functions or packages? If yes, give a scenario.
22) What do you have to give the value for the Record Required parameter for a natural join?
23) When do you use Partition by Expression?
24) What is Adhoc File System? Give me a scenario where you used it.
25) What are the different commands that you used when writing wrappers?
26) What do the hidden files in a sandbox represent and what does start.ksh represent?
27) What are different things that you have to consider when loading data into a table?
28) What is difference between Redefine Format and Reformat components?
29) Sometimes you have to use dynamic length strings. Can you give me one circumstance
where you need it?
30) If you have a flat file as follows:
20 General Manager Chris
30 Divisional Manager Harry
20 General Manager Mary
30 Divisional Manager Dravid
How do you count the number of records that have 20 in the first column, and likewise for 30.
2nd time Interview 0n 29-09-2014:
1.) How to read multiple files and load into target table using utiliy mode?
Case -1: If table has duplicate record will graph fail?
2.) Explain with scenarios in real time where you have used API mode and Utility mode?
3.) i want get the count of file which is having of 1TB data without, design grap in formance
prospective and using Unix command
4.) What is Multiple and how it work? If i have 1000 records and Round robin is 4 way parttion
so how many record goes to each partition?
5.) I have a file with Credit card information’s, Custmorid, transction_amts and date. i want to
get first two highest transction amts for each coustomre?
6.) How do you remove particularobject from the given tag?
7.) How to see the primary objects and comments using air command?
8.) how comfortable you are with Unix and give me rating out of 5?
9.)how to delete 30 days old files?
10.) how to search pattern in a file in multiple directoy?
11.) have you written any unix script? if so explain me?
12.) How comfortable you are with oracle and give me rating out of 5?
13.) Write down the where clause order of execution in oracle?
14.) Write the syntax for all type of Joins?
15.) Write any SQL to delete duplicates from given table?
Pandu Interview
1) A file contains 100 records , in that records null & duplicate records also is there, when ever
we loaded into the table how the result came ? (duplicate and null records allso coming are
not)?
2) A file contains 100 records, we have to add 50 records to that same file? (we should not take
another file)
3) if the record count is 300 the graph will get execute, if not means the graph will get aborted?
4.) if in the directory contains 10 files. i want to write all 10 files into single file.. how you do it?
5.) what is the diff beween gde verions 2.16 and 3?
6.) while checkoutdid you got dependecy analysis?
subject marks
english 65
hindi 70
science 75
english 55
hindi 50
science 70
how to generate the same sequence number per each group. Treat frist English, hindi and
science are one group.
26) tell me the order of execution of order where, group by and having clauses in oracle
27)what is the first line in unix script?
28)Have you heard end script
A. If a graph Failed end script will run/not?
B. Even through if Graph is failed end script need to be run how?
29)No of arguments in unix?
30)Grep versions?
31)how you wil get the count of records in multifile using graph ?
32)Access time in unix?
33)Modify time in unix?
34)How to find 10 days back modify file?
35)How to find previous day file?
36)How to search Tendulkar in file(in unix)?
37)I want to display only directories?
38)How to devide Header, Trailer?
39)what is . and what is /.
40)I have 3 records 1st one have 5 columns, 2nd one have 3 columns, , 3rd one have 4
columns..I want to display last columns from all 3 records
41)what is the sed?
42)What are Sed, AWK ?
43)display last record using sed command ?
44)How to replace the Hari to ramu with sed command?
45)what is grep one example?
46)Recursively, how to find the pattern in file and also in sub directories?
47)how you will replace string in unix ?
48)how you will do the same using vi editor ?
49)i have file with two records
jithendra
kumar
generate 2 files which contains jithendra in one file and jithendra kumar in another file ?
50)I want to display Unique , duplicate records?
51)I want to see the zero bytes file in directories?
52)What is the command for count the multifile in unix?
53)How to find the running process which is in background?
54)how to print line nos in a file in unix ?
55)how you will improve the performance of graph
56)What is Data Poisioning?
57)How can u change the timestamp of the file in unix?
58)How can u display the 20 record from 100 in abinitio & in unix?
59)How can u set the global variable without using the export command in unix?
60)how to grep jithendra or kumar from a file ?
61)How to run mp in unix?
62)How to run a mp in Unix?
63)how to find the files created by a perticular user and remove them interactively
64)How to find common projects in Unix?
65)There is a fixed lenth file of record lenth 1400 and it has 1000 records,In which only one
record lenth is < 1400 how to find out that record in Unix?
66)How to create sequence number in multifile? in uinx and abinitio?
67)How can u display the 20 record from 100 in abinitio & in unix?
68)How can u change the timestamp of the file in unix?
69)How can u set the global variable without using the export command in unix?
70)I have 1000 bytes in a file I want to cut positio 11-20 th & 50-60 th & 700-720 How you will
do?
71)I have one file it contains Header, Body, Triler but I donot know identifer how you will
seperate?
a) whether we can multiple into inputs to reformat? why?
72)I want to see the zero bytes file in directories?
ERRORS
1. what is the force error?
2. Difference between Force error & Force abort?
10.how to find the files created by a perticular user and remove them interactively
11. replace 4th column with present date where my file is | delimeter
12. create a generic graph and it should modify the file based on input parameter given
if i give a.dat it should modify a.dat
if i give b.dat it should modify b.dat
....
i will give mask ( question not understand properly)
13. what is dynamic lookup
14. what is the difference between lookup and join when you will go for lookup or join
15. how you will return entire record from a lookup ?
16. what is dimension table and fact table what is the difference
17 . how you will generate sorragate key ?
18. how you will implement scd1 and scd2 for a mfs file ?
19. what is star scheman and snowflake schema which one is better why it is better
20. what is coincile schema (exactly not sure what he aksed)
21. what is fact less fact table
22. what type of validations you normally do
23. what is mapping docment what includes in there ?
24. i am a able to insert a record into fact if corresponding record not present in dimension
25.what are the methods we do have in plans s
26. what is inherited method ? why we need them how you implemented ?
27. what is opconsole ?
28. in which purpose we need operational console ? if i want to create a variable in opconsole
and this variable i need to use in graphs or parameters . how you can achive this
29. what is bridge in abinitio ? have you worked in bridges ?
30 . if you are not having bridges how you are working ? how you are handling a scenario like
this ?
if two teams need to work on same project then if a graph was locked by one user and you
want to modify the same graph then how you will do this ?
i thought to tell him i will aks that guy to remove the lock or i will break the lock :P
31. how you will identify how many records proccessed by a graph how you will improve
performance of a graph .
32. how you will check in or check out of a graph from unix .
33. i have a table_a , table_b
i have accountid in both tables common 1000 record present in table_a
now i want to get all 1000 records by joining table_b and if accountid matches in both the
tables then i have to to populate a new column called matched with values yes and no (yes
if matches and no if not matches)
34 what is abinitiorc file what it contains ?
35 what is the execution order of parameters ?
36. difference between .project.pset and .sandbox.pset
37. how you will see the objects locked by user.
38. how to list the objects in project ?
unknown company
26. what is the difference between the .dbc and .cfg extensions and give any clear
information ? Cognizent
27. how can you convert 3 way to 7way partitioning in abinitio?
28. what is extension list in abinitio?
29. how will i can implemate Insert,Update,delete in abinitio? how will u view MFS in unix?
what is diff/btween conditional dml& conditional component? what is decoding & what
is NVL?
30. what is hash file in abinitio? how will it read in abinitio? in hash file the data files r
combine (file1.dat,file2.dat,file3.dat......etc) how will it divide? What is meant header
and tailer, suppose header and tailer had some junk data how will delete junk data ?
which components r used?
31. I had 10,000 records r there i loded today 4000 records, i need load to 4001 - 10,000
next day how is in Type 1 and how is it on type 2?
32. Graph was failed how it is achived ? send answer with phone number? i want speak with
directly who is intrest on abinitio?
33. What is the difference between In-Memory Sort and Inputs must be sorted?
34. what are the steps in actual ab initio graph processing including general,pre and post
process settings?
35. What is air_project_parameters and air_sandbox_overrides? what is the relation
between them?
36. In Join component which record will go to unused port and which will go to reject port
37. What is max core value ? what is the use of it? Hi friends if u don't mind i want one
question What abt READ MULTIFILE COMPONENT? PLZ GIVE BRIEFLY?
38. How will u remove header record? How will u remove tailer record?
39. the inputs are 10,20,30,40,50 but the output was comes 0 records,the graph was failed
what is bad situation?
40. What is the difference between a DB config and a CFG file?
41. Describe how you would monitor and control database physical file size and growth?
42. Describe which system or process elements you would review when trouble-shooting
general server slows ?
43. Have you ever encountered an error called "depth not equal"?
i. What are Cartesian joins?
44. How many parallelisms are in Abinitio?
45. What is the difference between partitioning with key and round robin? What is a
ramp limit?
46. what is the relation between EME , GDE and Co-operating system ?
47. what are kinds of layouts does ab initio supports ?
48. what is meant by fancing in abinitio ?
49. How Does MAXCORE works?
50. What is meant by Co > Operating system and why it is special for Ab-initio ?
61. How will you use EME for view or publish metadata reports using EME?
67. Why might the optimizer use a table scan when an index is available?
68. Describe what the SQL Server tempdb and tlog files are used for? Describe the effect of
the "checkpoint" t-sql statement?
69. What are the components new in 2.14 compared to 1.8 and State the usage of the
components?
70. What is .abinitiorc and What it contain?
71. How to Create Surrogate Key using Ab Initio?
72. What does dependency analysis mean in Ab Initio?
HCL Interview questions 07/03/2015
1. Explain about scd typ-2
2. In join if I connect 4 inputs what is required type for left outer join and how it performs?
3. I have graph like , In one flow I used PBKS with key A,B and in another flow I used
PBKS with key A only, now I connected Join and keep key as A,B then what will be
output ?
4. In lookup I have fields like
1| Jyothi
1|ram
1|siva
1|lokesh
If I called lookup to these values which record will comes out
5. For the above same scenario , I want to get output like
1|Jyothi,ram,siva,lokesh ( Here my file is lookup)
6. Here is my file like this
1| sam
2|ravi
3| prabhas
Here is my code in output index with count value 2
If (cust_name == sam) 0
else
(cust_name==ravi)1
Now I have a record prabhas so where the record is going
Does the graph fail?
7. How do you get rank in unix file system
8. I have a file like and I want to remove empty row’s and empty spaces
Test tester | ram |1
1|2|3
3|4|5
6 |7|8
9. How to delete duplicate records from a table
10. How to get a last column from a file
Sai Systems
1. Tell me about yourself and projects
2. What is your role
3. De normalization process
4. denormalization is the process of attempting to optimize the read performance of a
database by adding redundant data
5. If I have requirement What kind of approach for estimation of your task you can give me
6. Do you know in SCD Type -2 , How the records are represented in type-2
7. How many types of joins in oracle
8. How do you get unique values in the table without using distinct
9. Architecture of AB Initio
10. Main Components in AB Initio
11. What is GDE , When you run a graph , what will happen in background
12. What is purpose EME
13. Difference between private sandbox and user sandbox
14. Difference .db config and .cfg
15. What is Environmental Variables
a. Variable which can access through entire einvironment $AB_HOME Dev to test
16. How you will normally do error handling
17. If I got some error like truncate error I want to handle without stopping
a. In my src table column length is 20(char) and destination is 10 type varchar , now
I want to capture first ten char without any error and without any
transformation
18. What is force error , is there any other similar function
19. Is it possible to create surrogate key in GDE , is there any component to get auto
incremental value
20. What is the purpose of parallelism?
21. Different kind of partition component
22. What is important factor that you consider to improve your graph?
23. How to move the graph from develop to testing (which is in different environment ) how
do you maintain all the information
24. Is it configured Environment variable
25. About SDLC & previous project
26. what are the top 5 errors in your project?
27. Data error mismatch
28. Performance issues
29. Setting driving properly
30. Types of parallelism
31. Different types of layouts
32. Questions on GDE, Co –op, EME
33. What are the graph level parameters and what is the difference between local and
formal level parameters
34. Local parameter can be given only at Design Time, while in the case of Formal
parameter, when the Graph will run it will first prompt for the values of Formal
parameters. In other words, u can say that the value of the Formal Parameter can be
given at Run Time.
35. Kind of documents that you may receive in your project
36. Field level documents
37. Mapping documents
1.what are the versions you used in present and previous projects ?
2.what is the difference between de-partion and partion components?
3.what is the difference between partion by key and partion by round robin ,in which situations
u used these components in u r project ?
4. cat with multifile and serial file ?
5. how the tables are extracted to input table ?
6. how u will get oracle tables in Abinitio ?
7. if we connect serial and multifile is it executed or not ?
8. Serial reformat joinmultifile
Is it work ?in what way it is working ?
9. Parameters of scan component ?
10. What u r done in u r previous project?
11. how can u generate cost and revenue of u r project ?
12.how the multifiles and serial files are executed in unix environment ?
13.what r the components u used in your project ?
14. what is the main aim of u r project ?
15. how the multifile can be changed to serial file,serial file to multifile ?
16. How the data will be seen in multifiles ?
1] 1
2
3
.
.
10
divide these records in 2 ports . 2 in each port alternately.
eg 1,2 port 0
3,4 port 1
5,6expla port 0
2] file1 file2
1 1
1 1
1 1
explain what will be the output for each type of join.
2014-03-06
8. Summary of recent work.
9. How to convert a multi-file to a serial file? (Departitioners)
10. Identify and describe the standard departitioners (Concatenate, Gather, Interleave,
Merge).
11. Given a slowly operating join component, how can one optimize it? (In which
situations to use in-memory, and which to use sorted input?)
12. What are the possible “keep” settings for the Dedup Sorted component? (first, last,
unique-only)
13. Can one de-dup data without sorting? (Yes, use in-memory Rollup instead.)
14. DATE/CHANNEL/REVENUE query question... something like
select c.name
, sum(r.revenue)
, count(distinct r.order_number) – in case order number is repeated in the fact
table
from REVENUE r
inner join DATE d – assuming r.date_id is not nullable
on (r.date_id = d.id)
inner join CHANNEL c – assuming r.channel_id is not nullable
on (r.channel_id = c.id)
where to_char('yyyy', d.date_string) = '2013' – or whatever (DB's date format
function)
group by c.channel_name
;
2. How to optimize database queries? (Did not know for sure, beyond indices. Partitioning
and database server tuning options might be possible, but such knowledge is beyond
candidate's experience.)
Unknown company
1. Tell me abt ur self?
2. What is fact and dimension?
3. Hw to use granualarity in fact table?
4. What are the schemas did u use?and explain it?which is give better performance?
5. Types of dimensins?
6. Firstly which table u have to load it/and that data where to read it?
7. Wt is the use of staging area in dwh?
8. Explain abt scds?did u wrk on dt?
9. By seeing only hw to find dt table belongs to scd type2 or not?is it possible?
10. Explain abt FBE?and tell me one scenario?
11. Wt is PBK nd PBRR?which gives better performance?
12. Diff b/w replicate nd broadcast?
13. Wt is multifie?did u wrk ondt?
14. Hw to change depth of multifiles?
15. Diff b/w checkpoint and phase?
16. Wt is join nd explain it nd gve da scenarios of each join type?
17. Explain wt is parallism?
18. I hve one file its having 10 recors hw to displays the count into target table?
19. I m having 3 tables, in dt 10th dept no records are get into target?hw to get it?
20. One table is der in dt table header having 3 fields,detail having 3 fielda and trailer having
no.of record count ? in dt table hw to read dml?
21. Hw to get depth of multifile ?
22. By using awk cmd hw to count the no.of columns?
23. Hw to find 0 byte file/
24. Hw to delete first 3 chars in line?
25. Explain abt rollup nd scan? And wt is the diff b/w them?
26. By using reformate wt are the operating u did so far in ur graph or project?
Unknown company
1. Difference between eme& sandbox
2. different approaches of Data Warehousing
3. how to move graph from development to QA and production
4. I checked out one graph from eme and after 10 days I check in same graph is it possible and
why
5. graphs to achieve SCD-1,SCD-2,SCD-3
6. difference between partition by expression and filter by expression
7.functionalities of scan and rollup and package mode stages like initialization and rollup or
scan how many times called
8.how to remove files which are created before 10 days
9.how to display particular fields from a file
10. input file have 100 records how to display 40 to 50 records
11.how to delete duplicate rows from table
12.difference between delete and truncate
Ramarao questions
1. tell me about yourself?
2. how you will write the dml for the following
1,jithu
2,chandu
3,srinu
4,sandeep
3. how you will validate header and trailor?
4. abort the graph if header and trailor validation failed . if validation sucess create 4 way
multifile
5. compare the account no in new graph and insert the new record to the target table. and get
the total no of records inserted into the summary table.
6. how many projects you worked on you know informatica also ? what is your total dev and
production support experience
7.how you will validate the header and trailor i.e. above scenario by using unix
8.how you will get the second record from a file with out using sed , head and tail commands?
9.i have account no and transaction date get the last 3 transactions for each account no (write
the query and tell me how you can implement using graph )
10. get the total no of transaction dates for each account there may be multiple records for one
day .
11. what are the dimensions tables and fact tables?
12. what are the different types of scds?
13. explain about all scd2? .
14. i have account no as key field . how many records will be there for scd1, scd2 and scd3?
Input Output
1. C1 1. C1
2. A 2. A
3. B 3. a
4. C 4. B
5. D 5. C
6. a 6. c
7. d 7. D
8. c 8. d
9. How do you run graph from the background
10. How to see the detailed version for an object
11. How to move your project from dev -> quality
12. How you create tag for multiple objects
13. Scenario , I have two mfs files with having depth -2 with fields like
Acct_id, Acct_type , Amt
10,credit,100
20,debit,400
30,debit,600
In these scenario , We already sorted with acct_id now I want to join the two mfs files
with again sorting with another field acct_type so how you gonna ?
14. For join component how the multifile system works?
15. How to create multifile system
16. Scenario, If graph success then its ok but if my graph fails then I want to send mail
notification how you do that in abinitio ?
17. How to check the status of a graph
18. How do you execute the ksh file
19. When do you go for pset? Types of interpretation
20. How do you checkout the objects/graph air project export cmd and air sandbox
command then she asked
21. What is the difference between the air project export & air sandbox export
- Pls share more on this if you any solutions
22. Difference between broadcast & replicate
23. Given basic scenario about the rollup which needs to be group & taking sum
24. Main difference between scan & rollup
25. Scenario
25. In the above output I want to send all records except the char contains ‘a ‘
26. In sql how do you left the two tables give me query
27. How to send 25 to 30 records to outport
28. I want to send values to the output file w.r.to deptno , Eg given in emp table we
have 10, 20,30,40 , Now i want only deptno 10 value in one output file and all the other
want to send into another output file
29. What will happen if we keep null in sort which given one sample file
30. How do you eliminate duplicate records
31. What kind of testing methods you have used in the project
32. What kind of validation in the unit testing
33. What is the difference between dbc file & config file
How do you create table in pset level ?
Skytech :
1. Tell me about u r self?
2. Tell me about u r project?
3. How do u do the graph performance check ?
4. How do u develop the generic graph? How do u know that the graph is generic are custom
graph?
5. How do u validate the header in generic graph ?
6. What type of header validations we can do ?
7. Different file are comeing from diff source so how can u find out how to delete the the header?
8. Most complex components u worked ?
9. Have u worked on any countionnus flows ?
10. Main reason of using palns ?
11. What is the diff between run the graph by using plans and by using scripting?
12. Have u write any shell scripts in which case u wrote?
Unknown company
1)What is the function you would use to transfer a string into a decimal?
2)How many parallelisms are in Abinitio? Please give a definition of each.
3)What is the difference between a DB config and a CFG file?
4)Have you eveer encountered an error called "Pipeline Broken"? (This occurs when you
extensively create graphs it is a trick question)
5)How do you truncate a table? (Each candidate would say only 1 of the several ways to do
this.)
6)How do you improve the performance of a graph?
7)What is the difference between partitioning with key and round robin?
8)Have you worked with packages?
9)How do you add default rules in transformer?
10)What is a ramp limit and Maxcore values for Scan, Rollup, Sort, Replicate?
11)Have you used rollup component? Describe how.
12)How many components in your most complicated graph?
13)Do you know what a local lookup is?
14)What is Ad hoc multifile? How is it used?
Unknown company
Unknown company
1. How to create sandbox by using command?
2. Is it possible to create sandbox without using GDE? If it is possible what is difference?
3. How to move the multi file path to sandbox?
4. What is difference between lookup and lookup local () functions?
5. Tell me right outer join query?
6. How to display a file and delete same file at a time by using UNIX?
7. Can you do read and write in single graph?
8. How to take the max-core value each partition?
9. What is mount?
10. What is AB_JOB?
11. What is component folding?
12. How can you find out job executed successfully or not?
13. How can you find out graph is executing in which phase?
14. What is the difference between primary key and primary index?
15. How can change the primary index?
16. What is the difference between soft-link and hard-link?
17. How can you find the file size?
18. What is wrapper script?
19. How can you find the process id?
20. What is top-down and bottom-up approach?
21. Can you draw your project architecture?
22. What is granularity?
23. How to delete the duplicates in file?
24. What is the query to delete the duplicate rows in the table?
25. How to find out the 10th maximum salary in table?
26. How to find out the second highest salary by using ab initio?
27. What is AB_WORK_DIR?
28. What type of information there in .ksh?
29. What is there in AB_HOME?
30. Where co-operating system path is stored?
31. When .ksh will create?
32. How can you deploy the script?
33. If I pass 10 value in select expression of filter by expression component what is the
result?
34. How can you store file data into table?
35. Difference between dimension and fact table?
36. How can you run the graph in UNIX?
37. How to move the graphs from one environment to another environment?
38. How can you schedule the jobs?
39. How to retrieve the duplicates in the table?
40. If I give the next_in_sequence () in multi file, will u get the duplicates or not?
41. How can you apply the next_in_sequence () in multi file?
42. If I set max-core less, which type of error will you get?
Ans: By using rollup we cant generate cumulative summary records for that we will be
using scan.
What is the difference between partitioning with key and round robin?
Ans; A .dbc file has the information required for Ab Initio to connect to the database
to extract or load tables or views. While .CFG file is the table configuration file created
by db_config while using components like Load DB Table
What is the function you would use to transfer a string into a decimal?
Ans: For converting a string to a decimal we need to typecast it using the following
syntax,
out.decimal_field :: ( decimal( size_of_decimal ) ) string_field;
The above statement converts the string to decimal and populates it to the decimal
field in output.
. How to execute the graph from start to end stages? Tell me and how to run graph in non-
Abinitio system?
Ans: There are so many ways to do this, i am giving one example due to time
constraint you can run components according to phasea how you defined.
by creating ksh, sh scripts also you can run.
Ans; Data mapping deals with the transformation of the extracted data at FIELD level
i.e. the transformation of the source field to target field is specified by the mapping
defined on the target field. The data mapping is specified during the cleansing of the
data to be loaded.
For Example:
source;
string(35) name = "Siva Krishna ";
target;
string("01") nm=NULL("");/*(maximum length is string(35))*/
Then we can have a mapping like:
Straight move.Trim the leading or trailing spaces.
The above mapping specifies the transformation of the field nm
What is the difference between sandbox and EME, can we perform checkin
and checkout through sandbox/ Can anybody explain checkin and checkout?
Ans; Sandboxes are work areas used to develop, test or run code associated with a
given project. Only one version of the code can be held within the sandbox at any
time.
The EME Datastore contains all versions of the code that have been checked into it. A
particular sandbox is associated with only one Project where as a Project can be
checked out to a number of sandboxes
Ans: There are somany ways to improve the performance of the graphs in Abinitio.
I have few points from my side.
1.Use MFS system using Partion by Round by robin.
2.If needed use lookup local than lookup when there is a large data.
3.Takeout unnecessary components like filter by exp instead provide them in
reformat/Join/Rollup.
4.Use gather instead of concatenate.
5.Tune Max_core for Optional performance.
6.Try to avoid more phases.
What are the most commonly used components in a Abinition graph. example of a
trasformation of data, say customer data in a credit card company into meaningful output
based on business rules?
Ans: The most commonly used components in to any Ab Initio project are
input file/output file
input table/output table
lookup file
reformat,gather,join,runsql,join with db,compress components,sort,trash,partition by
expression,partition by key ,concatinate
Difference between conventional loading and direct loading ? when it is used in real time .?
Ans: .cfg file is for the remote connection and .dbc is for connecting the database.
.cfg contains :
1. The name of the remote machine
2. The username/pwd to be used while connecting to the db.
3. The location of the operating system on the remote machine.
4. The connection method.
and .dbc file contains the information:
1. The database name2. Database version
3. Userid/pwd
4. Database character set and some more...
. How to do we run sequences of jobs ,,like output of A JOB is Input to B .How do we co-
ordinate the jobs?
Ans: By writing the wrapper scripts we can control the sequence of execution of more
than one job.
How would you do performance tuning for already built graph ? Can you let me know some
examples?
Ans: example :- suppose sort is used in fornt of merge component its no use of using
sort ! bcz we hv sort component built in merge.
2) we use lookup instead of JOIN,Merge Componenet.
3) suppose we wnt to join the data comming from 2 files and we dnt wnt dupliates we
will use union funtion instead of adding addtional component for duplicate remover.
. What is semi-join
Ans: If your source is a cobol copybook, then we have a command in unix which
generates the required in Ab Initio. here it is:
cobol-to-dml.
Ans: Two are graph level parameters but in local you need to initialize the value at the
time of declaration where as globle no need to initialize the data it will promt at the
time of running the graph for that parameter.
ans: Broadcast - Takes data from multiple inputs, combines it and sends it to all the
output ports.
Eg - You have 2 incoming flows (This can be data parallelism or component
parallelism) on Broadcast component, one with 10 records & other with 20 records.
Then on all the outgoing flows (it can be any number of flows) will have 10 + 20 = 30
records
Replicate - It replicates the data for a particular partition and send it out to multiple
out ports of the component, but maintains the partition integrity.
Eg - Your incoming flow to replicate has a data parallelism level of 2. with one partition
having 10 recs & other one having 20 recs. Now suppose you have 3 output flos from
replicate. Then each flow will have 2 data partitions with 10 & 20 records respectively.
What is m_dump
Ans: In RUN ==> Deploy >> As script , it create a .bat file at ur host directory ,and
then run .bat file from Command prompt
Ans: Maxcore is a value (it will be in Kb).Whne ever a component is executed it will
take that much memeory we specified for execution
Ans: To convert 4 way to 8 way partition we need to change the layout in the
partioning component. There will be seperate parameters for each and every type of
partioning eg. AI_MFS_HOME, AI_MFS_MEDIUM_HOME, AI_MFS_WIDE_HOME etc.
The appropriate parameter need to be selected in the component layout for the type of
partioning..
What is mean by Co > Operating system and why it is special for Abinitio ?
ans: It converts the AbInitio specific code into the format, which the UNIX/Windows
can understand and feeds it to the native operating system, which carries out the task.
. Which one is faster for processing fixed length dmls or delimited dmls and why ?
ans: Fixed length DML's are faster because it will directly read the data of that length
without any comparisons but in delimited one,s every character is to be compared and
hence delays
ans: Contineous components used to create graphs,that produce useful output file
while running continously
Ex:- Contineous rollup,Contineous update,batch subscribe
How to retrieve data from database to source in that case whice component is used for this?
ans; To unload (retrive) Data from the database DB2, Informix, or Oracle we have
components like Input Table and Unload DB Table by using these two components we
can unload data from the database.
ans: EME is said as enterprise metdata env, GDE as graphical devlopment env and Cooperating sytem
can be said as asbinitio server relation b/w this CO-OP, EME AND GDE
is as fallows
Co operating system is the Abinitio Server. this co-op is installed on perticular O.S
platform that is called NATIVE O.S .comming to the EME, , its hold the metadata,trnsformations,db
config files source and targets information's. comming to GDE its is end user envirinment where we can
devlop the
graphs(mapping just like in informatica)
designer uses the GDE and designs the graphs and save to the EME or Sand box it is
at user side.where EME is ast server side.
ans: Basically there are serial and parallel layouts supported by AbInitio. A graph can
have both at the same time. The parallel one depends on the degree of data
parallelism. If the multi-file system is 4-way parallel then a component in a graph can
run 4 way parallel if the layout is defined such as it's same as the degree of
parallelism.
ans: This is a tricky question, number of component in a graph has nothing to do withthe level of
knowledge a person has. On the contrary, a proper standardized and
modular parametric approach will reduce the number of components to a very few. In
a well thought modular and parametric design, mostly the graphs will have 3/4
components, which will be doing a particular task and will then call another sets of
graphs to do the next and so on. This way total numbers of distinct graphs will
drastically come down, support and maintenance will be much more simplified.
The bottomline is, there are lot more other things to plan rather than to add
components.
ans: If the DML changes dynamically then both dml and xfr has to be passed as graph
level parameter during the runtime.
Ans: Packages are nothing but the reusable blocks of objects like transforms, user
defined functions, dmls etc. These packages are to be included in the transform where
you use them. For example, consider a user defined function like
/*string_trim.xfr*/
out::trim(input_string)=
begin
let string(35) trimmed_string = string_lrtrim(input_string);
out::trimmed_string;
end
Now, the above xfr can be included in the transform where you call the above function
as
include ''~/xfr/string_trim.xfr'';
But this should be included ABOVE your transform function.
For more details see the help file in "packages".
Ans: In RDBMS the relationship between the two tables is represented as Primary key
and foreign key relationship.Wheras the primary key table is the parent table and
foreignkey table is the child table.The criteria for both the tables is there should be a
matching column.
Ans:Do you mean by merging Gui map files in WR.If so, by merging GUI map files in
GUI map editor it wont create corresponding test script.without testscript you cant run
a file.So it is impossible to run a file by merging 2 GUI map files.
In the context of databases and data management, the terms "truncate,"
"drop," and "delete" refer to different operations that can be performed on
database tables. Here's an explanation of each command: