Talend
Talend
com/blog/talend-interview-questions-answers/
What is The tMap Component? What are the Various Functions That can be Performed Using the tMap Component?
tMap in Talend is a core component of the ‘Processing’ family. It allows you to map the input to the output data.
Row: This connection represents the data flow. Some row connections are Lookup, Multiple Input/Output and
Uniques/Duplicates. Apart from these, Filter, Output, Rejects, ErrorRejects are also row connections.
Iterate: Using the iterate connection, you can perform a loop function on files in a file directory, rows or database entries.
Trigger: The dependency between Subjobs and Jobs triggered in the order as per the Trigger’s nature is created by Trigger.
Link: Using the Link connection, a user can transfer the information in a table schema to the ELT mapper in Talend
1.Subjob Triggers which include OnSubjobOK, OnSubjobError and Run if. OnSubjobOk is executed once the previous Subjob
has been executed.
2.Component Triggers which include OnComponentOK, OnComponentError and Run if. OnComponentOk is executed once
the previous component has been executed.
OnComponentOk OnSubjobOk
1. Belongs to Component Triggers 1. Belongs to Subjob Triggers
2. The linked Subjob starts executing only when the 2. The linked Subjob starts executing only when the
previous component successfully finishes its execution previous Subjob completely finishes its execution
3. This link can only be used with the first component of the
3. This link can be used with any component in a Job
Subjob
Differentiate between tMap and tJoin.
tMap tJoin
2. Can accept multiple input links (one is main and rest are
2. Can accept only two input links (main and lookup)
lookups)
3. Can have more than one output links 3. Can have only two output links (main and reject)
5. Supports inner join and left outer join 5. Supports only inner join
tDenormalizeSortedRow belongs to the ‘Processing’ family of the components. It helps in synthesizing sorted
input flow in order to save memory. It combines all input sorted rows in a group where the distinct values
are joined with item separators.
XMS parameter is used to specify the initial heap size in Java whereas XMX parameter is used to specify the
maximum heap size in Java.
o For simple Jobs, one can rely on the exception throwing process of Talend Open Studio, which is
displayed in the Run View as a red stack trace.
o Each Subjob and component has to return a code which leads the additional processing. The Subjob
Ok/Error and Component Ok/Error links can be used to direct the error towards an error handling
routine.
o The basic way of handling an error is to define an error handling Subjob which should execute
whenever an error occurs.
You can execute a Talend Job remotely from the command line. All you need to do is, export the job along with
its dependencies and then access its instructions files from the terminal.
This component transforms and routes the data from single or multiple sources to single or multiple destinations.
It is an advanced component which is sculpted for transforming and routing XML data flow. Especially when we
need to process numerous XML data sources.
How can you expand the performance of Talend job which has a complex design?
The tMap component supports multiple joins and joins models, which are as follows:
Join models: Unique join, First join and all join, etc.
The tReplicate component duplicates the incoming schema into two similar output flows. And it allows us to perform
different operations on the same schema. The tReplicate component is used to replicate a row as many times as needed.
Talend Studio allows a range of SQL templates to simplify the most common tasks. It also contains the SQL editor that
allows us to customize or design our SQL templates.
The SQL template is used with the components from the Talend ELT component which having the tSQLTemplate,
tSQLTemplateFilterColumns, tSQLTemplateRollback, tSQLTemplateCommit, tSQLTemplateAggregate,
tSQLTemplateFilterRows and tSQLTemplateMerge and these components execute the selected SQL statements.
With the help of these SQL templates, we can enhance the efficiency of our DBMS [database management system] by
storing and retrieving our data according to the structural requirements.
The tJoin component is used to perform the inner and outer join between the main data flow and lookup flow, and this
component helps us to ensure the data quality of any source data against a reference data source.
The tLogRow component is used to display data or results in the Run console window. It is mainly used to monitor data
processed.
The tSortRow component is used to sort the input data based on one or more columns by sort type and order.
The main objective of the tSortRow component is to help us to create metrics and classification of the table.
The tLoqateAddressRow component is used to compare address data against reference data to make sure that it is correct
and complete. If any changes needed, we can correct the spelling, add the missing address data like city, area of the city,
postcode or region, and any other related data.
The tXMLMap component is used to transform and route data from single or multiple sources to single or multiple
destinations.
http://rathinasamyy.blogspot.com/2015/02/talend-interview-questions-and-answers.html
1.(http://www.deepinopensource.com/talend-interview-questions/)
1. Talend – Merge multiple files into single file with sorting operation.
2. Loading Fact Table Using Talend
3. ROWNUM Analytical Function in Talend
4. SCD-2 Implementations in Talend
5. Deployment strategies in Talend
6. Custom Header Footer in Talend
7. Data Masking Using Talend
8. How to use Shared DB Connection in Talend
9. Load all rows from source to target except last 5
10. Late Arriving Dimension Using Talend
11. Date Dimension Using Talend
12. Dynamic Column Ordering Of Source File Using Talend
13. Incremental Load Using Talend
14. Getting Files From FTP Server
15. Initializing Context At Run Time Using Popup
16. User Define Function In Talend
17. Calling DB Sequence From Talend
2.(http://www.talendtutorials.com/talend-interview-questions)
1. Difference between tAggregatedRow and tAggregateSortedRow in Talend
2. How to resume job execution from same location if job get failed in Talend
3. How to execute more than one sub jobs parallel in Talend
4. How to iterate filename and directories in Talend
5. What is the difference between OnSubjobOK and OnComponentOK in Talend
6. How can you pass a value form parent job to child job in Talend
7. How to call stored procedure and function in Talend Job
8. How to export job and execute outside from Talend Studio
9. How to pass value from outside in Talend
10. Can I define schema of database or tables at run time
11. What is tReplicate in Talend
12. What is tUnite in Talend Open Studio
13. How to optimize talend job to stop outOfMemory runtime error
14. How to optimize Talend Performance
15. How to execute multipule SQL statements with one component in Talend
16. What is tSystem component in Talend
17. Can I execute multiple commands at one time with a tSystem component
18. What is difference between tMap and tFilterrow in Talend
3.(http://www.cram.com/flashcards/talend-interview-questions-5197224)
1. What is the difference between the ETL and ELT components of Talend Open Studio?
2. How does one deploy Talend projects?
3. What are the elements of a Talend project?
4. What is the most current version of Talend Open Studio?
5. How do you implement versioning for Talend jobs?
6. What is the tMap component?
7. What is the difference between the tMap and tJoin components?
8. Which *component* is used to sort data?
4.(http://msureshreddy.blogspot.in/2013/08/talend-interview-questions.html)