0% found this document useful (0 votes)
82 views

Talend Course Content

The Talend course content covers topics such as: 1) the role of open source ETL technologies and Talend concepts, 2) features of Talend including installation, GUI layout, and architecture, 3) reading from and writing to various data sources and file types, 4) using advanced transformation components, 5) parameterization with context variables and functions, and 6) scheduling jobs, controlling execution flow, and best practices for databases, files, APIs and more.

Uploaded by

Babjee Reddy
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
82 views

Talend Course Content

The Talend course content covers topics such as: 1) the role of open source ETL technologies and Talend concepts, 2) features of Talend including installation, GUI layout, and architecture, 3) reading from and writing to various data sources and file types, 4) using advanced transformation components, 5) parameterization with context variables and functions, and 6) scheduling jobs, controlling execution flow, and best practices for databases, files, APIs and more.

Uploaded by

Babjee Reddy
Copyright
© © All Rights Reserved
Available Formats
Download as DOCX, PDF, TXT or read online on Scribd
You are on page 1/ 3

Talend Course Content

1.Role of Open Source ETL Technologies in Big Data

 Overviews on: TOS (Talend Open Studio) for Data Integration


 ETL concepts
 Data warehousing concepts

2. Talend: A Revolution in Big Data

 Why Talend?
 Features
 Advantages
 Talend Installation/System Requirements
 GUI layout (designer)
 Understanding it's Basic Features
 Comparison with other market leader tools in ETL domain
 Important areas in Talend Architecture: Project
 Workspace
 Job
 Metadata
 Propagation
 Linking components

3. Talend: Read & Write various Types of Source/Target System

 Data Source Connection


 File as Source
 Create meta data
 Database as source
 Create metadata
 Using MySQL database (create tables, Insert, Update Data from talend)
 Read and write into excel files, into multiple tabs
 View data
 How to capture log and navigate around basic errors
 Role of tLogrow and how it makes developers life easy

4. Talend: How to Transform Your Business: Basic

 Using Advanced components like: tMap, tJoin, tFilter, tSortRow, tAggregateRow,


tReplicate, tSplit, Lookup, tRowGenerator

5. Talend: How to Transform Your Business: Advanced 1

 Trigger (types) and Row Types


 Context Variables (paramaterization)
 Functions (basic to advanced functions to transform business rules such as string,
date, mathematical etc.)
 Accessing job level / component level information within the job

6. Talend: How to Transform Your Business: Advanced 2

 Type Casting (convert data types among source-target platforms)


 Looping components (like tLoop, tFor)
 tFileList
 tRunJob
 How to schedule and run talend DI jobs externally (not in GUI)

Working with hierarchical file structures


o Read and Write an XML file, configure the schema and XPath expression to
parse an XML file

o Read and Write a JSON file, configure the schema and JSONPath expression
to parse a JSON file

o Read and write delimited, fixed width files.

Context Variables and global variables


o Create context/global variables

o Use context/global variables in the configuration of Talend components

o Load context variables from a flow

Best practices
o Working with databases and implementing data warehousing concepts.

o Working with files (excel, delimited, JSON, XML etc.)

o Working with REST APIs

Orchestration and controlling execution flow


o Files - Use components to list, archive, and delete files from a directory

o Database – Controlling Commit and Rollback

 COMMIT at end of job/ every x number of rows


 Rollback on error.

 Shared DB connection across jobs and subjobs

o Use triggers to connect components and subJobs.

o Orchestrate several jobs in master jobs.

o Handling Errors

 Kill a Job on a component error

 Implement a specific Job execution path on a component error

 Configure the log level in the console

You might also like