• Post category:StudyBullet-3
  • Reading time:12 mins read


Talend Tutotrial with Talend Open Studio for Data Integration, Big Data, ETL, Data Warehouse, Data Migration, Cloud, BI

What you will learn

connect your data sources, such as files, databases, XML, web services, Google Drive and more formats

build your own integration processes using practical examples and comprehensive scenarios

master the most important transformations like mappings, joins, aggregations and sorting

orchestrate processes into larger units by using preJobs, postJobs, variable and hierachies

Description

Data. Everywhere. All well-behaved in their own environment. But who actually lets them talk to each other? You do. With data integration. Become a data savant and add value with ETL and your new knowledge!


Get Instant Notification of New Courses on our Telegram channel.


Talend Open Studio is an open, flexible data integration solution. You build your processes with a graphical editor and over 600 components provide flexibility.

Each section has a practical example and you will receive this complete material at the beginning of the course. So you can not only view each section, but also compare it to your own solution. There are also extensive practical scenarios included. So you’ll be well equipped for practice!

What are the biggest topics you can expect?

  • Installation on different operating systems (Windows, Linux, Mac)
  • understanding and using important data types
  • reading and writing from databases
  • process different file formats, like Excel, XML, JSON, delimited, positional
  • create and use metadata
  • build schemas
  • use helpful keyboard shortcuts
  • retrieve data from WebServices
  • connect to GoogleDrive and fetch data
  • using iteration and loops
  • convert data flows into iterations
  • build and understand job hierarchies
  • All major transformations: Map, join, normalize, pivot, and aggregate data
  • create and extract XML and JSON
  • use regular expressions
  • Orchestrate components in processes
  • Check and improve data quality
  • Use fuzzy matching and interval matching
  • Use variables for different environments
  • Perform schema validation
  • Handle reject data separately
  • Find and fix errors quickly
  • Write meaningful logs
  • Include and react to warnings and aborts
  • Build job hierarchies and pass data between different levels
  • implement and test your own assumptions
  • configure your project for logging, versioning and context loading
  • learn best practices and establish your own
  • document items and have documentation generated

What are you waiting for? See you in the course!

English
language

Content

Course Overview
Course Overview
Your maximum success
Why Data Integration?
Why Data Integration?
Setup
Intro
Installation on Windows and Linux
Language of Talend Open Studio
Import course project
Download course files
“Hello world” example
Intro
“Hello world” with tMsgBox
Get to know the UI
Intro
Get to know the user interface
Useful helpers
Install external libraries
Find errors
Your first job
Intro
Your first job
Process files
Intro
Read files
Write files
List folder contents
File processing scenario
Understand properties
Intro
BuiltIn vs. Repository
Schema
Data types
Connection types
Properties quiz for Talend Open Studio
Process databases
Intro
Connect to a database
Write to a database
Read from a database
Iterate over database contents
Database processing Scenario
Process other formats
Intro
Process JSON
Process XML
Google Drive access
Query REST service
File downloads
Read RSS feeds
Data Sources Quiz
Use variables
Intro
Create context variables
Create variables and environments
Data types in context
Central context
Create context for metadata
Write and load context
Load context implicitly
Global variables
Variables Quiz
Transformations
Intro
Filter data
Sort data
Aggregate data
Convert data types
Split data
De-Normalization
Simple joins
Install Sakila
Mapping with Join
Mapping with filter
Mapping with transformation
Field extraction: Intro
Field extraction: JSON
Field extraction: XML
Field extraction: Positional
Field extraction: Delimited
Field extraction: Regex
Field extraction: Outro
Generate XML and JSON fields
Generate sample data
Pivot data
Transformations Quiz
Data quality
Intro
Remove duplicates
Match intervals
Make substitutions
Schema checking
Redundancy key
Fuzzy matching
File Management
Intro
Basic operations
File comparison
File properties
File lists
File compression
Temporary files
Job orchestration
Intro
Pre- & Post-Job
MessageBox
Replicate and unite
Data flow to iteration & vice versa
Loops
Wait and measure
System interaction
Job hierarchies
Orchestration Quiz
Logging
Intro
Debug data
Test assertions
Log volumes
Log errors & warnings
Log executions
Log jobs
Log jobs better
Log projects
Logging Quiz
Documentation
Intro
Use best practices
Create documentation
Conclusion & Surprise
Congratulations