Getting started with Talend Open Studio for Data Integration.
In Detail Talend Open Studio for Data Integration (TOS) is an open source graphical development environment for creating custom integrations between systems. It comes with over 600 pre-built connectors that make it quick and easy to connect databases, transform files, load data, move, copy and renam...
Clasificación: | Libro Electrónico |
---|---|
Autor principal: | |
Formato: | Electrónico eBook |
Idioma: | Inglés |
Publicado: |
Birmingham, UK :
Packt Pub.,
2012.
|
Colección: | Community experience distilled.
|
Temas: | |
Acceso en línea: | Texto completo (Requiere registro previo con correo institucional) |
Tabla de Contenidos:
- Cover; Copyright; Credits; Foreword; About the Author; Acknowledgement; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1:Knowing Talend Open Studio; What Talend Open Studio is; Use cases; History of Talend Open Studio; Benefits of Talend Open Studio; Installing Talend Open Studio; Prerequisites; Installation guide; Other useful software; Text editor; MySQL; Sample jobs and data; Summary; Chapter 2: Working with Talend Open Studio; Studio definitions; Starting the Studio; Tour of the Studio; The Repository; The design workspace; The Palette; Configuration tabs
- Outline and Code panelsCreating a new project; Creating an example job; Metadata; Summary; Chapter 3: Transforming Files; Transforming XML to CSV; Transforming CSV to XML; Maps and expressions; Advanced XML output for complex XML structures; Working with multi-schema XML files; Enriching data with lookups; Extracting data from Excel files; Extracting data from multiple sheets; Joining data from multiple sheets; Summary; Chapter 4: Working with Databases; Database metadata; Extracting data from a database; Extracts from multiple tables; Joining within the database component
- Joining outside the database componentWriting data to a database; Database to database transfer; Modifying data in a database; Dynamic database lookup; Summary; Chapter 5: Filtering, Sorting, and Other Processing Techniques; Filtering data; Simple filter; Filter and rejects; Filter and split; Sorting data; Aggregating data; Normalizing and denormalizing data; Data normalization; Data denormalization; Extracting delimited fields; Find and replace; Sampling rows; Summary; Chapter 6: Managing Files; Managing local files; Copying files; Copying and removing files; Renaming files; Deleting files
- Time-stamping a fileListing files in a directory; Checking for files; Archiving and unarchiving files; FTP file operations; FTP Metadata; FTP Put; FTP Get; FTP File Exist; FTP File List and Rename; Deleting files on an FTP server; Summary; Chapter 7: Job Orchestration; What is a subjob?; A simple subjob; On Subjob Error; On Component OK; Run If; Jobs as subjobs; Iterating and looping; Iterate connections; ForEach loop; Loop ""n"" times; Infinite loop; Duplicating and merging dataflows; Duplicating data; Merging data; Summary; Chapter 8: Managing Jobs; Job versions
- Exporting and importing jobsExporting jobs; Exporting a project; Exporting a job; Exporting a job for execution; Importing jobs; Importing a project; Importing a job; Scheduling jobs; Summary; Chapter 9: Global Variables and Contexts; Global variables; Studio global variables; User defined global variables; Contexts; Embedded context variables; Repository context variables; External context variables; Complex context variables; Using embedded, repository, and external contexts; Summary; Chapter 10: Worked Examples; Product catalog; Data import from the ERP system