Pentaho Data Integration Beginner's Guide.
This book focuses on teaching you by example. The book walks you through every aspect of Pentaho Data Integration, giving systematic instructions in a friendly style, allowing you to learn in front of your computer, playing with the tool. The extensive use of drawings and screenshots make the proces...
Clasificación: | Libro Electrónico |
---|---|
Autor principal: | |
Formato: | Electrónico eBook |
Idioma: | Inglés |
Publicado: |
Birmingham :
Packt Publishing,
2013.
|
Edición: | 2nd ed. |
Temas: | |
Acceso en línea: | Texto completo |
Tabla de Contenidos:
- Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Getting Started with Pentaho Data Integration; Pentaho Data Integration and Pentaho BI Suite; Exploring the Pentaho Demo; Pentaho Data Integration; Using PDI in real world scenarios; Loading data warehouses or datamarts; Integrating data; Data cleansing; Migrating information; Exporting data; Integrating PDI along with other Pentaho tools; Installing PDI; Time for action
- installing PDI; Launching the PDI graphical designer
- Spoon.
- Time for action
- starting and customizing SpoonSpoon; Setting preferences in the Options window; Storing transformations and jobs in a repository; Creating your first transformation; Time for action
- creating a hello world transformation; Directing Kettle engine with transformations; Exploring the Spoon interface; Designing a transformation; Running and previewing the transformation; Installing MySQL; Time for action
- installing MySQL on Windows; Time for action
- installing MySQL on Ubuntu; Summary; Chapter 2: Getting Started with Transformations; Designing and previewing transformations.
- Time for action
- creating a simple transformation and getting familiar with the design processGetting familiar with editing features; Using the mouse-over assistance toolbar; Working with grids; Understanding the Kettle rowset; Looking at the results in the Execution Results pane; The Logging tab; The Step Metrics tab; Running transformations in an interactive fashion; Time for action
- generating a range of dates and inspecting the data as it is being created; Adding or modifying fields by using different PDI steps; The Select values step; Getting fields; Date fields; Handling errors.
- Time for action
- avoiding errors while converting the estimated time from string to integerThe error handling functionality; Time for action
- configuring the error handling to see the description of the errors; Personalizing the error handling; Summary; Chapter 3: Manipulating Real-world Data; Reading data from files; Time for action
- reading results of football matches from files; Input files; Input steps; Reading several files at once; Time for action
- reading all your files at a time using a single text file input step.
- Time for action
- reading all your files at a time using a single text file input step and regular expressionsRegular expressions; Troubleshooting reading files; Sending data to files; Time for action
- sending the results of matches to a plain file; Output files; Output steps; Getting system information; Time for action
- reading and writing matches files with flexibility; The Get System Info step; Running transformations from a terminal window; Time for action
- running the matches transformation from a terminal window; XML files.
- Time for action
- getting data from an XML file with information about countries.