Cargando…

Pentaho Data Integration Beginner's Guide.

This book focuses on teaching you by example. The book walks you through every aspect of Pentaho Data Integration, giving systematic instructions in a friendly style, allowing you to learn in front of your computer, playing with the tool. The extensive use of drawings and screenshots make the proces...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Roldán, María Carina
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Birmingham : Packt Publishing, 2013.
Edición:2nd ed.
Temas:
Acceso en línea:Texto completo
Tabla de Contenidos:
  • Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Getting Started with Pentaho Data Integration; Pentaho Data Integration and Pentaho BI Suite; Exploring the Pentaho Demo; Pentaho Data Integration; Using PDI in real world scenarios; Loading data warehouses or datamarts; Integrating data; Data cleansing; Migrating information; Exporting data; Integrating PDI along with other Pentaho tools; Installing PDI; Time for action
  • installing PDI; Launching the PDI graphical designer
  • Spoon.
  • Time for action
  • starting and customizing SpoonSpoon; Setting preferences in the Options window; Storing transformations and jobs in a repository; Creating your first transformation; Time for action
  • creating a hello world transformation; Directing Kettle engine with transformations; Exploring the Spoon interface; Designing a transformation; Running and previewing the transformation; Installing MySQL; Time for action
  • installing MySQL on Windows; Time for action
  • installing MySQL on Ubuntu; Summary; Chapter 2: Getting Started with Transformations; Designing and previewing transformations.
  • Time for action
  • creating a simple transformation and getting familiar with the design processGetting familiar with editing features; Using the mouse-over assistance toolbar; Working with grids; Understanding the Kettle rowset; Looking at the results in the Execution Results pane; The Logging tab; The Step Metrics tab; Running transformations in an interactive fashion; Time for action
  • generating a range of dates and inspecting the data as it is being created; Adding or modifying fields by using different PDI steps; The Select values step; Getting fields; Date fields; Handling errors.
  • Time for action
  • avoiding errors while converting the estimated time from string to integerThe error handling functionality; Time for action
  • configuring the error handling to see the description of the errors; Personalizing the error handling; Summary; Chapter 3: Manipulating Real-world Data; Reading data from files; Time for action
  • reading results of football matches from files; Input files; Input steps; Reading several files at once; Time for action
  • reading all your files at a time using a single text file input step.
  • Time for action
  • reading all your files at a time using a single text file input step and regular expressionsRegular expressions; Troubleshooting reading files; Sending data to files; Time for action
  • sending the results of matches to a plain file; Output files; Output steps; Getting system information; Time for action
  • reading and writing matches files with flexibility; The Get System Info step; Running transformations from a terminal window; Time for action
  • running the matches transformation from a terminal window; XML files.
  • Time for action
  • getting data from an XML file with information about countries.