Cargando…

Pentaho data integration quick start guide : create ETL processes using Pentaho /

Chapter 6: Getting Started with PDI; Introducing PDI; Installing PDI; Configuring the graphical designer tool; Creating a simple transformation; Understanding the Kettle home directory; Summary; Other Books You May Enjoy; Index

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Roldán, María Carina (Autor)
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Birmingham, UK : Packt Publishing, 2018.
Temas:
Acceso en línea:Texto completo (Requiere registro previo con correo institucional)
Tabla de Contenidos:
  • Cover; Title Page; Copyright and Credits; Dedication; Packt Upsell; Foreword; Contributors; Table of Contents; Preface; Chapter 1: Getting Familiar with Spoon; Exploring the Spoon interface; Designing, previewing, and running transformations; Designing and previewing a transformation; Understanding the logging options; Understanding the Step Metrics tab; Dealing with errors while designing; Saving and running a transformation; Defining and using Kettle variables; Using named parameters; Running transformations with the Pan utility; Summary; Chapter 2: Extracting Data
  • Getting data from plain filesReading plain files; Reading files with great versatility; Reading files from remote locations; Reading files from Google Drive; Getting data from relational databases; Connecting to a database and using the database explorer; Getting data from a database; Getting data from other sources; XML and JSON; System information and Kettle variables; Combining different sources into a single dataset; Manipulating the metadata; Combining two different datasets into a single dataset; Summary; Chapter 3: Transforming Data; Transforming data in different ways
  • Extracting data from existing fieldsMore ways to create new fields; Sorting and aggregating data; Sorting data; Aggregating data; Filtering rows; Filtering rows upon conditions; Splitting the stream upon conditions; Looking up for data; Looking for data in a secondary stream; Looking up data in a database; Summary; Chapter 4: Loading Data; Generating different kinds of files; Inserting and updating data in database tables; Inserting data; Updating data; Handling errors; Loading a datamart; Loading a time dimension; Loading other kinds of dimensions
  • Loading a dimension with a combination lookup/update stepLoading a dimension with a dimension lookup/update step; Loading a fact table; Summary; Chapter 5: Orchestrating Your Work; Understanding the purpose of PDI jobs; Designing and running jobs; Creating and running a simple job ; Understanding the results of execution; Sequencing tasks; Taking a tour of the job entries; Sending emails; Combining the execution of jobs and transformations; Executing transformations from a job; Creating user-defined Kettle variables; Nesting transformations and jobs; Running jobs with the Kitchen utility