Cargando…

Pentaho data integration cookbook : over 100 recipes for building open source ETL solutions with Pentaho data integration /

Pentaho Data Integration Cookbook Second Edition is written in a cookbook format, presenting examples in the style of recipes. This allows you to go directly to your topic of interest, or follow topics throughout a chapter to gain a thorough in-depth knowledge. Pentaho Data Integration Cookbook Seco...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autores principales: Meadows, Alex (Autor), Pulvirenti, Adrián Sergio (Autor), Roldá, María Carina (Autor)
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Birmingham : Packt Publishing, 2013.
Edición:Second edition.
Temas:
Acceso en línea:Texto completo
Texto completo
Tabla de Contenidos:
  • Cover
  • Copyright
  • Credits
  • About the Author
  • About the Reviewers
  • www.PacktPub.com
  • Table of Contents
  • Preface
  • Chapter 1: Working with Databases
  • Introduction
  • Connecting to a database
  • Getting data from a database
  • Getting data from a database by providing parameters
  • Getting data from a database by running a query built at runtime
  • Inserting or updating rows in a table
  • Inserting new rows where a simple primary key has to be generated
  • Inserting new rows where the primary key has to be generated based on stored values
  • Deleting data from a tableCreating or altering a database table from PDI (design time)
  • Creating or altering a database table from PDI (runtime)
  • Inserting, deleting, or updating a table depending on a field
  • Changing the database connection at runtime
  • Loading a parent-child table
  • Building SQL queries via database metadata
  • Performing repetitive database design tasks from PDI
  • Chapter 2: Reading and Writing Files
  • Introduction
  • Reading a simple file
  • Reading several files at the same time
  • Reading semi-structured files
  • Reading files having one field per rowReading files with some fields occupying two or more rows
  • Writing a simple file
  • Writing a semi-structured file
  • Providing the name of a file (for reading or writing) dynamically
  • Using the name of a file (or part of it) as a field
  • Reading an Excel file
  • Getting the value of specific cells in an Excel file
  • Writing an Excel file with several sheets
  • Writing an Excel file with a dynamic number of sheets
  • Reading data from an AWS S3 Instance
  • Chapter 3: Working with Big Data and Cloud Sources
  • IntroductionLoading data into Salesforce.com
  • Getting data from Salesforce.com
  • Loading data into Hadoop
  • Getting data from Hadoop
  • Loading data into HBase
  • Getting data from HBase
  • Loading data into MongoDB
  • Getting data from MongoDB
  • Chapter 4: Manipulating XML Structures
  • Introduction
  • Reading simple XML files
  • Specifying fields by using Path notation
  • Validating well-formed XML files
  • Validating an XML file against DTD definitions
  • Validating an XML file against an XSD schema
  • Generating a simple XML document
  • Generating complex XML structuresGenerating an HTML page using XML and XSL transformations
  • Reading an RSS Feed
  • Generating an RSS Feed
  • Chapter 5: File Management
  • Introduction
  • Copying or moving one or more files
  • Deleting one or more files
  • Getting files from a remote server
  • Putting files on a remote server
  • Copying or moving a custom list of files
  • Deleting a custom list of files
  • Comparing files and folders
  • Working with ZIP files
  • Encrypting and decrypting files
  • Chapter 6: Looking for Data
  • Introduction