Cargando…

Learning Pentaho Data Integration 8 CE - Third Edition.

Get up and running with the Pentaho Data Integration tool using this hands-on, easy-to-read guide About This Book Manipulate your data by exploring, transforming, validating, and integrating it using Pentaho Data Integration 8 CE A comprehensive guide exploring the features of Pentaho Data Integrati...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Roldan, Maria Carina
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Birmingham : Packt Publishing, 2017.
Edición:3rd ed.
Temas:
Acceso en línea:Texto completo

MARC

LEADER 00000cam a2200000Mu 4500
001 EBOOKCENTRAL_on1015876294
003 OCoLC
005 20240329122006.0
006 m o d
007 cr |n|---|||||
008 171223s2017 enk o 000 0 eng d
040 |a EBLCP  |b eng  |e pn  |c EBLCP  |d MERUC  |d UIU  |d OCLCF  |d CHVBK  |d IDEBK  |d IDB  |d NLE  |d COO  |d OCLCQ  |d UOK  |d TEFOD  |d OCLCQ  |d WYU  |d LVT  |d C6I  |d UKAHL  |d VT2  |d OCLCQ  |d UKMGB  |d LOY  |d OCLCQ  |d OCLCO  |d K6U  |d OCLCQ  |d OCLCO  |d OCLCL 
015 |a GBB7O2790  |2 bnb 
016 7 |a 018646989  |2 Uk 
019 |a 1019733596  |a 1125095790 
020 |a 9781788290074  |q (electronic bk.) 
020 |a 1788290070  |q (electronic bk.) 
020 |z 9781788292436 
020 |z 178829243X 
029 1 |a AU@  |b 000067105877 
029 1 |a CHNEW  |b 000981704 
029 1 |a CHVBK  |b 507496825 
029 1 |a UKMGB  |b 018646989 
029 1 |a AU@  |b 000062004317 
035 |a (OCoLC)1015876294  |z (OCoLC)1019733596  |z (OCoLC)1125095790 
037 |a 600F5D36-20D1-441A-8633-99E9DD9AFB2C  |b OverDrive, Inc.  |n http://www.overdrive.com 
050 4 |a QA76.9.D3  |b .R653 2017eb 
082 0 4 |a 001.6425  |2 23 
049 |a UAMI 
100 1 |a Roldan, Maria Carina. 
245 1 0 |a Learning Pentaho Data Integration 8 CE - Third Edition. 
250 |a 3rd ed. 
260 |a Birmingham :  |b Packt Publishing,  |c 2017. 
300 |a 1 online resource (487 pages) 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
588 0 |a Print version record. 
520 |a Get up and running with the Pentaho Data Integration tool using this hands-on, easy-to-read guide About This Book Manipulate your data by exploring, transforming, validating, and integrating it using Pentaho Data Integration 8 CE A comprehensive guide exploring the features of Pentaho Data Integration 8 CE Connect to any database engine, explore the databases, and perform all kind of operations on relational databases Who This Book Is For This book is a must-have for software developers, business intelligence analysts, IT students, or anyone involved or interested in developing ETL solutions. If you plan on using Pentaho Data Integration for doing any data manipulation task, this book will help you as well. This book is also a good starting point for data warehouse designers, architects, or anyone who is responsible for data warehouse projects and needs to load data into them. What You Will Learn Explore the features and capabilities of Pentaho Data Integration 8 Community Edition Install and get started with PDI Learn the ins and outs of Spoon, the graphical designer tool Learn to get data from all kind of data sources, such as plain files, Excel spreadsheets, databases, and XML files Use Pentaho Data Integration to perform CRUD (create, read, update, and delete) operations on relationaldatabases Populate a data mart with Pentaho Data Integration Use Pentaho Data Integration to organize files and folders, run daily processes, deal with errors, and more In Detail Pentaho Data Integration(PDI) is an intuitive and graphical environment packed with drag-and-drop design and powerful Extract-Tranform-Load (ETL) capabilities. This book shows and explains the new interactive features of Spoon, the revamped look and feel, and the newest features of the tool including transformations and jobs Executors and the invaluable Metadata Injection capability. We begin with the installation of PDI software and then move on to cover all the key PDI concepts. Each of the chapter introduces new features, enabling you to gradually get practicing with the tool. First, you will learn to do all kind of data manipulation and work with simple plain files. Then, the book teaches you how you can work with relational databases inside PDI. Moreover, you will be given a primer on data warehouse concepts and you will learn how to load data in a data warehouse. During the course of this book, you will be familiarized with its intuitive, graphical and drag-and-drop design envir ... 
505 0 |a Cover -- Title Page -- Copyright -- Credits -- About the Author -- About the Reviewers -- www.PacktPub.com -- Customer Feedback -- Table of Contents -- Preface -- Chapter 1: Getting Started with Pentaho Data Integration -- Pentaho Data Integration and Pentaho BI Suite -- Introducing Pentaho Data Integration -- Using PDI in real-world scenarios -- Loading data warehouses or data marts -- Integrating data -- Data cleansing -- Migrating information -- Exporting data -- Integrating PDI along with other Pentaho tools -- Installing PDI -- Launching the PDI Graphical Designer -- Spoon -- Starting and customizing Spoon -- Exploring the Spoon interface -- Extending the PDI functionality through the Marketplace -- Introducing transformations -- The basics about transformations -- Creating a Hello World! Transformation -- Designing a Transformation -- Previewing and running a Transformation -- Installing useful related software -- Summary -- Chapter 2: Getting Started with Transformations -- Designing and previewing transformations -- Getting familiar with editing features -- Using the mouseover assistance toolbar -- Adding steps and creating hops -- Working with grids -- Designing transformations -- Putting the editing features in practice -- Previewing and fixing errors as they appear -- Looking at the results in the execution results pane -- The Logging tab -- The Step Metrics tab -- Running transformations in an interactive fashion -- Understanding PDI data and metadata -- Understanding the PDI rowset -- Adding or modifying fields by using different PDI steps -- Explaining the PDI data types -- Handling errors -- Implementing the error handling functionality -- Customizing the error handling -- Summary -- Chapter 3: Creating Basic Task Flows -- Introducing jobs -- Learning the basics about jobs -- Creating a Simple Job -- Designing and running jobs. 
505 8 |a Revisiting the Spoon interface and the editing features -- Designing jobs -- Getting familiar with the job design process -- Looking at the results in the Execution results window -- The Logging tab -- The Job metrics tab -- Enriching your work by sending an email -- Running transformations from a Job -- Using the Transformation Job Entry -- Understanding and changing the flow of execution -- Changing the flow of execution based on conditions -- Forcing a status with an abort Job or success entry -- Changing the execution to be synchronous -- Managing files -- Creating a Job that moves some files -- Selecting files and folders -- Working with regular expressions -- Summarizing the Job entries that deal with files -- Customizing the file management -- Knowing the basics about Kettle variables -- Understanding the kettle.properties file -- How and when you can use variables -- Summary -- Chapter 4: Reading and Writing Files -- Reading data from files -- Reading a simple file -- Troubleshooting reading files -- Learning to read all kind of files -- Specifying the name and location of the file -- Reading several files at the same time -- Reading files that are compressed or located on a remote server -- Reading a file whose name is known at runtime -- Describing the incoming fields -- Reading Date fields -- Reading Numeric fields -- Reading only a subset of the file -- Reading the most common kinds of sources -- Reading text files -- Reading spreadsheets -- Reading XML files -- Reading JSON files -- Outputting data to files -- Creating a simple file -- Learning to create all kind of files and write data into them -- Providing the name and location of an output file -- Creating a file whose name is known only at runtime -- Creating several files whose name depend on the content of the file -- Describing the content of the output file. 
505 8 |a Formatting Date fields -- Formatting Numeric fields -- Creating the most common kinds of files -- Creating text files -- Creating spreadsheets -- Creating XML files -- Creating JSON files -- Working with Big Data and cloud sources -- Reading files from an AWS S3 instance -- Writing files to an AWS S3 instance -- Getting data from HDFS -- Sending data to HDFS -- Summary -- Chapter 5: Manipulating PDI Data and Metadata -- Manipulating simple fields -- Working with strings -- Extracting parts of strings using regular expressions -- Searching and replacing using regular expressions -- Doing some math with Numeric fields -- Operating with dates -- Performing simple operations on dates -- Subtracting dates with the Calculator step -- Getting information relative to the current date -- Using the Get System Info step -- Performing other useful operations on dates -- Getting the month names with a User Defined Java Class step -- Modifying the metadata of streams -- Working with complex structures -- Working with XML -- Introducing XML terminology -- Getting familiar with the XPath notation -- Parsing XML structures with PDI -- Reading an XML file with the Get data from XML step -- Parsing an XML structure stored in a field -- PDI Transformation and Job files -- Parsing JSON structures -- Introducing JSON terminology -- Getting familiar with the JSONPath notation -- Parsing JSON structures with PDI -- Reading a JSON file with the JSON input step -- Parsing a JSON structure stored in a field -- Summary -- Chapter 6: Controlling the Flow of Data -- Filtering data -- Filtering rows upon conditions -- Reading a file and getting the list of words found in it -- Filtering unwanted rows with a Filter rows step -- Filtering rows by using the Java Filter step -- Filtering data based on row numbers -- Splitting streams unconditionally -- Copying rows. 
505 8 |a Distributing rows -- Introducing partitioning and clustering -- Splitting the stream based on conditions -- Splitting a stream based on a simple condition -- Exploring PDI steps for splitting a stream based on conditions -- Merging streams in several ways -- Merging two or more streams -- Customizing the way of merging streams -- Looking up data -- Looking up data with a Stream lookup step -- Summary -- Chapter 7: Cleansing, Validating, and Fixing Data -- Cleansing data -- Cleansing data by example -- Standardizing information -- Improving the quality of data -- Introducing PDI steps useful for cleansing data -- Dealing with non-exact matches -- Cleansing by doing a fuzzy search -- Deduplicating non-exact matches -- Validating data -- Validating data with PDI -- Validating and reporting errors to the log -- Introducing common validations and their implementation with PDI -- Treating invalid data by splitting and merging streams -- Fixing data that doesn't match the rules -- Summary -- Chapter 8: Manipulating Data by Coding -- Doing simple tasks with the JavaScript step -- Using the JavaScript language in PDI -- Inserting JavaScript code using the JavaScript step -- Adding fields -- Modifying fields -- Organizing your code -- Controlling the flow using predefined constants -- Testing the script using the Test script button -- Parsing unstructured files with JavaScript -- Doing simple tasks with the Java Class step -- Using the Java language in PDI -- Inserting Java code using the Java Class step -- Learning to insert java code in a Java Class step -- Data types equivalence -- Adding fields -- Modifying fields -- Controlling the flow with the putRow() function -- Testing the Java Class using the Test class button -- Getting the most out of the Java Class step -- Receiving parameters -- Reading data from additional steps. 
505 8 |a Redirecting data to different target steps -- Parsing JSON structures -- Avoiding coding using purpose-built steps -- Summary -- Chapter 9: Transforming the Dataset -- Sorting data -- Sorting a dataset with the sort rows step -- Working on groups of rows -- Aggregating data -- Summarizing the PDI steps that operate on sets of rows -- Converting rows to columns -- Converting row data to column data using the Row denormaliser step -- Aggregating data with a Row Denormaliser step -- Normalizing data -- Modifying the dataset with a Row Normaliser step -- Going forward and backward across rows -- Picking rows backward and forward with the Analytic Query step -- Summary -- Chapter 10: Performing Basic Operations with Databases -- Connecting to a database and exploring its content -- Connecting with Relational Database Management Systems -- Exploring a database with the Database Explorer -- Previewing and getting data from a database -- Getting data from the database with the Table input step -- Using the Table input step to run flexible queries -- Adding parameters to your queries -- Using Kettle variables in your queries -- Inserting, updating, and deleting data -- Inserting new data into a database table -- Inserting or updating data with the Insert / Update step -- Deleting records of a database table with the Delete step -- Performing CRUD operations with more flexibility -- Verifying a connection, running DDL scripts, and doing other useful tasks -- Looking up data in different ways -- Doing simple lookups with the Database Value Lookup step -- Making a performance difference when looking up data in a database -- Performing complex database lookups -- Looking for data using a Database join step -- Looking for data using a Dynamic SQL row step -- Summary -- Chapter 11: Loading Data Marts with PDI -- Preparing the environment. 
590 |a ProQuest Ebook Central  |b Ebook Central Academic Complete 
650 0 |a Data integration (Computer science) 
650 0 |a Data mining. 
650 0 |a Decision support systems. 
650 2 |a Data Mining 
650 2 |a Decision Support Systems, Management 
650 6 |a Intégration de données (Informatique) 
650 6 |a Exploration de données (Informatique) 
650 6 |a Systèmes d'aide à la décision. 
650 7 |a Data integration (Computer science)  |2 fast 
650 7 |a Data mining  |2 fast 
650 7 |a Decision support systems  |2 fast 
758 |i has work:  |a Learning Pentaho Data Integration 8 CE - Third Edition (Text)  |1 https://id.oclc.org/worldcat/entity/E39PCY3twrg8rjvMHXcVhjmBGd  |4 https://id.oclc.org/worldcat/ontology/hasWork 
776 0 8 |i Print version:  |a Roldan, Maria Carina.  |t Learning Pentaho Data Integration 8 CE - Third Edition.  |d Birmingham : Packt Publishing, ©2017 
856 4 0 |u https://ebookcentral.uam.elogim.com/lib/uam-ebooks/detail.action?docID=5178186  |z Texto completo 
938 |a Askews and Holts Library Services  |b ASKH  |n AH33743714 
938 |a ProQuest Ebook Central  |b EBLB  |n EBL5178186 
938 |a ProQuest MyiLibrary Digital eBook Collection  |b IDEB  |n cis38039491 
994 |a 92  |b IZTAP