Cargando…

Hadoop MapReduce cookbook.

Individual self-contained code recipes. Solve specific problems using individual recipes, or work through the book to develop your capabilities. If you are a big data enthusiast and striving to use Hadoop to solve your problems, this book is for you. Aimed at Java programmers with some knowledge of...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Perera, Srinath
Otros Autores: Gunarathne, Thilina
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Birmingham, UK : Packt Pub., 2013.
Temas:
Acceso en línea:Texto completo
Tabla de Contenidos:
  • Table of Contents; Hadoop MapReduce Cookbook; Hadoop MapReduce Cookbook; Credits; About the Authors; About the Reviewers; www.PacktPub.com; Support files, eBooks, discount offers and more; Why Subscribe?; Free Access for Packt account holders; Preface; What this book covers; What you need for this book; Who this book is for; Conventions; Reader feedback; Customer support; Downloading the example code; Errata; Piracy; Questions; 1. Getting Hadoop Up and Running in a Cluster; Introduction; Setting up Hadoop on your machine; Getting ready; How to do it ... ; How it works ...
  • Writing a WordCount MapReduce sample, bundling it, and running it using standalone HadoopGetting ready; How to do it ... ; How it works ... ; There's more ... ; Adding the combiner step to the WordCount MapReduce program; How to do it ... ; How it works ... ; There's more ... ; Setting up HDFS; Getting ready; How to do it ... ; How it works ... ; Using HDFS monitoring UI; Getting ready; How to do it ... ; HDFS basic command-line file operations; Getting ready; How to do it ... ; How it works ... ; There's more ... ; Setting Hadoop in a distributed cluster environment; Getting ready; How to do it ... ; How it works ...
  • There's more ... Running the WordCount program in a distributed cluster environment; Getting ready; How to do it ... ; How it works ... ; There's more ... ; Using MapReduce monitoring UI; How to do it ... ; How it works ... ; 2. Advanced HDFS; Introduction; Benchmarking HDFS; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Adding a new DataNode; Getting ready; How to do it ... ; There's more ... ; Rebalancing HDFS; See also; Decommissioning DataNodes; How to do it ... ; How it works ... ; See also; Using multiple disks/volumes and limiting HDFS disk usage; How to do it ...
  • Setting HDFS block sizeHow to do it ... ; There's more ... ; See also; Setting the file replication factor; How to do it ... ; How it works ... ; There's more ... ; See also; Using HDFS Java API; Getting ready; How to do it ... ; How it works ... ; There's more ... ; Configuring the FileSystem object; Retrieving the list of data blocks of a file; See also; Using HDFS C API (libhdfs); Getting ready; How to do it ... ; How it works ... ; There's more ... ; Configuring using HDFS configuration files; See also; Mounting HDFS (Fuse-DFS); Getting ready; How to do it ... ; How it works ... ; There's more ... ; Building libhdfs.
  • See alsoMerging files in HDFS; How to do it ... ; How it works ... ; 3. Advanced Hadoop MapReduce Administration; Introduction; Tuning Hadoop configurations for cluster deployments; Getting ready; How to do it ... ; How it works ... ; There's more ... ; Running benchmarks to verify the Hadoop installation; Getting ready; How to do it ... ; How it works ... ; There's more ... ; Reusing Java VMs to improve the performance; How to do it ... ; How it works ... ; Fault tolerance and speculative execution; How to do it ... ; How it works ... ; Debug scripts
  • analyzing task failures; Getting ready; How to do it ...