Hadoop MapReduce cookbook.
Individual self-contained code recipes. Solve specific problems using individual recipes, or work through the book to develop your capabilities. If you are a big data enthusiast and striving to use Hadoop to solve your problems, this book is for you. Aimed at Java programmers with some knowledge of...
Clasificación: | Libro Electrónico |
---|---|
Autor principal: | |
Otros Autores: | |
Formato: | Electrónico eBook |
Idioma: | Inglés |
Publicado: |
Birmingham, UK :
Packt Pub.,
2013.
|
Temas: | |
Acceso en línea: | Texto completo |
Tabla de Contenidos:
- Table of Contents; Hadoop MapReduce Cookbook; Hadoop MapReduce Cookbook; Credits; About the Authors; About the Reviewers; www.PacktPub.com; Support files, eBooks, discount offers and more; Why Subscribe?; Free Access for Packt account holders; Preface; What this book covers; What you need for this book; Who this book is for; Conventions; Reader feedback; Customer support; Downloading the example code; Errata; Piracy; Questions; 1. Getting Hadoop Up and Running in a Cluster; Introduction; Setting up Hadoop on your machine; Getting ready; How to do it ... ; How it works ...
- Writing a WordCount MapReduce sample, bundling it, and running it using standalone HadoopGetting ready; How to do it ... ; How it works ... ; There's more ... ; Adding the combiner step to the WordCount MapReduce program; How to do it ... ; How it works ... ; There's more ... ; Setting up HDFS; Getting ready; How to do it ... ; How it works ... ; Using HDFS monitoring UI; Getting ready; How to do it ... ; HDFS basic command-line file operations; Getting ready; How to do it ... ; How it works ... ; There's more ... ; Setting Hadoop in a distributed cluster environment; Getting ready; How to do it ... ; How it works ...
- There's more ... Running the WordCount program in a distributed cluster environment; Getting ready; How to do it ... ; How it works ... ; There's more ... ; Using MapReduce monitoring UI; How to do it ... ; How it works ... ; 2. Advanced HDFS; Introduction; Benchmarking HDFS; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Adding a new DataNode; Getting ready; How to do it ... ; There's more ... ; Rebalancing HDFS; See also; Decommissioning DataNodes; How to do it ... ; How it works ... ; See also; Using multiple disks/volumes and limiting HDFS disk usage; How to do it ...
- Setting HDFS block sizeHow to do it ... ; There's more ... ; See also; Setting the file replication factor; How to do it ... ; How it works ... ; There's more ... ; See also; Using HDFS Java API; Getting ready; How to do it ... ; How it works ... ; There's more ... ; Configuring the FileSystem object; Retrieving the list of data blocks of a file; See also; Using HDFS C API (libhdfs); Getting ready; How to do it ... ; How it works ... ; There's more ... ; Configuring using HDFS configuration files; See also; Mounting HDFS (Fuse-DFS); Getting ready; How to do it ... ; How it works ... ; There's more ... ; Building libhdfs.
- See alsoMerging files in HDFS; How to do it ... ; How it works ... ; 3. Advanced Hadoop MapReduce Administration; Introduction; Tuning Hadoop configurations for cluster deployments; Getting ready; How to do it ... ; How it works ... ; There's more ... ; Running benchmarks to verify the Hadoop installation; Getting ready; How to do it ... ; How it works ... ; There's more ... ; Reusing Java VMs to improve the performance; How to do it ... ; How it works ... ; Fault tolerance and speculative execution; How to do it ... ; How it works ... ; Debug scripts
- analyzing task failures; Getting ready; How to do it ...