Cargando…

Hadoop mapreduce v2 cookbook : explore the hadoop mapreduce v2 ecosystem to gain insights from very large datasets /

If you are a Big Data enthusiast and wish to use Hadoop v2 to solve your problems, then this book is for you. This book is for Java programmers with little to moderate knowledge of Hadoop MapReduce. This is also a one-stop reference for developers and system admins who want to quickly get up to spee...

Descripción completa

Detalles Bibliográficos
Clasificación:	Libro Electrónico
Autor principal:	Gunarathne, Thilina (Autor)
Otros Autores:	Jarek Blaminsky (Diseñador de portada), Gordon, Edward (Editor ), Lalwani, Puja (Editor ), Paiva, Alfida (Editor ), Subramanian, Laxmi (Editor )
Formato:	Electrónico eBook
Idioma:	Inglés
Publicado:	Birmingham, England : Packt Publishing, 2015.
Edición:	Second edition.
Colección:	Community experience distilled.
Temas:	Electronic data processing > Distributed processing. File organization (Computer science) Traitement réparti. Fichiers (Informatique) > Organisation. COMPUTERS > Computer Literacy. COMPUTERS > Computer Science. COMPUTERS > Data Processing. COMPUTERS > Hardware > General. COMPUTERS > Information Technology. COMPUTERS > Machine Theory. COMPUTERS > Reference.
Acceso en línea:	Texto completo

MARC


LEADER	00000cam a2200000 i 4500
001	EBSCO_ocn905919092
003	OCoLC
005	20231017213018.0
006	m o d
007	cr cnu\|\|\|unuuu
008	150313s2015 enka o 001 0 eng d
040			\|a E7B \|b eng \|e rda \|e pn \|c E7B \|d OCLCO \|d N$T \|d OCLCQ \|d OCLCF \|d AZK \|d VT2 \|d WYU \|d RDF \|d OCLCO \|d OCLCQ \|d QGK
019			\|a 961492576 \|a 1259162596
020			\|a 9781783285488 \|q (electronic bk.)
020			\|a 1783285486 \|q (electronic bk.)
020			\|z 9781783285471
029	1		\|a AU@ \|b 000056944656
029	1		\|a CHNEW \|b 000705557
029	1		\|a DEBBG \|b BV042622513
035			\|a (OCoLC)905919092 \|z (OCoLC)961492576 \|z (OCoLC)1259162596
050		4	\|a QA76.9.D5 \|b .G863 2015
072		7	\|a COM \|x 013000 \|2 bisacsh
072		7	\|a COM \|x 014000 \|2 bisacsh
072		7	\|a COM \|x 018000 \|2 bisacsh
072		7	\|a COM \|x 067000 \|2 bisacsh
072		7	\|a COM \|x 032000 \|2 bisacsh
072		7	\|a COM \|x 037000 \|2 bisacsh
072		7	\|a COM \|x 052000 \|2 bisacsh
082	0	4	\|a 004.36 \|2 23
049			\|a UAMI
100	1		\|a Gunarathne, Thilina, \|e author.
245	1	0	\|a Hadoop mapreduce v2 cookbook : \|b explore the hadoop mapreduce v2 ecosystem to gain insights from very large datasets / \|c Thilina Gunarathne ; cover image by Jarek Blaminsky ; commissioning editor Edward Gordon ; copy editors Puja Lalwani, Alfida Paiva, Laxmi Subramanian.
250			\|a Second edition.
264		1	\|a Birmingham, England : \|b Packt Publishing, \|c 2015.
264		4	\|c ©2015
300			\|a 1 online resource (322 pages) : \|b illustrations (some color)
336			\|a text \|b txt \|2 rdacontent
337			\|a computer \|b c \|2 rdamedia
338			\|a online resource \|b cr \|2 rdacarrier
347			\|a data file
490	1		\|a Community experience distilled
500			\|a Includes index.
588	0		\|a Online resource; title from PDF title page (ebrary, viewed March 14, 2015).
505	0		\|a Cover; Copyright; Credits; About the Author; Acknowledgments; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Getting Started with Hadoop v2; Introduction; Setting up Hadoop v2 on your local machine; Writing a WordCount MapReduce application, bundling it, and running it using Hadoop local mode; Adding a combiner step to the WordCount MapReduce program; Setting up HDFS; Setting up Hadoop YARN in a distributed cluster environment using Hadoop v2; Setting up Hadoop ecosystem in a distributed cluster environment using a Hadoop distribution
505	8		\|a HDFS command-line file operationsRunning the WordCount program in a distributed cluster environment; Benchmarking HDFS using DFSIO; Benchmarking Hadoop MapReduce using TeraSort; Chapter 2: Cloud Deployments -- Using Hadoop YARN on Cloud Environments; Introduction; Running Hadoop MapReduce v2 computations using Amazon Elastic MapReduce; Saving money using Amazon EC2 Spot Instances to execute EMR job flows; Executing a Pig script using EMR; Executing a Hive script using EMR; Creating an Amazon EMR job flow using the AWS Command Line Interface
505	8		\|a Deploying an Apache HBase cluster on Amazon EC2 using EMRUsing EMR bootstrap actions to configure VMs for the Amazon EMR jobs; Using Apache Whirr to deploy an Apache Hadoop cluster in a cloud environment; Chapter 3: Hadoop Essentials -- Configurations, Unit Tests, and Other APIs; Introduction; Optimizing Hadoop YARN and MapReduce configurations for cluster deployments; Shared user Hadoop clusters -- using Fair and Capacity schedulers; Setting classpath precedence to user-provided JARs; Speculative execution of straggling tasks; Unit testing Hadoop MapReduce applications using MRUnit
505	8		\|a Integration testing Hadoop MapReduce applications using MiniYarnClusterAdding a new DataNode; Decommissioning DataNodes; Using multiple disks/volumes and limiting HDFS disk usage; Setting the HDFS block size; Setting the file replication factor; Using the HDFS Java API; Chapter 4: Developing Complex Hadoop MapReduce Applications; Introduction; Choosing appropriate Hadoop data types; Implementing a custom Hadoop Writable data type; Implementing a custom Hadoop key type; Emitting data of different value types from a Mapper; Choosing a suitable Hadoop InputFormat for your input data format
505	8		\|a Adding support for new input data formats -- implementing a custom InputFormatFormatting the results of MapReduce computations -- using Hadoop OutputFormats; Writing multiple outputs from a MapReduce computation; Hadoop intermediate data partitioning; Secondary sorting -- sorting Reduce input values; Broadcasting and distributing shared resources to tasks in a MapReduce job -- Hadoop DistributedCache; Using Hadoop with legacy applications -- Hadoop Streaming; Adding dependencies between MapReduce jobs; Hadoop counters for reporting custom metrics; Chapter 5: Analytics; Introduction
520			\|a If you are a Big Data enthusiast and wish to use Hadoop v2 to solve your problems, then this book is for you. This book is for Java programmers with little to moderate knowledge of Hadoop MapReduce. This is also a one-stop reference for developers and system admins who want to quickly get up to speed with using Hadoop v2. It would be helpful to have a basic knowledge of software development using Java and a basic working knowledge of Linux.
546			\|a English.
590			\|a eBooks on EBSCOhost \|b EBSCO eBook Subscription Academic Collection - Worldwide
650		0	\|a Electronic data processing \|x Distributed processing.
650		0	\|a File organization (Computer science)
650		6	\|a Traitement réparti.
650		6	\|a Fichiers (Informatique) \|x Organisation.
650		7	\|a COMPUTERS \|x Computer Literacy. \|2 bisacsh
650		7	\|a COMPUTERS \|x Computer Science. \|2 bisacsh
650		7	\|a COMPUTERS \|x Data Processing. \|2 bisacsh
650		7	\|a COMPUTERS \|x Hardware \|x General. \|2 bisacsh
650		7	\|a COMPUTERS \|x Information Technology. \|2 bisacsh
650		7	\|a COMPUTERS \|x Machine Theory. \|2 bisacsh
650		7	\|a COMPUTERS \|x Reference. \|2 bisacsh
650		7	\|a Electronic data processing \|x Distributed processing. \|2 fast \|0 (OCoLC)fst00906987
650		7	\|a File organization (Computer science) \|2 fast \|0 (OCoLC)fst00924147
700	1		\|a Jarek Blaminsky, \|e cover designer.
700	1		\|a Gordon, Edward, \|e editor.
700	1		\|a Lalwani, Puja, \|e editor.
700	1		\|a Paiva, Alfida, \|e editor.
700	1		\|a Subramanian, Laxmi, \|e editor.
776	0	8	\|i Print version: \|a Gunarathne, Thilina. \|t Hadoop mapreduce v2 cookbook : explore the hadoop mapreduce v2 ecosystem to gain insights from very large datasets. \|b Second edition. \|d Birmingham, England ; Mumbai, [India] : Packt Publishing, ©2015 \|h iv, 304 pages \|k Community experience distilled. \|z 9781783285471
830		0	\|a Community experience distilled.
856	4	0	\|u https://ebsco.uam.elogim.com/login.aspx?direct=true&scope=site&db=nlebk&AN=959553 \|z Texto completo
938			\|a ebrary \|b EBRY \|n ebr11025937
938			\|a EBSCOhost \|b EBSC \|n 959553
994			\|a 92 \|b IZTAP

Hadoop mapreduce v2 cookbook : explore the hadoop mapreduce v2 ecosystem to gain insights from very large datasets /

MARC

Ejemplares similares