Cargando…

Optimizing Hadoop for MapReduce : learn how to configure your Hadoop cluster to run optimal MapReduce jobs /

This book is an example-based tutorial that deals with Optimizing Hadoop for MapReduce job performance. If you are a Hadoop administrator, developer, MapReduce user, or beginner, this book is the best choice available if you wish to optimize your clusters and applications. Having prior knowledge of...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Tannir, Khaled
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Birmingham, UK : Packt Pub., 2014.
Colección:Community experience distilled.
Temas:
Acceso en línea:Texto completo (Requiere registro previo con correo institucional)

MARC

LEADER 00000cam a2200000 i 4500
001 OR_ocn883632323
003 OCoLC
005 20231017213018.0
006 m o d
007 cr cnu---unuuu
008 140714s2014 enka o 001 0 eng d
040 |a N$T  |b eng  |e rda  |e pn  |c N$T  |d UMI  |d E7B  |d COO  |d DEBBG  |d YDXCP  |d VT2  |d OCLCQ  |d OCLCO  |d OCLCF  |d OCLCO  |d D6H  |d AGLDB  |d OCLCQ  |d OCLCO  |d COCUF  |d ICA  |d CNNOR  |d OCLCQ  |d OCLCO  |d MOR  |d PIFAG  |d OCLCQ  |d OCLCO  |d U3W  |d REB  |d STF  |d VTS  |d CEF  |d NLE  |d INT  |d UKMGB  |d OCLCQ  |d WYU  |d OCLCO  |d G3B  |d ICG  |d TKN  |d OCLCQ  |d OCLCO  |d UAB  |d AU@  |d M8D  |d HS0  |d OCLCQ  |d OCLCO  |d OCLCQ  |d OCLCO 
016 7 |a 018006551  |2 Uk 
019 |a 873843694  |a 878827551  |a 894369302  |a 900287258 
020 |a 9781783285662  |q (electronic bk.) 
020 |a 1783285664  |q (electronic bk.) 
020 |z 9781783285655 
020 |z 1783285656 
029 1 |a CHNEW  |b 000691003 
029 1 |a CHNEW  |b 000691005 
029 1 |a DEBBG  |b BV042031919 
029 1 |a DEBSZ  |b 414173805 
029 1 |a DEBSZ  |b 484725882 
029 1 |a GBVCP  |b 814568637 
029 1 |a UKMGB  |b 018006551 
029 1 |a AU@  |b 000056925187 
035 |a (OCoLC)883632323  |z (OCoLC)873843694  |z (OCoLC)878827551  |z (OCoLC)894369302  |z (OCoLC)900287258 
037 |a CL0500000403  |b Safari Books Online 
050 4 |a QA76.9.D5  |b T36 2014eb 
072 7 |a COM  |x 013000  |2 bisacsh 
072 7 |a COM  |x 014000  |2 bisacsh 
072 7 |a COM  |x 018000  |2 bisacsh 
072 7 |a COM  |x 067000  |2 bisacsh 
072 7 |a COM  |x 032000  |2 bisacsh 
072 7 |a COM  |x 037000  |2 bisacsh 
072 7 |a COM  |x 052000  |2 bisacsh 
082 0 4 |a 004/.36  |2 23 
049 |a UAMI 
100 1 |a Tannir, Khaled. 
245 1 0 |a Optimizing Hadoop for MapReduce :  |b learn how to configure your Hadoop cluster to run optimal MapReduce jobs /  |c Khaled Tannir. 
264 1 |a Birmingham, UK :  |b Packt Pub.,  |c 2014. 
300 |a 1 online resource (iii, 103 pages) :  |b illustrations 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
490 1 |a Community experience distilled 
500 |a Includes index. 
588 0 |a Print version record. 
505 0 |a Cover; Copyright; Credits; About the Author; Acknowledgments; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Understanding Hadoop MapReduce; The MapReduce model; Overview of Hadoop MapReduce; Hadoop MapReduce internals; Factors affecting the performance of MapReduce; Summary; Chapter 2: An Overview of the Hadoop Parameters; Investigating the Hadoop parameters; The mapred-site.xml configuration file; The CPU-related parameters; The disk I/O related parameters; The memory-related parameters; The network-related parameters; The hdfs-site.xml configuration file 
505 8 |a The core-site.xml configuration fileHadoop MapReduce metrics; Performance monitoring tools; Using Chukwa to monitor Hadoop; Using Ganglia to monitor Hadoop; Using Nagios to monitor Hadoop; Using Apache Ambari to monitor Hadoop; Summary; Chapter 3: Detecting System Bottlenecks; Performance tuning; Creating a performance baseline; Identifying resource bottlenecks; Identifying RAM bottlenecks; Identifying CPU bottlenecks; Identifying storage bottlenecks; Identifying network bandwidth bottlenecks; Summary; Chapter 4: Identifying Resource Weaknesses; Identifying cluster weakness 
505 8 |a Checking the Hadoop cluster node's healthChecking the input data size; Checking massive I/O and network traffic; Checking for insufficient concurrent tasks; Checking for CPU contention; Sizing your Hadoop cluster; Configuring your cluster correctly; Summary; Chapter 5: Enhancement of Map and Reduce Tasks; Enhancing Map tasks; Input data and block size impact; Dealing with small and unsplittable files; Reducing spilled records during the Map phase; Calculating map tasks' throughput; Enhancing Reduce tasks; Calculating reduce task throughput; Improving Reduce execution phase 
505 8 |a Tuning map and reduce parametersSummary; Chapter 6: Optimizing MapReduce Tasks; Using Combiners; Using compression; Using appropriate Writable types; Reusing types smartly; Optimizing mappers and reducers code; Summary; Chapter 7: Best Practices and Recommendations; Hardware tuning and OS recommendations; Hadoop cluster checklists; The Bios tuning checklist; OS configuration recommendations; Hadoop best practices and recommendations; Deploying Hadoop; Hadoop tuning recommendations; Using a MapReduce template class code; Summary; Index 
520 |a This book is an example-based tutorial that deals with Optimizing Hadoop for MapReduce job performance. If you are a Hadoop administrator, developer, MapReduce user, or beginner, this book is the best choice available if you wish to optimize your clusters and applications. Having prior knowledge of creating MapReduce applications is not necessary, but will help you better understand the concepts and snippets of MapReduce class template code. 
590 |a O'Reilly  |b O'Reilly Online Learning: Academic/Public Library Edition 
590 |a eBooks on EBSCOhost  |b EBSCO eBook Subscription Academic Collection - Worldwide 
630 0 0 |a Apache Hadoop. 
630 0 7 |a Apache Hadoop  |2 fast 
650 0 |a Electronic data processing  |x Distributed processing. 
650 0 |a Cluster analysis  |x Data processing. 
650 6 |a Traitement réparti. 
650 6 |a Classification automatique (Statistique)  |x Informatique. 
650 7 |a COMPUTERS  |x Computer Literacy.  |2 bisacsh 
650 7 |a COMPUTERS  |x Computer Science.  |2 bisacsh 
650 7 |a COMPUTERS  |x Data Processing.  |2 bisacsh 
650 7 |a COMPUTERS  |x Hardware  |x General.  |2 bisacsh 
650 7 |a COMPUTERS  |x Information Technology.  |2 bisacsh 
650 7 |a COMPUTERS  |x Machine Theory.  |2 bisacsh 
650 7 |a COMPUTERS  |x Reference.  |2 bisacsh 
650 7 |a Cluster analysis  |x Data processing  |2 fast 
650 7 |a Electronic data processing  |x Distributed processing  |2 fast 
776 0 8 |i Print version:  |a Tannir, Khaled.  |t Optimizing Hadoop for MapReduce  |z 9781783285655  |w (OCoLC)879573172 
830 0 |a Community experience distilled. 
856 4 0 |u https://learning.oreilly.com/library/view/~/9781783285655/?ar  |z Texto completo (Requiere registro previo con correo institucional) 
938 |a ebrary  |b EBRY  |n ebr10842106 
938 |a EBSCOhost  |b EBSC  |n 707232 
938 |a YBP Library Services  |b YANK  |n 11629330 
994 |a 92  |b IZTAP