Cargando…

Hadoop backup and recovery solutions : learn the best strategies for data recovery from Hadoop backup clusters and troubleshoot problems /

If you are a Hadoop administrator and you want to get a good grounding in how to back up large amounts of data and manage Hadoop clusters, then this book is for you.

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autores principales: Barot, Gaurav (Autor), Patel, Amij (Autor), Mehta, Chintan (Autor)
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Birmingham, UK : Packt Publishing, 2015.
Colección:Community experience distilled.
Temas:
Acceso en línea:Texto completo

MARC

LEADER 00000cam a2200000Ii 4500
001 EBSCO_ocn918863974
003 OCoLC
005 20231017213018.0
006 m o d
007 cr unu||||||||
008 150817s2015 enka o 001 0 eng d
040 |a UMI  |b eng  |e rda  |e pn  |c UMI  |d IDEBK  |d EBLCP  |d DEBSZ  |d COO  |d OCLCF  |d N$T  |d YDXCP  |d DEBBG  |d OCLCQ  |d MERUC  |d OCLCQ  |d OCLCO  |d CEF  |d NLE  |d UKMGB  |d OCLCQ  |d WYU  |d OCLCO  |d UAB  |d AU@  |d UKAHL  |d OCLCQ  |d OCLCO  |d VLY  |d AJS  |d OCLCQ  |d OCLCO  |d OCLCQ  |d QGK 
016 7 |a 018006635  |2 Uk 
019 |a 915154145  |a 923616104  |a 1259204267 
020 |a 9781783289059  |q (electronic bk.) 
020 |a 1783289058  |q (electronic bk.) 
020 |z 9781783289042 
020 |z 178328904X 
029 1 |a AU@  |b 000056111596 
029 1 |a DEBBG  |b BV043020070 
029 1 |a DEBBG  |b BV043622431 
029 1 |a DEBSZ  |b 445087633 
029 1 |a DEBSZ  |b 455696365 
029 1 |a GBVCP  |b 882743422 
029 1 |a UKMGB  |b 018006635 
035 |a (OCoLC)918863974  |z (OCoLC)915154145  |z (OCoLC)923616104  |z (OCoLC)1259204267 
037 |a CL0500000627  |b Safari Books Online 
050 4 |a QA76.9.D5 
072 7 |a COM  |x 013000  |2 bisacsh 
072 7 |a COM  |x 014000  |2 bisacsh 
072 7 |a COM  |x 018000  |2 bisacsh 
072 7 |a COM  |x 067000  |2 bisacsh 
072 7 |a COM  |x 032000  |2 bisacsh 
072 7 |a COM  |x 037000  |2 bisacsh 
072 7 |a COM  |x 052000  |2 bisacsh 
082 0 4 |a 004.10923478 
049 |a UAMI 
100 1 |a Barot, Gaurav,  |e author. 
245 1 0 |a Hadoop backup and recovery solutions :  |b learn the best strategies for data recovery from Hadoop backup clusters and troubleshoot problems /  |c Gaurav Barot, Amij Patel, Chintan Mehta. 
246 3 0 |a Learn the best strategies for data recovery from Hadoop backup clusters and troubleshoot problems 
264 1 |a Birmingham, UK :  |b Packt Publishing,  |c 2015. 
300 |a 1 online resource (1 volume) :  |b illustrations 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a text file 
490 1 |a Community experience distilled 
588 0 |a Online resource; title from cover (Safari, viewed August 13, 2015). 
500 |a Includes index. 
505 0 |a Cover -- Copyright -- Credits -- About the Authors -- About the Reviewers -- www.PacktPub.com -- Table of Contents -- Preface -- Chapter 1: Knowing Hadoop and Clustering Basics -- Understanding the need for Hadoop -- Apache Hive -- Apache Pig -- Apache HBase -- Apache HCatalog -- Understanding HDFS design -- Getting familiar with HDFS daemons -- Scenario 1 â€? writing data to the HDFS cluster -- Scenario 2 â€? reading data from the HDFS cluster -- Understanding the basics of Hadoop cluster -- Summary 
505 8 |a Chapter 2: Understanding Hadoop Backup and Recovery NeedsUnderstanding the backup and recovery philosophies -- Replication of data using DistCp -- Updating and overwriting using DistCp -- The backup philosophy -- Changes since the last backup -- The rate of new data arrival -- The size of the cluster -- Priority of the datasets -- Selecting the datasets or parts of datasets -- The timelines of data backups -- Reducing the window of possible data loss -- Backup consistency -- Avoiding invalid backups -- The recovery philosophy 
505 8 |a Knowing the necessity of backing up HadoopDetermining backup areas â€? what should I back up? -- Datasets -- Block size â€? a large file divided into blocks -- Replication factor -- A list of all the blocks of a file -- A list of DataNodes for each block â€? sorted by distance -- The ACK package -- The checksums -- The number of under-replicated blocks -- The secondary NameNode -- Active and passive nodes in second generation Hadoop -- Hardware failure -- Software failure -- Applications -- Configurations -- Is taking backup enough? 
505 8 |a Understanding the disaster recovery principleKnowing a disaster -- The need for recovery -- Understanding recovery areas -- Summary -- Chapter 3: Determining Backup Strategies -- Knowing the areas to be protected -- Understanding the common failure types -- Hardware failure -- Host failure -- Using commodity hardware -- Hardware failures may lead to loss of data -- User application failure -- Software causing task failure -- Failure of slow-running tasks -- Hadoop's handling of failing tasks -- Task failure due to data 
505 8 |a Bad data handling â€? through codeHadoop's skip mode -- Learning a way to define the backup strategy -- Why do I need a strategy? -- What should be considered in a strategy? -- Filesystem check (fsck) -- Filesystem balancer -- Upgrading your Hadoop cluster -- Designing network layout and rack awareness -- Most important areas to consider while defining a backup strategy -- Understanding the need for backing up Hive metadata -- What is Hive? -- Hive replication -- Summary -- Chapter 4: Backing Up Hadoop -- Data backup in Hadoop -- Distributed copy 
520 |a If you are a Hadoop administrator and you want to get a good grounding in how to back up large amounts of data and manage Hadoop clusters, then this book is for you. 
546 |a English. 
590 |a eBooks on EBSCOhost  |b EBSCO eBook Subscription Academic Collection - Worldwide 
630 0 0 |a Apache Hadoop. 
630 0 7 |a Apache Hadoop.  |2 fast  |0 (OCoLC)fst01911570 
650 0 |a Electronic data processing  |x Distributed processing. 
650 0 |a Information retrieval. 
650 0 |a Big data. 
650 0 |a Open source software. 
650 6 |a Traitement réparti. 
650 6 |a Recherche de l'information. 
650 6 |a Données volumineuses. 
650 6 |a Logiciels libres. 
650 7 |a information retrieval.  |2 aat 
650 7 |a COMPUTERS  |x Computer Literacy.  |2 bisacsh 
650 7 |a COMPUTERS  |x Computer Science.  |2 bisacsh 
650 7 |a COMPUTERS  |x Data Processing.  |2 bisacsh 
650 7 |a COMPUTERS  |x Hardware  |x General.  |2 bisacsh 
650 7 |a COMPUTERS  |x Information Technology.  |2 bisacsh 
650 7 |a COMPUTERS  |x Machine Theory.  |2 bisacsh 
650 7 |a COMPUTERS  |x Reference.  |2 bisacsh 
650 7 |a Big data.  |2 fast  |0 (OCoLC)fst01892965 
650 7 |a Electronic data processing  |x Distributed processing.  |2 fast  |0 (OCoLC)fst00906987 
650 7 |a Information retrieval.  |2 fast  |0 (OCoLC)fst00972619 
650 7 |a Open source software.  |2 fast  |0 (OCoLC)fst01046097 
700 1 |a Patel, Amij,  |e author. 
700 1 |a Mehta, Chintan,  |e author. 
776 0 8 |i Print version:  |a Barot, Gaurav.  |t Hadoop backup and recovery solutions : learn the best strategies for data recovery from Hadoop backup clusters and troubleshoot problems.  |d Birmingham, England ; Mumbai, [India] : Packt Publishing, ©2015  |h xi, 180 pages  |k Community experience distilled.  |z 9781783289042 
830 0 |a Community experience distilled. 
856 4 0 |u https://ebsco.uam.elogim.com/login.aspx?direct=true&scope=site&db=nlebk&AN=1045696  |z Texto completo 
938 |a Askews and Holts Library Services  |b ASKH  |n AH29054194 
938 |a EBSCOhost  |b EBSC  |n 1045696 
938 |a ProQuest MyiLibrary Digital eBook Collection  |b IDEB  |n cis32249875 
938 |a YBP Library Services  |b YANK  |n 12548029 
994 |a 92  |b IZTAP