Cargando…

Learning Hadoop 2 : design and implement data processing, lifecycle management, and analytic workflows with the cutting-edge toolbox of Hadoop 2 /

If you are a system or application developer interested in learning how to solve practical problems using the Hadoop framework, then this book is ideal for you. You are expected to be familiar with the Unix/Linux command-line interface and have some experience with the Java programming language. Fam...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autores principales: Turkington, Garry (Autor), Modena, Gabriele (Autor)
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Birmingham, UK : Packt Publishing, 2015.
Colección:Community experience distilled.
Temas:
Acceso en línea:Texto completo
Texto completo

MARC

LEADER 00000cam a2200000 i 4500
001 EBSCO_ocn905091883
003 OCoLC
005 20231017213018.0
006 m o d
007 cr unu||||||||
008 150318s2015 enka o 001 0 eng d
040 |a UMI  |b eng  |e rda  |e pn  |c UMI  |d COO  |d N$T  |d DEBBG  |d YDXCP  |d OCLCF  |d CEF  |d UKMGB  |d UAB  |d AU@  |d UKAHL  |d SFB  |d OCLCO  |d OCLCQ  |d QGK  |d OCLCO 
015 |a GBB6G3008  |2 bnb 
016 7 |a 018006559  |2 Uk 
019 |a 1259248353 
020 |a 9781783285525  |q (electronic bk.) 
020 |a 1783285524  |q (electronic bk.) 
020 |z 1783285524 
020 |z 1783285516 
020 |z 9781783285518 
029 1 |a DEBBG  |b BV042682824 
029 1 |a DEBSZ  |b 446580252 
029 1 |a GBVCP  |b 829881018 
029 1 |a UKMGB  |b 018006559 
035 |a (OCoLC)905091883  |z (OCoLC)1259248353 
037 |a CL0500000568  |b Safari Books Online 
050 4 |a QA76.9.D5 
072 7 |a COM  |x 013000  |2 bisacsh 
072 7 |a COM  |x 014000  |2 bisacsh 
072 7 |a COM  |x 018000  |2 bisacsh 
072 7 |a COM  |x 067000  |2 bisacsh 
072 7 |a COM  |x 032000  |2 bisacsh 
072 7 |a COM  |x 037000  |2 bisacsh 
072 7 |a COM  |x 052000  |2 bisacsh 
082 0 4 |a 004.36  |2 23 
049 |a UAMI 
100 1 |a Turkington, Garry,  |e author. 
245 1 0 |a Learning Hadoop 2 :  |b design and implement data processing, lifecycle management, and analytic workflows with the cutting-edge toolbox of Hadoop 2 /  |c Garry Turkington, Gabriele Modena. 
246 3 |a Learning Hadoop two 
246 3 0 |a Design and implement data processing, lifecycle management, and analytic workflows with the cutting-edge toolbox of Hadoop 2 
264 1 |a Birmingham, UK :  |b Packt Publishing,  |c 2015. 
300 |a 1 online resource (1 volume) :  |b illustrations 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a text file 
490 1 |a Community experience distilled 
588 0 |a Online resource; title from cover (Safari, viewed March 10, 2015). 
500 |a Includes index. 
505 0 |a Cover; Copyright; Credits; About the Authors; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Introduction; A note on versioning; The background of Hadoop; Components of Hadoop; Common building blocks; Storage; Computation; Better together; Hadoop 2 -- what's the big deal?; Storage in Hadoop 2; Computation in Hadoop 2; Distributions of Apache Hadoop; A dual approach; AWS -- infrastructure on demand from Amazon; Simple Storage Service (S3); Elastic MapReduce (EMR); Getting started; Cloudera QuickStart VM; Amazon EMR; Creating an AWS account 
505 8 |a Signing up for the necessary servicesUsing Elastic MapReduce; Getting Hadoop up and running; How to use EMR; AWS credentials; The AWS command-line interface; Running the examples; Data processing with Hadoop; Why Twitter?; Building our first dataset; One service, multiple APIs; Anatomy of a Tweet; Twitter credentials; Programmatic access with Python; Summary; Chapter 2: Storage; The inner workings of HDFS; Cluster startup; NameNode startup; DataNode startup; Block replication; Command-line access to the HDFS filesystem; Exploring the HDFS filesystem; Protecting the filesystem metadata 
505 8 |a Secondary NameNode not to the rescueHadoop 2 NameNode HA; Keeping the HA NameNodes in sync; Client configuration; How a failover works; Apache ZooKeeper -- a different type of filesystem; Implementing a distributed lock with sequential ZNodes; Implementing group membership and leader election using ephemeral ZNodes; Java API; Building blocks; Further reading; Automatic NameNode failover; HDFS snapshots; Hadoop filesystems; Hadoop interfaces; Java FileSystem API; Libhdfs; Thrift ; Managing and serializing data; The Writable interface; Introducing the wrapper classes ; Array wrapper classes 
505 8 |a The Comparable and WritableComparable interfacesStoring data; Serialization and Containers; Compression; General-purpose file formats; Column-oriented data formats; RCFile; ORC; Parquet; Avro; Using the Java API; Summary; Chapter 3: Processing -- MapReduce and Beyond; MapReduce; Java API to MapReduce; The Mapper class; The Reducer class; The Driver class; Combiner; Partitioning; The optional partition function; Hadoop-provided mapper and reducer implementations; Sharing reference data; Writing MapReduce programs; Getting started; Running the examples; Local cluster; Elastic MapReduce 
505 8 |a WordCount, the Hello World of MapReduceWord co-occurrences; Trending topics; The Top N pattern; Sentiment of hashtags; Text cleanup using chain mapper; Walking through a run of a MapReduce job; Startup; Splitting the input; Task assignment; Task startup; Ongoing JobTracker monitoring; Mapper input; Mapper execution; Mapper output and reducer input; Reducer input; Reducer execution; Reducer output; Shutdown; Input/Output; InputFormat and RecordReader; Hadoop-provided InputFormat; Hadoop-provided RecordReader; OutputFormat and RecordWriter; Hadoop-provided OutputFormat; Sequence files; YARN 
520 |a If you are a system or application developer interested in learning how to solve practical problems using the Hadoop framework, then this book is ideal for you. You are expected to be familiar with the Unix/Linux command-line interface and have some experience with the Java programming language. Familiarity with Hadoop would be a plus. 
546 |a English. 
590 |a O'Reilly  |b O'Reilly Online Learning: Academic/Public Library Edition 
590 |a eBooks on EBSCOhost  |b EBSCO eBook Subscription Academic Collection - Worldwide 
630 0 0 |a Apache Hadoop. 
630 0 7 |a Apache Hadoop  |2 fast 
650 0 |a Electronic data processing  |x Distributed processing. 
650 0 |a Big data. 
650 6 |a Traitement réparti. 
650 6 |a Données volumineuses. 
650 7 |a COMPUTERS  |x Computer Literacy.  |2 bisacsh 
650 7 |a COMPUTERS  |x Computer Science.  |2 bisacsh 
650 7 |a COMPUTERS  |x Data Processing.  |2 bisacsh 
650 7 |a COMPUTERS  |x Hardware  |x General.  |2 bisacsh 
650 7 |a COMPUTERS  |x Information Technology.  |2 bisacsh 
650 7 |a COMPUTERS  |x Machine Theory.  |2 bisacsh 
650 7 |a COMPUTERS  |x Reference.  |2 bisacsh 
650 7 |a Big data  |2 fast 
650 7 |a Electronic data processing  |x Distributed processing  |2 fast 
700 1 |a Modena, Gabriele,  |e author. 
776 0 8 |i Print version:  |a Turkington, Garry.  |t Learning Hadoop 2 : design and implement data processing, lifecycle management, and analytic workflows with the cutting-edge toolbox of Hadoop 2.  |d Birmingham, England ; Mumbai, India : Packt Publishing, ©2015  |h x, 360 pages  |k Community experience distilled.  |z 9781783285518 
830 0 |a Community experience distilled. 
856 4 0 |u https://learning.oreilly.com/library/view/~/9781783285518/?ar  |z Texto completo 
856 4 0 |u https://ebsco.uam.elogim.com/login.aspx?direct=true&scope=site&db=nlebk&AN=956633  |z Texto completo 
938 |a Askews and Holts Library Services  |b ASKH  |n AH28256048 
938 |a EBSCOhost  |b EBSC  |n 956633 
938 |a YBP Library Services  |b YANK  |n 12298529 
994 |a 92  |b IZTAP