Learning Hadoop 2 : design and implement data processing, lifecycle management, and analytic workflows with the cutting-edge toolbox of Hadoop 2 /
If you are a system or application developer interested in learning how to solve practical problems using the Hadoop framework, then this book is ideal for you. You are expected to be familiar with the Unix/Linux command-line interface and have some experience with the Java programming language. Fam...
Clasificación: | Libro Electrónico |
---|---|
Autores principales: | , |
Formato: | Electrónico eBook |
Idioma: | Inglés |
Publicado: |
Birmingham, UK :
Packt Publishing,
2015.
|
Colección: | Community experience distilled.
|
Temas: | |
Acceso en línea: | Texto completo Texto completo |
MARC
LEADER | 00000cam a2200000 i 4500 | ||
---|---|---|---|
001 | EBSCO_ocn905091883 | ||
003 | OCoLC | ||
005 | 20231017213018.0 | ||
006 | m o d | ||
007 | cr unu|||||||| | ||
008 | 150318s2015 enka o 001 0 eng d | ||
040 | |a UMI |b eng |e rda |e pn |c UMI |d COO |d N$T |d DEBBG |d YDXCP |d OCLCF |d CEF |d UKMGB |d UAB |d AU@ |d UKAHL |d SFB |d OCLCO |d OCLCQ |d QGK |d OCLCO | ||
015 | |a GBB6G3008 |2 bnb | ||
016 | 7 | |a 018006559 |2 Uk | |
019 | |a 1259248353 | ||
020 | |a 9781783285525 |q (electronic bk.) | ||
020 | |a 1783285524 |q (electronic bk.) | ||
020 | |z 1783285524 | ||
020 | |z 1783285516 | ||
020 | |z 9781783285518 | ||
029 | 1 | |a DEBBG |b BV042682824 | |
029 | 1 | |a DEBSZ |b 446580252 | |
029 | 1 | |a GBVCP |b 829881018 | |
029 | 1 | |a UKMGB |b 018006559 | |
035 | |a (OCoLC)905091883 |z (OCoLC)1259248353 | ||
037 | |a CL0500000568 |b Safari Books Online | ||
050 | 4 | |a QA76.9.D5 | |
072 | 7 | |a COM |x 013000 |2 bisacsh | |
072 | 7 | |a COM |x 014000 |2 bisacsh | |
072 | 7 | |a COM |x 018000 |2 bisacsh | |
072 | 7 | |a COM |x 067000 |2 bisacsh | |
072 | 7 | |a COM |x 032000 |2 bisacsh | |
072 | 7 | |a COM |x 037000 |2 bisacsh | |
072 | 7 | |a COM |x 052000 |2 bisacsh | |
082 | 0 | 4 | |a 004.36 |2 23 |
049 | |a UAMI | ||
100 | 1 | |a Turkington, Garry, |e author. | |
245 | 1 | 0 | |a Learning Hadoop 2 : |b design and implement data processing, lifecycle management, and analytic workflows with the cutting-edge toolbox of Hadoop 2 / |c Garry Turkington, Gabriele Modena. |
246 | 3 | |a Learning Hadoop two | |
246 | 3 | 0 | |a Design and implement data processing, lifecycle management, and analytic workflows with the cutting-edge toolbox of Hadoop 2 |
264 | 1 | |a Birmingham, UK : |b Packt Publishing, |c 2015. | |
300 | |a 1 online resource (1 volume) : |b illustrations | ||
336 | |a text |b txt |2 rdacontent | ||
337 | |a computer |b c |2 rdamedia | ||
338 | |a online resource |b cr |2 rdacarrier | ||
347 | |a text file | ||
490 | 1 | |a Community experience distilled | |
588 | 0 | |a Online resource; title from cover (Safari, viewed March 10, 2015). | |
500 | |a Includes index. | ||
505 | 0 | |a Cover; Copyright; Credits; About the Authors; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Introduction; A note on versioning; The background of Hadoop; Components of Hadoop; Common building blocks; Storage; Computation; Better together; Hadoop 2 -- what's the big deal?; Storage in Hadoop 2; Computation in Hadoop 2; Distributions of Apache Hadoop; A dual approach; AWS -- infrastructure on demand from Amazon; Simple Storage Service (S3); Elastic MapReduce (EMR); Getting started; Cloudera QuickStart VM; Amazon EMR; Creating an AWS account | |
505 | 8 | |a Signing up for the necessary servicesUsing Elastic MapReduce; Getting Hadoop up and running; How to use EMR; AWS credentials; The AWS command-line interface; Running the examples; Data processing with Hadoop; Why Twitter?; Building our first dataset; One service, multiple APIs; Anatomy of a Tweet; Twitter credentials; Programmatic access with Python; Summary; Chapter 2: Storage; The inner workings of HDFS; Cluster startup; NameNode startup; DataNode startup; Block replication; Command-line access to the HDFS filesystem; Exploring the HDFS filesystem; Protecting the filesystem metadata | |
505 | 8 | |a Secondary NameNode not to the rescueHadoop 2 NameNode HA; Keeping the HA NameNodes in sync; Client configuration; How a failover works; Apache ZooKeeper -- a different type of filesystem; Implementing a distributed lock with sequential ZNodes; Implementing group membership and leader election using ephemeral ZNodes; Java API; Building blocks; Further reading; Automatic NameNode failover; HDFS snapshots; Hadoop filesystems; Hadoop interfaces; Java FileSystem API; Libhdfs; Thrift ; Managing and serializing data; The Writable interface; Introducing the wrapper classes ; Array wrapper classes | |
505 | 8 | |a The Comparable and WritableComparable interfacesStoring data; Serialization and Containers; Compression; General-purpose file formats; Column-oriented data formats; RCFile; ORC; Parquet; Avro; Using the Java API; Summary; Chapter 3: Processing -- MapReduce and Beyond; MapReduce; Java API to MapReduce; The Mapper class; The Reducer class; The Driver class; Combiner; Partitioning; The optional partition function; Hadoop-provided mapper and reducer implementations; Sharing reference data; Writing MapReduce programs; Getting started; Running the examples; Local cluster; Elastic MapReduce | |
505 | 8 | |a WordCount, the Hello World of MapReduceWord co-occurrences; Trending topics; The Top N pattern; Sentiment of hashtags; Text cleanup using chain mapper; Walking through a run of a MapReduce job; Startup; Splitting the input; Task assignment; Task startup; Ongoing JobTracker monitoring; Mapper input; Mapper execution; Mapper output and reducer input; Reducer input; Reducer execution; Reducer output; Shutdown; Input/Output; InputFormat and RecordReader; Hadoop-provided InputFormat; Hadoop-provided RecordReader; OutputFormat and RecordWriter; Hadoop-provided OutputFormat; Sequence files; YARN | |
520 | |a If you are a system or application developer interested in learning how to solve practical problems using the Hadoop framework, then this book is ideal for you. You are expected to be familiar with the Unix/Linux command-line interface and have some experience with the Java programming language. Familiarity with Hadoop would be a plus. | ||
546 | |a English. | ||
590 | |a O'Reilly |b O'Reilly Online Learning: Academic/Public Library Edition | ||
590 | |a eBooks on EBSCOhost |b EBSCO eBook Subscription Academic Collection - Worldwide | ||
630 | 0 | 0 | |a Apache Hadoop. |
630 | 0 | 7 | |a Apache Hadoop |2 fast |
650 | 0 | |a Electronic data processing |x Distributed processing. | |
650 | 0 | |a Big data. | |
650 | 6 | |a Traitement réparti. | |
650 | 6 | |a Données volumineuses. | |
650 | 7 | |a COMPUTERS |x Computer Literacy. |2 bisacsh | |
650 | 7 | |a COMPUTERS |x Computer Science. |2 bisacsh | |
650 | 7 | |a COMPUTERS |x Data Processing. |2 bisacsh | |
650 | 7 | |a COMPUTERS |x Hardware |x General. |2 bisacsh | |
650 | 7 | |a COMPUTERS |x Information Technology. |2 bisacsh | |
650 | 7 | |a COMPUTERS |x Machine Theory. |2 bisacsh | |
650 | 7 | |a COMPUTERS |x Reference. |2 bisacsh | |
650 | 7 | |a Big data |2 fast | |
650 | 7 | |a Electronic data processing |x Distributed processing |2 fast | |
700 | 1 | |a Modena, Gabriele, |e author. | |
776 | 0 | 8 | |i Print version: |a Turkington, Garry. |t Learning Hadoop 2 : design and implement data processing, lifecycle management, and analytic workflows with the cutting-edge toolbox of Hadoop 2. |d Birmingham, England ; Mumbai, India : Packt Publishing, ©2015 |h x, 360 pages |k Community experience distilled. |z 9781783285518 |
830 | 0 | |a Community experience distilled. | |
856 | 4 | 0 | |u https://learning.oreilly.com/library/view/~/9781783285518/?ar |z Texto completo |
856 | 4 | 0 | |u https://ebsco.uam.elogim.com/login.aspx?direct=true&scope=site&db=nlebk&AN=956633 |z Texto completo |
938 | |a Askews and Holts Library Services |b ASKH |n AH28256048 | ||
938 | |a EBSCOhost |b EBSC |n 956633 | ||
938 | |a YBP Library Services |b YANK |n 12298529 | ||
994 | |a 92 |b IZTAP |