Cargando…

Learning Big Data with Amazon Elastic MapReduce.

Amazon Elastic MapReduce is a web service used to process and store vast amount of data, and it is one of the largest Hadoop operators in the world. With the increase in the amount of data generated and collected by many businesses and the arrival of cost-effective cloud-based solutions for distribu...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Singh, Amarkant (Software developer)
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Packt Publishing, 2014.
Temas:
Acceso en línea:Texto completo

MARC

LEADER 00000cam a22000007a 4500
001 EBSCO_ocn894628948
003 OCoLC
005 20231017213018.0
006 m o d
007 cr |n|||||||||
008 141107s2014 xx o 000 0 eng d
040 |a IDEBK  |b eng  |e pn  |c IDEBK  |d EBLCP  |d YDXCP  |d DEBBG  |d CHVBK  |d OCLCO  |d OCLCQ  |d OCLCF  |d FEM  |d MNU  |d N$T  |d ZCU  |d AGLDB  |d MERUC  |d OCLCQ  |d OCLCO  |d REB  |d D6H  |d OCLCQ  |d VTS  |d ICG  |d NLE  |d UKMGB  |d OCLCQ  |d OCLCO  |d WYU  |d STF  |d DKC  |d OCLCQ  |d UKAHL  |d OCLCQ  |d OCLCO  |d AJS  |d OCLCO  |d SGP  |d OCLCQ 
016 7 |a 018005576  |2 Uk 
019 |a 968068997  |a 969040859 
020 |a 1322242186  |q (electronic bk.) 
020 |a 9781322242187  |q (electronic bk.) 
020 |a 9781782173441  |q (electronic bk.) 
020 |a 1782173447  |q (electronic bk.) 
020 |a 9781782173434 
020 |a 1782173439 
024 3 |a 9781782173434 
029 1 |a AU@  |b 000062350256 
029 1 |a AU@  |b 000067107178 
029 1 |a CHNEW  |b 000889249 
029 1 |a CHVBK  |b 374479682 
029 1 |a DEBBG  |b BV043613349 
029 1 |a DEBSZ  |b 493153128 
029 1 |a DKDLA  |b 820120-katalog:999928272305765 
029 1 |a GBVCP  |b 802477178 
029 1 |a UKMGB  |b 018005576 
035 |a (OCoLC)894628948  |z (OCoLC)968068997  |z (OCoLC)969040859 
050 4 |a T55.4-60.8 
072 7 |a COM  |x 013000  |2 bisacsh 
072 7 |a COM  |x 014000  |2 bisacsh 
072 7 |a COM  |x 018000  |2 bisacsh 
072 7 |a COM  |x 067000  |2 bisacsh 
072 7 |a COM  |x 032000  |2 bisacsh 
072 7 |a COM  |x 037000  |2 bisacsh 
072 7 |a COM  |x 052000  |2 bisacsh 
082 0 4 |a 004.109236 
049 |a UAMI 
100 1 |a Singh, Amarkant  |c (Software developer) 
245 1 0 |a Learning Big Data with Amazon Elastic MapReduce. 
260 |b Packt Publishing,  |c 2014. 
300 |a 1 online resource 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a text file 
588 0 |a Print version record. 
520 |a Amazon Elastic MapReduce is a web service used to process and store vast amount of data, and it is one of the largest Hadoop operators in the world. With the increase in the amount of data generated and collected by many businesses and the arrival of cost-effective cloud-based solutions for distributed computing, the feasibility to crunch large amounts of data to get deep insights within a short span of time has increased greatly. This book will get you started with AWS so that you can quickly create your own account and explore the services provided, many of which you might be delighted to use. This book covers the architectural details of the MapReduce framework, Apache Hadoop, various job models on EMR, how to manage clusters on EMR, and the command-line tools available with EMR. Each chapter builds on the knowledge of the previous one, leading to the final chapter where you will learn about solving a real-world use case using Apache Hadoop and EMR. This book will, therefore, get you up and running with major Big Data technologies quickly and efficiently. 
505 0 |a Cover; Copyright; Credits; About the Authors; Acknowledgments; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Amazon Web Services; What is Amazon Web Services?; Structure and Design; Regions; Availability Zones; Services provided by AWS; Compute; Amazon EC2; Auto Scaling; Elastic Load Balancing; Amazon Workspaces; Storage; Amazon S3; Amazon EBS; Amazon Glacier; AWS Storage Gateway; AWS Import/Export; Databases; Amazon RDS; Amazon DynamoDB; Amazon Redshift; Amazon ElastiCache; Networking and CDN; Amazon VPC; Amazon Route 53; Amazon CloudFront; AWS Direct Connect 
505 8 |a AnalyticsAmazon EMR; Amazon Kinesis; AWS Data Pipeline; Application services; Amazon CloudSearch (Beta); Amazon SQS; Amazon SNS; Amazon SES; Amazon AppStream; Amazon Elastic Transcoder; Amazon SWF; Deployment and Management; AWS Identity and Access Management; Amazon CloudWatch; AWS Elastic Beanstalk; AWS CloudFormation; AWS OpsWorks; AWS CloudHSM; AWS CloudTrail; AWS Pricing; Creating an account on AWS; Step 1 -- Creating an Amazon.com account; Step 2 -- Providing a payment method; Step 3 -- Identity verification by telephone; Step 4 -- Selecting the AWS support plan 
505 8 |a Launching the AWS management consoleGetting started with Amazon EC2; How to start a machine on AWS?; Step 1 -- Choosing an Amazon Machine Image; Step 2 -- Choosing an instance type; Step 3 -- Configuring instance details; Step 4 -- Adding storage; Step 5 -- Tagging your instance; Step 6 -- Configuring a security group; Communicating with the launched instance; EC2 instance types; General purpose; Memory optimized; Compute optimized; Getting started with Amazon S3; Creating a S3 bucket; Bucket naming; S3cmd; Summary; Chapter 2: MapReduce; The map function; The reduce function; Divide and conquer 
505 8 |a What is MapReduce?The map reduce function models; The map function model; The reduce function model; Data life cycle in the MapReduce framework; Creation of input data splits; Record reader; Mapper; Combiner; Partitioner; Shuffle and sort; Reducer; Real-world examples and use cases of MapReduce; Social networks ; Media and entertainment; E-commerce and websites; Fraud detection and financial analytics; Search engines and ad networks; ETL and data analytics; Software distributions built on the MapReduce framework; Apache Hadoop; MapR; Cloudera distribution; Summary; Chapter 3: Apache Hadoop 
505 8 |a What is Apache Hadoop?Hadoop modules; Hadoop Distributed File System; Major architectural goals of HDFS; Block replication and rack awareness; The HDFS architecture; NameNode; DataNode; Apache Hadoop MapReduce; Hadoop MapReduce 1.x; JobTracker; TaskTracker; Hadoop MapReduce 2.0; Hadoop YARN; Apache Hadoop as a platform; Apache Pig; Apache Hive; Summary; Chapter 4: Amazon EMR -- Hadoop on Amazon Web Services; What is AWS EMR?; Features of EMR; Accessing Amazon EMR features; Programming on AWS EMR; The EMR architecture; Types of nodes; EMR Job Flow and Steps; Job Steps; An EMR cluster 
505 8 |a Hadoop filesystem on EMR -- S3 and HDFS 
590 |a eBooks on EBSCOhost  |b EBSCO eBook Subscription Academic Collection - Worldwide 
630 0 0 |a MapReduce (Computer file) 
630 0 7 |a MapReduce (Computer file)  |2 fast  |0 (OCoLC)fst01915150 
650 0 |a Big data. 
650 6 |a Données volumineuses. 
650 7 |a COMPUTERS  |x Computer Literacy.  |2 bisacsh 
650 7 |a COMPUTERS  |x Computer Science.  |2 bisacsh 
650 7 |a COMPUTERS  |x Data Processing.  |2 bisacsh 
650 7 |a COMPUTERS  |x Hardware  |x General.  |2 bisacsh 
650 7 |a COMPUTERS  |x Information Technology.  |2 bisacsh 
650 7 |a COMPUTERS  |x Machine Theory.  |2 bisacsh 
650 7 |a COMPUTERS  |x Reference.  |2 bisacsh 
650 7 |a Big data.  |2 fast  |0 (OCoLC)fst01892965 
776 0 8 |i Print version:  |n Druck-Ausgabe  |t Singh, Amarkant. Learning Big Data with Amazon Elastic MapReduce 
856 4 0 |u https://ebsco.uam.elogim.com/login.aspx?direct=true&scope=site&db=nlebk&AN=882358  |z Texto completo 
936 |a BATCHLOAD 
938 |a Askews and Holts Library Services  |b ASKH  |n AH27198374 
938 |a EBSCOhost  |b EBSC  |n 882358 
938 |a YBP Library Services  |b YANK  |n 12148088 
994 |a 92  |b IZTAP