Cargando…

Hadoop cluster deployment : construct a modern Hadoop data platform effortlessly and gain insights into how to manage clusters efficiently /

This book is a step-by-step tutorial filled with practical examples which will show you how to build and manage a Hadoop cluster along with its intricacies. This book is ideal for database administrators, data engineers, and system administrators, and it will act as an invaluable reference if you ar...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Zburivsky, Danil
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Birmingham UK : Packt Publishing, 2013.
Temas:
Acceso en línea:Texto completo
Tabla de Contenidos:
  • Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Setting Up Hadoop Cluster
  • from Hardware to Distribution; Choosing Hadoop cluster hardware; Choosing the DataNode hardware; Low storage density cluster; High storage density cluster; NameNode and JobTracker hardware configuration; The NameNode hardware; The JobTracker hardware; Gateway and other auxiliary services; Network considerations; Hadoop hardware summary; Hadoop distributions; Hadoop versions; Choosing Hadoop distribution; Cloudera Hadoop distribution.
  • Hortonworks Hadooop distributionMapR; Choosing OS for the Hadoop cluster; Summary; Chapter 2: Installing and Configuring Hadoop; Configuring OS for Hadoop cluster; Choosing and setting up the filesystem; Setting up Java Development Kit; Other OS settings; Setting up the CDH repositories; Setting up NameNode; JournalNode, ZooKeeper, and Failover Controller; Hadoop configuration files; NameNode HA configuration; JobTracker configuration; Configuring the job scheduler; DataNode configuration; TaskTracker configuration; Advanced Hadoop tuning; Summary; Chapter 3: Configuring the Hadoop Ecosystem.
  • Hosting the Hadoop ecosystemSqoop; Installing and configuring Sqoop; Sqoop import example; Sqoop export example; Hive; Hive architecture; Installing Hive Metastore; Installing the Hive client; Installing Hive Server; Impala; Impala architecture; Installing Impala state store; Installing the Impala server; Summary; Chapter 4: Securing Hadoop Installation; Hadoop security overview; HDFS security; MapReduce security; Hadoop Service Level Authorization; Hadoop and Kerberos; Kerberos overview; Kerberos in Hadoop; Configuring Kerberos clients; Generating Kerberos principals.
  • Enabling Kerberos for HDFSEnabling Kerberos for MapReduce; Summary; Chapter 5: Monitoring Hadoop Cluster; Monitoring strategy overview; Hadoop Metrics; JMX Metrics; Monitoring Hadoop with Nagios; Monitoring HDFS; NameNode checks; JournalNode checks; ZooKeeper checks; Monitoring MapReduce; JobTracker checks; Monitoring Hadoop With Ganglia; Summary; Chapter 6: Deploying Hadoop to the Cloud; Amazon Elastic MapReduce; Installing the EMR command-line interface; Choosing the Hadoop version; Launching the EMR cluster; Temporary EMR clusters; Preparing input and output locations; Using Whirr.
  • Installing and configuring WhirrSummary; Index.