Hadoop cluster deployment : construct a modern Hadoop data platform effortlessly and gain insights into how to manage clusters efficiently /
This book is a step-by-step tutorial filled with practical examples which will show you how to build and manage a Hadoop cluster along with its intricacies. This book is ideal for database administrators, data engineers, and system administrators, and it will act as an invaluable reference if you ar...
Clasificación: | Libro Electrónico |
---|---|
Autor principal: | |
Formato: | Electrónico eBook |
Idioma: | Inglés |
Publicado: |
Birmingham UK :
Packt Publishing,
2013.
|
Temas: | |
Acceso en línea: | Texto completo |
Tabla de Contenidos:
- Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Setting Up Hadoop Cluster
- from Hardware to Distribution; Choosing Hadoop cluster hardware; Choosing the DataNode hardware; Low storage density cluster; High storage density cluster; NameNode and JobTracker hardware configuration; The NameNode hardware; The JobTracker hardware; Gateway and other auxiliary services; Network considerations; Hadoop hardware summary; Hadoop distributions; Hadoop versions; Choosing Hadoop distribution; Cloudera Hadoop distribution.
- Hortonworks Hadooop distributionMapR; Choosing OS for the Hadoop cluster; Summary; Chapter 2: Installing and Configuring Hadoop; Configuring OS for Hadoop cluster; Choosing and setting up the filesystem; Setting up Java Development Kit; Other OS settings; Setting up the CDH repositories; Setting up NameNode; JournalNode, ZooKeeper, and Failover Controller; Hadoop configuration files; NameNode HA configuration; JobTracker configuration; Configuring the job scheduler; DataNode configuration; TaskTracker configuration; Advanced Hadoop tuning; Summary; Chapter 3: Configuring the Hadoop Ecosystem.
- Hosting the Hadoop ecosystemSqoop; Installing and configuring Sqoop; Sqoop import example; Sqoop export example; Hive; Hive architecture; Installing Hive Metastore; Installing the Hive client; Installing Hive Server; Impala; Impala architecture; Installing Impala state store; Installing the Impala server; Summary; Chapter 4: Securing Hadoop Installation; Hadoop security overview; HDFS security; MapReduce security; Hadoop Service Level Authorization; Hadoop and Kerberos; Kerberos overview; Kerberos in Hadoop; Configuring Kerberos clients; Generating Kerberos principals.
- Enabling Kerberos for HDFSEnabling Kerberos for MapReduce; Summary; Chapter 5: Monitoring Hadoop Cluster; Monitoring strategy overview; Hadoop Metrics; JMX Metrics; Monitoring Hadoop with Nagios; Monitoring HDFS; NameNode checks; JournalNode checks; ZooKeeper checks; Monitoring MapReduce; JobTracker checks; Monitoring Hadoop With Ganglia; Summary; Chapter 6: Deploying Hadoop to the Cloud; Amazon Elastic MapReduce; Installing the EMR command-line interface; Choosing the Hadoop version; Launching the EMR cluster; Temporary EMR clusters; Preparing input and output locations; Using Whirr.
- Installing and configuring WhirrSummary; Index.