Cloudera Administration Handbook.
An easy-to-follow Apache Hadoop administrator's guide filled with practical screenshots and explanations for each step and configuration. This book is great for administrators interested in setting up and managing a large Hadoop cluster. If you are an administrator, or want to be an administrat...
Clasificación: | Libro Electrónico |
---|---|
Autor principal: | |
Formato: | Electrónico eBook |
Idioma: | Inglés |
Publicado: |
Packt Publishing,
2014.
|
Temas: | |
Acceso en línea: | Texto completo |
Tabla de Contenidos:
- Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Getting Started with Apache Hadoop; History of Apache Hadoop and its trends; Components of Apache Hadoop; Understanding the Apache Hadoop daemons; Namenode; Secondary namenode; Jobtracker; Tasktracker; ResourceManager; NodeManager; Job submission in YARN; Introducing Cloudera; Introducing CDH; Responsibilities of a Hadoop administrator; Summary; Chapter 2: HDFS and MapReduce; Essentials of HDFS; Configuring HDFS; The read/write operational flow in HDFS.
- Writing files in HDFSReading files in HDFS; Understanding the namenode UI; Understanding the secondary namenode UI; Exploring HDFS commands; Commonly used HDFS commands; Commands to administer HDFS; Getting acquainted with MapReduce; Understanding the map phase; Understanding the reduce phase; Learning all about the MapReduce job flow; Configuring MapReduce; Understanding the jobtracker UI; Getting MapReduce job information; Summary; Chapter 3: Cloudera's Distribution Including Apache Hadoop
- CDH; Getting started with CDH; Understanding the CDH components; Apache Hadoop; Apache Flume NG.
- Apache SqoopApache Pig; Apache Hive; Apache ZooKeeper; Apache HBase; Apache Whirr; Snappy
- previously known as Zippy; Apache Mahout; Apache Avro; Apache Oozie; Cloudera Search; Cloudera Impala; Cloudera Hue; Beeswax
- Hive UI; Cloudera Impala UI; Pig UI; File Browser; Metastore Manager; Sqoop Jobs; Job Browser; Job Designs; Dashboard; Collection Manager; Hue Shell; HBase Browser; Installing CDH; Stopping Hadoop services; Understanding a YARN cluster; Installing the CDH components; Installing Apache Flume; Installing Apache Sqoop; Installing Apache Sqoop 2; Installing Apache Pig.
- Installing Apache HiveInstalling Apache Oozie; Installing Apache ZooKeeper; Summary; Chapter 4: Exploring HDFS Federation and Its High Availability; Implementing HDFS Federation; Configuring HDFS Federation; Configuring ViewFS for federated HDFS; Implementing HDFS High Availability; Quorum-based storage; Configuring HDFS high availability by Quorum-based storage; Shared storage using NFS; Configuring HDFS high availability by shared storage sing NFS; Configuring automatic failover for HDFS high availability; Jobtracker high availability; Configuring Jobtracker High Availability.
- Configuring automatic failover for Jobtracker high availabilitySummary; Chapter 5: Using Cloudera Manager; Introducing Cloudera Manager; Understanding the Cloudera Manager architecture; Installing Cloudera Manager; Navigating the Cloudera Manager Web console; Navigating the Home screen; Navigating the Clusters menu; Exploring the Hosts menu; Understanding the Diagnostics menu; Understanding the Audits screen; Understanding the Charts menu; Understanding the Backup menu; Understanding the Administration menu; Configuring High Availability using Cloudera Manager; Summary.