Cargando…

Moving Hadoop to the cloud : harnessing cloud features and flexibility for Hadoop Clusters /

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Havanki, Bill (Autor)
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Sebastopol, CA : O'Reilly Media, 2017.
Edición:First edition.
Temas:
Acceso en línea:Texto completo (Requiere registro previo con correo institucional)
Tabla de Contenidos:
  • Copyright; Table of Contents; Foreword; Preface; Who This Book Is For; What You Should Already Know; What This Book Leaves Out; How This Book Works; Which Software Versions This Book Uses; Conventions Used in This Book; IP Addresses; Using Code Examples; O'Reilly Safari; How to Contact Us; Acknowledgments; Part I. Introduction to the Cloud; Chapter 1. Why Hadoop in the Cloud?; What Is the Cloud?; What Does Hadoop in the Cloud Mean?; Reasons to Run Hadoop in the Cloud; Reasons to Not Run Hadoop in the Cloud; What About Security?; Hybrid Clouds; Hadoop Solutions from Cloud Providers
  • Elastic MapReduceGoogle Cloud Dataproc; HDInsight; Hadoop-Like Services; A Spectrum of Choices; Getting Started; Chapter 2. Overview and Comparison of Cloud Providers; Amazon Web Services; References; Google Cloud Platform; References; Microsoft Azure; References; Which One Should You Use?; Part II. Cloud Primer; Chapter 3. Instances; Instance Types; Regions and Availability Zones; Instance Control; Temporary Instances; Spot Instances; Preemptible Instances; Images; No Instance Is an Island; Chapter 4. Networking and Security; A Drink of CIDR; Virtual Networks; Private DNS
  • Public IP Addresses and DNSVirtual Networks and Regions; Routing; Routing in AWS; Routing in Google Cloud Platform; Routing in Azure; Network Security Rules; Inbound Versus Outbound; Allow Versus Deny; Network Security Rules in AWS; Network Security Rules in Google Cloud Platform; Network Security Rules in Azure; Putting Networking and Security Together; What About the Data?; Chapter 5. Storage; Block Storage; Block Storage in AWS; Block Storage in Google Cloud Platform; Block Storage in Azure; Object Storage; Buckets; Data Objects; Object Access; Object Storage in AWS
  • Object Storage in Google Cloud PlatformObject Storage in Azure; Cloud Relational Databases; Cloud Relational Databases in AWS; Cloud Relational Databases in Google Cloud Platform; Cloud Relational Databases in Azure; Cloud NoSQL Databases; Where to Start?; Part III. A Simple Cluster in the Cloud; Chapter 6. Setting Up in AWS; Prerequisites; Allocating Instances; Generating a Key Pair; Launching Instances; Securing the Instances; Next Steps; Chapter 7. Setting Up in Google Cloud Platform; Prerequisites; Creating a Project; Allocating Instances; SSH Keys; Creating Instances
  • Securing the InstancesNext Steps; Chapter 8. Setting Up in Azure; Prerequisites; Creating a Resource Group; Creating Resources; SSH Keys; Creating Virtual Machines; The Manager Instance; The Worker Instances; Next Steps; Chapter 9. Standing Up a Cluster; The JDK; Hadoop Accounts; Passwordless SSH; Hadoop Installation; HDFS and YARN Configuration; The Environment; XML Configuration Files; Finishing Up Configuration; Startup; SSH Tunneling; Running a Test Job; What If the Job Hangs?; Running Basic Data Loading and Analysis; Wikipedia Exports; Analyzing a Small Export; Go Bigger