Cargando…

Modern data engineering with Apache Spark : a hands-on guide for building mission-critical streaming applications /

Leverage Apache Spark within a modern data engineering ecosystem. This hands-on guide will teach you how to write fully functional applications, follow industry best practices, and learn the rationale behind these decisions. With Apache Spark as the foundation, you will follow a step-by-step journey...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Haines, Scott (Autor)
Formato: Electrónico eBook
Idioma:Inglés
Publicado: [Berkeley, Calif.] : Apress, 2022.
Temas:
Acceso en línea:Texto completo (Requiere registro previo con correo institucional)
Tabla de Contenidos:
  • Part I. The Fundamentals of Data Engineering with Spark
  • 1. Introduction to Modern Data Engineering
  • 2. Getting Started with Apache Spark
  • 3. Working with Data
  • 4. Transforming Data with Spark SQL and the DataFrame API
  • 5. Bridging Spark SQL with JDBC
  • 6. Data Discovery and the Spark SQL Catalog
  • 7. Data Pipelines & Structured Spark Applications
  • Part II. The Streaming Pipeline Ecosystem
  • 8. Workflow Orchestration with Apache Airflow
  • 9. A Gentle Introduction to Stream Processing
  • 10. Patterns for Writing Structured Streaming Applications
  • 11. Apache Kafka & Spark Structured Streaming
  • 12. Analytical Processing & Insights
  • Part III. Advanced Techniques
  • 13. Advanced Analytics with Spark Stateful Structured Streaming
  • 14. Deploying Mission Critical Spark Applications on Spark Standalone
  • 15. Deploying Mission Critical Spark Applications on Kubernetes.