Modern data engineering with Apache Spark : a hands-on guide for building mission-critical streaming applications /
Leverage Apache Spark within a modern data engineering ecosystem. This hands-on guide will teach you how to write fully functional applications, follow industry best practices, and learn the rationale behind these decisions. With Apache Spark as the foundation, you will follow a step-by-step journey...
Clasificación: | Libro Electrónico |
---|---|
Autor principal: | |
Formato: | Electrónico eBook |
Idioma: | Inglés |
Publicado: |
[Berkeley, Calif.] :
Apress,
2022.
|
Temas: | |
Acceso en línea: | Texto completo (Requiere registro previo con correo institucional) |
Tabla de Contenidos:
- Part I. The Fundamentals of Data Engineering with Spark
- 1. Introduction to Modern Data Engineering
- 2. Getting Started with Apache Spark
- 3. Working with Data
- 4. Transforming Data with Spark SQL and the DataFrame API
- 5. Bridging Spark SQL with JDBC
- 6. Data Discovery and the Spark SQL Catalog
- 7. Data Pipelines & Structured Spark Applications
- Part II. The Streaming Pipeline Ecosystem
- 8. Workflow Orchestration with Apache Airflow
- 9. A Gentle Introduction to Stream Processing
- 10. Patterns for Writing Structured Streaming Applications
- 11. Apache Kafka & Spark Structured Streaming
- 12. Analytical Processing & Insights
- Part III. Advanced Techniques
- 13. Advanced Analytics with Spark Stateful Structured Streaming
- 14. Deploying Mission Critical Spark Applications on Spark Standalone
- 15. Deploying Mission Critical Spark Applications on Kubernetes.