Cargando…

Essential PySpark for Scalable Data Analytics /

Get started with distributed computing using PySpark, a single unified framework to solve end-to-end data analytics at scale Key Features Discover how to convert huge amounts of raw data into meaningful and actionable insights Use Spark's unified analytics engine for end-to-end analytics, from...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Nudurupati, Sreeram (Autor)
Autor Corporativo: Safari, an O'Reilly Media Company
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Packt Publishing, 2021.
Edición:1st edition.
Temas:
Acceso en línea:Texto completo (Requiere registro previo con correo institucional)
Tabla de Contenidos:
  • Table of Contents Distributed Computing Primer Data Ingestion Data Cleansing and Integration Real-time Data Analytics Scalable Machine Learning with PySpark Feature Engineering – Extraction, Transformation, and Selection Supervised Machine Learning Unsupervised Machine Learning Machine Learning Life Cycle Management Scaling Out Single-Node Machine Learning Using PySpark Data Visualization with PySpark Spark SQL Primer Integrating External Tools with Spark SQL The Data Lakehouse.