Cargando…

Data analysis with Python and Pyspark /

Data Analysis with Python and PySpark helps you solve the daily challenges of data science with PySpark. You'll learn how to scale your processing capabilities across multiple machines while ingesting data from any source--whether that's Hadoop clusters, cloud data storage, or local data f...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Rioux, Jonathan (Autor)
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Shelter Island : Manning Publications, 2022.
Temas:
Acceso en línea:Texto completo (Requiere registro previo con correo institucional)
Descripción
Sumario:Data Analysis with Python and PySpark helps you solve the daily challenges of data science with PySpark. You'll learn how to scale your processing capabilities across multiple machines while ingesting data from any source--whether that's Hadoop clusters, cloud data storage, or local data files. Once you've covered the fundamentals, you'll explore the full versatility of PySpark by building machine learning pipelines, and blending Python, pandas, and PySpark code.
Descripción Física:1 online resource (1 volume.)
ISBN:9781617297205
1617297208