Cargando…

ADVANCED ANALYTICS WITH PYSPARK patterns for learning from data at scale using Python and Spark /

The amount of data being generated today is staggering and growing. Apache Spark has emerged as the de facto tool to analyze big data and is now a critical part of the data science toolbox. Updated for Spark 3.0, this practical guide brings together Spark, statistical methods, and real-world dataset...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autores principales: Tandon, Akash (Autor), Owen, Sean (Autor), Wills, Josh (Autor), Ryza, Sandy (Autor), Laserson, Uri, 1983- (Autor)
Formato: Electrónico eBook
Idioma:Inglés
Publicado: [S.l.] : O'REILLY MEDIA, 2022.
Temas:
Acceso en línea:Texto completo (Requiere registro previo con correo institucional)

MARC

LEADER 00000cam a22000007a 4500
001 OR_on1330690750
003 OCoLC
005 20231017213018.0
006 m o d
007 cr |n|||||||||
008 220618s2022 xx o 000 0 eng d
040 |a YDX  |b eng  |c YDX  |d ORMDA  |d OCLCF  |d UKAHL  |d OCLCQ 
020 |a 9781098103620  |q (electronic bk.) 
020 |a 1098103629  |q (electronic bk.) 
020 |z 1098103653 
020 |z 9781098103651 
029 1 |a AU@  |b 000072141876 
035 |a (OCoLC)1330690750 
037 |a 9781098103644  |b O'Reilly Media 
050 4 |a QA76.9.D343 
082 0 4 |a 006.3/12  |2 23/eng/20220621 
049 |a UAMI 
100 1 |a Tandon, Akash,  |e author. 
245 0 0 |a ADVANCED ANALYTICS WITH PYSPARK  |h [electronic resource] :  |b patterns for learning from data at scale using Python and Spark /  |c Akash Tandon, Sandy Ryza, Uri Laserson, Sean Owen & Josh Wills. 
260 |a [S.l.] :  |b O'REILLY MEDIA,  |c 2022. 
300 |a 1 online resource 
520 |a The amount of data being generated today is staggering and growing. Apache Spark has emerged as the de facto tool to analyze big data and is now a critical part of the data science toolbox. Updated for Spark 3.0, this practical guide brings together Spark, statistical methods, and real-world datasets to teach you how to approach analytics problems using PySpark, Spark's Python API, and other best practices in Spark programming. Data scientists Akash Tandon, Sandy Ryza, Uri Laserson, Sean Owen, and Josh Wills offer an introduction to the Spark ecosystem, then dive into patterns that apply common techniques-including classification, clustering, collaborative filtering, and anomaly detection, to fields such as genomics, security, and finance. This updated edition also covers NLP and image processing. If you have a basic understanding of machine learning and statistics and you program in Python, this book will get you started with large-scale data analysis. Familiarize yourself with Spark's programming model and ecosystem Learn general approaches in data science Examine complete implementations that analyze large public datasets Discover which machine learning tools make sense for particular problems Explore code that can be adapted to many uses. 
590 |a O'Reilly  |b O'Reilly Online Learning: Academic/Public Library Edition 
630 0 0 |a SPARK (Electronic resource) 
630 0 7 |a SPARK (Electronic resource)  |2 fast  |0 (OCoLC)fst01400497 
650 0 |a Python (Computer program language) 
650 0 |a Data mining. 
650 7 |a Data mining.  |2 fast  |0 (OCoLC)fst00887946 
650 7 |a Python (Computer program language)  |2 fast  |0 (OCoLC)fst01084736 
700 1 |a Owen, Sean,  |e author. 
700 1 |a Wills, Josh,  |e author. 
700 1 |a Ryza, Sandy,  |e author. 
700 1 |a Laserson, Uri,  |d 1983-  |e author. 
776 0 8 |i Print version:  |z 1098103653  |z 9781098103651  |w (OCoLC)1272856308 
856 4 0 |u https://learning.oreilly.com/library/view/~/9781098103644/?ar  |z Texto completo (Requiere registro previo con correo institucional) 
938 |a Askews and Holts Library Services  |b ASKH  |n AH40320402 
938 |a YBP Library Services  |b YANK  |n 18010463 
938 |a YBP Library Services  |b YANK  |n 302973313 
938 |a YBP Library Services  |b YANK  |n 18010463 
994 |a 92  |b IZTAP