Cargando…

PySpark Cookbook : Over 60 Recipes for Implementing Big Data Processing and Analytics Using Apache Spark and Python.

This cookbook presents recipes on leveraging the power of Python and putting it to use in the Apache Spark ecosystem. By the end of this book, you will be able to solve any problem associated with building effective, data-intensive applications and performing machine learning and structured streamin...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Drabas, Tomasz
Otros Autores: Lee, Denny
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Birmingham : Packt Publishing Ltd, 2018.
Temas:
Acceso en línea:Texto completo

MARC

LEADER 00000cam a2200000Mi 4500
001 EBOOKCENTRAL_on1045023903
003 OCoLC
005 20240329122006.0
006 m o d
007 cr |n|---|||||
008 180721s2018 enk o 000 0 eng d
040 |a EBLCP  |b eng  |e pn  |c EBLCP  |d MERUC  |d NLE  |d CHVBK  |d OCLCO  |d OCLCQ  |d OCLCO  |d LVT  |d OCLCF  |d OCLCQ  |d UX1  |d K6U  |d OCLCO  |d OCLCQ  |d OCLCO  |d OCLCL 
019 |a 1175630327 
020 |a 9781788834254 
020 |a 1788834259 
020 |a 9781788835367 
020 |a 1788835360  |q (Trade Paper) 
024 3 |a 9781788835367 
029 1 |a AU@  |b 000066233178 
029 1 |a CHNEW  |b 001023894 
029 1 |a CHVBK  |b 530323508 
035 |a (OCoLC)1045023903  |z (OCoLC)1175630327 
037 |a B09522  |b 01201872 
050 4 |a QA76.73.S59  |b .L44 2018 
082 0 4 |a 004.2 
049 |a UAMI 
100 1 |a Drabas, Tomasz. 
245 1 0 |a PySpark Cookbook :  |b Over 60 Recipes for Implementing Big Data Processing and Analytics Using Apache Spark and Python. 
260 |a Birmingham :  |b Packt Publishing Ltd,  |c 2018. 
300 |a 1 online resource (321 pages) 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
588 0 |a Print version record. 
505 0 |a Cover; Title Page; Copyright and Credits; Packt Upsell; Contributors; Table of Contents; Preface; Chapter 1: Installing and Configuring Spark; Introduction; Installing Spark requirements; Getting ready; How to do it ... ; How it works ... ; There's more ... ; Installing Java; Installing Python; Installing R; Installing Scala; Installing Maven; Updating PATH; Installing Spark from sources; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Installing Spark from binaries; Getting ready; How to do it ... ; How it works ... ; There's more ... ; Configuring a local instance of Spark. 
505 8 |a Getting readyHow to do it ... ; How it works ... ; See also; Configuring a multi-node instance of Spark; Getting ready; How to do it ... ; How it works ... ; See also; Installing Jupyter; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Configuring a session in Jupyter; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Working with Cloudera Spark images; Getting ready; How to do it ... ; How it works ... ; Chapter 2: Abstracting Data with RDDs; Introduction; Creating RDDs; Getting ready ; How to do it ... ; How it works ... 
505 8 |a Spark context parallelize method.take(...) method; Reading data from files; Getting ready ; How to do it ... ; How it works ... ; .textFile(...) method; .map(...) method; Partitions and performance; Overview of RDD transformations; Getting ready; How to do it ... ; .map(...) transformation; .filter(...) transformation; .flatMap(...) transformation; .distinct() transformation; .sample(...) transformation; .join(...) transformation; .repartition(...) transformation; .zipWithIndex() transformation; .reduceByKey(...) transformation; .sortByKey(...) transformation; .union(...) transformation. 
505 8 |a .MapPartitionsWithIndex(...) transformationHow it works ... ; Overview of RDD actions; Getting ready; How to do it ... ; .take(...) action; .collect() action; .reduce(...) action; .count() action; .saveAsTextFile(...) action; How it works ... ; Pitfalls of using RDDs; Getting ready; How to do it ... ; How it works ... ; Chapter 3: Abstracting Data with DataFrames; Introduction; Creating DataFrames; Getting ready; How to do it ... ; How it works ... ; There's more ... ; From JSON; From CSV; See also; Accessing underlying RDDs; Getting ready; How to do it ... ; How it works ... ; Performance optimizations. 
505 8 |a Getting readyHow to do it ... ; How it works ... ; There's more ... ; See also; Inferring the schema using reflection; Getting ready; How to do it ... ; How it works ... ; See also; Specifying the schema programmatically; Getting ready; How to do it ... ; How it works ... ; See also; Creating a temporary table; Getting ready; How to do it ... ; How it works ... ; There's more ... ; Using SQL to interact with DataFrames; Getting ready; How to do it ... ; How it works ... ; There's more ... ; Overview of DataFrame transformations; Getting ready; How to do it ... ; The .select(...) transformation. 
500 |a The .filter(...) transformation. 
520 |a This cookbook presents recipes on leveraging the power of Python and putting it to use in the Apache Spark ecosystem. By the end of this book, you will be able to solve any problem associated with building effective, data-intensive applications and performing machine learning and structured streaming using PySpark. 
590 |a ProQuest Ebook Central  |b Ebook Central Academic Complete 
650 0 |a SPARK. 
650 0 |a Python. 
650 0 |a Application software  |x Development. 
650 6 |a Logiciels d'application  |x Développement. 
650 7 |a Data capture & analysis.  |2 bicssc 
650 7 |a Database design & theory.  |2 bicssc 
650 7 |a Information architecture.  |2 bicssc 
650 7 |a Programming & scripting languages: general.  |2 bicssc 
650 7 |a Computers  |x Data Modeling & Design.  |2 bisacsh 
650 7 |a Computers  |x Data Processing.  |2 bisacsh 
650 7 |a Computers  |x Programming Languages  |x Python.  |2 bisacsh 
650 7 |a Application software  |x Development  |2 fast 
700 1 |a Lee, Denny. 
758 |i has work:  |a PYSPARK COOKBOOK (Text)  |1 https://id.oclc.org/worldcat/entity/E39PCXQ7hPQ8wHBTf4VfGHbPgq  |4 https://id.oclc.org/worldcat/ontology/hasWork 
776 0 8 |i Print version:  |a Drabas, Tomasz.  |t PySpark Cookbook : Over 60 Recipes for Implementing Big Data Processing and Analytics Using Apache Spark and Python.  |d Birmingham : Packt Publishing Ltd, ©2018  |z 9781788835367 
856 4 0 |u https://ebookcentral.uam.elogim.com/lib/uam-ebooks/detail.action?docID=5446040  |z Texto completo 
938 |a EBL - Ebook Library  |b EBLB  |n EBL5446040 
994 |a 92  |b IZTAP