Cargando…

PySpark Cookbook : Over 60 Recipes for Implementing Big Data Processing and Analytics Using Apache Spark and Python.

This cookbook presents recipes on leveraging the power of Python and putting it to use in the Apache Spark ecosystem. By the end of this book, you will be able to solve any problem associated with building effective, data-intensive applications and performing machine learning and structured streamin...

Descripción completa

Detalles Bibliográficos
Clasificación:	Libro Electrónico
Autor principal:	Drabas, Tomasz
Otros Autores:	Lee, Denny
Formato:	Electrónico eBook
Idioma:	Inglés
Publicado:	Birmingham : Packt Publishing Ltd, 2018.
Temas:	SPARK. Python. Application software > Development. Logiciels d'application > Développement. Data capture & analysis. Database design & theory. Information architecture. Programming & scripting languages: general. Computers > Data Modeling & Design. Computers > Data Processing. Computers > Programming Languages > Python. Application software > Development
Acceso en línea:	Texto completo

MARC


LEADER	00000cam a2200000Mi 4500
001	EBOOKCENTRAL_on1045023903
003	OCoLC
005	20240329122006.0
006	m o d
007	cr \|n\|---\|\|\|\|\|
008	180721s2018 enk o 000 0 eng d
040			\|a EBLCP \|b eng \|e pn \|c EBLCP \|d MERUC \|d NLE \|d CHVBK \|d OCLCO \|d OCLCQ \|d OCLCO \|d LVT \|d OCLCF \|d OCLCQ \|d UX1 \|d K6U \|d OCLCO \|d OCLCQ \|d OCLCO \|d OCLCL
019			\|a 1175630327
020			\|a 9781788834254
020			\|a 1788834259
020			\|a 9781788835367
020			\|a 1788835360 \|q (Trade Paper)
024	3		\|a 9781788835367
029	1		\|a AU@ \|b 000066233178
029	1		\|a CHNEW \|b 001023894
029	1		\|a CHVBK \|b 530323508
035			\|a (OCoLC)1045023903 \|z (OCoLC)1175630327
037			\|a B09522 \|b 01201872
050		4	\|a QA76.73.S59 \|b .L44 2018
082	0	4	\|a 004.2
049			\|a UAMI
100	1		\|a Drabas, Tomasz.
245	1	0	\|a PySpark Cookbook : \|b Over 60 Recipes for Implementing Big Data Processing and Analytics Using Apache Spark and Python.
260			\|a Birmingham : \|b Packt Publishing Ltd, \|c 2018.
300			\|a 1 online resource (321 pages)
336			\|a text \|b txt \|2 rdacontent
337			\|a computer \|b c \|2 rdamedia
338			\|a online resource \|b cr \|2 rdacarrier
588	0		\|a Print version record.
505	0		\|a Cover; Title Page; Copyright and Credits; Packt Upsell; Contributors; Table of Contents; Preface; Chapter 1: Installing and Configuring Spark; Introduction; Installing Spark requirements; Getting ready; How to do it ... ; How it works ... ; There's more ... ; Installing Java; Installing Python; Installing R; Installing Scala; Installing Maven; Updating PATH; Installing Spark from sources; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Installing Spark from binaries; Getting ready; How to do it ... ; How it works ... ; There's more ... ; Configuring a local instance of Spark.
505	8		\|a Getting readyHow to do it ... ; How it works ... ; See also; Configuring a multi-node instance of Spark; Getting ready; How to do it ... ; How it works ... ; See also; Installing Jupyter; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Configuring a session in Jupyter; Getting ready; How to do it ... ; How it works ... ; There's more ... ; See also; Working with Cloudera Spark images; Getting ready; How to do it ... ; How it works ... ; Chapter 2: Abstracting Data with RDDs; Introduction; Creating RDDs; Getting ready ; How to do it ... ; How it works ...
505	8		\|a Spark context parallelize method.take(...) method; Reading data from files; Getting ready ; How to do it ... ; How it works ... ; .textFile(...) method; .map(...) method; Partitions and performance; Overview of RDD transformations; Getting ready; How to do it ... ; .map(...) transformation; .filter(...) transformation; .flatMap(...) transformation; .distinct() transformation; .sample(...) transformation; .join(...) transformation; .repartition(...) transformation; .zipWithIndex() transformation; .reduceByKey(...) transformation; .sortByKey(...) transformation; .union(...) transformation.
505	8		\|a .MapPartitionsWithIndex(...) transformationHow it works ... ; Overview of RDD actions; Getting ready; How to do it ... ; .take(...) action; .collect() action; .reduce(...) action; .count() action; .saveAsTextFile(...) action; How it works ... ; Pitfalls of using RDDs; Getting ready; How to do it ... ; How it works ... ; Chapter 3: Abstracting Data with DataFrames; Introduction; Creating DataFrames; Getting ready; How to do it ... ; How it works ... ; There's more ... ; From JSON; From CSV; See also; Accessing underlying RDDs; Getting ready; How to do it ... ; How it works ... ; Performance optimizations.
505	8		\|a Getting readyHow to do it ... ; How it works ... ; There's more ... ; See also; Inferring the schema using reflection; Getting ready; How to do it ... ; How it works ... ; See also; Specifying the schema programmatically; Getting ready; How to do it ... ; How it works ... ; See also; Creating a temporary table; Getting ready; How to do it ... ; How it works ... ; There's more ... ; Using SQL to interact with DataFrames; Getting ready; How to do it ... ; How it works ... ; There's more ... ; Overview of DataFrame transformations; Getting ready; How to do it ... ; The .select(...) transformation.
500			\|a The .filter(...) transformation.
520			\|a This cookbook presents recipes on leveraging the power of Python and putting it to use in the Apache Spark ecosystem. By the end of this book, you will be able to solve any problem associated with building effective, data-intensive applications and performing machine learning and structured streaming using PySpark.
590			\|a ProQuest Ebook Central \|b Ebook Central Academic Complete
650		0	\|a SPARK.
650		0	\|a Python.
650		0	\|a Application software \|x Development.
650		6	\|a Logiciels d'application \|x Développement.
650		7	\|a Data capture & analysis. \|2 bicssc
650		7	\|a Database design & theory. \|2 bicssc
650		7	\|a Information architecture. \|2 bicssc
650		7	\|a Programming & scripting languages: general. \|2 bicssc
650		7	\|a Computers \|x Data Modeling & Design. \|2 bisacsh
650		7	\|a Computers \|x Data Processing. \|2 bisacsh
650		7	\|a Computers \|x Programming Languages \|x Python. \|2 bisacsh
650		7	\|a Application software \|x Development \|2 fast
700	1		\|a Lee, Denny.
758			\|i has work: \|a PYSPARK COOKBOOK (Text) \|1 https://id.oclc.org/worldcat/entity/E39PCXQ7hPQ8wHBTf4VfGHbPgq \|4 https://id.oclc.org/worldcat/ontology/hasWork
776	0	8	\|i Print version: \|a Drabas, Tomasz. \|t PySpark Cookbook : Over 60 Recipes for Implementing Big Data Processing and Analytics Using Apache Spark and Python. \|d Birmingham : Packt Publishing Ltd, ©2018 \|z 9781788835367
856	4	0	\|u https://ebookcentral.uam.elogim.com/lib/uam-ebooks/detail.action?docID=5446040 \|z Texto completo
938			\|a EBL - Ebook Library \|b EBLB \|n EBL5446040
994			\|a 92 \|b IZTAP

PySpark Cookbook : Over 60 Recipes for Implementing Big Data Processing and Analytics Using Apache Spark and Python.

MARC

Ejemplares similares