Cargando…

Apache Spark Quick Start Guide : Quickly Learn the Art of Writing Efficient Big Data Applications with Apache Spark.

Apache Spark is a ﬂexible in-memory framework that allows processing of both batch and real-time data. Its unified engine has made it quite popular for big data use cases. This book will help you to quickly get started with Apache Spark 2.0 and write efficient big data applications for a variety of...

Descripción completa

Detalles Bibliográficos
Clasificación:	Libro Electrónico
Autor principal:	Mehrotra, Shrey
Otros Autores:	Grade, Akash
Formato:	Electrónico eBook
Idioma:	Inglés
Publicado:	Birmingham : Packt Publishing Ltd, 2019.
Temas:	Spark (Electronic resource : Apache Software Foundation) Machine learning. Apprentissage automatique. COMPUTERS > Programming > Open Source. COMPUTERS > Software Development & Engineering > Tools. COMPUTERS > Software Development & Engineering > General. Machine learning
Acceso en línea:	Texto completo

MARC


LEADER	00000cam a2200000Mi 4500
001	EBSCO_on1086054843
003	OCoLC
005	20231017213018.0
006	m o d
007	cr cnu---unuuu
008	190216s2019 enk o 000 0 eng d
040			\|a EBLCP \|b eng \|e pn \|c EBLCP \|d YDX \|d UKMGB \|d OCLCO \|d TEFOD \|d N$T \|d OCLCF \|d OCLCQ \|d OCLCO \|d UKAHL \|d OCLCQ \|d OCLCO \|d K6U \|d OCLCQ \|d OCLCO
015			\|a GBB948607 \|2 bnb
016	7		\|a 019253766 \|2 Uk
019			\|a 1085783006 \|a 1086269943 \|a 1086665234
020			\|a 178934266X
020			\|a 9781789342666 \|q (electronic bk.)
020			\|z 1789349109
020			\|z 9781789349108
029	1		\|a AU@ \|b 000065065761
029	1		\|a CHNEW \|b 001040252
029	1		\|a CHVBK \|b 559039158
029	1		\|a UKMGB \|b 019253766
029	1		\|a AU@ \|b 000070435827
035			\|a (OCoLC)1086054843 \|z (OCoLC)1085783006 \|z (OCoLC)1086269943 \|z (OCoLC)1086665234
037			\|a 9781789342666 \|b Packt Publishing
050		4	\|a QA76.73.S59
072		7	\|a COM \|x 051390 \|2 bisacsh
072		7	\|a COM \|x 051440 \|2 bisacsh
072		7	\|a COM \|x 051230 \|2 bisacsh
082	0	4	\|a 005.133 \|2 23
049			\|a UAMI
100	1		\|a Mehrotra, Shrey.
245	1	0	\|a Apache Spark Quick Start Guide : \|b Quickly Learn the Art of Writing Efficient Big Data Applications with Apache Spark.
260			\|a Birmingham : \|b Packt Publishing Ltd, \|c 2019.
300			\|a 1 online resource (150 pages)
336			\|a text \|b txt \|2 rdacontent
337			\|a computer \|b c \|2 rdamedia
338			\|a online resource \|b cr \|2 rdacarrier
588	0		\|a Print version record.
505	0		\|a Cover; Title Page; Copyright and Credits; About Packt; Contributors; Table of Contents; Preface; Chapter 1: Introduction to Apache Spark; What is Spark?; Spark architecture overview; Spark language APIs; Scala; Java; Python; R; SQL; Spark components; Spark Core; Spark SQL; Spark Streaming; Spark machine learning; Spark graph processing; Cluster manager; Standalone scheduler; YARN; Mesos; Kubernetes; Making the most of Hadoop and Spark; Summary; Chapter 2: Apache Spark Installation; AWS elastic compute cloud (EC2); Creating a free account on AWS; Connecting to your Linux instance
505	8		\|a Configuring SparkPrerequisites; Installing Java; Installing Scala; Installing Python; Installing Spark; Using Spark components; Different modes of execution; Spark sandbox; Summary; Chapter 3: Spark RDD; What is an RDD?; Resilient metadata; Programming using RDDs; Transformations and actions; Transformation; Narrow transformations; map(); flatMap(); filter(); union(); mapPartitions(); Wide transformations; distinct(); sortBy(); intersection(); subtract(); cartesian(); Action; collect(); count(); take(); top(); takeOrdered(); first(); countByValue(); reduce(); saveAsTextFile(); foreach()
505	8		\|a Types of RDDsPair RDDs; groupByKey(); reduceByKey(); sortByKey(); join(); Caching and checkpointing; Caching; Checkpointing ; Understanding partitions ; repartition() versus coalesce(); partitionBy(); Drawbacks of using RDDs; Summary; Chapter 4: Spark DataFrame and Dataset; DataFrames; Creating DataFrames; Data sources; DataFrame operations and associated functions; Running SQL on DataFrames; Temporary views on DataFrames; Global temporary views on DataFrames; Datasets; Encoders; Internal row; Creating custom encoders; Summary; Chapter 5: Spark Architecture and Application Execution Flow
505	8		\|a A sample applicationDAG constructor; Stage; Tasks; Task scheduler; FIFO; FAIR; Application execution modes; Local mode; Client mode; Cluster mode; Application monitoring; Spark UI; Application logs; External monitoring solution; Summary; Chapter 6: Spark SQL; Spark SQL; Spark metastore; Using the Hive metastore in Spark SQL; Hive configuration with Spark; SQL language manual; Database; Table and view; Load data; Creating UDFs; SQL database using JDBC; Summary; Chapter 7: Spark Streaming, Machine Learning, and Graph Analysis; Spark Streaming; Use cases; Data sources; Stream processing
505	8		\|a MicrobatchDStreams; Streaming architecture; Streaming example; Machine learning; MLlib; ML; Graph processing; GraphX; mapVertices; mapEdges; subgraph; GraphFrames; degrees; subgraphs; Graph algorithms; PageRank; Summary; Chapter 8: Spark Optimizations; Cluster-level optimizations; Memory; Disk; CPU cores; Project Tungsten; Application optimizations; Language choice; Structured versus unstructured APIs; File format choice; RDD optimizations; Choosing the right transformations; Serializing and compressing ; Broadcast variables; DataFrame and dataset optimizations; Catalyst optimizer; Storage
500			\|a Parallelism
520			\|a Apache Spark is a ﬂexible in-memory framework that allows processing of both batch and real-time data. Its unified engine has made it quite popular for big data use cases. This book will help you to quickly get started with Apache Spark 2.0 and write efficient big data applications for a variety of use cases.
590			\|a eBooks on EBSCOhost \|b EBSCO eBook Subscription Academic Collection - Worldwide
630	0	0	\|a Spark (Electronic resource : Apache Software Foundation)
630	0	7	\|a Spark (Electronic resource : Apache Software Foundation) \|2 fast
650		0	\|a Machine learning.
650		6	\|a Apprentissage automatique.
650		7	\|a COMPUTERS \|x Programming \|x Open Source. \|2 bisacsh
650		7	\|a COMPUTERS \|x Software Development & Engineering \|x Tools. \|2 bisacsh
650		7	\|a COMPUTERS \|x Software Development & Engineering \|x General. \|2 bisacsh
650		7	\|a Machine learning \|2 fast
700	1		\|a Grade, Akash.
776	0	8	\|i Print version: \|a Mehrotra, Shrey. \|t Apache Spark Quick Start Guide : Quickly Learn the Art of Writing Efficient Big Data Applications with Apache Spark. \|d Birmingham : Packt Publishing Ltd, ©2019 \|z 9781789349108
856	4	0	\|u https://ebsco.uam.elogim.com/login.aspx?direct=true&scope=site&db=nlebk&AN=2018971 \|z Texto completo
938			\|a Askews and Holts Library Services \|b ASKH \|n BDZ0039650230
938			\|a ProQuest Ebook Central \|b EBLB \|n EBL5675596
938			\|a EBSCOhost \|b EBSC \|n 2018971
938			\|a YBP Library Services \|b YANK \|n 16044691
994			\|a 92 \|b IZTAP

Apache Spark Quick Start Guide : Quickly Learn the Art of Writing Efficient Big Data Applications with Apache Spark.

MARC

Ejemplares similares