Cargando…

Apache Spark Quick Start Guide : Quickly Learn the Art of Writing Efficient Big Data Applications with Apache Spark.

Apache Spark is a flexible in-memory framework that allows processing of both batch and real-time data. Its unified engine has made it quite popular for big data use cases. This book will help you to quickly get started with Apache Spark 2.0 and write efficient big data applications for a variety of...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Mehrotra, Shrey
Otros Autores: Grade, Akash
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Birmingham : Packt Publishing Ltd, 2019.
Temas:
Acceso en línea:Texto completo

MARC

LEADER 00000cam a2200000Mi 4500
001 EBSCO_on1086054843
003 OCoLC
005 20231017213018.0
006 m o d
007 cr cnu---unuuu
008 190216s2019 enk o 000 0 eng d
040 |a EBLCP  |b eng  |e pn  |c EBLCP  |d YDX  |d UKMGB  |d OCLCO  |d TEFOD  |d N$T  |d OCLCF  |d OCLCQ  |d OCLCO  |d UKAHL  |d OCLCQ  |d OCLCO  |d K6U  |d OCLCQ  |d OCLCO 
015 |a GBB948607  |2 bnb 
016 7 |a 019253766  |2 Uk 
019 |a 1085783006  |a 1086269943  |a 1086665234 
020 |a 178934266X 
020 |a 9781789342666  |q (electronic bk.) 
020 |z 1789349109 
020 |z 9781789349108 
029 1 |a AU@  |b 000065065761 
029 1 |a CHNEW  |b 001040252 
029 1 |a CHVBK  |b 559039158 
029 1 |a UKMGB  |b 019253766 
029 1 |a AU@  |b 000070435827 
035 |a (OCoLC)1086054843  |z (OCoLC)1085783006  |z (OCoLC)1086269943  |z (OCoLC)1086665234 
037 |a 9781789342666  |b Packt Publishing 
050 4 |a QA76.73.S59 
072 7 |a COM  |x 051390  |2 bisacsh 
072 7 |a COM  |x 051440  |2 bisacsh 
072 7 |a COM  |x 051230  |2 bisacsh 
082 0 4 |a 005.133  |2 23 
049 |a UAMI 
100 1 |a Mehrotra, Shrey. 
245 1 0 |a Apache Spark Quick Start Guide :  |b Quickly Learn the Art of Writing Efficient Big Data Applications with Apache Spark. 
260 |a Birmingham :  |b Packt Publishing Ltd,  |c 2019. 
300 |a 1 online resource (150 pages) 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
588 0 |a Print version record. 
505 0 |a Cover; Title Page; Copyright and Credits; About Packt; Contributors; Table of Contents; Preface; Chapter 1: Introduction to Apache Spark; What is Spark?; Spark architecture overview; Spark language APIs; Scala; Java; Python; R; SQL; Spark components; Spark Core; Spark SQL; Spark Streaming; Spark machine learning; Spark graph processing; Cluster manager; Standalone scheduler; YARN; Mesos; Kubernetes; Making the most of Hadoop and Spark; Summary; Chapter 2: Apache Spark Installation; AWS elastic compute cloud (EC2); Creating a free account on AWS; Connecting to your Linux instance 
505 8 |a Configuring SparkPrerequisites; Installing Java; Installing Scala; Installing Python; Installing Spark; Using Spark components; Different modes of execution; Spark sandbox; Summary; Chapter 3: Spark RDD; What is an RDD?; Resilient metadata; Programming using RDDs; Transformations and actions; Transformation; Narrow transformations; map(); flatMap(); filter(); union(); mapPartitions(); Wide transformations; distinct(); sortBy(); intersection(); subtract(); cartesian(); Action; collect(); count(); take(); top(); takeOrdered(); first(); countByValue(); reduce(); saveAsTextFile(); foreach() 
505 8 |a Types of RDDsPair RDDs; groupByKey(); reduceByKey(); sortByKey(); join(); Caching and checkpointing; Caching; Checkpointing ; Understanding partitions ; repartition() versus coalesce(); partitionBy(); Drawbacks of using RDDs; Summary; Chapter 4: Spark DataFrame and Dataset; DataFrames; Creating DataFrames; Data sources; DataFrame operations and associated functions; Running SQL on DataFrames; Temporary views on DataFrames; Global temporary views on DataFrames; Datasets; Encoders; Internal row; Creating custom encoders; Summary; Chapter 5: Spark Architecture and Application Execution Flow 
505 8 |a A sample applicationDAG constructor; Stage; Tasks; Task scheduler; FIFO; FAIR; Application execution modes; Local mode; Client mode; Cluster mode; Application monitoring; Spark UI; Application logs; External monitoring solution; Summary; Chapter 6: Spark SQL; Spark SQL; Spark metastore; Using the Hive metastore in Spark SQL; Hive configuration with Spark; SQL language manual; Database; Table and view; Load data; Creating UDFs; SQL database using JDBC; Summary; Chapter 7: Spark Streaming, Machine Learning, and Graph Analysis; Spark Streaming; Use cases; Data sources; Stream processing 
505 8 |a MicrobatchDStreams; Streaming architecture; Streaming example; Machine learning; MLlib; ML; Graph processing; GraphX; mapVertices; mapEdges; subgraph; GraphFrames; degrees; subgraphs; Graph algorithms; PageRank; Summary; Chapter 8: Spark Optimizations; Cluster-level optimizations; Memory; Disk; CPU cores; Project Tungsten; Application optimizations; Language choice; Structured versus unstructured APIs; File format choice; RDD optimizations; Choosing the right transformations; Serializing and compressing ; Broadcast variables; DataFrame and dataset optimizations; Catalyst optimizer; Storage 
500 |a Parallelism 
520 |a Apache Spark is a flexible in-memory framework that allows processing of both batch and real-time data. Its unified engine has made it quite popular for big data use cases. This book will help you to quickly get started with Apache Spark 2.0 and write efficient big data applications for a variety of use cases. 
590 |a eBooks on EBSCOhost  |b EBSCO eBook Subscription Academic Collection - Worldwide 
630 0 0 |a Spark (Electronic resource : Apache Software Foundation) 
630 0 7 |a Spark (Electronic resource : Apache Software Foundation)  |2 fast 
650 0 |a Machine learning. 
650 6 |a Apprentissage automatique. 
650 7 |a COMPUTERS  |x Programming  |x Open Source.  |2 bisacsh 
650 7 |a COMPUTERS  |x Software Development & Engineering  |x Tools.  |2 bisacsh 
650 7 |a COMPUTERS  |x Software Development & Engineering  |x General.  |2 bisacsh 
650 7 |a Machine learning  |2 fast 
700 1 |a Grade, Akash. 
776 0 8 |i Print version:  |a Mehrotra, Shrey.  |t Apache Spark Quick Start Guide : Quickly Learn the Art of Writing Efficient Big Data Applications with Apache Spark.  |d Birmingham : Packt Publishing Ltd, ©2019  |z 9781789349108 
856 4 0 |u https://ebsco.uam.elogim.com/login.aspx?direct=true&scope=site&db=nlebk&AN=2018971  |z Texto completo 
938 |a Askews and Holts Library Services  |b ASKH  |n BDZ0039650230 
938 |a ProQuest Ebook Central  |b EBLB  |n EBL5675596 
938 |a EBSCOhost  |b EBSC  |n 2018971 
938 |a YBP Library Services  |b YANK  |n 16044691 
994 |a 92  |b IZTAP