Cargando…

Learning Apache Spark 2 : process big data with the speed of light! /

Learn about the fastest-growing open source project in the world, and find out how it revolutionizes big data analytics About This Book Exclusive guide that covers how to get up and running with fast data processing using Apache Spark Explore and exploit various possibilities with Apache Spark using...

Descripción completa

Detalles Bibliográficos
Clasificación:	Libro Electrónico
Autor principal:	Abbasi, Muhammad Asif (Autor)
Formato:	Electrónico eBook
Idioma:	Inglés
Publicado:	Birmingham, UK : Packt Publishing, 2017.
Temas:	Spark (Electronic resource : Apache Software Foundation) Electronic data processing > Distributed processing > Management. Big data. Machine learning. Données volumineuses. Apprentissage automatique. Big data Electronic data processing > Distributed processing > Management Machine learning
Acceso en línea:	Texto completo (Requiere registro previo con correo institucional)

MARC


LEADER	00000cam a2200000Ii 4500
001	OR_ocn984515083
003	OCoLC
005	20231017213018.0
006	m o d
007	cr unu\|\|\|\|\|\|\|\|
008	170427s2017 enka ob 000 0 eng d
040			\|a UMI \|b eng \|e rda \|e pn \|c UMI \|d IDEBK \|d TOH \|d OCLCF \|d TEFOD \|d VT2 \|d OCLCQ \|d UOK \|d CEF \|d KSU \|d WYU \|d UAB \|d DST \|d OCLCO \|d OCLCQ \|d N$T \|d OCLCO
020			\|a 9781785889585 \|q (electronic bk.)
020			\|a 1785889583 \|q (electronic bk.)
020			\|z 9781785885136
029	1		\|a GBVCP \|b 1004862830
035			\|a (OCoLC)984515083
037			\|a CL0500000852 \|b Safari Books Online
037			\|a B532ED43-CF76-44DF-8402-5BE45EC31D5C \|b OverDrive, Inc. \|n http://www.overdrive.com
050		4	\|a QA76.9.D343
082	0	4	\|a 006.312 \|2 23
049			\|a UAMI
100	1		\|a Abbasi, Muhammad Asif, \|e author.
245	1	0	\|a Learning Apache Spark 2 : \|b process big data with the speed of light! / \|c Muhammad Asif Abbasi.
246	3		\|a Learning Apache Spark two
264		1	\|a Birmingham, UK : \|b Packt Publishing, \|c 2017.
300			\|a 1 online resource (1 volume) : \|b illustrations
336			\|a text \|b txt \|2 rdacontent
337			\|a computer \|b c \|2 rdamedia
338			\|a online resource \|b cr \|2 rdacarrier
588			\|a Description based on online resource; title from cover (Safari, viewed April 26, 2017).
504			\|a Includes bibliographical references.
520			\|a Learn about the fastest-growing open source project in the world, and find out how it revolutionizes big data analytics About This Book Exclusive guide that covers how to get up and running with fast data processing using Apache Spark Explore and exploit various possibilities with Apache Spark using real-world use cases in this book Want to perform efficient data processing at real time? This book will be your one-stop solution. Who This Book Is For This guide appeals to big data engineers, analysts, architects, software engineers, even technical managers who need to perform efficient data processing on Hadoop at real time. Basic familiarity with Java or Scala will be helpful. The assumption is that readers will be from a mixed background, but would be typically people with background in engineering/data science with no prior Spark experience and want to understand how Spark can help them on their analytics journey. What You Will Learn Get an overview of big data analytics and its importance for organizations and data professionals Delve into Spark to see how it is different from existing processing platforms Understand the intricacies of various file formats, and how to process them with Apache Spark. Realize how to deploy Spark with YARN, MESOS or a Stand-alone cluster manager. Learn the concepts of Spark SQL, SchemaRDD, Caching and working with Hive and Parquet file formats Understand the architecture of Spark MLLib while discussing some of the off-the-shelf algorithms that come with Spark. Introduce yourself to the deployment and usage of SparkR. Walk through the importance of Graph computation and the graph processing systems available in the market Check the real world example of Spark by building a recommendation engine with Spark using ALS. Use a Telco data set, to predict customer churn using Random Forests. In Detail Spark juggernaut keeps on rolling and getting more and more momentum each day. Spark provides key capabilities in the form of Spark SQL, Spark Streaming, Spark ML and Graph X all accessible via Java, Scala, Python and R. Deploying the key capabilities is crucial whether it is on a Standalone framework or as a part of existing Hadoop installation and configuring with Yarn and Mesos. The next part of the journey after installation is using key components, APIs, Clustering, machine learning APIs, data pipelines, parallel programming. It is important to understand why each framework component is key, how widely it is being u...
590			\|a O'Reilly \|b O'Reilly Online Learning: Academic/Public Library Edition
630	0	0	\|a Spark (Electronic resource : Apache Software Foundation)
630	0	7	\|a Spark (Electronic resource : Apache Software Foundation) \|2 fast
650		0	\|a Electronic data processing \|x Distributed processing \|x Management.
650		0	\|a Big data.
650		0	\|a Machine learning.
650		6	\|a Données volumineuses.
650		6	\|a Apprentissage automatique.
650		7	\|a Big data \|2 fast
650		7	\|a Electronic data processing \|x Distributed processing \|x Management \|2 fast
650		7	\|a Machine learning \|2 fast
856	4	0	\|u https://learning.oreilly.com/library/view/~/9781785885136/?ar \|z Texto completo (Requiere registro previo con correo institucional)
938			\|a ProQuest MyiLibrary Digital eBook Collection \|b IDEB \|n cis36983548
938			\|a EBSCOhost \|b EBSC \|n 1495816
994			\|a 92 \|b IZTAP

Learning Apache Spark 2 : process big data with the speed of light! /

MARC

Ejemplares similares