Cargando…

Spark : big data cluster computing in production /

Spark: Big Data Cluster Computing in Production goes beyond general Spark overviews to provide targeted guidance toward using lightning-fast big-data clustering in production. Written by an expert team well-known in the big data community, this book walks you through the challenges in moving from pr...

Descripción completa

Detalles Bibliográficos
Clasificación:	Libro Electrónico
Autores principales:	Ganelin, Ilya (Autor), Orhian, Ema (Autor), Sasaki, Kai (Autor), York, Brennon (Autor)
Formato:	Electrónico eBook
Idioma:	Inglés
Publicado:	Indianapolis, IN : John Wiley & Sons, Inc., [2016] ©2016
Temas:	Spark (Electronic resource : Apache Software Foundation) Electronic data processing > Distributed processing. Big data. Parallel processing (Electronic computers) Traitement réparti. Données volumineuses. Parallélisme (Informatique) COMPUTERS > General.
Acceso en línea:	Texto completo (Requiere registro previo con correo institucional)

MARC


LEADER	00000cam a2200000 i 4500
001	OR_ocn945137904
003	OCoLC
005	20231017213018.0
006	m o d
007	cr cnu---unuuu
008	160319s2016 inu o 000 0 eng d
040			\|a EBLCP \|b eng \|e pn \|c EBLCP \|d N$T \|d IDEBK \|d CDX \|d YDXCP \|d OCLCF \|d DG1 \|d RECBK \|d COO \|d TEFOD \|d OCLCQ \|d DG1 \|d DEBSZ \|d CCO \|d IDB \|d MERUC \|d LOA \|d UPM \|d COCUF \|d DG1 \|d K6U \|d STF \|d PIFAG \|d FVL \|d OCLCQ \|d OCLCO \|d ZCU \|d U3W \|d OCLCQ \|d D6H \|d OCLCQ \|d OCLCO \|d WRM \|d OCLCQ \|d KSU \|d CRU \|d ICG \|d VTS \|d OCLCQ \|d OCLCO \|d VT2 \|d OCLCQ \|d G3B \|d TKN \|d OCLCQ \|d DKC \|d OCLCQ \|d UKAHL \|d OCLCQ \|d OCLCO \|d C6I \|d OCLCQ \|d UMI \|d OH1 \|d N9V \|d OCL \|d OCLCO \|d OCLCQ
019			\|a 951950544 \|a 973795037 \|a 1126097456 \|a 1129377792 \|a 1139722723 \|a 1145283517 \|a 1152996343 \|a 1228541388 \|a 1240530639 \|a 1244443506 \|a 1249215805
020			\|a 9781119254805 \|q (electronic bk.)
020			\|a 1119254809 \|q (electronic bk.)
020			\|a 9781119254041 \|q (electronic bk.)
020			\|a 1119254043 \|q (electronic bk.)
020			\|a 9781119254058 \|q (electronic bk.)
020			\|a 1119254051 \|q (electronic bk.)
020			\|z 9781119254010
020			\|z 1119254019
029	1		\|a AU@ \|b 000057258612
029	1		\|a AU@ \|b 000060080995
029	1		\|a CHBIS \|b 010879372
029	1		\|a CHNEW \|b 000895505
029	1		\|a CHNEW \|b 000945243
029	1		\|a CHVBK \|b 480262586
029	1		\|a DEBBG \|b BV043629288
029	1		\|a DEBSZ \|b 480367094
029	1		\|a DEBSZ \|b 485065916
029	1		\|a GBVCP \|b 1002764785
035			\|a (OCoLC)945137904 \|z (OCoLC)951950544 \|z (OCoLC)973795037 \|z (OCoLC)1126097456 \|z (OCoLC)1129377792 \|z (OCoLC)1139722723 \|z (OCoLC)1145283517 \|z (OCoLC)1152996343 \|z (OCoLC)1228541388 \|z (OCoLC)1240530639 \|z (OCoLC)1244443506 \|z (OCoLC)1249215805
037			\|a 981A916F-B662-49E4-841E-31BB904FAAC4 \|b OverDrive, Inc. \|n http://www.overdrive.com
050		4	\|a QA76.9.D5 \|b G358 2016
072		7	\|a COM \|x 000000 \|2 bisacsh
082	0	4	\|a 005.3/76 \|2 23
049			\|a UAMI
100	1		\|a Ganelin, Ilya, \|e author.
245	1	0	\|a Spark : \|b big data cluster computing in production / \|c Ilya Ganelin [and others].
260			\|a Indianapolis, IN : \|b John Wiley & Sons, Inc., \|c [2016]
264		4	\|c ©2016
300			\|a 1 online resource (219 pages)
336			\|a text \|b txt \|2 rdacontent
337			\|a computer \|b c \|2 rdamedia
338			\|a online resource \|b cr \|2 rdacarrier
520			\|a Spark: Big Data Cluster Computing in Production goes beyond general Spark overviews to provide targeted guidance toward using lightning-fast big-data clustering in production. Written by an expert team well-known in the big data community, this book walks you through the challenges in moving from proof-of-concept or demo Spark applications to live Spark in production. Real use cases provide deep insight into common problems, limitations, challenges, and opportunities, while expert tips and tricks help you get the most out of Spark performance. Coverage includes Spark SQL, Tachyon, Kerberos, ML Lib, YARN, and Mesos, with clear, actionable guidance on resource scheduling, db connectors, streaming, security, and much more.
588	0		\|a Print version record.
505	0		\|a Spark"!Big Data Cluster Computing in Production; About the Authors; About the Technical Editors; Credits; Acknowledgments; Contents at a glance; Contents; Introduction; Chapter 1 Finishing Your Spark Job; Installation of the Necessary Components; Native Installation Using a Spark Standalone Cluster; The History of Distributed Computing That Led to Spark; Enter the Cloud; Understanding Resource Management; Using Various Formats for Storage; Text Files; Sequence Files; Avro Files; Parquet Files; Making Sense of Monitoring and Instrumentation; Spark UI; Spark Standalone UI; Metrics REST API.
505	8		\|a Metrics SystemExternal Monitoring Tools; Summary; Chapter 2 Cluster Management; Background; Spark Components; Driver; Workers and Executors; Configuration; Spark Standalone; Architecture; Single-Node Setup Scenario; Multi-Node Setup; YARN; Architecture; Dynamic Resource Allocation; Scenario; Mesos; Setup; Architecture; Dynamic Resource Allocation; Basic Setup Scenario; Comparison; Summary; Chapter 3 Performance Tuning; Spark Execution Model; Partitioning; Controlling Parallelism; Partitioners; Shuffling Data; Shuffling and Data Partitioning; Operators and Shuffling.
505	8		\|a Shuffling Is Not That Bad After AllSerialization; Kryo Registrators; Spark Cache; Spark SQL Cache; Memory Management; Garbage Collection; Shared Variables; Broadcast Variables; Accumulators; Data Locality; Summary; Chapter 4 Security; Architecture; Security Manager; Setup Configurations; ACL; Configuration; Job Submission; Web UI; Network Security; Encryption; Event logging; Kerberos; Apache Sentry; Summary; Chapter 5 Fault Tolerance or Job Execution; Lifecycle of a Spark Job; Spark Master; Spark Driver; Spark Worker; Job Lifecycle; Job Scheduling; Scheduling within an Application.
505	8		\|a Scheduling with External UtilitiesFault Tolerance; Internal and External Fault Tolerance; Service Level Agreements (SLAs); Resilient Distributed Datasets (RDDs); Batch versus Streaming; Testing Strategies; Recommended Configurations; Summary; Chapter 6 Beyond Spark; Data Warehousing; Spark SQL CLI; Thrift JDBC/ODBC Server; Hive on Spark; Machine Learning; DataFrame; MLlib and ML; Mahout on Spark; Hivemall on Spark; External Frameworks; Spark Package; XGBoost; spark-jobserver; Future Works; Integration with the Parameter Server; Deep Learning; Enterprise Usage.
505	8		\|a Collecting User Activity Log with Spark and KafkaReal-Time Recommendation with Spark; Real-Time Categorization of Twitter Bots; Summary; Index; EULA.
590			\|a O'Reilly \|b O'Reilly Online Learning: Academic/Public Library Edition
630	0	0	\|a Spark (Electronic resource : Apache Software Foundation)
630	0	7	\|a Spark (Electronic resource : Apache Software Foundation) \|2 fast \|0 (OCoLC)fst01938143
650		0	\|a Electronic data processing \|x Distributed processing.
650		0	\|a Big data.
650		0	\|a Parallel processing (Electronic computers)
650		6	\|a Traitement réparti.
650		6	\|a Données volumineuses.
650		6	\|a Parallélisme (Informatique)
650		7	\|a COMPUTERS \|x General. \|2 bisacsh
650		7	\|a Parallel processing (Electronic computers) \|2 fast \|0 (OCoLC)fst01052928
650		7	\|a Electronic data processing \|x Distributed processing. \|2 fast \|0 (OCoLC)fst00906987
650		7	\|a Big data. \|2 fast \|0 (OCoLC)fst01892965
700	1		\|a Orhian, Ema, \|e author.
700	1		\|a Sasaki, Kai, \|e author.
700	1		\|a York, Brennon, \|e author.
776	0	8	\|i Print version: \|a Ganelin, Ilya. \|t Spark : Big Data Cluster Computing in Production. \|d : Wiley, ©2016 \|z 9781119254805
856	4	0	\|u https://learning.oreilly.com/library/view/~/9781119254010/?ar \|z Texto completo (Requiere registro previo con correo institucional)
938			\|a Askews and Holts Library Services \|b ASKH \|n AH30144351
938			\|a Askews and Holts Library Services \|b ASKH \|n AH30144350
938			\|a Coutts Information Services \|b COUT \|n 33055432
938			\|a EBL - Ebook Library \|b EBLB \|n EBL4451522
938			\|a EBSCOhost \|b EBSC \|n 1198682
938			\|a ProQuest MyiLibrary Digital eBook Collection \|b IDEB \|n cis33055432
938			\|a Recorded Books, LLC \|b RECE \|n rbeEB00671757
938			\|a YBP Library Services \|b YANK \|n 12976387
938			\|a YBP Library Services \|b YANK \|n 12888564
994			\|a 92 \|b IZTAP

Spark : big data cluster computing in production /

MARC

Ejemplares similares