Cargando…

Mastering Spark with R : the complete guide to large-scale analysis and modeling /

"Authors Javier Luraschi, Kevin Kuo, and Edgar Ruiz show you how to combine R with Spark to analyze data at scale. This book covers relevant data science topics, cluster computing, and issues that will interest even the most advanced users."--Back cover

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autores principales: Luraschi, Javier (Autor), Kuo, Kevin (Autor), Ruiz, Edgar (Autor)
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Sebastopol, CA : O'Reilly Media, [2019]
Edición:First edition.
Temas:
Acceso en línea:Texto completo (Requiere registro previo con correo institucional)

MARC

LEADER 00000cam a2200000 i 4500
001 OR_on1123174078
003 OCoLC
005 20231017213018.0
006 m o d
007 cr unu||||||||
008 191015t20192020caua ob 001 0 eng d
040 |a UMI  |b eng  |e rda  |e pn  |c UMI  |d OCLCF  |d CDN  |d EBLCP  |d TEFOD  |d GZM  |d UKAHL  |d N$T  |d YDX  |d OCLCQ  |d OCLCO  |d NZAUC  |d OCLCQ  |d OCLCO 
019 |a 1122917346  |a 1131767521 
020 |a 9781492046349  |q (electronic bk.) 
020 |a 1492046345  |q (electronic bk.) 
020 |a 9781492046325 
020 |a 1492046329 
020 |z 9781492046370 
020 |z 149204637X 
029 1 |a AU@  |b 000071520524 
035 |a (OCoLC)1123174078  |z (OCoLC)1122917346  |z (OCoLC)1131767521 
037 |a CL0501000076  |b Safari Books Online 
050 4 |a QA276.45.R3 
082 0 4 |a 004.2/2  |2 23 
049 |a UAMI 
100 1 |a Luraschi, Javier,  |e author. 
245 1 0 |a Mastering Spark with R :  |b the complete guide to large-scale analysis and modeling /  |c Javier Luraschi, Kevin Kuo, and Edgar Ruiz. 
250 |a First edition. 
264 1 |a Sebastopol, CA :  |b O'Reilly Media,  |c [2019] 
264 4 |c ©2020 
300 |a 1 online resource (xviii, 274 pages) :  |b illustrations 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
588 0 |a Online resource; title from title page (Safari, viewed October 10, 2019). 
504 |a Includes bibliographical references and index. 
505 0 |a Intro; Copyright; Table of Contents; Foreword; Preface; Formatting; Acknowledgments; Conventions Used in This Book; Using Code Examples; O'Reilly Online Learning; How to Contact Us; Chapter 1. Introduction; Overview; Hadoop; Spark; R; sparklyr; Recap; Chapter 2. Getting Started; Overview; Prerequisites; Installing sparklyr; Installing Spark; Connecting; Using Spark; Web Interface; Analysis; Modeling; Data; Extensions; Distributed R; Streaming; Logs; Disconnecting; Using RStudio; Resources; Recap; Chapter 3. Analysis; Overview; Import; Wrangle; Built-in Functions; Correlations; Visualize 
505 8 |a Using ggplot2Using dbplot; Model; Caching; Communicate; Recap; Chapter 4. Modeling; Overview; Exploratory Data Analysis; Feature Engineering; Supervised Learning; Generalized Linear Regression; Other Models; Unsupervised Learning; Data Preparation; Topic Modeling; Recap; Chapter 5. Pipelines; Overview; Creation; Use Cases; Hyperparameter Tuning; Operating Modes; Interoperability; Deployment; Batch Scoring; Real-Time Scoring; Recap; Chapter 6. Clusters; Overview; On-Premises; Managers; Distributions; Cloud; Amazon; Databricks; Google; IBM; Microsoft; Qubole; Kubernetes; Tools; RStudio; Jupyter 
505 8 |a LivyRecap; Chapter 7. Connections; Overview; Edge Nodes; Spark Home; Local; Standalone; YARN; YARN Client; YARN Cluster; Livy; Mesos; Kubernetes; Cloud; Batches; Tools; Multiple Connections; Troubleshooting; Logging; Spark Submit; Windows; Recap; Chapter 8. Data; Overview; Reading Data; Paths; Schema; Memory; Columns; Writing Data; Copying Data; File Formats; CSV; JSON; Parquet; Others; File Systems; Storage Systems; Hive; Cassandra; JDBC; Recap; Chapter 9. Tuning; Overview; Graph; Timeline; Configuring; Connect Settings; Submit Settings; Runtime Settings; sparklyr Settings; Partitioning 
505 8 |a Implicit PartitionsExplicit Partitions; Caching; Checkpointing; Memory; Shuffling; Serialization; Configuration Files; Recap; Chapter 10. Extensions; Overview; H2O; Graphs; XGBoost; Deep Learning; Genomics; Spatial; Troubleshooting; Recap; Chapter 11. Distributed R; Overview; Use Cases; Custom Parsers; Partitioned Modeling; Grid Search; Web APIs; Simulations; Partitions; Grouping; Columns; Context; Functions; Packages; Cluster Requirements; Installing R; Apache Arrow; Troubleshooting; Worker Logs; Resolving Timeouts; Inspecting Partitions; Debugging Workers; Recap; Chapter 12. Streaming 
505 8 |a OverviewTransformations; Analysis; Modeling; Pipelines; Distributed R; Kafka; Shiny; Recap; Chapter 13. Contributing; Overview; The Spark API; Spark Extensions; Using Scala Code; Recap; Appendix A. Supplemental Code References; Preface; Formatting; Chapter 1; The World's Capacity to Store Information; Daily Downloads of CRAN Packages; Chapter 2; Prerequisites; Chapter 3; Hive Functions; Chapter 4; MLlib Functions; Chapter 6; Google Trends for On-Premises (Mainframes), Cloud Computing, and Kubernetes; Chapter 12; Stream Generator; Installing Kafka; Index; About the Authors; Colophon 
520 |a "Authors Javier Luraschi, Kevin Kuo, and Edgar Ruiz show you how to combine R with Spark to analyze data at scale. This book covers relevant data science topics, cluster computing, and issues that will interest even the most advanced users."--Back cover 
590 |a O'Reilly  |b O'Reilly Online Learning: Academic/Public Library Edition 
630 0 0 |a Spark (Electronic resource : Apache Software Foundation) 
630 0 7 |a Spark (Electronic resource : Apache Software Foundation)  |2 fast 
650 0 |a R (Computer program language) 
650 0 |a Electronic data processing. 
650 0 |a Big data. 
650 6 |a R (Langage de programmation) 
650 6 |a Données volumineuses. 
650 7 |a Big data  |2 fast 
650 7 |a Electronic data processing  |2 fast 
650 7 |a R (Computer program language)  |2 fast 
700 1 |a Kuo, Kevin,  |e author. 
700 1 |a Ruiz, Edgar,  |e author. 
776 0 8 |i Print version:  |a Luraschi, Javier.  |t Mastering Spark with R : The Complete Guide to Large-Scale Analysis and Modeling.  |d Sebastopol : O'Reilly Media, Incorporated, ©2019  |z 9781492046370 
856 4 0 |u https://learning.oreilly.com/library/view/~/9781492046363/?ar  |z Texto completo (Requiere registro previo con correo institucional) 
938 |a Askews and Holts Library Services  |b ASKH  |n AH36840625 
938 |a ProQuest Ebook Central  |b EBLB  |n EBL5928213 
938 |a EBSCOhost  |b EBSC  |n 2267502 
938 |a YBP Library Services  |b YANK  |n 300879994 
938 |a YBP Library Services  |b YANK  |n 16494849 
994 |a 92  |b IZTAP