Cargando…

Advanced analytics with Spark : patterns from learning from data at scale /

The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by presenting examples and a set of self-contained patterns for performing large-scale data analysis with Spark. You'll start with an introduction to Spark and its eco...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autores principales: Ryza, Sandy (Autor), Laserson, Uri (Autor), Owen, Sean (Autor), Wills, Josh (Autor)
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Sebastopol, CA : O'Reilly Media, 2017.
Edición:Second edition.
Temas:
Acceso en línea:Texto completo (Requiere registro previo con correo institucional)
Tabla de Contenidos:
  • Analyzing big data
  • Introduction to data analysis with Scala and Spark
  • Recommending music and the audioscrobbler data set
  • Predicting forest cover with decision trees
  • Anomaly detection in network traffic with K-means clustering
  • Understanding Wikipedia with latent semantic analysis
  • Analyzing co-occurrence networks with GraphX
  • Geospatial and temporal data analysis on the New York City taxi trip data
  • Estimating financial risk through Monte Carlo simulation
  • Analyzing genomics data and the BDG project
  • Analyzing neuroimaging data with PySpark and Thunder.