Loading…

Advanced analytics with Spark : patterns from learning from data at scale /

The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by presenting examples and a set of self-contained patterns for performing large-scale data analysis with Spark. You'll start with an introduction to Spark and its eco...

Full description

Bibliographic Details
Call Number:Libro Electrónico
Main Authors: Ryza, Sandy (Author), Laserson, Uri (Author), Owen, Sean (Author), Wills, Josh (Author)
Format: Electronic eBook
Language:Inglés
Published: Sebastopol, CA : O'Reilly Media, 2017.
Edition:Second edition.
Subjects:
Online Access:Texto completo (Requiere registro previo con correo institucional)
Table of Contents:
  • Analyzing big data
  • Introduction to data analysis with Scala and Spark
  • Recommending music and the audioscrobbler data set
  • Predicting forest cover with decision trees
  • Anomaly detection in network traffic with K-means clustering
  • Understanding Wikipedia with latent semantic analysis
  • Analyzing co-occurrence networks with GraphX
  • Geospatial and temporal data analysis on the New York City taxi trip data
  • Estimating financial risk through Monte Carlo simulation
  • Analyzing genomics data and the BDG project
  • Analyzing neuroimaging data with PySpark and Thunder.