Cargando…

Learning predictive analytics with python : gain practical insights into predictive modelling by implementing Predictive Analytics algorithms on public datasets with Python /

Annotation

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Kumar, Ashish (Autor)
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Birmingham : Packt Publishing, [2016]
Ã2016
Colección:Community experience distilled.
Temas:
Acceso en línea:Texto completo (Requiere registro previo con correo institucional)
Tabla de Contenidos:
  • Cover ; Copyright; Credits; Foreword; About the Author; Acknowledgments; About the Reviewer; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Getting Started with Predictive Modelling ; Introducing predictive modelling; Scope of predictive modelling; Ensemble of statistical algorithms; Statistical tools; Historical data; Mathematical function; Business context; Knowledge matrix for predictive modelling; Task matrix for predictive modelling; Applications and examples of predictive modelling; LinkedIn's ""People also viewed"" feature; What it does?; How is it done?
  • Correct targeting of online adsHow is it done?; Santa Cruz predictive policing; How is it done?; Determining the activity of a smartphone user using accelerometer data; How is it done?; Sport and fantasy leagues; How was it done?; Python and its packages
  • download and installation; Anaconda; Standalone Python; Installing a Python package; Installing pip; Installing Python packages with pip; Python and its packages for predictive modelling; IDEs for Python; Summary; Chapter 2: Data Cleaning ; Reading the data
  • variations and examples; Data frames; Delimiters.
  • Various methods of importing data in PythonCase 1
  • reading a dataset using the read_csv method; The read_csv method; Use cases of the read_csv method; Case 2
  • reading a dataset using the open method of Python; Reading a dataset line by line; Changing the delimiter of a dataset; Case 3
  • reading data from a URL; Case 4
  • miscellaneous cases; Reading from an .xls or .xlsx file; Writing to a CSV or Excel file; Basics
  • summary, dimensions, structure; Handling missing values; Checking for missing values; What constitutes missing data?; How missing values are generated and propagated.
  • Treating missing valuesDeletion; Imputation; Creating dummy variables; Visualizing a dataset by basic plotting; Scatter plots; Histograms; Boxplots; Summary; Chapter 3: Data Wrangling ; Subsetting a dataset; Selecting columns; Selecting rows; Selecting a combination of rows and columns; Creating new columns; Generating random numbers and their usage; Various methods for generating random numbers; Seeding a random number; Generating random numbers following probability distributions; Probability density function; Cumulative density function; Uniform distribution; Normal distribution.
  • Using the Monte-Carlo simulation to find the value of piGeometry and mathematics behind the calculation of pi; Generating a dummy data frame; Grouping the data
  • aggregation, filtering, and transformation; Aggregation; Filtering; Transformation; Miscellaneous operations; Random sampling
  • splitting a dataset in training and testing datasets; Method 1
  • using the Customer Churn Model; Method 2
  • using sklearn; Method 3
  • using the shuffle function; Concatenating and appending data; Merging/joining datasets; Inner Join; Left Join; Right Join; An example of the Inner Join.