Cargando…

Python Data Mining Quick Start Guide : a Beginner's Guide to Extracting Valuable Insights from Your Data.

This book is an introduction to data mining and its practical demonstration of working with real-world data sets. With this book, you will be able to extract useful insights using common Python libraries. You will also learn key stages like data loading, cleaning, analysis, visualization to build an...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Greeneltch, Nathan
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Birmingham : Packt Publishing, Limited, 2019.
Temas:
Acceso en línea:Texto completo

MARC

LEADER 00000cam a2200000Mi 4500
001 EBSCO_on1100030829
003 OCoLC
005 20231017213018.0
006 m o d
007 cr |n|---|||||
008 190518s2019 enk o 000 0 eng d
040 |a EBLCP  |b eng  |e pn  |c EBLCP  |d UKAHL  |d UKMGB  |d OCLCO  |d OCLCF  |d OCLCQ  |d CHVBK  |d YDX  |d OCLCQ  |d N$T  |d OCLCQ  |d OCLCO  |d OCLCQ 
015 |a GBB986173  |2 bnb 
016 7 |a 019396752  |2 Uk 
019 |a 1099567606 
020 |a 1789806402 
020 |a 9781789806403  |q (electronic bk.) 
029 1 |a CHNEW  |b 001058748 
029 1 |a CHVBK  |b 569753511 
029 1 |a UKMGB  |b 019396752 
029 1 |a AU@  |b 000065284873 
035 |a (OCoLC)1100030829  |z (OCoLC)1099567606 
037 |a 9781789806403  |b Packt Publishing 
050 4 |a QA76.73.P98  |b .G744 2019 
082 0 4 |a 006.312  |2 23 
049 |a UAMI 
100 1 |a Greeneltch, Nathan. 
245 1 0 |a Python Data Mining Quick Start Guide :  |b a Beginner's Guide to Extracting Valuable Insights from Your Data. 
260 |a Birmingham :  |b Packt Publishing, Limited,  |c 2019. 
300 |a 1 online resource (181 pages) 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
588 0 |a Print version record. 
505 0 |a Cover; Title Page; Copyright and Credits; Dedication; About Packt; Contributors; Table of Contents; Preface; Chapter 1: Data Mining and Getting Started with Python Tools; Descriptive, predictive, and prescriptive analytics; What will and will not be covered in this book; Recommended readings for further explanation; Setting up Python environments for data mining; Installing the Anaconda distribution and Conda package manager; Installing on Linux; Installing on Windows; Installing on macOS; Launching the Spyder IDE; Launching a Jupyter Notebook; Installing high-performance Python distribution 
505 8 |a Recommended libraries and how to installRecommended libraries; Summary; Chapter 2: Basic Terminology and Our End-to-End Example; Basic data terminology; Sample spaces; Variable types; Data types; Basic summary statistics; An end-to-end example of data mining in Python; Loading data into memory -- viewing and managing with ease using pandas; Plotting and exploring data -- harnessing the power of Seaborn; Transforming data -- PCA and LDA with scikit-learn; Quantifying separations -- k-means clustering and the silhouette score; Making decisions or predictions; Summary 
505 8 |a Chapter 3: Collecting, Exploring, and Visualizing DataTypes of data sources and loading into pandas; Databases; Basic Structured Query Language (SQL) queries; Disks; Web sources; From URLs; From Scikit-learn and Seaborn-included sets; Access, search, and sanity checks with pandas; Basic plotting in Seaborn; Popular types of plots for visualizing data; Scatter plots; Histograms; Jointplots; Violin plots; Pairplots; Summary; Chapter 4: Cleaning and Readying Data for Analysis; The scikit-learn transformer API; Cleaning input data; Missing values; Finding and removing missing values 
505 8 |a Imputing to replace the missing valuesFeature scaling; Normalization; Standardization; Handling categorical data; Ordinal encoding; One-hot encoding; Label encoding; High-dimensional data; Dimension reduction; Feature selection; Feature filtering; The variance threshold; The correlation coefficient; Wrapper methods; Sequential feature selection; Transformation; PCA; LDA; Summary; Chapter 5: Grouping and Clustering Data; Introducing clustering concepts; Location of the group; Euclidean space (centroids); Non-Euclidean space (medioids); Similarity; Euclidean space; The Euclidean distance 
505 8 |a The Manhattan distanceMaximum distance; Non-Euclidean space; The cosine distance; The Jaccard distance; Termination condition; With known number of groupings; Without known number of groupings; Quality score and silhouette score; Clustering methods; Means separation; K-means; Finding k; K-means++; Mini batch K-means; Hierarchical clustering; Reuse the dendrogram to find number of clusters; Plot dendrogram; Density clustering; Spectral clustering; Summary; Chapter 6: Prediction with Regression and Classification; Scikit-learn Estimator API; Introducing prediction concepts 
500 |a Prediction nomenclature 
520 |a This book is an introduction to data mining and its practical demonstration of working with real-world data sets. With this book, you will be able to extract useful insights using common Python libraries. You will also learn key stages like data loading, cleaning, analysis, visualization to build an efficient data mining pipeline. 
590 |a eBooks on EBSCOhost  |b EBSCO eBook Subscription Academic Collection - Worldwide 
650 0 |a Data mining. 
650 0 |a Python (Computer program language) 
650 2 |a Data Mining 
650 6 |a Exploration de données (Informatique) 
650 6 |a Python (Langage de programmation) 
650 7 |a Data mining.  |2 fast  |0 (OCoLC)fst00887946 
650 7 |a Python (Computer program language)  |2 fast  |0 (OCoLC)fst01084736 
776 0 8 |i Print version:  |a Greeneltch, Nathan.  |t Python Data Mining Quick Start Guide : A Beginner's Guide to Extracting Valuable Insights from Your Data.  |d Birmingham : Packt Publishing, Limited, ©2019  |z 9781789800265 
856 4 0 |u https://ebsco.uam.elogim.com/login.aspx?direct=true&scope=site&db=nlebk&AN=2111782  |z Texto completo 
938 |a Askews and Holts Library Services  |b ASKH  |n BDZ0040020888 
938 |a ProQuest Ebook Central  |b EBLB  |n EBL5761233 
938 |a EBSCOhost  |b EBSC  |n 2111782 
938 |a YBP Library Services  |b YANK  |n 300485162 
994 |a 92  |b IZTAP