Cargando…

Python data analytics : data analysis and science using Pandas, Matplotlib and the Python programming language /

Python Data Analytics will help you tackle the world of data acquisition and analysis using the power of the Python language. At the heart of this book lies the coverage of pandas, an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools fo...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Nelli, Fabio (Autor)
Formato: Electrónico eBook
Idioma:Inglés
Publicado: [New York, NY] : Apress, [2015]
Colección:Expert's voice in Python.
Temas:
Acceso en línea:Texto completo (Requiere registro previo con correo institucional)

MARC

LEADER 00000cam a2200000 i 4500
001 OR_ocn923327741
003 OCoLC
005 20231017213018.0
006 m o d
007 cr unu||||||||
008 151009s2015 nyua o 001 0 eng d
040 |a UMI  |b eng  |e rda  |e pn  |c UMI  |d OCLCF  |d DEBBG  |d DEBSZ  |d CEF  |d OCLCQ  |d UWO  |d UAB  |d AU@  |d UBY  |d OCLCO  |d OCLCQ  |d OCLCO 
019 |a 1067210748 
020 |a 1484209591 
020 |a 9781484209592 
020 |a 9781484209585 
020 |a 1484209583 
020 |z 9781484209592 
024 7 |a 10.1007/978-1-4842-0958-5  |2 doi 
029 1 |a AU@  |b 000061146035 
029 1 |a DEBBG  |b BV043020438 
029 1 |a DEBSZ  |b 455700117 
029 1 |a GBVCP  |b 882744887 
035 |a (OCoLC)923327741  |z (OCoLC)1067210748 
037 |a CL0500000654  |b Safari Books Online 
050 4 |a QA76.73.P98 
072 7 |a UMX  |2 bicssc 
072 7 |a UMC  |2 bicssc 
072 7 |a COM051010  |2 bisacsh 
072 7 |a COM010000  |2 bisacsh 
082 0 4 |a 005.13  |2 23 
049 |a UAMI 
100 1 |a Nelli, Fabio,  |e author. 
245 1 0 |a Python data analytics :  |b data analysis and science using Pandas, Matplotlib and the Python programming language /  |c Fabio Nelli. 
246 3 0 |a Data analysis and science using Pandas, Matplotlib and the Python programming language 
264 1 |a [New York, NY] :  |b Apress,  |c [2015] 
264 2 |a New York, NY :  |b Distributed to the Book trade worldwide by Springer Science+Business Media New York 
264 4 |c ©2015 
300 |a 1 online resource (1 volume) :  |b illustrations 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a text file 
347 |b PDF 
490 1 |a The expert's voice in Python 
588 0 |a Print version record. 
500 |a Includes index. 
520 |a Python Data Analytics will help you tackle the world of data acquisition and analysis using the power of the Python language. At the heart of this book lies the coverage of pandas, an open source, BSD-licensed library providing high-performance, easy-to-use data structures and data analysis tools for the Python programming language. Author Fabio Nelli expertly shows the strength of the Python programming language when applied to processing, managing and retrieving information. Inside, you will see how intuitive and flexible it is to discover and communicate meaningful patterns of data using Python scripts, reporting systems, and data export. This book examines how to go about obtaining, processing, storing, managing and analyzing data using the Python programming language. You will use Python and other open source tools to wrangle data and tease out interesting and important trends in that data that will allow you to predict future patterns. Whether you are dealing with sales data, investment data (stocks, bonds, etc.), medical data, web page usage, or any other type of data set, Python can be used to interpret, analyze, and glean information from a pile of numbers and statistics. This book is an invaluable reference with its examples of storing and accessing data in a database; it walks you through the process of report generation; it provides three real world case studies or examples that you can take with you for your everyday analysis needs. 
505 0 |a Intro -- Contents at a Glance -- Contents -- About the Author -- About the Technical Reviewer -- Acknowledgments -- Chapter 1: An Introduction to Data Analysis -- Data Analysis -- Knowledge Domains of the Data Analyst -- Computer Science -- Mathematics and Statistics -- Machine Learning and Artificial Intelligence -- Professional Fields of Application -- Understanding the Nature of the Data -- When the Data Become Information -- When the Information Becomes Knowledge -- Types of Data -- The Data Analysis Process -- Problem Definition -- Data Extraction -- Data Preparation -- Data Exploration/Visualization -- Predictive Modeling -- Model Validation -- Deployment -- Quantitative and Qualitative Data Analysis -- Open Data -- Python and Data Analysis -- Conclusions -- Chapter 2: Introduction to the Python's World -- Python-The Programming Language -- Python-The Interpreter -- Cython -- Jython -- PyPy -- Python 2 and Python 3 -- Installing Python -- Python Distributions -- Anaconda -- Enthought Canopy -- Python(x, y) -- Using Python -- Python Shell -- Run an Entire Program Code -- Implement the Code Using an IDE -- Interact with Python -- Writing Python Code -- Make Calculations -- Import New Libraries and Functions -- Data Structure -- Functional Programming (Only for Python 3.4) -- Indentation -- IPython -- IPython Shell -- IPython Qt-Console -- IPython Notebook -- The Jupyter Project -- PyPI-The Python Package Index -- The IDEs for Python -- IDLE (Integrated DeveLopment Environment) -- Spyder -- Eclipse (pyDev) -- Sublime -- Liclipse -- NinjaIDE -- Komodo IDE -- SciPy -- NumPy -- Pandas -- matplotlib -- Conclusions -- Chapter 3: The NumPy Library -- NumPy: A Little History -- The NumPy Installation -- Ndarray: The Heart of the Library -- Create an Array -- Types of Data -- The dtype Option -- Intrinsic Crea tion of an Array -- Basic Operations. 
505 8 |a Arit hmetic Operators -- The M atrix Product -- Increm ent and Decrement Operators -- Universal Functions (ufunc) -- Aggregat e Functions -- Indexing, Slicing, and Iterating -- Indexing -- Slicing -- Iterating an Array -- Conditions an d Boolean Arrays -- Shape Manipulation -- Array Manipulation -- Joining Arrays -- Splitting Arrays -- General Concepts -- Copies or Views of Objects -- Vectorization -- Broadcasting -- Structured Arrays -- Reading and Writing Array Data on Files -- Loading and Saving Data in Binary Files -- Reading File with T abular Data -- Conclusions -- Chapter 4: The pandas Library-An Introduction -- pandas: The Python Data Analysis Library -- Installation -- Installation from Anaconda -- Installation from PyPI -- Installation on Linux -- Installation from Source -- A Module Repository for Windows -- Test Your pandas Installation -- Getting Started with pandas -- Introduction to pandas Data Structures -- The Series -- Declaring a Series -- Selecting the Internal Elements -- Assigning Values to the Elements -- Defining Series from NumPy Arrays and Other Series -- Filtering Values -- Operations and Mathematical Functions -- Evaluating Values -- NaN Values -- Series as Dictionaries -- Operations between Series -- The DataFrame -- Defining a DataFrame -- Selecting Elements -- Assigning Values -- Membership of a Value -- Deleting a Column -- Filtering -- DataFrame from Nested dict -- Transposition of a DataFrame -- The Index Objects -- Methods on Index -- Index with Duplicate Labels -- Other Functionalities on Indexes -- Reindexing -- Dropping -- Arithmetic and Data Alignment -- Operations between Data Structures -- Flexible Arithmetic Methods -- Operations between DataFrame and Series -- Function Application and Mapping -- Functions by Element -- Functions by Row or Column -- Statistics Functions -- Sorting and Ranking. 
505 8 |a Correlation and Covariance -- "Not a Number" Data -- Assigning a NaN Value -- Filtering Out NaN Values -- Filling in NaN Occurrences -- Hierarchical Indexing and Leveling -- Reordering and Sorting Levels -- Summary Statistic by Level -- Conclusions -- Chapter 5: pandas: Reading and Writing Data -- I/O API Tools -- CSV and Textual Files -- Reading Data in CSV or Text Files -- Using RegExp for Parsing TXT Files -- Reading TXT Files into Parts or Partially -- Writing Data in CSV -- Reading and Writing HTML Files -- Writing Data in HTML -- Reading Data from an HTML File -- Reading Data from XML -- Reading and Writing Data on Microsoft Excel Files -- JSON Data -- The Format HDF5 -- Pickle-Python Object Serialization -- Serialize a Python Object with cPickle -- Pickling with pandas -- Interacting with Databases -- Loading and Writing Data with SQLite3 -- Loading and Writing Data with PostgreSQL -- Reading and Writing Data with a NoSQL Database: MongoDB -- Conclusions -- Chapter 6: pandas in Depth: Data Manipulation -- Data Preparation -- Merging -- Merging on Index -- Concatenating -- Combining -- Pivoting -- Pivoting with Hierarchical Indexing -- Pivoting from "Long" to "Wide" Format -- Removing -- Data Transformation -- Removing Duplicates -- Mapping -- Replacing Values via Mapping -- Adding Values via Mapping -- Rename the Indexes of the Axes -- Discretization and Binning -- Detecting and Filtering Outliers -- Permutation -- Random Sampling -- String Manipulation -- Built-in Methods for Manipulation of Strings -- Regular Expressions -- Data Aggregation -- GroupBy -- A Practical Example -- Hierarchical Grouping -- Group Iteration -- Chain of Transformations -- Functions on Groups -- Advanced Data Aggregation -- Conclusions -- Chapter 7: Data Visualization with matplotlib -- The matplotlib Library -- Installation -- IPython and IPython QtConsole. 
505 8 |a Matplotlib Architecture -- Backend Layer -- Artist Layer -- Scripting Layer (pyplot) -- pylab and pyplot -- pyplot -- A Simple Interactive Chart -- Set the Properties of the Plot -- matplotlib and NumPy -- Using the kwargs -- Working with Multiple Figures and Axes -- Adding Further Elements to the Chart -- Adding Text -- Adding a Grid -- Adding a Legend -- Saving Your Charts -- Saving the Code -- Converting Your Session as an HTML File -- Saving Your Chart Directly as an Image -- Handling Date Values -- Chart Typology -- Line Chart -- Line Charts with pandas -- Histogram -- Bar Chart -- Horizontal Bar Chart -- Multiserial Bar Chart -- Multiseries Bar Chart with pandas DataFrame -- Multiseries Stacked Bar Charts -- Stacked Bar Charts with pandas DataFrame -- Other Bar Chart Representations -- Pie Charts -- Pie Charts with pandas DataFrame -- Advanced Charts -- Contour Plot -- Polar Chart -- mplot3d -- 3D Surfaces -- Scatter Plot in 3D -- Bar Chart 3D -- Multi-Panel Plots -- Display Subplots within Other Subplots -- Grids of Subplots -- Conclusions -- Chapter 8: Machine Learning with scikit-learn -- The scikit-learn Library -- Machine Learning -- Supervised and Unsupervised Learning -- Training Set and Testing Set -- Supervised Learning with scikit-learn -- The Iris Flower Dataset -- The PCA Decomposition -- K-Nearest Neighbors Classifier -- Diabetes Dataset -- Linear Regression: The Least Square Regression -- Support Vector Machines (SVMs) -- Support Vector Classification (SVC) -- Nonlinear SVC -- Plotting Different SVM Classifiers Using the Iris Dataset -- Support Vector Regression (SVR) -- Conclusions -- Chapter 9: An Example-Meteorological Data -- A Hypothesis to Be Tested: The Influence of the Proximity of the Sea -- The System in the Study: The Adriatic Sea and the Po Valley -- Data Source -- Data Analysis on IPython Notebook -- The RoseWind. 
505 8 |a Calculating the Distribution of the Wind Speed Means -- Conclusions -- Chapter 10: Embedding the JavaScript D3 Library in IPython Notebook -- The Open Data Source for Demographics -- The JavaScript D3 Library -- Drawing a Clustered Bar Chart -- The Choropleth Maps -- The Choropleth Map of the US Population in 2014 -- Conclusions -- Chapter 11: Recognizing Handwritten Digits -- Handwriting Recognition -- Recognizing Handwritten Digits with scikit-learn -- The Digits Dataset -- Learning and Predicting -- Conclusions -- Appendix A: Writing Mathematical Expressions with LaTeX -- With matplotlib -- With IPython Notebook in a Markdown Cell -- With IPython Notebook in a Python 2 Cell -- Subscripts and Superscripts -- Fractions, Binomials, and Stacked Numbers -- Radicals -- Fonts -- Accents -- Appendix B: Open Data Sources -- Political and Government Data -- Health Data -- Social Data -- Miscellaneous and Public Data Sets -- Financial Data -- Climatic Data -- Sports Data -- Publications, Newspapers, and Books -- Musical Data -- Index. 
590 |a O'Reilly  |b O'Reilly Online Learning: Academic/Public Library Edition 
650 0 |a Python (Computer program language) 
650 0 |a Data mining. 
650 2 |a Data Mining 
650 6 |a Python (Langage de programmation) 
650 6 |a Exploration de données (Informatique) 
650 7 |a Data mining  |2 fast 
650 7 |a Python (Computer program language)  |2 fast 
776 0 8 |i Print version:  |a Nelli, Fabio.  |t Python data analytics.  |d Berkeley, CA : Apress, 2015  |z 1484209591  |w (OCoLC)919853056 
830 0 |a Expert's voice in Python. 
856 4 0 |u https://learning.oreilly.com/library/view/~/9781484209585/?ar  |z Texto completo (Requiere registro previo con correo institucional) 
994 |a 92  |b IZTAP