|
|
|
|
LEADER |
00000cam a2200000Ii 4500 |
001 |
OR_ocn913875222 |
003 |
OCoLC |
005 |
20231017213018.0 |
006 |
m o d |
007 |
cr unu|||||||| |
008 |
150716s2015 caua o 001 0 eng d |
040 |
|
|
|a UMI
|b eng
|e rda
|e pn
|c UMI
|d I3U
|d OCLCF
|d DEBBG
|d DEBSZ
|d CEF
|d UAB
|d DST
|d OCLCO
|d OCLCQ
|
020 |
|
|
|z 9781491910290
|
020 |
|
|
|a 1491910291
|
020 |
|
|
|a 9781491910290
|
029 |
1 |
|
|a DEBBG
|b BV043019775
|
029 |
1 |
|
|a DEBSZ
|b 455693463
|
029 |
1 |
|
|a GBVCP
|b 882742035
|
035 |
|
|
|a (OCoLC)913875222
|
037 |
|
|
|a CL0500000612
|b Safari Books Online
|
050 |
|
4 |
|a QA76.73.P98
|
082 |
0 |
4 |
|a 005.13/3
|2 23
|
049 |
|
|
|a UAMI
|
100 |
1 |
|
|a Mitchell, Ryan,
|e author.
|
245 |
1 |
0 |
|a Web scraping with Python :
|b collecting data from the modern web /
|c Ryan Mitchell.
|
246 |
3 |
0 |
|a Collecting data from the modern web
|
250 |
|
|
|a First edition.
|
264 |
|
1 |
|a Sebastopol, CA :
|b O'Reilly Media,
|c 2015.
|
300 |
|
|
|a 1 online resource (1 volume) :
|b illustrations
|
336 |
|
|
|a text
|b txt
|2 rdacontent
|
337 |
|
|
|a computer
|b c
|2 rdamedia
|
338 |
|
|
|a online resource
|b cr
|2 rdacarrier
|
588 |
|
|
|a Description based on online resource; title from title page (Safari, viewed June 30, 2015).
|
500 |
|
|
|a Includes index.
|
505 |
0 |
|
|a ""Copyright""; ""Table of Contents""; ""Preface""; ""What Is Web Scraping?""; ""Why Web Scraping?""; ""About This Book""; ""Conventions Used in This Book""; ""Using Code Examples""; ""Safari® Books Online""; ""How to Contact Us""; ""Acknowledgments""; ""Part I. Building Scrapers""; ""Chapter 1. Your First Web Scraper""; ""Connecting""; ""An Introduction to BeautifulSoup""; ""Installing BeautifulSoup""; ""Running BeautifulSoup""; ""Connecting Reliably""; ""Chapter 2. Advanced HTML Parsing""; ""You Don't Always Need a Hammer""; ""Another Serving of BeautifulSoup""
|
505 |
8 |
|
|a ""Find() and findAll() with BeautifulSoup""""Other BeautifulSoup Objects""; ""Navigating Trees""; ""Regular Expressions""; ""Regular Expressions and BeautifulSoup""; ""Accessing Attributes""; ""Lambda Expressions""; ""Beyond BeautifulSoup""; ""Chapter 3. Starting to Crawl""; ""Traversing a Single Domain""; ""Crawling an Entire Site""; ""Collecting Data Across an Entire Site""; ""Crawling Across the Internet""; ""Crawling with Scrapy""; ""Chapter 4. Using APIs""; ""How APIs Work""; ""Common Conventions""; ""Methods""; ""Authentication""; ""Responses""; ""API Calls""; ""Echo Nest""
|
505 |
8 |
|
|a ""A Few Examples""""Twitter""; ""Getting Started""; ""A Few Examples""; ""Google APIs""; ""Getting Started""; ""A Few Examples""; ""Parsing JSON""; ""Bringing It All Back Home""; ""More About APIs""; ""Chapter 5. Storing Data""; ""Media Files""; ""Storing Data to CSV""; ""MySQL""; ""Installing MySQL""; ""Some Basic Commands""; ""Integrating with Python""; ""Database Techniques and Good Practice""; """Six Degrees" in MySQL""; ""Email""; ""Chapter 6. Reading Documents""; ""Document Encoding""; ""Text""; ""Text Encoding and the Global Internet""; ""CSV""; ""Reading CSV Files""; ""PDF""
|
505 |
8 |
|
|a ""Microsoft Word and .docx""""Part II. Advanced Scraping""; ""Chapter 7. Cleaning Your Dirty Data""; ""Cleaning in Code""; ""Data Normalization""; ""Cleaning After the Fact""; ""OpenRefine""; ""Chapter 8. Reading and Writing Natural Languages""; ""Summarizing Data""; ""Markov Models""; ""Six Degrees of Wikipedia: Conclusion""; ""Natural Language Toolkit""; ""Installation and Setup""; ""Statistical Analysis with NLTK""; ""Lexicographical Analysis with NLTK""; ""Additional Resources""; ""Chapter 9. Crawling Through Forms and Logins""; ""Python Requests Library""; ""Submitting a Basic Form""
|
505 |
8 |
|
|a ""Radio Buttons, Checkboxes, and Other Inputs""""Submitting Files and Images""; ""Handling Logins and Cookies""; ""HTTP Basic Access Authentication""; ""Other Form Problems""; ""Chapter 10. Scraping JavaScript""; ""A Brief Introduction to JavaScript""; ""Common JavaScript Libraries""; ""Ajax and Dynamic HTML""; ""Executing JavaScript in Python with Selenium""; ""Handling Redirects""; ""Chapter 11. Image Processing and Text Recognition""; ""Overview of Libraries""; ""Pillow""; ""Tesseract""; ""NumPy""; ""Processing Well-Formatted Text""; ""Scraping Text from Images on Websites""
|
520 |
|
|
|a Learn web scraping and crawling techniques to access unlimited data from any web source in any format. With this practical guide, you'll learn how to use Python scripts and web APIs to gather and process data from thousands-or even millions-of web pages at once. Ideal for programmers, security professionals, and web administrators familiar with Python, this book not only teaches basic web scraping mechanics, but also delves into more advanced topics, such as analyzing raw data or using scrapers for frontend website testing. Code samples are available to help you understand the concepts in practice.
|
590 |
|
|
|a O'Reilly
|b O'Reilly Online Learning: Academic/Public Library Edition
|
650 |
|
0 |
|a Python (Computer program language)
|
650 |
|
0 |
|a Data mining.
|
650 |
|
0 |
|a Automatic data collection systems.
|
650 |
|
2 |
|a Data Mining
|
650 |
|
6 |
|a Python (Langage de programmation)
|
650 |
|
6 |
|a Exploration de données (Informatique)
|
650 |
|
6 |
|a Collecte automatique des données.
|
650 |
|
7 |
|a Automatic data collection systems.
|2 fast
|0 (OCoLC)fst00822733
|
650 |
|
7 |
|a Data mining.
|2 fast
|0 (OCoLC)fst00887946
|
650 |
|
7 |
|a Python (Computer program language)
|2 fast
|0 (OCoLC)fst01084736
|
856 |
4 |
0 |
|u https://learning.oreilly.com/library/view/~/9781491910283/?ar
|z Texto completo (Requiere registro previo con correo institucional)
|
994 |
|
|
|a 92
|b IZTAP
|