Cargando…

Hands-on web scraping with Python : perform advanced scraping operations using various Python libraries and tools such as Selenium, Regex, and others /

Web scraping is an essential technique used in many organizations to scrape valuable data from web pages. This book will help you master web scraping techniques and methodologies using Python libraries and other popular tools such as Selenium. By the end of this book, you will have learned how to ef...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Chapagain, Anish (Autor)
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Birmingham, UK : Packt, [2019]
Temas:
Acceso en línea:Texto completo

MARC

LEADER 00000cam a2200000 i 4500
001 EBSCO_on1109795904
003 OCoLC
005 20231017213018.0
006 m o d
007 cr |n|---|||||
008 190727s2019 enk ob 000 0 eng d
040 |a EBLCP  |b eng  |e pn  |c EBLCP  |d UKMGB  |d OCLCO  |d OCLCQ  |d EBLCP  |d OCLCF  |d TEFOD  |d YDX  |d TEFOD  |d UKAHL  |d OCLCQ  |d N$T  |d OCLCQ  |d OCLCO  |d NZAUC  |d OCLCQ  |d OCLCO 
015 |a GBB9C8173  |2 bnb 
016 7 |a 019478456  |2 Uk 
019 |a 1109765268 
020 |a 9781789536195  |q (electronic bk.) 
020 |a 1789536197  |q (electronic bk.) 
020 |z 9781789533392  |q (pbk.) 
029 1 |a AU@  |b 000066231004 
029 1 |a UKMGB  |b 019478456 
029 1 |a AU@  |b 000065674819 
029 1 |a AU@  |b 000070435876 
035 |a (OCoLC)1109795904  |z (OCoLC)1109765268 
037 |a 9781789536195  |b Packt Publishing 
037 |a 37BF8E93-F0EA-4517-BB09-D637AAABB5AF  |b OverDrive, Inc.  |n http://www.overdrive.com 
050 4 |a QA76.9.D3385 
082 0 4 |a 006.3/12  |2 23 
049 |a UAMI 
100 1 |a Chapagain, Anish,  |e author. 
245 1 0 |a Hands-on web scraping with Python :  |b perform advanced scraping operations using various Python libraries and tools such as Selenium, Regex, and others /  |c Anish Chapagain. 
264 1 |a Birmingham, UK :  |b Packt,  |c [2019] 
264 4 |c ©2019 
300 |a 1 online resource (337 pages) 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
504 |a Includes bibliographical references. 
588 0 |a Print version record. 
505 0 |a Cover; Title Page; Copyright and Credits; Dedication; About Packt; Contributors; Table of Contents; Preface; Section 1: Introduction to Web Scraping; Chapter 1: Web Scraping Fundamentals; Introduction to web scraping; Understanding web development and technologies; HTTP; HTML; HTML elements and attributes; Global attributes; XML; JavaScript; JSON; CSS; AngularJS; Data finding techniques for the web; HTML page source; Case 1; Case 2; Developer tools; Sitemaps; The robots.txt file; Summary; Further reading; Section 2: Beginning Web Scraping 
505 8 |a Chapter 2: Python and the Web -- Using urllib and RequestsTechnical requirements; Accessing the web with Python; Setting things up; Loading URLs; URL handling and operations with urllib and requests; urllib; requests; Implementing HTTP methods; GET; POST; Summary; Further reading; Chapter 3: Using LXML, XPath, and CSS Selectors; Technical requirements; Introduction to XPath and CSS selector; XPath; CSS selectors; Element selectors; ID and class selectors; Attribute selectors; Pseudo selectors; Using web browser developer tools for accessing web content; HTML elements and DOM navigation 
505 8 |a XPath and CSS selectors using DevToolsScraping using lxml, a Python library; lxml by examples; Example 1 -- reading XML from file and traversing through its elements; Example 2 -- reading HTML documents using lxml.html; Example 3 -- reading and parsing HTML for retrieving HTML form type element attributes; Web scraping using lxml; Example 1 -- extracting selected data from a single page using lxml.html.xpath; Example 2 -- looping with XPath and scraping data from multiple pages; Example 3 -- using lxml.cssselect to scrape content from a page; Summary; Further reading 
505 8 |a Chapter 4: Scraping Using pyquery -- a Python LibraryTechnical requirements; Introduction to pyquery; Exploring pyquery; Loading documents; Element traversing, attributes, and pseudo-classes; Iterating; Web scraping using pyquery; Example 1 -- scraping data science announcements; Example 2 -- scraping information from nested links; Example 3 -- extracting AHL Playoff results; Example 4 -- collecting URLs from sitemap.xml; Case 1 -- using the HTML parser; Case 2 -- using the XML parser; Summary; Further reading; Chapter 5: Web Scraping Using Scrapy and Beautiful Soup; Technical requirements 
505 8 |a Web scraping using Beautiful SoupIntroduction to Beautiful Soup; Exploring Beautiful Soup; Searching, traversing, and iterating; Using children and parents; Using next and previous; Using CSS Selectors; Example 1 -- listing elements with the data-id attribute; Example 2 -- traversing through elements; Example 3 -- searching elements based on attribute values; Building a web crawler; Web scraping using Scrapy; Introduction to Scrapy; Setting up a project; Generating a Spider; Creating an item; Extracting data; Using XPath; Using CSS Selectors; Data from multiple pages; Running and exporting 
500 |a Deploying a web crawler 
520 |a Web scraping is an essential technique used in many organizations to scrape valuable data from web pages. This book will help you master web scraping techniques and methodologies using Python libraries and other popular tools such as Selenium. By the end of this book, you will have learned how to efficiently scrape different websites. 
590 |a eBooks on EBSCOhost  |b EBSCO eBook Subscription Academic Collection - Worldwide 
650 0 |a Data mining. 
650 0 |a Python (Computer program language) 
650 2 |a Data Mining 
650 6 |a Exploration de données (Informatique) 
650 6 |a Python (Langage de programmation) 
650 7 |a Data mining  |2 fast 
650 7 |a Python (Computer program language)  |2 fast 
776 0 8 |i Print version:  |a Chapagain, Anish.  |t Hands-On Web Scraping with Python : Perform Advanced Scraping Operations Using Various Python Libraries and Tools Such As Selenium, Regex, and Others.  |d Birmingham : Packt Publishing, Limited, ©2019  |z 9781789533392 
856 4 0 |u https://ebsco.uam.elogim.com/login.aspx?direct=true&scope=site&db=nlebk&AN=2201137  |z Texto completo 
938 |a Askews and Holts Library Services  |b ASKH  |n BDZ0040267433 
938 |a ProQuest Ebook Central  |b EBLB  |n EBL5830822 
938 |a EBSCOhost  |b EBSC  |n 2201137 
938 |a YBP Library Services  |b YANK  |n 300717306 
994 |a 92  |b IZTAP