Getting Started with Beautiful Soup /
This book is a practical, hands-on guide that takes you through the techniques of web scraping using Beautiful Soup. Getting Started with Beautiful Soup is great for anybody who is interested in website scraping and extracting information. However, a basic knowledge of Python, HTML tags, and CSS is...
Clasificación: | Libro Electrónico |
---|---|
Autor principal: | |
Formato: | Electrónico eBook |
Idioma: | Inglés |
Publicado: |
Birmingham, UK :
Packt Publishing,
2014.
|
Temas: | |
Acceso en línea: | Texto completo |
Tabla de Contenidos:
- Cover; Copyright; Credits; About the Author; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Installing Beautiful Soup; Installing Beautiful Soup; Installing Beautiful Soup in Linux; Installing Beautiful Soup using package manager; Installing Beautiful Soup using pip or easy_install; Installing Beautiful Soup using pip; Installing Beautiful Soup using easy_install; Installing Beautiful Soup in Windows; Verifying Python path in Windows; Installing Beautiful Soup using setup.py; Using Beautiful Soup without installation; Verifying the installation; Quick reference
- Finding all tertiary consumersUnderstanding parameters used with find_all(); Searching for Tags in relation; Searching for the parent tags; Searching for siblings; Searching for next; Searching for previous; Using search methods to scrape information from a web page; Quick reference; Summary; Chapter 4: Navigation Using Beautiful Soup; Navigation using Beautiful Soup; Navigating down; Using the name of the child tag; Using predefined attributes; Special attributes for navigating down; Navigating up; The .parent attribute; The .parents attribute; Navigating sideways to the siblings
- The .next_sibling attributeThe .previous_sibling attribute; Navigating to the previous and next objects parsed; Quick reference; Summary; Chapter 5: Modifying Content Using Beautiful Soup; Modifying Tag using Beautiful Soup; Modifying the name property of Tag; Modifying the attribute values of Tag; Updating the existing attribute value of Tag; Adding new attribute values to Tag; Deleting the tag attributes; Adding a new tag; Modifying string contents; Using .string to modify the string content; Adding strings using .append(), insert(), and new_string(); Deleting tags from the HTML document
- Deleting the producer using decompose()Deleting the producer using extract(); Deleting the contents of a tag using Beautiful Soup; Special functions to modify content; Quick reference; Summary; Chapter 6: Encoding Support in Beautiful Soup; Encoding in Beautiful Soup; Understanding the original encoding of the HTML document; Specifying the encoding of the HTML document; Output encoding; Quick reference; Summary; Chapter 7: Output in Beautiful Soup; Formatted printing; Unformatted printing; Output formatters in Beautiful Soup; The minimal formatter; The html formatter; The None formatter