Automated data collection with R : a practical guide to Web scraping and text mining /
Automated data collection with R : a practical guide to Web scraping and text mining /
"This book provides a unified framework of web scraping and information extraction from text data with R for the social sciences"--
Clasificación: | Libro Electrónico |
---|---|
Autores principales: | , , , |
Formato: | Electrónico eBook |
Idioma: | Inglés |
Publicado: |
Chichester, England :
Wiley,
2015.
|
Temas: | |
Acceso en línea: | Texto completo |
MARC
LEADER | 00000cam a2200000 i 4500 | ||
---|---|---|---|
001 | EBOOKCENTRAL_ocn896846732 | ||
003 | OCoLC | ||
005 | 20240329122006.0 | ||
006 | m o d | ||
007 | cr cn||||||||| | ||
008 | 141106t20152015enka ob 001 0 eng d | ||
040 | |a E7B |b eng |e rda |e pn |c E7B |d OCLCO |d GPM |d YDXCP |d IAI |d OCLCF |d UMI |d COO |d DEBBG |d EBLCP |d DEBSZ |d OCLCO |d B24X7 |d OCLCQ |d OCLCO |d D6H |d OCLCQ |d COCUF |d UAB |d MOR |d CCO |d PIFAG |d ZCU |d LIV |d MERUC |d YDX |d OCLCQ |d U3W |d STF |d ICG |d INT |d VT2 |d OCLCQ |d WYU |d TKN |d OCLCQ |d DKC |d AU@ |d OCLCQ |d UWK |d OCLCQ |d TUHNV |d OCLCO |d OCLCQ |d OCLCO |d OCLCL | ||
066 | |c (S | ||
019 | |a 894170700 |a 898424053 |a 903395741 |a 1055317279 |a 1066448487 |a 1081254698 |a 1148111336 | ||
020 | |a 9781118834787 |q (e-book) | ||
020 | |a 111883478X |q (e-book) | ||
020 | |a 9781118834800 | ||
020 | |a 1118834801 | ||
020 | |a 111883481X | ||
020 | |a 9781118834817 | ||
020 | |z 9781118834817 | ||
029 | 1 | |a CHNEW |b 000889182 | |
029 | 1 | |a DEBBG |b BV042487499 | |
029 | 1 | |a DEBBG |b BV043613251 | |
029 | 1 | |a DEBSZ |b 420093303 | |
029 | 1 | |a DEBSZ |b 434828971 | |
029 | 1 | |a DKDLA |b 800010-katalog:99122788306405763 | |
035 | |a (OCoLC)896846732 |z (OCoLC)894170700 |z (OCoLC)898424053 |z (OCoLC)903395741 |z (OCoLC)1055317279 |z (OCoLC)1066448487 |z (OCoLC)1081254698 |z (OCoLC)1148111336 | ||
037 | |a CL0500000551 |b Safari Books Online | ||
050 | 4 | |a QA76.9.D343 |b .A986 2015eb | |
072 | 7 | |a COM |x 000000 |2 bisacsh | |
082 | 0 | 4 | |a 006.3/12 |2 23 |
084 | |a COM021030 |2 bisacsh | ||
049 | |a UAMI | ||
100 | 1 | |a Munzert, Simon, |e author. | |
245 | 1 | 0 | |a Automated data collection with R : |b a practical guide to Web scraping and text mining / |c Simon Munzert, Christian Ruoba, Peter Meiboner, Dominic Nyhuis. |
264 | 1 | |a Chichester, England : |b Wiley, |c 2015. | |
264 | 4 | |c ©2015 | |
300 | |a 1 online resource (xxii, 452 pages) : |b illustrations | ||
336 | |a text |b txt |2 rdacontent | ||
337 | |a computer |b c |2 rdamedia | ||
338 | |a online resource |b cr |2 rdacarrier | ||
504 | |a Includes bibliographical references and index. | ||
588 | 0 | |a Print version record. | |
520 | |a "This book provides a unified framework of web scraping and information extraction from text data with R for the social sciences"-- |c Provided by publisher | ||
505 | 0 | |a Automated Data Collection with R; Contents; Preface; What you won't learn from reading this book; Why R?; Recommended reading to get started with R; Typographic conventions; The book's website; Disclaimer; Acknowledgments; 1 Introduction; 1.1 Case study: World Heritage Sites in Danger; 1.2 Some remarks on web data quality; 1.3 Technologies for disseminating, extracting, and storing web data; 1.3.1 Technologies for disseminating content on the Web; 1.3.2 Technologies for information extraction from web documents; 1.3.3 Technologies for data storage; 1.4 Structure of the book. | |
505 | 8 | |a Part One A Primer on Web and Data Technologies2 HTML; 2.1 Browser presentation and source code; 2.2 Syntax rules; 2.2.1 Tags, elements, and attributes; 2.2.2 Tree structure; 2.2.3 Comments; 2.2.4 Reserved and special characters; 2.2.5 Document type definition; 2.2.6 Spaces and line breaks; 2.3 Tags and attributes; 2.3.1 The anchor tag ; 2.3.2 The metadata tag ; 2.3.3 The external reference tag ; 2.3.4 Emphasizing tags, ; 2.3.5 The paragraphs tag ; 2.3.6 Heading tags, ; 2.3.7 Listing content with, and. | |
505 | 8 | |a 2.3.8 The organizational tags and 2.3.9 The tag and its companions; 2.3.10 The foreign script tag ; 2.3.11 Table tags, and ; 2.4 Parsing; 2.4.1 What is parsing?; 2.4.2 Discarding nodes; 2.4.3 Extracting information in the building process; Summary; Further reading; Problems; 3 XML and JSON; 3.1 A short example XML document; 3.2 XML syntax rules; 3.2.1 Elements and attributes; 3.2.2 XML structure; 3.2.3 Naming and special characters; 3.2.4 Comments and character data; 3.2.5 XML syntax summary; 3.3 When is an XML document well formed or valid? | |
505 | 8 | |a 3.4 XML extensions and technologies3.4.1 Namespaces; 3.4.2 Extensions of XML; 3.4.3 Example: Really Simple Syndication; 3.4.4 Example: scalable vector graphics; 3.5 XML and R in practice; 3.5.1 Parsing XML; 3.5.2 Basic operations on XML documents; 3.5.3 From XML to data frames or lists; 3.5.4 Event-driven parsing; 3.6 A short example JSON document; 3.7 JSON syntax rules; 3.8 JSON and R in practice; Summary; Further reading; Problems; 4 XPath; 4.1 XPath-a query language for web documents; 4.2 Identifying node sets with XPath; 4.2.1 Basic structure of an XPath query; 4.2.2 Node relations. | |
505 | 8 | |a 4.2.3 XPath predicates4.3 Extracting node elements; 4.3.1 Extending the fun argument; 4.3.2 XML namespaces; 4.3.3 Little XPath helper tools; Summary; Further reading; Problems; 5 HTTP; 5.1 HTTP fundamentals; 5.1.1 A short conversation with a web server; 5.1.2 URL syntax; 5.1.3 HTTP messages; 5.1.4 Request methods; 5.1.5 Status codes; 5.1.6 Header fields; 5.2 Advanced features of HTTP; 5.2.1 Identification; 5.2.2 Authentication; 5.2.3 Proxies; 5.3 Protocols beyond HTTP; 5.3.1 HTTP Secure; 5.3.2 FTP; 5.4 HTTP in action; 5.4.1 The libcurl library; 5.4.2 Basic request methods. | |
590 | |a ProQuest Ebook Central |b Ebook Central Academic Complete | ||
650 | 0 | |a Data mining. | |
650 | 0 | |a Automatic data collection systems. | |
650 | 0 | |a Social sciences |x Research |x Data processing. | |
650 | 0 | |a R (Computer program language) | |
650 | 4 | |a Automatic data collection systems. | |
650 | 4 | |a Data mining. | |
650 | 4 | |a R (Computer program language) | |
650 | 4 | |a Social sciences |x Research |x Data processing. | |
650 | 6 | |a Exploration de données (Informatique) | |
650 | 6 | |a Collecte automatique des données. | |
650 | 6 | |a Sciences sociales |x Recherche |x Informatique. | |
650 | 6 | |a R (Langage de programmation) | |
650 | 7 | |a COMPUTERS |x Database Management |x Data Mining. |2 bisacsh | |
650 | 7 | |a COMPUTERS |x General. |2 bisacsh | |
650 | 7 | |a Automatic data collection systems |2 fast | |
650 | 7 | |a Data mining |2 fast | |
650 | 7 | |a R (Computer program language) |2 fast | |
650 | 7 | |a Social sciences |x Research |x Data processing |2 fast | |
700 | 1 | |a Ruoba, Christin, |e author. | |
700 | 1 | |a Meiboner, Peter, |e author. | |
700 | 1 | |a Nyhuis, Dominic, |e author. | |
758 | |i has work: |a Automated data collection with R (Text) |1 https://id.oclc.org/worldcat/entity/E39PCG4vQPbGr68vTprvP3HxcK |4 https://id.oclc.org/worldcat/ontology/hasWork | ||
776 | 0 | 8 | |i Print version: |t Automated data collection with R : a practical guide to web scraping and text mining. |d Chichester, England : Wiley, ©2015 |h xxii, 452 pages |z 9781118834817 |w (DLC) 2014032266 |
856 | 4 | 0 | |u https://ebookcentral.uam.elogim.com/lib/uam-ebooks/detail.action?docID=1824310 |z Texto completo |
880 | 1 | 0 | |6 245-00/(S |a Automated data collection with R : |b a practical guide to Web scraping and text mining / |c Simon Munzert, Christian Ruoba, Peter Meiβner, Dominic Nyhuis. |
936 | |a BATCHLOAD | ||
938 | |a Books 24x7 |b B247 |n bks00072676 | ||
938 | |a ProQuest Ebook Central |b EBLB |n EBL1824310 | ||
938 | |a ebrary |b EBRY |n ebr10960598 | ||
938 | |a YBP Library Services |b YANK |n 12136922 | ||
938 | |a YBP Library Services |b YANK |n 11129876 | ||
938 | |a YBP Library Services |b YANK |n 12677446 | ||
994 | |a 92 |b IZTAP |