Cargando…

Automated data collection with R : a practical guide to Web scraping and text mining /

Automated data collection with R : a practical guide to Web scraping and text mining /

"This book provides a unified framework of web scraping and information extraction from text data with R for the social sciences"--

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autores principales: Munzert, Simon (Autor), Ruoba, Christin (Autor), Meiboner, Peter (Autor), Nyhuis, Dominic (Autor)
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Chichester, England : Wiley, 2015.
Temas:
Acceso en línea:Texto completo

MARC

LEADER 00000cam a2200000 i 4500
001 EBOOKCENTRAL_ocn896846732
003 OCoLC
005 20240329122006.0
006 m o d
007 cr cn|||||||||
008 141106t20152015enka ob 001 0 eng d
040 |a E7B  |b eng  |e rda  |e pn  |c E7B  |d OCLCO  |d GPM  |d YDXCP  |d IAI  |d OCLCF  |d UMI  |d COO  |d DEBBG  |d EBLCP  |d DEBSZ  |d OCLCO  |d B24X7  |d OCLCQ  |d OCLCO  |d D6H  |d OCLCQ  |d COCUF  |d UAB  |d MOR  |d CCO  |d PIFAG  |d ZCU  |d LIV  |d MERUC  |d YDX  |d OCLCQ  |d U3W  |d STF  |d ICG  |d INT  |d VT2  |d OCLCQ  |d WYU  |d TKN  |d OCLCQ  |d DKC  |d AU@  |d OCLCQ  |d UWK  |d OCLCQ  |d TUHNV  |d OCLCO  |d OCLCQ  |d OCLCO  |d OCLCL 
066 |c (S 
019 |a 894170700  |a 898424053  |a 903395741  |a 1055317279  |a 1066448487  |a 1081254698  |a 1148111336 
020 |a 9781118834787  |q (e-book) 
020 |a 111883478X  |q (e-book) 
020 |a 9781118834800 
020 |a 1118834801 
020 |a 111883481X 
020 |a 9781118834817 
020 |z 9781118834817 
029 1 |a CHNEW  |b 000889182 
029 1 |a DEBBG  |b BV042487499 
029 1 |a DEBBG  |b BV043613251 
029 1 |a DEBSZ  |b 420093303 
029 1 |a DEBSZ  |b 434828971 
029 1 |a DKDLA  |b 800010-katalog:99122788306405763 
035 |a (OCoLC)896846732  |z (OCoLC)894170700  |z (OCoLC)898424053  |z (OCoLC)903395741  |z (OCoLC)1055317279  |z (OCoLC)1066448487  |z (OCoLC)1081254698  |z (OCoLC)1148111336 
037 |a CL0500000551  |b Safari Books Online 
050 4 |a QA76.9.D343  |b .A986 2015eb 
072 7 |a COM  |x 000000  |2 bisacsh 
082 0 4 |a 006.3/12  |2 23 
084 |a COM021030  |2 bisacsh 
049 |a UAMI 
100 1 |a Munzert, Simon,  |e author. 
245 1 0 |a Automated data collection with R :  |b a practical guide to Web scraping and text mining /  |c Simon Munzert, Christian Ruoba, Peter Meiboner, Dominic Nyhuis. 
264 1 |a Chichester, England :  |b Wiley,  |c 2015. 
264 4 |c ©2015 
300 |a 1 online resource (xxii, 452 pages) :  |b illustrations 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
504 |a Includes bibliographical references and index. 
588 0 |a Print version record. 
520 |a "This book provides a unified framework of web scraping and information extraction from text data with R for the social sciences"--  |c Provided by publisher 
505 0 |a Automated Data Collection with R; Contents; Preface; What you won't learn from reading this book; Why R?; Recommended reading to get started with R; Typographic conventions; The book's website; Disclaimer; Acknowledgments; 1 Introduction; 1.1 Case study: World Heritage Sites in Danger; 1.2 Some remarks on web data quality; 1.3 Technologies for disseminating, extracting, and storing web data; 1.3.1 Technologies for disseminating content on the Web; 1.3.2 Technologies for information extraction from web documents; 1.3.3 Technologies for data storage; 1.4 Structure of the book. 
505 8 |a Part One A Primer on Web and Data Technologies2 HTML; 2.1 Browser presentation and source code; 2.2 Syntax rules; 2.2.1 Tags, elements, and attributes; 2.2.2 Tree structure; 2.2.3 Comments; 2.2.4 Reserved and special characters; 2.2.5 Document type definition; 2.2.6 Spaces and line breaks; 2.3 Tags and attributes; 2.3.1 The anchor tag ; 2.3.2 The metadata tag ; 2.3.3 The external reference tag ; 2.3.4 Emphasizing tags, ; 2.3.5 The paragraphs tag ; 2.3.6 Heading tags, ; 2.3.7 Listing content with, and. 
505 8 |a 2.3.8 The organizational tags and 2.3.9 The tag and its companions; 2.3.10 The foreign script tag ; 2.3.11 Table tags, and ; 2.4 Parsing; 2.4.1 What is parsing?; 2.4.2 Discarding nodes; 2.4.3 Extracting information in the building process; Summary; Further reading; Problems; 3 XML and JSON; 3.1 A short example XML document; 3.2 XML syntax rules; 3.2.1 Elements and attributes; 3.2.2 XML structure; 3.2.3 Naming and special characters; 3.2.4 Comments and character data; 3.2.5 XML syntax summary; 3.3 When is an XML document well formed or valid? 
505 8 |a 3.4 XML extensions and technologies3.4.1 Namespaces; 3.4.2 Extensions of XML; 3.4.3 Example: Really Simple Syndication; 3.4.4 Example: scalable vector graphics; 3.5 XML and R in practice; 3.5.1 Parsing XML; 3.5.2 Basic operations on XML documents; 3.5.3 From XML to data frames or lists; 3.5.4 Event-driven parsing; 3.6 A short example JSON document; 3.7 JSON syntax rules; 3.8 JSON and R in practice; Summary; Further reading; Problems; 4 XPath; 4.1 XPath-a query language for web documents; 4.2 Identifying node sets with XPath; 4.2.1 Basic structure of an XPath query; 4.2.2 Node relations. 
505 8 |a 4.2.3 XPath predicates4.3 Extracting node elements; 4.3.1 Extending the fun argument; 4.3.2 XML namespaces; 4.3.3 Little XPath helper tools; Summary; Further reading; Problems; 5 HTTP; 5.1 HTTP fundamentals; 5.1.1 A short conversation with a web server; 5.1.2 URL syntax; 5.1.3 HTTP messages; 5.1.4 Request methods; 5.1.5 Status codes; 5.1.6 Header fields; 5.2 Advanced features of HTTP; 5.2.1 Identification; 5.2.2 Authentication; 5.2.3 Proxies; 5.3 Protocols beyond HTTP; 5.3.1 HTTP Secure; 5.3.2 FTP; 5.4 HTTP in action; 5.4.1 The libcurl library; 5.4.2 Basic request methods. 
590 |a ProQuest Ebook Central  |b Ebook Central Academic Complete 
650 0 |a Data mining. 
650 0 |a Automatic data collection systems. 
650 0 |a Social sciences  |x Research  |x Data processing. 
650 0 |a R (Computer program language) 
650 4 |a Automatic data collection systems. 
650 4 |a Data mining. 
650 4 |a R (Computer program language) 
650 4 |a Social sciences  |x Research  |x Data processing. 
650 6 |a Exploration de données (Informatique) 
650 6 |a Collecte automatique des données. 
650 6 |a Sciences sociales  |x Recherche  |x Informatique. 
650 6 |a R (Langage de programmation) 
650 7 |a COMPUTERS  |x Database Management  |x Data Mining.  |2 bisacsh 
650 7 |a COMPUTERS  |x General.  |2 bisacsh 
650 7 |a Automatic data collection systems  |2 fast 
650 7 |a Data mining  |2 fast 
650 7 |a R (Computer program language)  |2 fast 
650 7 |a Social sciences  |x Research  |x Data processing  |2 fast 
700 1 |a Ruoba, Christin,  |e author. 
700 1 |a Meiboner, Peter,  |e author. 
700 1 |a Nyhuis, Dominic,  |e author. 
758 |i has work:  |a Automated data collection with R (Text)  |1 https://id.oclc.org/worldcat/entity/E39PCG4vQPbGr68vTprvP3HxcK  |4 https://id.oclc.org/worldcat/ontology/hasWork 
776 0 8 |i Print version:  |t Automated data collection with R : a practical guide to web scraping and text mining.  |d Chichester, England : Wiley, ©2015  |h xxii, 452 pages  |z 9781118834817  |w (DLC) 2014032266 
856 4 0 |u https://ebookcentral.uam.elogim.com/lib/uam-ebooks/detail.action?docID=1824310  |z Texto completo 
880 1 0 |6 245-00/(S  |a Automated data collection with R :  |b a practical guide to Web scraping and text mining /  |c Simon Munzert, Christian Ruoba, Peter Meiβner, Dominic Nyhuis. 
936 |a BATCHLOAD 
938 |a Books 24x7  |b B247  |n bks00072676 
938 |a ProQuest Ebook Central  |b EBLB  |n EBL1824310 
938 |a ebrary  |b EBRY  |n ebr10960598 
938 |a YBP Library Services  |b YANK  |n 12136922 
938 |a YBP Library Services  |b YANK  |n 11129876 
938 |a YBP Library Services  |b YANK  |n 12677446 
994 |a 92  |b IZTAP