Cargando…

Unstructured data analysis : entity resolution and regular expressions in SAS /

Unstructured data is the most voluminous form of data in the world, and several elements are critical for any advanced analytics practitioner leveraging SAS software to effectively address the challenge of deriving value from that data. This book covers the five critical elements of entity extractio...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Windham, K. Matthew (Autor)
Formato: Electrónico eBook
Idioma:Inglés
Publicado: Cary, NC : SAS Institute, [2018]
Temas:
Acceso en línea:Texto completo (Requiere registro previo con correo institucional)

MARC

LEADER 00000cam a2200000 i 4500
001 OR_on1057309674
003 OCoLC
005 20231017213018.0
006 m o d
007 cr unu||||||||
008 181017s2018 ncua ob 000 0 eng d
040 |a UMI  |b eng  |e rda  |e pn  |c UMI  |d OCLCF  |d TOH  |d STF  |d G3B  |d CZL  |d OCLCQ  |d OCLCO  |d OCLCQ  |d OCL  |d EBLCP  |d N$T  |d YDX  |d TEFOD  |d UKAHL  |d VT2  |d NZAUC 
019 |a 1054022328  |a 1054226346  |a 1055260201  |a 1272921913 
020 |a 9781635267099 
020 |a 1635267099 
020 |z 9781629598420 
020 |a 9781635267112  |q (electronic bk.) 
020 |a 1635267110  |q (electronic bk.) 
020 |z 1629598429 
029 1 |a AU@  |b 000069002957 
029 1 |a AU@  |b 000065066670 
035 |a (OCoLC)1057309674  |z (OCoLC)1054022328  |z (OCoLC)1054226346  |z (OCoLC)1055260201  |z (OCoLC)1272921913 
037 |a CL0500000998  |b Safari Books Online 
050 4 |a QA276.45.S27 
072 7 |a COM  |x 051010  |2 bisacsh 
082 0 4 |a 005.55  |2 23 
049 |a UAMI 
100 1 |a Windham, K. Matthew,  |e author. 
245 1 0 |a Unstructured data analysis :  |b entity resolution and regular expressions in SAS /  |c K. Matthew Windham. 
264 1 |a Cary, NC :  |b SAS Institute,  |c [2018] 
264 4 |c ©2018 
300 |a 1 online resource (1 volume) :  |b illustrations 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
588 0 |a Online resource; title from cover (Safari, viewed October 17, 2018). 
504 |a Includes bibliographical references. 
520 |a Unstructured data is the most voluminous form of data in the world, and several elements are critical for any advanced analytics practitioner leveraging SAS software to effectively address the challenge of deriving value from that data. This book covers the five critical elements of entity extraction, unstructured data, entity resolution, entity network mapping and analysis, and entity management. By following examples of how to apply processing to unstructured data, readers will derive tremendous long-term value from this book as they enhance the value they realize from SAS products. 
505 0 |a Intro; Contents; About This Book; Software Used to Develop the Book's Content; Example Code and Data; SAS University Edition; Acknowledgments; Chapter 1: Getting Started with Regular Expressions; 1.1.1 Defining Regular Expressions; 1.1.2 Motivational Examples; 1.1.3 RegEx Essentials; 1.1.4 RegEx Test Code; 1.3.1 Wildcard; 1.3.2 Word; 1.3.3 Non-word; 1.3.4 Tab; 1.3.5 Whitespace; 1.3.6 Non-whitespace; 1.3.7 Digit; 1.3.8 Non-digit; 1.3.9 Newline; 1.3.10 Bell; 1.3.11 Control Character; 1.3.12 Octal; 1.3.13 Hexadecimal; 1.4.1 List; 1.4.2 Not List; 1.4.3 Range; 1.5.1 Case Modifiers 
505 8 |a 1.5.2 Repetition Modifiers1.6.1 Ignore Case; 1.6.2 Single Line; 1.6.3 Multiline; 1.6.4 Compile Once; 1.6.5 Substitution Operator; 1.7.1 Start of Line; 1.7.2 End of Line; 1.7.3 Word Boundary; 1.7.4 Non-word Boundary; 1.7.5 String Start; Chapter 2: Using Regular Expressions in SAS; 2.1.1 Capture Buffer; 2.2.1 PRXPARSE; 2.2.2 PRXMATCH; 2.2.3 PRXCHANGE; 2.2.4 PRXPOSN; 2.2.5 PRXPAREN; 2.3.1 CALL PRXCHANGE; 2.3.2 CALL PRXPOSN; 2.3.3 CALL PRXSUBSTR; 2.3.4 CALL PRXNEXT; 2.3.5 CALL PRXDEBUG; 2.3.6 CALL PRXFREE; 2.4.1 Data Cleansing and Standardization; 2.4.2 Information Extraction 
505 8 |a 2.4.3 Search and ReplacementChapter 3: Entity Resolution Analytics; 3.3.1 Entity Extraction; 3.3.2 Extract, Transform, and Load; 3.3.3 Entity Resolution; 3.3.4 Entity Network Mapping and Analysis; 3.3.5 Entity Management; 3.4.1 Establish Clear Goals; 3.4.2 Verify Proper Data Inventory; 3.4.3 Create SMART Objectives; Chapter 4: Entity Extraction; 4.3.1 Webpage; 4.3.2 File System; 4.4.1 Social Security Number; 4.4.2 Phone Number; 4.4.3 Address; 4.4.4 Website; 4.4.5 Corporation Name; Chapter 5: Extract, Transform, Load; 5.2.1 PROC CONTENTS; 5.2.2 PROC FREQ; 5.2.3 PROC MEANS 
505 8 |a 5.4.1 Hexadecimal to Decimal5.4.2 Working with Dates; 5.6.1 Quantile Binning; 5.6.2 Bucket Binning; Chapter 6: Entity Resolution; 6.1.1 Exact Matching; 6.1.2 Fuzzy Matching; 6.1.3 Error Handling; 6.2.1 INDEX=; 6.3.1 COMPGED and COMPLEV; 6.3.2 SOUNDEX; 6.3.3 Putting Things Together; Chapter 7: Entity Network Mapping and Analysis; 7.2.1 Shared Entity Attributes; 7.2.2 Entity Interactions; 7.3.1 Articulation Points and Biconnected Components; 7.3.2 Minimum Spanning Trees; 7.3.3 Clique Detection; 7.3.4 Minimum Cut; 7.3.5 Shortest Paths; Chapter 8: Entity Management 
505 8 |a Appendix A: Additional ResourcesA. 2.1 Non-Printing Characters; A.2.2 Printing Characters; A.4.1 Random PII Generator; A.4.2 Output 
590 |a O'Reilly  |b O'Reilly Online Learning: Academic/Public Library Edition 
630 0 0 |a SAS (Computer file) 
630 0 7 |a SAS (Computer file)  |2 fast  |0 (OCoLC)fst01364029 
650 0 |a SAS (Computer program language) 
650 0 |a Text processing (Computer science) 
650 0 |a Programming languages (Electronic computers)  |x Syntax. 
650 0 |a Word processing operations. 
650 0 |a Word processing. 
650 6 |a SAS (Langage de programmation) 
650 6 |a Traitement de texte. 
650 6 |a Langages de programmation  |x Syntaxe. 
650 7 |a Word processing operations.  |2 fast  |0 (OCoLC)fst01180160 
650 7 |a Word processing.  |2 fast  |0 (OCoLC)fst01180127 
650 7 |a Programming languages (Electronic computers)  |x Syntax.  |2 fast  |0 (OCoLC)fst01078721 
650 7 |a SAS (Computer program language)  |2 fast  |0 (OCoLC)fst01738546 
650 7 |a Text processing (Computer science)  |2 fast  |0 (OCoLC)fst01148527 
650 7 |a COMPUTERS  |x Programming Languages  |x General.  |2 bisacsh 
776 0 8 |i Print version:  |a Windham, Matthew.  |t Unstructured Data Analysis : Entity Resolution and Regular Expressions in SAS.  |d Cary, NC : SAS Institute, ©2018  |z 9781629598420 
856 4 0 |u https://learning.oreilly.com/library/view/~/9781635267099/?ar  |z Texto completo (Requiere registro previo con correo institucional) 
938 |a Askews and Holts Library Services  |b ASKH  |n AH35195381 
938 |a ProQuest Ebook Central  |b EBLB  |n EBL5517178 
938 |a EBSCOhost  |b EBSC  |n 1901481 
938 |a YBP Library Services  |b YANK  |n 15722368 
938 |a YBP Library Services  |b YANK  |n 15727038 
994 |a 92  |b IZTAP