Cargando…

NATURAL LANGUAGE PROCESSING WITH SPARK NLP : learning to understand text at scale.

If you want to build an enterprise-quality application that uses natural language text but aren't sure where to begin or what tools to use, this practical guide will help get you started. Alex Thomas, principal data scientist at Wisecube, shows software engineers and data scientists how to buil...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: THOMAS, ALEX
Formato: Electrónico eBook
Idioma:Inglés
Publicado: [Place of publication not identified] O'REILLY MEDIA, 2020.
Temas:
Acceso en línea:Texto completo (Requiere registro previo con correo institucional)

MARC

LEADER 00000cam a2200000M 4500
001 OR_on1160226562
003 OCoLC
005 20231017213018.0
006 m o d
007 cr |n|||||||||
008 200703s2020 xx ob 001 0 eng d
040 |a YDX  |b eng  |e pn  |c YDX  |d EBLCP  |d UKAHL  |d UMI  |d OCLCF  |d YDX  |d N$T  |d OCLCQ  |d OCLCO  |d OCLCQ 
019 |a 1163183470  |a 1163383838  |a 1164347342  |a 1177135301 
020 |a 9781492047735  |q (electronic bk.) 
020 |a 1492047732  |q (electronic bk.) 
020 |a 9781492047711 
020 |a 1492047716 
020 |z 1492047767 
020 |z 9781492047766 
029 1 |a AU@  |b 000071521101 
035 |a (OCoLC)1160226562  |z (OCoLC)1163183470  |z (OCoLC)1163383838  |z (OCoLC)1164347342  |z (OCoLC)1177135301 
037 |a CL0501000125  |b Safari Books Online 
050 4 |a QA76.9.N38 
082 0 4 |a 006.35  |2 23 
049 |a UAMI 
100 1 |a THOMAS, ALEX. 
245 1 0 |a NATURAL LANGUAGE PROCESSING WITH SPARK NLP :  |b learning to understand text at scale. 
260 |a [Place of publication not identified]  |b O'REILLY MEDIA,  |c 2020. 
300 |a 1 online resource 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
504 |a Includes bibliographical references and index. 
505 0 |a Intro -- Copyright -- Table of Contents -- Preface -- Why Natural Language Processing Is Important and Difficult -- Background -- Philosophy -- Conventions Used in This Book -- Using Code Examples -- O'Reilly Online Learning -- How to Contact Us -- Acknowledgments -- Part I. Basics -- Chapter 1. Getting Started -- Introduction -- Other Tools -- Setting Up Your Environment -- Prerequisites -- Starting Apache Spark -- Checking Out the Code -- Getting Familiar with Apache Spark -- Starting Apache Spark with Spark NLP -- Loading and Viewing Data in Apache Spark -- Hello World with Spark NLP 
505 8 |a Chapter 2. Natural Language Basics -- What Is Natural Language? -- Origins of Language -- Spoken Language Versus Written Language -- Linguistics -- Phonetics and Phonology -- Morphology -- Syntax -- Semantics -- Sociolinguistics: Dialects, Registers, and Other Varieties -- Formality -- Context -- Pragmatics -- Roman Jakobson -- How To Use Pragmatics -- Writing Systems -- Origins -- Alphabets -- Abjads -- Abugidas -- Syllabaries -- Logographs -- Encodings -- ASCII -- Unicode -- UTF-8 -- Exercises: Tokenizing -- Tokenize English -- Tokenize Greek -- Tokenize Ge'ez (Amharic) -- Resources 
505 8 |a Chapter 3. NLP on Apache Spark -- Parallelism, Concurrency, Distributing Computation -- Parallelization Before Apache Hadoop -- MapReduce and Apache Hadoop -- Apache Spark -- Architecture of Apache Spark -- Physical Architecture -- Logical Architecture -- Spark SQL and Spark MLlib -- Transformers -- Estimators and Models -- Evaluators -- NLP Libraries -- Functionality Libraries -- Annotation Libraries -- NLP in Other Libraries -- Spark NLP -- Annotation Library -- Stages -- Pretrained Pipelines -- Finisher -- Exercises: Build a Topic Model -- Resources -- Chapter 4. Deep Learning Basics 
505 8 |a Gradient Descent -- Backpropagation -- Convolutional Neural Networks -- Filters -- Pooling -- Recurrent Neural Networks -- Backpropagation Through Time -- Elman Nets -- LSTMs -- Exercise 1 -- Exercise 2 -- Resources -- Part II. Building Blocks -- Chapter 5. Processing Words -- Tokenization -- Vocabulary Reduction -- Stemming -- Lemmatization -- Stemming Versus Lemmatization -- Spelling Correction -- Normalization -- Bag-of-Words -- CountVectorizer -- N-Gram -- Visualizing: Word and Document Distributions -- Exercises -- Resources -- Chapter 6. Information Retrieval -- Inverted Indices 
505 8 |a Building an Inverted Index -- Vector Space Model -- Stop-Word Removal -- Inverse Document Frequency -- In Spark -- Exercises -- Resources -- Chapter 7. Classification and Regression -- Bag-of-Words Features -- Regular Expression Features -- Feature Selection -- Modeling -- Naïve Bayes -- Linear Models -- Decision/Regression Trees -- Deep Learning Algorithms -- Iteration -- Exercises -- Chapter 8. Sequence Modeling with Keras -- Sentence Segmentation -- (Hidden) Markov Models -- Section Segmentation -- Part-of-Speech Tagging -- Conditional Random Field -- Chunking and Syntactic Parsing 
520 |a If you want to build an enterprise-quality application that uses natural language text but aren't sure where to begin or what tools to use, this practical guide will help get you started. Alex Thomas, principal data scientist at Wisecube, shows software engineers and data scientists how to build scalable natural language processing (NLP) applications using deep learning and the Apache Spark NLP library. Through concrete examples, pratical and theoretical explanations, and hads-on exercises for using NLP on the Spark processing framework, this book teaches you everything from basics linguistics and writing systems to sentiment analysis and search engines. You'll also explore special concerns for developging text-based applications, such as performance. 
590 |a O'Reilly  |b O'Reilly Online Learning: Academic/Public Library Edition 
630 0 0 |a Spark (Electronic resource : Apache Software Foundation) 
630 0 7 |a Spark (Electronic resource : Apache Software Foundation)  |2 fast  |0 (OCoLC)fst01938143 
650 0 |a Natural language processing (Computer science) 
650 0 |a Application software  |x Development. 
650 0 |a Text data mining. 
650 6 |a Traitement automatique des langues naturelles. 
650 6 |a Logiciels d'application  |x Développement. 
650 7 |a Application software  |x Development.  |2 fast  |0 (OCoLC)fst00811707 
650 7 |a Natural language processing (Computer science)  |2 fast  |0 (OCoLC)fst01034365 
650 7 |a Text data mining.  |2 fast  |0 (OCoLC)fst02008831 
776 0 8 |i Print version:  |z 1492047767  |z 9781492047766  |w (OCoLC)1138608548 
856 4 0 |u https://learning.oreilly.com/library/view/~/9781492047759/?ar  |z Texto completo (Requiere registro previo con correo institucional) 
938 |a Askews and Holts Library Services  |b ASKH  |n AH38049679 
938 |a Askews and Holts Library Services  |b ASKH  |n AH37493609 
938 |a ProQuest Ebook Central  |b EBLB  |n EBL6236165 
938 |a EBSCOhost  |b EBSC  |n 2508326 
938 |a YBP Library Services  |b YANK  |n 301350440 
994 |a 92  |b IZTAP