Cargando…

Practical Apache Lucene 8 : uncover the search capabilities of your application /

Gain a thorough knowledge of Lucene's capabilities and use it to develop your own search applications. This book explores the Java-based, high-performance text search engine library used to build search capabilities in your applications. Starting with the basics of Lucene and searching, you wil...

Descripción completa

Detalles Bibliográficos
Clasificación:Libro Electrónico
Autor principal: Sharma, Atri (Autor)
Formato: Electrónico eBook
Idioma:Inglés
Publicado: [Berkeley, CA] : Apress, [2020]
Temas:
Acceso en línea:Texto completo (Requiere registro previo con correo institucional)

MARC

LEADER 00000cam a2200000 i 4500
001 OR_on1202752026
003 OCoLC
005 20231017213018.0
006 m o d
007 cr cnu---unuuu
008 201103s2020 cau o 001 0 eng d
040 |a YDX  |b eng  |e rda  |e pn  |c YDX  |d YDXIT  |d OCLCO  |d VT2  |d UKAHL  |d EBLCP  |d SFB  |d DCT  |d OCLCF  |d GW5XE  |d CQ$  |d N$T  |d K6U  |d OCL  |d OCLCQ  |d OCLCO  |d COM  |d OCLCQ 
019 |a 1224921446  |a 1225891559  |a 1264848820 
020 |a 9781484263457  |q (electronic bk.) 
020 |a 1484263456  |q (electronic bk.) 
020 |z 1484263448 
020 |z 9781484263440 
024 7 |a 10.1007/978-1-4842-6345-7  |2 doi 
029 1 |a AU@  |b 000068143734 
029 1 |a AU@  |b 000068747935 
029 1 |a AU@  |b 000068856609 
035 |a (OCoLC)1202752026  |z (OCoLC)1224921446  |z (OCoLC)1225891559  |z (OCoLC)1264848820 
050 4 |a TK5105.8885.L84  |b S53 2020 
072 7 |a UMX  |2 bicssc 
072 7 |a COM051280  |2 bisacsh 
072 7 |a UMX  |2 thema 
082 0 4 |a 005.133  |2 23 
049 |a UAMI 
100 1 |a Sharma, Atri,  |e author. 
245 1 0 |a Practical Apache Lucene 8 :  |b uncover the search capabilities of your application /  |c Atri Sharma. 
264 1 |a [Berkeley, CA] :  |b Apress,  |c [2020] 
300 |a 1 online resource 
336 |a text  |b txt  |2 rdacontent 
337 |a computer  |b c  |2 rdamedia 
338 |a online resource  |b cr  |2 rdacarrier 
347 |a text file 
347 |b PDF 
500 |a Includes index. 
520 |a Gain a thorough knowledge of Lucene's capabilities and use it to develop your own search applications. This book explores the Java-based, high-performance text search engine library used to build search capabilities in your applications. Starting with the basics of Lucene and searching, you will learn about the types of queries used in it and also take a look at scoring models. Applying this basic knowledge, you will develop a hello world app using basic Lucene queries and explore functions like scoring and document level boosting. Along the way you will also uncover the concepts of partial searching and matching in Lucene and then learn how to integrate geographical information (geospatial data) in Lucene using spatial queries and n-dimensional indexing. This will prepare you to build a location-aware search engine with a representative data set that allows location constraints to be specified during a search. You'll also develop a text classifier using Lucene and Apache Mahout, a popular machine learning framework. After a detailed review of performance bench-marking and common issues associated with it, you'll learn some of the best practices of tuning the performance of your application. By the end of the book you'll be able to build your first Lucene patch, where you will not only write your patch, but also test it and ensure it adheres to community coding standards. You will: Master the basics of Apache Lucene. Utilize different query types in Apache Lucene. Explore scoring and document level boosting. Integrate geospatial data into your application. 
588 0 |a Online resource; title from digital title page (viewed on January 11, 2021). 
505 0 |a Intro -- Table of Contents -- About the Author -- About the Technical Reviewer -- Acknowledgments -- Introduction -- Chapter 1: Hola, Lucene! -- Key Features of Lucene -- Information Retrieval Basics -- Linear Scan -- Stop List -- Stemming -- Term -- Term-Document Incidence Matrix -- Serving Queries Using a Term-Document Incidence Matrix -- Basic Terminology -- Heart of Lucene's Data Representation -- Lucene's Inverted Index Structure -- On-Disk Representation of a Lucene Index -- Terms Dictionary -- Frequencies File -- Positions File -- Queries on Lucene -- Structure of a Lucene Query 
505 8 |a Fields -- Types of Queries in Lucene -- Lucene vs. Relational Databases -- Chapter 2: Hello World: The Lucene Way -- Indexing Data in Lucene -- Document -- Analyzers -- StandardAnalyzer -- StopAnalyzer -- SimpleAnalyzer -- IndexWriter -- Directory -- Create Documents -- Create Index and Write Documents -- Adding Data to the Index -- Bringing It All Together -- TestClass -- Document Search -- QueryParser -- TopDocs -- IndexSearcher -- IndexReader -- Searching -- Boolean Model -- What Is Relevance? -- Scoring Algorithms -- TF/IDF -- Vector Space Model -- Scoring Example -- Lucene's Scoring Model 
505 8 |a Fields -- Similarity -- Boosting -- Collectors -- Chapter 3: Core Search Fundamentals -- Codecs -- DocValues -- Phrase Queries -- Term Vectors -- BooleanQuery -- MultiTermQuery -- QueryCache -- Scorer as Part of the Search Process -- Chapter 4: Spatial Indexing -- Spatial Module -- What Are Geohashes? -- Quad Trees -- K-D Trees -- BKD Trees -- Using Spatial Indexing -- Chapter 5: Location-Aware Search Engines -- Why Use a Search Engine for Geographic Searches? -- Range Queries -- Function Queries -- Geospatial Basics -- Representing Spatial Data -- Tiered Design for Storage -- Geohashes 
505 8 |a Spatial Data with Text Search -- Distance Calculations -- Bounding Box Filter -- A Point on Distance Calculation -- Chapter 6: Introducing Machine Learning with Apache Mahout -- Origin of Apache Mahout -- Why Apache Mahout? -- Introduction to Machine Learning -- Learning -- Collaborative Filtering -- Clustering -- Categorization -- Converting from Lucene Components to Mahout Components -- Integrating Lucene with Mahout -- lucene.vector -- Lucene2seq -- Java Version of Lucene2seq -- Putting It All Together -- Chapter 7: Improving Lucene's Performance -- Increase Indexing Speed 
505 8 |a Reuse Field Instances -- The Curious Case of Large Commits -- Reuse Tokens in Analyzers -- Tuning Flush Intervals -- Increase mergeFactor -- Choosing the Correct Analyzers -- Use Multiple Threads with One IndexWriter -- Index into Separate Indexes and Then Merge -- Improve Search Performance -- Use the Latest Version of Lucene -- Use IndexReader with the readOnly Attribute Equal to True -- Use MMapDirectory/NIOFSDirectory -- Decrease mergeFactor -- Ignore First Query's Performance -- Avoid Reopening IndexSearcher Instances -- Share IndexSearcher Instances 
588 0 |a Print version record. 
590 |a O'Reilly  |b O'Reilly Online Learning: Academic/Public Library Edition 
630 0 0 |a Lucene (Electronic resource) 
630 0 7 |a Lucene (Electronic resource)  |2 fast  |0 (OCoLC)fst01403193 
650 0 |a Web search engines. 
650 0 |a Internet searching. 
650 0 |a Java (Computer program language) 
650 6 |a Moteurs de recherche sur Internet. 
650 6 |a Recherche sur Internet. 
650 6 |a Java (Langage de programmation) 
650 7 |a Web search engines.  |2 fast  |0 (OCoLC)fst01173234 
650 7 |a Internet searching.  |2 fast  |0 (OCoLC)fst00977289 
650 7 |a Application software.  |2 fast  |0 (OCoLC)fst00811706 
650 7 |a Computer programming.  |2 fast  |0 (OCoLC)fst00872390 
650 7 |a Java (Computer program language)  |2 fast  |0 (OCoLC)fst00982065 
650 7 |a Open source software.  |2 fast  |0 (OCoLC)fst01046097 
776 0 8 |i Print version:  |a Sharma, Atri.  |t Practical Apache Lucene 8.  |d [Berkeley, CA] : Apress, [2020]  |z 9781484263457  |w (OCoLC)1182514259 
856 4 0 |u https://learning.oreilly.com/library/view/~/9781484263457/?ar  |z Texto completo (Requiere registro previo con correo institucional) 
938 |a Askews and Holts Library Services  |b ASKH  |n AH37890087 
938 |a ProQuest Ebook Central  |b EBLB  |n EBL6419844 
938 |a EBSCOhost  |b EBSC  |n 2663168 
938 |a YBP Library Services  |b YANK  |n 17092711 
994 |a 92  |b IZTAP