Cargando…

Apache Solr 4 cookbook : over 100 recipes to make Apache Solr faster, more reliable, and return better results /

Apache Solr is a blazing fast, scalable, open source enterprise search server built upon Apache Lucene. Solr is wildly popular because it supports complex search criteria, faceting, result highlighting, query-completion, query spell-checking, and relevancy tuning, amongst other numerous features. To...

Descripción completa

Detalles Bibliográficos
Clasificación:	Libro Electrónico
Autor principal:	Kuć, Rafał
Formato:	Electrónico eBook
Idioma:	Inglés
Publicado:	Birmingham : Packt Pub., 2013.
Edición:	2nd ed.
Colección:	Community experience distilled.
Temas:	Search engines > Programming. Open source software. Web search engines. Moteurs de recherche > Programmation. Logiciels libres. Moteurs de recherche sur Internet. COMPUTERS > Web > Search Engines. Open source software Search engines > Programming Web search engines
Acceso en línea:	Texto completo

Tabla de Contenidos:

Cover; Copyright; Credits; About the Author; Acknowledgement; About the Reviewers; www.PacktPub.com; Table of Contents; Preface; Chapter 1: Apache Solr Configuration; Introduction; Running Solr on Jetty; Running Solr on Apache Tomcat; Installing a standalone ZooKeeper; Clustering your data; Choosing the right directory implementation; Configuring spellchecker to not use its own index; Solr cache configuration; How to fetch and index web pages; How to set up the extracting request handler; Changing the default similarity implementation; Chapter 2: Indexing Your Data; Introduction.
Indexing PDF filesGenerating unique fields automatically; Extracting metadata from binary files; How to properly configure Data Import Handler with JDBC; Indexing data from a database using Data Import Handler; How to import data using Data Import Handler and delta query; How to use Data Import Handler with the URL data source; How to modify data while importing with Data Import Handler; Updating a single field of your document; Handling multiple currencies; Detecting the document's language; Optimizing your primary key field indexing; Chapter 3: Analyzing Your Text Data; Introduction.
Storing additional information using payloadsEliminating XML and HTML tags from text; Copying the contents of one field to another; Changing words to other words; Splitting text by CamelCase; Splitting text by whitespace only; Making plural words singular without stemming; Lowercasing the whole string; Storing geographical points in the index; Stemming your data; Preparing text to perform an efficient trailing wildcard search; Splitting text by numbers and non-whitespace characters; Using Hunspell as a stemmer; Using your own stemming dictionary; Protecting words from being stemmed.
Chapter 4: Querying SolrIntroduction; Asking for a particular field value; Sorting results by a field value; How to search for a phrase, not a single word; Boosting phrases over words; Positioning some documents over others on a query; Positioning documents with words closer to each other first; Sorting results by a distance from a point; Getting documents with only a partial match; Affecting scoring with functions; Nesting queries; Modifying returned documents; Using parent-child relationships; Ignoring typos in terms of performance; Detecting and omitting duplicate documents.
Using field aliasesReturning a value of a function in the results; Chapter 5: Using the Faceting Mechanism; Introduction; Getting the number of documents with the same field value; Getting the number of documents with the same value range; Getting the number of documents matching the query and subquery; Removing filters from faceting results; Sorting faceting results in alphabetical order; Implementing the autosuggest feature using faceting; Getting the number of documents that don't have a value in the field; Having two different facet limits for two different fields in the same query.

Apache Solr 4 cookbook : over 100 recipes to make Apache Solr faster, more reliable, and return better results /

Ejemplares similares