5. Query
• Highlighting
– Activated by query (hl=true)
• Text Analysis
– Use dictionary and thesaurus
– Relevancy searchs
– Spelling suggestions
– Search by similarity (“More like this”)
– Fuzzy (Damerau-Levenshtein distance)
6. Query
• Querying data
– Words
– Words by field
– Orderly (sort)
• Faceted Search
– Categories
7. Query
• Faceted Search, the queries could be a problem?
– Exemple
http://localhost:8983/solr/select?
q=video&rows=0&facet=true&facet.field=inStock
&facet.query=price:[*+TO+500]
&facet.query=price:[500+TO+*]
&facet.prefix=xx&facet.limit=5&facet.mincount=1
8. Data indexing
• Solr XML native
• CSV
• Database (DIH)
• Rich Documents
• Crawler
9. Index
• Index is being larger than you imagine?
• Could be adjusted in:
– Index size segments
– Merge index segments
13. SolrCaching
• Put searched docs on cache
• Two implementations
– Solr.search.LRUCache (LRU= Least Recently Used in
memory)
– Solr.search.FastLRUCache (a partir da versão 1.4)
• How to use
– filterCache
– queryResultCache
– documentCache (sobe tudo em memória)
14. Cluster – Carrot2
• Search Results Clustering Engine
• Search in many nodes
• Live Demo
– http://search.carrot2.org/stable/search
Filter cache (usado para 3 condições)\n1. cacheia conteúdo dos parâmetros “fq”\n2. cachear as faceted\n3. Sort -> se estiver setado para true\n