Your SlideShare is downloading. ×
0
Solr
Tuning:Improving
Relevancy
Agenda✤   Solr Schema Basics✤   Some of Drupal’s Apache Solr Hooks✤   Why isn’t my content at the top?    ✤ Admin Field An...
Solr Schema Basics                       - Schema.xml✤   Types    ✤   Raw - int, long, string, date, double, etc    ✤   Cu...
Schema Cont..<fieldType name="text" class="solr.TextField" positionIncrementGap="100"> <analyzer type="index">   <charFilte...
Apache Solr Module Hookshook_apachesolr_modify_query    ✤ Provides ability to override standard solr query     ✤   Useful ...
Apache Solr Module Hooks✤   hook_apachesolr_process_results✤   hook_apachesolr_update_index✤   hook_apachesolr_search_resu...
Why is my content at the top? Admin Field Analyser
Why is my content at the top?✤   Query Debug    ✤   http://localhost:8080/solr2/core3/select/?q=primary        +literacy&v...
Other Modules✤   Multi Server✤   Attachments✤   Apache Solr Views✤   Autocomplete✤   A12:Find (coming soon)    ✤ Solr 3.2 ...
Questions?
Upcoming SlideShare
Loading in...5
×

Drupal for ng_os

315

Published on

An overview of some of the tips and tricks to improve search results.

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
315
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
2
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • Transcript of "Drupal for ng_os"

    1. 1. Solr
Tuning:Improving
Relevancy
    2. 2. Agenda✤ Solr Schema Basics✤ Some of Drupal’s Apache Solr Hooks✤ Why isn’t my content at the top? ✤ Admin Field Analyser ✤ Query Debug✤ Other Apache Solr Modules
    3. 3. Solr Schema Basics - Schema.xml✤ Types ✤ Raw - int, long, string, date, double, etc ✤ Custom - Made up of raw plus ✤ analyzers - index and query ✤ tokenizers - break up content ✤ filters
    4. 4. Schema Cont..<fieldType name="text" class="solr.TextField" positionIncrementGap="100"> <analyzer type="index"> <charFilter class="solr.MappingCharFilterFactory" mapping="ISOLatin1.txt"/> <tokenizer class="solr.WhitespaceTokenizerFactory"/> <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt"ignoreCase="true" expand="false"/> <filter class="solr.StopFilterFactory" words="stopwords.txt"/> <filter class="solr.WordDelimiterFilterFactory" protected="protwords.txt"splitOnCaseChange="1" /> <filter class="solr.LowerCaseFilterFactory"/> <filter class="solr.SnowballPorterFilterFactory" language="English"protected="protwords.txt"/> <filter class="solr.RemoveDuplicatesTokenFilterFactory"/> </analyzer> <analyzer type="query"><!-- Repeated For Query --></analyzer></fieldType>
    5. 5. Apache Solr Module Hookshook_apachesolr_modify_query ✤ Provides ability to override standard solr query ✤ Useful additions include: ✤ Query Filtering ✤ AND/OR keyword separators ✤ Term boosting - primary AND literacy^30 ✤ Term Proximity “primary literacy”~2
    6. 6. Apache Solr Module Hooks✤ hook_apachesolr_process_results✤ hook_apachesolr_update_index✤ hook_apachesolr_search_result✤ hook_apachesolr_prepare_query✤ hook_apachesolr_types_exclude✤ hook_apachesolr_node_exclude✤ and more added per release....
    7. 7. Why is my content at the top? Admin Field Analyser
    8. 8. Why is my content at the top?✤ Query Debug ✤ http://localhost:8080/solr2/core3/select/?q=primary +literacy&version=2.2&start=0&rows=10&indent=on&debugQuery=true <str name="http://archive.teachfind.com/becta/localauthorities.becta.org.uk/index.html"> 2.3873122 = (MATCH) sum of: 0.84451145 = (MATCH) weight(body:primari in 14212), product of: 0.59476876 = queryWeight(body:primari), product of: 1.54579 = idf(docFreq=13202, maxDocs=22788) 0.38476688 = queryNorm 1.4198987 = (MATCH) fieldWeight(body:primari in 14212), product of: 2.4494898 = tf(termFreq(body:primari)=6) 1.54579 = idf(docFreq=13202, maxDocs=22788) 0.375 = fieldNorm(field=body, doc=14212) 1.5428007 = (MATCH) weight(body:literaci in 14212), product of: 0.8038968 = queryWeight(body:literaci), product of: 2.0893087 = idf(docFreq=7666, maxDocs=22788) 0.38476688 = queryNorm 1.9191527 = (MATCH) fieldWeight(body:literaci in 14212), product of: 2.4494898 = tf(termFreq(body:literaci)=6) 2.0893087 = idf(docFreq=7666, maxDocs=22788) 0.375 = fieldNorm(field=body, doc=14212) </str>
    9. 9. Other Modules✤ Multi Server✤ Attachments✤ Apache Solr Views✤ Autocomplete✤ A12:Find (coming soon) ✤ Solr 3.2 ✤ Field Grouping ✤ Drupal 6 and 7 support ✤ Geo Local ✤ Industry Specific Synonyms
    10. 10. Questions?
    1. A particular slide catching your eye?

      Clipping is a handy way to collect important slides you want to go back to later.

    ×