Drupal for ng_os
Upcoming SlideShare
Loading in...5
×

Like this? Share it with your network

Share

Drupal for ng_os

  • 495 views
Uploaded on

An overview of some of the tips and tricks to improve search results.

An overview of some of the tips and tricks to improve search results.

More in: Technology
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Be the first to comment
    Be the first to like this
No Downloads

Views

Total Views
495
On Slideshare
495
From Embeds
0
Number of Embeds
0

Actions

Shares
Downloads
1
Comments
0
Likes
0

Embeds 0

No embeds

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
    No notes for slide
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n

Transcript

  • 1. Solr
Tuning:Improving
Relevancy
  • 2. Agenda✤ Solr Schema Basics✤ Some of Drupal’s Apache Solr Hooks✤ Why isn’t my content at the top? ✤ Admin Field Analyser ✤ Query Debug✤ Other Apache Solr Modules
  • 3. Solr Schema Basics - Schema.xml✤ Types ✤ Raw - int, long, string, date, double, etc ✤ Custom - Made up of raw plus ✤ analyzers - index and query ✤ tokenizers - break up content ✤ filters
  • 4. Schema Cont..<fieldType name="text" class="solr.TextField" positionIncrementGap="100"> <analyzer type="index"> <charFilter class="solr.MappingCharFilterFactory" mapping="ISOLatin1.txt"/> <tokenizer class="solr.WhitespaceTokenizerFactory"/> <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt"ignoreCase="true" expand="false"/> <filter class="solr.StopFilterFactory" words="stopwords.txt"/> <filter class="solr.WordDelimiterFilterFactory" protected="protwords.txt"splitOnCaseChange="1" /> <filter class="solr.LowerCaseFilterFactory"/> <filter class="solr.SnowballPorterFilterFactory" language="English"protected="protwords.txt"/> <filter class="solr.RemoveDuplicatesTokenFilterFactory"/> </analyzer> <analyzer type="query"><!-- Repeated For Query --></analyzer></fieldType>
  • 5. Apache Solr Module Hookshook_apachesolr_modify_query ✤ Provides ability to override standard solr query ✤ Useful additions include: ✤ Query Filtering ✤ AND/OR keyword separators ✤ Term boosting - primary AND literacy^30 ✤ Term Proximity “primary literacy”~2
  • 6. Apache Solr Module Hooks✤ hook_apachesolr_process_results✤ hook_apachesolr_update_index✤ hook_apachesolr_search_result✤ hook_apachesolr_prepare_query✤ hook_apachesolr_types_exclude✤ hook_apachesolr_node_exclude✤ and more added per release....
  • 7. Why is my content at the top? Admin Field Analyser
  • 8. Why is my content at the top?✤ Query Debug ✤ http://localhost:8080/solr2/core3/select/?q=primary +literacy&version=2.2&start=0&rows=10&indent=on&debugQuery=true <str name="http://archive.teachfind.com/becta/localauthorities.becta.org.uk/index.html"> 2.3873122 = (MATCH) sum of: 0.84451145 = (MATCH) weight(body:primari in 14212), product of: 0.59476876 = queryWeight(body:primari), product of: 1.54579 = idf(docFreq=13202, maxDocs=22788) 0.38476688 = queryNorm 1.4198987 = (MATCH) fieldWeight(body:primari in 14212), product of: 2.4494898 = tf(termFreq(body:primari)=6) 1.54579 = idf(docFreq=13202, maxDocs=22788) 0.375 = fieldNorm(field=body, doc=14212) 1.5428007 = (MATCH) weight(body:literaci in 14212), product of: 0.8038968 = queryWeight(body:literaci), product of: 2.0893087 = idf(docFreq=7666, maxDocs=22788) 0.38476688 = queryNorm 1.9191527 = (MATCH) fieldWeight(body:literaci in 14212), product of: 2.4494898 = tf(termFreq(body:literaci)=6) 2.0893087 = idf(docFreq=7666, maxDocs=22788) 0.375 = fieldNorm(field=body, doc=14212) </str>
  • 9. Other Modules✤ Multi Server✤ Attachments✤ Apache Solr Views✤ Autocomplete✤ A12:Find (coming soon) ✤ Solr 3.2 ✤ Field Grouping ✤ Drupal 6 and 7 support ✤ Geo Local ✤ Industry Specific Synonyms
  • 10. Questions?