• Share
  • Email
  • Embed
  • Like
  • Save
  • Private Content
Drupal for ng_os
 

Drupal for ng_os

on

  • 419 views

An overview of some of the tips and tricks to improve search results.

An overview of some of the tips and tricks to improve search results.

Statistics

Views

Total Views
419
Views on SlideShare
419
Embed Views
0

Actions

Likes
0
Downloads
1
Comments
0

0 Embeds 0

No embeds

Accessibility

Categories

Upload Details

Uploaded via as Apple Keynote

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n
  • \n

Drupal for ng_os Drupal for ng_os Presentation Transcript

  • Solr
Tuning:Improving
Relevancy
  • Agenda✤ Solr Schema Basics✤ Some of Drupal’s Apache Solr Hooks✤ Why isn’t my content at the top? ✤ Admin Field Analyser ✤ Query Debug✤ Other Apache Solr Modules
  • Solr Schema Basics - Schema.xml✤ Types ✤ Raw - int, long, string, date, double, etc ✤ Custom - Made up of raw plus ✤ analyzers - index and query ✤ tokenizers - break up content ✤ filters
  • Schema Cont..<fieldType name="text" class="solr.TextField" positionIncrementGap="100"> <analyzer type="index"> <charFilter class="solr.MappingCharFilterFactory" mapping="ISOLatin1.txt"/> <tokenizer class="solr.WhitespaceTokenizerFactory"/> <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt"ignoreCase="true" expand="false"/> <filter class="solr.StopFilterFactory" words="stopwords.txt"/> <filter class="solr.WordDelimiterFilterFactory" protected="protwords.txt"splitOnCaseChange="1" /> <filter class="solr.LowerCaseFilterFactory"/> <filter class="solr.SnowballPorterFilterFactory" language="English"protected="protwords.txt"/> <filter class="solr.RemoveDuplicatesTokenFilterFactory"/> </analyzer> <analyzer type="query"><!-- Repeated For Query --></analyzer></fieldType>
  • Apache Solr Module Hookshook_apachesolr_modify_query ✤ Provides ability to override standard solr query ✤ Useful additions include: ✤ Query Filtering ✤ AND/OR keyword separators ✤ Term boosting - primary AND literacy^30 ✤ Term Proximity “primary literacy”~2
  • Apache Solr Module Hooks✤ hook_apachesolr_process_results✤ hook_apachesolr_update_index✤ hook_apachesolr_search_result✤ hook_apachesolr_prepare_query✤ hook_apachesolr_types_exclude✤ hook_apachesolr_node_exclude✤ and more added per release....
  • Why is my content at the top? Admin Field Analyser
  • Why is my content at the top?✤ Query Debug ✤ http://localhost:8080/solr2/core3/select/?q=primary +literacy&version=2.2&start=0&rows=10&indent=on&debugQuery=true <str name="http://archive.teachfind.com/becta/localauthorities.becta.org.uk/index.html"> 2.3873122 = (MATCH) sum of: 0.84451145 = (MATCH) weight(body:primari in 14212), product of: 0.59476876 = queryWeight(body:primari), product of: 1.54579 = idf(docFreq=13202, maxDocs=22788) 0.38476688 = queryNorm 1.4198987 = (MATCH) fieldWeight(body:primari in 14212), product of: 2.4494898 = tf(termFreq(body:primari)=6) 1.54579 = idf(docFreq=13202, maxDocs=22788) 0.375 = fieldNorm(field=body, doc=14212) 1.5428007 = (MATCH) weight(body:literaci in 14212), product of: 0.8038968 = queryWeight(body:literaci), product of: 2.0893087 = idf(docFreq=7666, maxDocs=22788) 0.38476688 = queryNorm 1.9191527 = (MATCH) fieldWeight(body:literaci in 14212), product of: 2.4494898 = tf(termFreq(body:literaci)=6) 2.0893087 = idf(docFreq=7666, maxDocs=22788) 0.375 = fieldNorm(field=body, doc=14212) </str>
  • Other Modules✤ Multi Server✤ Attachments✤ Apache Solr Views✤ Autocomplete✤ A12:Find (coming soon) ✤ Solr 3.2 ✤ Field Grouping ✤ Drupal 6 and 7 support ✤ Geo Local ✤ Industry Specific Synonyms
  • Questions?