Intro to Solr in Drupal
Upcoming SlideShare
Loading in...5
×
 

Intro to Solr in Drupal

on

  • 801 views

Does your website have a ton of data? How do your users find the relevant pages among all the noise in your site? ...

Does your website have a ton of data? How do your users find the relevant pages among all the noise in your site?

Solr can help deliver the pertinent search results to your users regardless of your site's size.

Apache Solr is a Java program that integrates with the Drupal contrib module that allows your users to quickly search millions of records and narrow down the results with minimal system impact.

Statistics

Views

Total Views
801
Slideshare-icon Views on SlideShare
790
Embed Views
11

Actions

Likes
0
Downloads
5
Comments
0

1 Embed 11

http://mis-gryphon.mediacurrentstaging.info 11

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • In this example Walmart found that conversion rates were directly affected by site load times. While this example is for sites it still applies to search.

Intro to Solr in Drupal Intro to Solr in Drupal Presentation Transcript

  • Intro to Solr
  • DrupalConPortland
  • Andrew RileyDirector of Drupal Development@andrewmriley
  • AgendaSearch?WhySolr?SearchingBehindtheScenes
  • Search?
  • What is Search?Search (v): to go or look through (a place, area, etc.)carefully in order to find something missing or lost: Isearched the desk for the letter.Source: http://dictionary.reference.com/browse/search@Mediacurrent
  • Why Users Search• Navigation doesnt make sense• It can be faster• Lots of data• Frequent data changes• Might just be looking for something@Mediacurrent
  • Search Problems• Search accuracy• Too much data• Slow response• Wrong results@Mediacurrent
  • WhySolr?
  • HistorySolr was initially created in 2004 as an in-houseproject for CNET. It was open sourced in 2006 anddonated to the Apache Software Foundation.@Mediacurrent
  • Lucene• Solr is a layer on top of Lucene• Lucene is a library• Solr stores files in Lucene format*http://wiki.apache.org/solr/SolrPerformanceData@Mediacurrent
  • SpeedSearch speed is important!@Mediacurrent
  • SpeedSource: Web Performance Today http://j.mp/12h8wLZ@Mediacurrent
  • Speed• Important!• It scales well• No database required• Clustering & Sharding• Netflix runs 1.2MM q/day on 4 servers**http://wiki.apache.org/solr/SolrPerformanceData@Mediacurrent
  • Natural Results• Stemming: Blogging vs. Blog• Stop Word Removal: The• Synonyms: Tissue vs Kleenex• Highly Configurable@Mediacurrent
  • Drupal Search• Not stemmed by default• Queries the database• Stores tokenized words in a single largetable• Much slower to index@Mediacurrent
  • VS@Mediacurrent
  • Searching
  • Ordering• Score• Comes from Lucene• Not "out of 100"• Bigger score firstMore Info: http://lucene.apache.org/core/3_6_1/scoring.html???201200199184@Mediacurrent
  • Facets• Users do the work• Fixes too much data• Native to Solr• Requires the Facet APImodule• Shopping Sites@Mediacurrent
  • Behind theScenes
  • Index?• Index contains Documents• Documents have Fields• Fields have Terms• ~2 minutes for updates• Uses Lucene syntax@Mediacurrent
  • Tokenizing• Splits words and numbers"this" "is" "blogging"• Excludes Stopwords"this" "blogging"• Handles Stemming (if enabled)"this" "blog"• Very configurable@Mediacurrent
  • Bias• Adjusts the order of search results• Works on: Content Type, Fields,Comments, Promoted to Home Page andmore• Can be dynamic with custom modules.@Mediacurrent
  • Recap
  • Modules• Apache Solr (apachesolr)• Facet API (facetapi)• Chaos tool suite (ctools)@Mediacurrent
  • Overall• Search is becoming more and moreimportant• You want to control your search results• If you dont provide a good searchexperience, somebody else will.• Solr doesnt have to be complex.• Solr is fast and scales.@Mediacurrent
  • Thank You!Questions?@Mediacurrent Mediacurrent.comandrew.riley@mediacurrent.com@andrewmrileyslideshare.net/mediacurrent