Solr in Drupal
Upcoming SlideShare
Loading in...5
×
 

Solr in Drupal

on

  • 6,155 views

Presentation held at Apache Lucene EuroCon 2010 (http://lucene-eurocon.org/)

Presentation held at Apache Lucene EuroCon 2010 (http://lucene-eurocon.org/)

Statistics

Views

Total Views
6,155
Views on SlideShare
6,121
Embed Views
34

Actions

Likes
6
Downloads
60
Comments
0

4 Embeds 34

http://www.slideshare.net 17
http://debategraph.org 15
http://borort.blogspot.com 1
http://www.linkedin.com 1

Accessibility

Categories

Upload Details

Uploaded via as Microsoft PowerPoint

Usage Rights

© All Rights Reserved

Report content

Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
  • Full Name Full Name Comment goes here.
    Are you sure you want to
    Your message goes here
    Processing…
Post Comment
Edit your comment
  • We can see lots of search parameters: status, priority, component etc.
  • To modify relevancy values, you can map field boosting values to different Drupal features: whether a node is promoted to the front page, or is sticky, the number of comments, recently commented nodes etc.
  • You can boost or diminish the ranking of individual content types, or exclude content types from being indexed altogether.
  • Dries Buytaert is the creator of Drupal, now the head of Acquia.com
  • Here you can see how many searches from a music site are being filtered by genre or instrumentation.

Solr in Drupal Solr in Drupal Presentation Transcript

  • Bringing Solr to Drupal
    1
    A General and a Library-Specific Use CaseKirály PétereXtensible Catalog
  • Two ways of using Solr in Drupal
    2
    General solution: Apache Solr Search Integration and related modules: Stats, Autocomplete, Multisite, Ajax, Biblio, Attachments, Übercart (e-commerce integration), Views, Multilingual, Geospatial and many othersFits for the overall needs, uses predefined fields
    Library specific solution: eXtensible Catalog modulesFits for library needs, uses dynamic fields
  • Part 1 – the general solution
    3
    General solution: Apache Solr Search Integration and related modules: Stats, Autocomplete, Multisite, Ajax, Biblio, Attachments, Übercart (e-commerce integration), Views, Multilingual, Geospatial and many othersFits for the overall needs, uses predefined fields
    Library specific solution: eXtensible Catalog modulesFits for library needs, uses dynamic fields
    This part of the presentation is based on the works and previous presentations of
    Robert Douglass.
  • Drupal architecture
    4
    Relational databasefor storage
    Solr index for search
  • Purposes
    5
    Creating a general insfrastructure which is usable in every Drupal installation
    Core module, and additional module for covering specific Solr functionalities (statistics, autocomplete etc.)
    Replace the original (and still default) MySQL-based search feature
  • 6
    sort
    Facet 1
    Facet 2
  • 7
    List of modules page
    sort
    module search
    API version
    categories
  • Advanced search: issues
    8
  • Whitehouse.gov on Drupal & Solr
    9
    sort
    facets
  • Boosting by Drupal specific properties
    10
  • Boosting and ignoring by document type
    11
  • Boosting by fields/ HTML tags
    12
  • More like this implementation
    13
  • Solrin Views integration
    14
    Views is a very popular module, helps creating interactive DB queries andresult pages. Now it can handle Apache Solr as data source.
  • Part of the Views admin page
    15
    You can specify fields, sorting, filters, layout,
    arguments, behaviours and more
  • Using Tika: file search
    16
  • Indexing/searching multiple sites
    17
  • Search in comments
    18
  • CCK date searching
    19
    Content Construction Kit: popular module to create document and field types. CCK date is a special field type handling dates.
  • statistics
    20
    impressive numbers – that’s why welove Solr…
  • Statistics of facet usage
    21
  • autocomplete
    22
  • Future plans
    23
    Crawling with Nutch
    Geospatial search
    eDismax (Solr 1.5)
    Drupal 7
    API changes
    Improving documentation
  • People behind these modules
    24
    Robert Douglass(DE) http://drupal.org/user/5449
    Alejandro Garza (MX) http://drupal.org/user/153120
    Peter Wolanin (US) http://drupal.org/user/49851
    James McKinney (CA) http://drupal.org/user/472460
    Scott Reynolds (US) http://drupal.org/user/60009
    Mike O'Connor (US) http://drupal.org/user/104525
    Markus Kalkbrenner (DE) http://drupal.org/user/124705
    and others…
  • Links
    25
    apachesolr http://drupal.org/project/apachesolr (this is the best starting point)
    content recommendation patchhttp://drupal.org/node/372767
    views integration http://drupal.org/project/apachesolr_views, http://acquia.com/node/911667
    file search http://drupal.org/project/apachesolr_attachments, http://acquia.com/node/1129446
    date facet for CCK fieldhttp://drupal.org/node/558160
    statisticshttp://drupal.org/project/apachesolr_stats
    multisitehttp://drupal.org/project/apachesolr_multisitesearch
    autocompletehttp://drupal.org/project/apachesolr_autocomplete
  • Part 2 – Library specific solution
    26
    General solution: Apache Solr Search Integration and related modules: Stats, Autocomplete, Multisite, Ajax, Biblio, Attachments, Übercart (e-commerce integration), Views, Multilingual, Geospatial and many othersFits for the overall needs, uses predefined fields
    Library specific solution: eXtensible Catalog modulesFits for library needs, uses dynamic fields
  • About eXtensible Catalog
    27
    a project creating an open source next generation library ‘discovery interface’ and an FRBR-based metadata platform
    started in 2007
    driven by new theories of library science, cultural anthropology and the practice of web 2.0, library 2.0
    Universities of Rochester, Notre Dame, Cornell, North Carolina at Charlotte, Rochester Institute of Technology and CARLI consortium
  • Architecture
    28
    Drupal CMS
    MARC Normalization
    DCNormalization
    XC DrupalToolkit
    Transformation
    Aggregation
    XC Metadata Services Toolkit
    circulation data
    XC NCIPToolkit
    XC OAI Toolkit
    Integrated Library System
    Repository
  • Purposes of XC Drupal Toolkit
    29
    integrate library data into a popular content management system
    customizable functionalities
    customizable interface(s)
    internationalization, localization
    5000+ custom modules, 20+ library specific modules
    wide range on mashup options
    all features are available through user interfaces
  • Search results
    30
    bibliographical data
    cover images
    highlighted terms
    facets
    availability information
  • Customized interface (Kyushu University)
    31
  • Similar documents
    32
  • XML attribute handling
    33
    subject=„History” and subject_OCLC=„History”subject_OCLC=„History”subject=„History” and subject_type=„OCLC”none
    <subject type="OCLC">History</subject>
    could be indexed as…
  • Mapping schema fields to Solr types
    34
  • Setup a facet
    35
    Aggregating values ofdifferent fields into onefacet
    specifySolr type
    custom PHP code to modify field values(conditions)
  • Custom PHP code for displaying title
    36
  • Getting records into Drupal: OAI harvesting
    37
    List of scheduled harvests
    Harvest is running
  • Data flow between components
    38
    OAI-PMHprovider
    Drupal
    batch delete/insert documents
    creating nodes
    MySQL
    Solr
  • Creating a ‘more like this’ parameter set
    39
    Saving parameters for ‘More like this’ functionality
  • Creating highlighter
    40
    wrapper around the highlighter’s parameters
  • Setting up field and date facet properties
    41
    Date facet properties
    Field facet properties
  • Putting facets together: ‘facet group’
    42
    General properties
    List of facets, and their type
  • Reordering facets
    43
    Just drag and drop
    You haven’t saved changes!
  • Using facet term list in search form
    44
    dropdown fullfilled withlanguagefacet terms
    dropdown definition
  • Adding widgets to UI: navigation bar
    45
    definition of navigation bar
    navigation bar in action
  • Links
    46
    Project page http://eXtensibleCatalog.org
    XC Drupal Toolkit http://drupal.org/project/xc
    Metadata Services Toolkit http://code.google.com/p/xcmetadataservicestoolkit
    OAI Toolkit http://code.google.com/p/xcoaitoolkit
    NCIP Toolkit http://code.google.com/p/xcnciptoolkit
    Developers:
    Mlen-Too Wesley(GH) http://drupal.org/user/318924
    Király Péter (H) http://drupal.org/user/352587, http://twitter.com/kiru