Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr

1,091 views

Published on

"eCommerce Search with Apache Solr", Grant Ingersoll

Published in: Technology, Design
0 Comments
6 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
1,091
On SlideShare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
22
Comments
0
Likes
6
Embeds 0
No embeds

No notes for slide
  • but not as a search engine for content
    more like a search engine for behavior
  • Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr

    1. 1. Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013 eCommerce Search with Apache Solr Grant Ingersoll CTO, LucidWorks Twitter: @gsingers
    2. 2. Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013 Tales from the trenches • The case of the missing data • The power of suggestion
    3. 3. Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013 Topics • Solr powered commerce – Companies – Features • Relevance, relevance, relevance • Demo
    4. 4. Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013 Solr Powers Leading eCommerce and Consumer Sites
    5. 5. Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013 Basic Features for eCommerce • High quality OOTB relevance • Facets – Range, Term/Category, Hierarchical, Pivot • Highlighting • Did you mean? • Boosting/Blocking/Landing Pages • Easy scale
    6. 6. Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013 Advanced Features • Spatial – Local – Route finding – Open Hours, etc. • Function Queries – Inventory, Margin • Stats Component – Missing data – Bounds, etc.
    7. 7. Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013 Tips and Tricks
    8. 8. Confidential and Proprietary © Copyright 2014 Look Before You Leap • Before undertaking any relevance tuning, you need to define what “better search” means to you • Once determined, many ways to test/measure • Once tested, many ways to fix http://www.betternetworker.com/files/useruploads/16675/leap.jpg
    9. 9. Confidential and Proprietary © Copyright 2014 Understand your… • Domain – Types of documents – Languages present – Document structures, metadata and other features – Lexical resources: jargon, synonyms, abbreviations... – Relationships between documents • Users – Sophistication/Expertise – Search and Discovery needs – Known Item vs. Keyword • Tolerance for Pain – Managers – Business Interests – Release cycles – Obsession in finding the one true relevance model (hint, it doesn’t exist) – “explain() blindness”
    10. 10. Confidential and Proprietary © Copyright 2014 Known Item vs. Keyword  eCommerce search often has a split between known item and keyword search  You probably have more “wiggle” room for relevancy on keyword search  E.g. What should be the top result for a search on “women’s shoes”?  Known Item should have best matches at the top  More in a moment
    11. 11. Confidential and Proprietary © Copyright 2014 Debugging • Check the analysis (more in the next slide) • Check for data quality issues • Check your query constructs (slop, boosts, etc.) • Try alternate query representations • (exact match)^100 OR (sloppy phrase match)^50 OR (OR query) • Use Lucene’s explain() or Solr’s &debugQuery
    12. 12. Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013 Signal Processing for Search and Discovery • Signals power modern relevance – Clicks, conversions, sharing, history, signatures • LucidWorks 5 makes it easy to capture and leverage signals – Recommendations, analytics, discovery • Simplifies your data workflow • Simplify your operational footprint
    13. 13. Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013 Solr Powered Signal Processing • Use Case: eCommerce • Data: – Product catalog (~1.2m items) – Click data (~3.9M clicks)
    14. 14. Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013 Metadata • http://www.lucidworks.com • grant@lucidworks.com • @gsingers • Lucene/Solr Revolution – Washington DC, Nov 11-14 – http://www.lucenerevolution.org

    ×