SlideShare a Scribd company logo
1 of 14
Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013
eCommerce Search with Apache Solr
Grant Ingersoll
CTO, LucidWorks
Twitter: @gsingers
Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013
Tales from the trenches
• The case of the missing data
• The power of suggestion
Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013
Topics
• Solr powered commerce
– Companies
– Features
• Relevance, relevance, relevance
• Demo
Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013
Solr Powers Leading eCommerce and
Consumer Sites
Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013
Basic Features for eCommerce
• High quality OOTB relevance
• Facets
– Range, Term/Category, Hierarchical, Pivot
• Highlighting
• Did you mean?
• Boosting/Blocking/Landing Pages
• Easy scale
Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013
Advanced Features
• Spatial
– Local
– Route finding
– Open Hours, etc.
• Function Queries
– Inventory, Margin
• Stats Component
– Missing data
– Bounds, etc.
Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013
Tips and Tricks
Confidential and Proprietary © Copyright 2014
Look Before You Leap
• Before undertaking any relevance tuning, you need
to define what “better search” means to you
• Once determined, many ways to test/measure
• Once tested, many ways to fix
http://www.betternetworker.com/files/useruploads/16675/leap.jpg
Confidential and Proprietary © Copyright 2014
Understand your…
• Domain
– Types of documents
– Languages present
– Document structures,
metadata and other features
– Lexical resources: jargon,
synonyms, abbreviations...
– Relationships between
documents
• Users
– Sophistication/Expertise
– Search and Discovery needs
– Known Item vs. Keyword
• Tolerance for Pain
– Managers
– Business Interests
– Release cycles
– Obsession in finding the one
true relevance model (hint, it
doesn’t exist)
– “explain() blindness”
Confidential and Proprietary © Copyright 2014
Known Item vs. Keyword
 eCommerce search often has a split between known item and
keyword search
 You probably have more “wiggle” room for relevancy on
keyword search
 E.g. What should be the top result for a search on “women’s shoes”?
 Known Item should have best matches at the top
 More in a moment
Confidential and Proprietary © Copyright 2014
Debugging
• Check the analysis (more in the next slide)
• Check for data quality issues
• Check your query constructs (slop, boosts, etc.)
• Try alternate query representations
• (exact match)^100 OR (sloppy phrase match)^50 OR (OR query)
• Use Lucene’s explain() or Solr’s &debugQuery
Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013
Signal Processing for Search and Discovery
• Signals power modern relevance
– Clicks, conversions, sharing, history, signatures
• LucidWorks 5 makes it easy to capture and
leverage signals
– Recommendations, analytics, discovery
• Simplifies your data workflow
• Simplify your operational footprint
Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013
Solr Powered Signal Processing
• Use Case: eCommerce
• Data:
– Product catalog (~1.2m items)
– Click data (~3.9M clicks)
Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013
Metadata
• http://www.lucidworks.com
• grant@lucidworks.com
• @gsingers
• Lucene/Solr Revolution
– Washington DC, Nov 11-14
– http://www.lucenerevolution.org

More Related Content

Viewers also liked

Chicago Solr Meetup - June 10th: This Ain't Your Parents' Search Engine
Chicago Solr Meetup - June 10th: This Ain't Your Parents' Search EngineChicago Solr Meetup - June 10th: This Ain't Your Parents' Search Engine
Chicago Solr Meetup - June 10th: This Ain't Your Parents' Search EngineLucidworks (Archived)
 
Indexing Text and HTML Files with Solr
Indexing Text and HTML Files with SolrIndexing Text and HTML Files with Solr
Indexing Text and HTML Files with SolrLucidworks (Archived)
 
Using Solr to find the Right Person for the Right Job
Using Solr to find the Right Person for the Right JobUsing Solr to find the Right Person for the Right Job
Using Solr to find the Right Person for the Right JobLucidworks (Archived)
 
How The Guardian Embraced the Internet using Content, Search, and Open Source
How The Guardian Embraced the Internet using Content, Search, and Open SourceHow The Guardian Embraced the Internet using Content, Search, and Open Source
How The Guardian Embraced the Internet using Content, Search, and Open SourceLucidworks (Archived)
 
Tate Tyler - Designing the Search Experience
Tate Tyler - Designing the Search ExperienceTate Tyler - Designing the Search Experience
Tate Tyler - Designing the Search ExperienceLucidworks (Archived)
 
I love you mommy
I love you mommyI love you mommy
I love you mommyNyiah
 
Technology opportunities in hampton roads (kaszubowski ), nasa technology day...
Technology opportunities in hampton roads (kaszubowski ), nasa technology day...Technology opportunities in hampton roads (kaszubowski ), nasa technology day...
Technology opportunities in hampton roads (kaszubowski ), nasa technology day...Marty Kaszubowski
 
Chicago Solr Meetup - June 10th: Exploring Hadoop with Search
Chicago Solr Meetup - June 10th: Exploring Hadoop with SearchChicago Solr Meetup - June 10th: Exploring Hadoop with Search
Chicago Solr Meetup - June 10th: Exploring Hadoop with SearchLucidworks (Archived)
 
Center for Enterprise Innovation (CEI) Summary for HREDA, 9-25-14
Center for Enterprise Innovation (CEI) Summary for HREDA, 9-25-14Center for Enterprise Innovation (CEI) Summary for HREDA, 9-25-14
Center for Enterprise Innovation (CEI) Summary for HREDA, 9-25-14Marty Kaszubowski
 
SFBay Area Solr Meetup - July 15th: Integrating Hadoop and Solr
 SFBay Area Solr Meetup - July 15th: Integrating Hadoop and Solr SFBay Area Solr Meetup - July 15th: Integrating Hadoop and Solr
SFBay Area Solr Meetup - July 15th: Integrating Hadoop and SolrLucidworks (Archived)
 

Viewers also liked (16)

Chicago Solr Meetup - June 10th: This Ain't Your Parents' Search Engine
Chicago Solr Meetup - June 10th: This Ain't Your Parents' Search EngineChicago Solr Meetup - June 10th: This Ain't Your Parents' Search Engine
Chicago Solr Meetup - June 10th: This Ain't Your Parents' Search Engine
 
What’s New in Apache Lucene 2.9
What’s New in Apache Lucene 2.9What’s New in Apache Lucene 2.9
What’s New in Apache Lucene 2.9
 
Indexing Text and HTML Files with Solr
Indexing Text and HTML Files with SolrIndexing Text and HTML Files with Solr
Indexing Text and HTML Files with Solr
 
Web Design Course Overview
Web Design Course OverviewWeb Design Course Overview
Web Design Course Overview
 
Crazy
CrazyCrazy
Crazy
 
Using Solr to find the Right Person for the Right Job
Using Solr to find the Right Person for the Right JobUsing Solr to find the Right Person for the Right Job
Using Solr to find the Right Person for the Right Job
 
How The Guardian Embraced the Internet using Content, Search, and Open Source
How The Guardian Embraced the Internet using Content, Search, and Open SourceHow The Guardian Embraced the Internet using Content, Search, and Open Source
How The Guardian Embraced the Internet using Content, Search, and Open Source
 
Tate Tyler - Designing the Search Experience
Tate Tyler - Designing the Search ExperienceTate Tyler - Designing the Search Experience
Tate Tyler - Designing the Search Experience
 
I love you mommy
I love you mommyI love you mommy
I love you mommy
 
Technology opportunities in hampton roads (kaszubowski ), nasa technology day...
Technology opportunities in hampton roads (kaszubowski ), nasa technology day...Technology opportunities in hampton roads (kaszubowski ), nasa technology day...
Technology opportunities in hampton roads (kaszubowski ), nasa technology day...
 
Chicago Solr Meetup - June 10th: Exploring Hadoop with Search
Chicago Solr Meetup - June 10th: Exploring Hadoop with SearchChicago Solr Meetup - June 10th: Exploring Hadoop with Search
Chicago Solr Meetup - June 10th: Exploring Hadoop with Search
 
Center for Enterprise Innovation (CEI) Summary for HREDA, 9-25-14
Center for Enterprise Innovation (CEI) Summary for HREDA, 9-25-14Center for Enterprise Innovation (CEI) Summary for HREDA, 9-25-14
Center for Enterprise Innovation (CEI) Summary for HREDA, 9-25-14
 
Web Design Course FETAC Level 5
Web Design Course FETAC Level 5 Web Design Course FETAC Level 5
Web Design Course FETAC Level 5
 
What’s New in Apache Lucene 2.9
What’s New in Apache Lucene 2.9What’s New in Apache Lucene 2.9
What’s New in Apache Lucene 2.9
 
Presentation: IT Wizard Summer Camp
Presentation: IT Wizard Summer CampPresentation: IT Wizard Summer Camp
Presentation: IT Wizard Summer Camp
 
SFBay Area Solr Meetup - July 15th: Integrating Hadoop and Solr
 SFBay Area Solr Meetup - July 15th: Integrating Hadoop and Solr SFBay Area Solr Meetup - July 15th: Integrating Hadoop and Solr
SFBay Area Solr Meetup - July 15th: Integrating Hadoop and Solr
 

Similar to Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr

EIS_B2Bonline_PaulWlodarczyk_World-Class-Product-Data_2015-04-27
EIS_B2Bonline_PaulWlodarczyk_World-Class-Product-Data_2015-04-27EIS_B2Bonline_PaulWlodarczyk_World-Class-Product-Data_2015-04-27
EIS_B2Bonline_PaulWlodarczyk_World-Class-Product-Data_2015-04-27Paul Wlodarczyk
 
SPLive Orlando - Beyond the Search Center - Application or Solution?
SPLive Orlando - Beyond the Search Center - Application or Solution?SPLive Orlando - Beyond the Search Center - Application or Solution?
SPLive Orlando - Beyond the Search Center - Application or Solution?Agnes Molnar
 
How to Use Site Search to Drive Conversions and Create Customers
How to Use Site Search to Drive Conversions and Create CustomersHow to Use Site Search to Drive Conversions and Create Customers
How to Use Site Search to Drive Conversions and Create CustomersEarley Information Science
 
Oracle BI Big Data and Bics
Oracle BI Big Data and BicsOracle BI Big Data and Bics
Oracle BI Big Data and BicsDarren Grogan
 
ITAM Review Oracle Seminar NY Pallisade Compliance Presentation
ITAM Review Oracle Seminar NY Pallisade Compliance PresentationITAM Review Oracle Seminar NY Pallisade Compliance Presentation
ITAM Review Oracle Seminar NY Pallisade Compliance PresentationMartin Thompson
 
SPC117 - How to manage and troubleshoot Search
SPC117 - How to manage and troubleshoot SearchSPC117 - How to manage and troubleshoot Search
SPC117 - How to manage and troubleshoot SearchAgnes Molnar
 
Solr site search makes shopping simple
Solr site search makes shopping simpleSolr site search makes shopping simple
Solr site search makes shopping simpleRyan Street
 
Krellenstein lucene revolution_2011_keynote_once_future_history_enterprise se...
Krellenstein lucene revolution_2011_keynote_once_future_history_enterprise se...Krellenstein lucene revolution_2011_keynote_once_future_history_enterprise se...
Krellenstein lucene revolution_2011_keynote_once_future_history_enterprise se...lucenerevolution
 
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...Concept Searching, Inc
 
Google's rapid, recent transformation of search
Google's rapid, recent transformation of searchGoogle's rapid, recent transformation of search
Google's rapid, recent transformation of searchTop Floor Technologies
 
Model Confidence for Master Data with David Loshin
Model Confidence for Master Data with David LoshinModel Confidence for Master Data with David Loshin
Model Confidence for Master Data with David LoshinEmbarcadero Technologies
 
Distillery Credentials May 2014
Distillery Credentials May 2014Distillery Credentials May 2014
Distillery Credentials May 2014Sam Jeffries
 
Google For Linked In09 25 12
Google For Linked In09 25 12Google For Linked In09 25 12
Google For Linked In09 25 12jhayesdc
 
Search and social patents for 2012 and beyond
Search and social patents for 2012 and beyondSearch and social patents for 2012 and beyond
Search and social patents for 2012 and beyondBill Slawski
 
The Future of Enterprise Search - #SPSUK Keynote
The Future of Enterprise Search - #SPSUK KeynoteThe Future of Enterprise Search - #SPSUK Keynote
The Future of Enterprise Search - #SPSUK KeynoteAgnes Molnar
 
Information Discovery and Search Strategies for Evidence-Based Research
Information Discovery and Search Strategies for Evidence-Based ResearchInformation Discovery and Search Strategies for Evidence-Based Research
Information Discovery and Search Strategies for Evidence-Based ResearchDavid Nzoputa Ofili
 

Similar to Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr (20)

Using LWE/Solr/Lucene for eCom
Using LWE/Solr/Lucene for eComUsing LWE/Solr/Lucene for eCom
Using LWE/Solr/Lucene for eCom
 
EIS_B2Bonline_PaulWlodarczyk_World-Class-Product-Data_2015-04-27
EIS_B2Bonline_PaulWlodarczyk_World-Class-Product-Data_2015-04-27EIS_B2Bonline_PaulWlodarczyk_World-Class-Product-Data_2015-04-27
EIS_B2Bonline_PaulWlodarczyk_World-Class-Product-Data_2015-04-27
 
SPLive Orlando - Beyond the Search Center - Application or Solution?
SPLive Orlando - Beyond the Search Center - Application or Solution?SPLive Orlando - Beyond the Search Center - Application or Solution?
SPLive Orlando - Beyond the Search Center - Application or Solution?
 
Contextualized Customer Journeys
Contextualized Customer JourneysContextualized Customer Journeys
Contextualized Customer Journeys
 
How to Use Site Search to Drive Conversions and Create Customers
How to Use Site Search to Drive Conversions and Create CustomersHow to Use Site Search to Drive Conversions and Create Customers
How to Use Site Search to Drive Conversions and Create Customers
 
Oracle BI Big Data and Bics
Oracle BI Big Data and BicsOracle BI Big Data and Bics
Oracle BI Big Data and Bics
 
ITAM Review Oracle Seminar NY Pallisade Compliance Presentation
ITAM Review Oracle Seminar NY Pallisade Compliance PresentationITAM Review Oracle Seminar NY Pallisade Compliance Presentation
ITAM Review Oracle Seminar NY Pallisade Compliance Presentation
 
SPC117 - How to manage and troubleshoot Search
SPC117 - How to manage and troubleshoot SearchSPC117 - How to manage and troubleshoot Search
SPC117 - How to manage and troubleshoot Search
 
Solr site search makes shopping simple
Solr site search makes shopping simpleSolr site search makes shopping simple
Solr site search makes shopping simple
 
Krellenstein lucene revolution_2011_keynote_once_future_history_enterprise se...
Krellenstein lucene revolution_2011_keynote_once_future_history_enterprise se...Krellenstein lucene revolution_2011_keynote_once_future_history_enterprise se...
Krellenstein lucene revolution_2011_keynote_once_future_history_enterprise se...
 
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
You Spoke, We Listened – Achieving a New Level of Search Optimization with Go...
 
Search Engine Optimisation (SEO) Basics Training - April 2013
Search Engine Optimisation (SEO) Basics Training - April 2013Search Engine Optimisation (SEO) Basics Training - April 2013
Search Engine Optimisation (SEO) Basics Training - April 2013
 
Google's rapid, recent transformation of search
Google's rapid, recent transformation of searchGoogle's rapid, recent transformation of search
Google's rapid, recent transformation of search
 
Model Confidence for Master Data with David Loshin
Model Confidence for Master Data with David LoshinModel Confidence for Master Data with David Loshin
Model Confidence for Master Data with David Loshin
 
Distillery Credentials May 2014
Distillery Credentials May 2014Distillery Credentials May 2014
Distillery Credentials May 2014
 
Upc jornadas tic 2015 final
Upc jornadas tic 2015 finalUpc jornadas tic 2015 final
Upc jornadas tic 2015 final
 
Google For Linked In09 25 12
Google For Linked In09 25 12Google For Linked In09 25 12
Google For Linked In09 25 12
 
Search and social patents for 2012 and beyond
Search and social patents for 2012 and beyondSearch and social patents for 2012 and beyond
Search and social patents for 2012 and beyond
 
The Future of Enterprise Search - #SPSUK Keynote
The Future of Enterprise Search - #SPSUK KeynoteThe Future of Enterprise Search - #SPSUK Keynote
The Future of Enterprise Search - #SPSUK Keynote
 
Information Discovery and Search Strategies for Evidence-Based Research
Information Discovery and Search Strategies for Evidence-Based ResearchInformation Discovery and Search Strategies for Evidence-Based Research
Information Discovery and Search Strategies for Evidence-Based Research
 

More from Lucidworks (Archived)

Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...
Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...
Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...Lucidworks (Archived)
 
SFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for Business
SFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for BusinessSFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for Business
SFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for BusinessLucidworks (Archived)
 
SFBay Area Solr Meetup - June 18th: Benchmarking Solr Performance
SFBay Area Solr Meetup - June 18th: Benchmarking Solr PerformanceSFBay Area Solr Meetup - June 18th: Benchmarking Solr Performance
SFBay Area Solr Meetup - June 18th: Benchmarking Solr PerformanceLucidworks (Archived)
 
Minneapolis Solr Meetup - May 28, 2014: Target.com Search
Minneapolis Solr Meetup - May 28, 2014: Target.com SearchMinneapolis Solr Meetup - May 28, 2014: Target.com Search
Minneapolis Solr Meetup - May 28, 2014: Target.com SearchLucidworks (Archived)
 
Exploration of multidimensional biomedical data in pub chem, Presented by Lia...
Exploration of multidimensional biomedical data in pub chem, Presented by Lia...Exploration of multidimensional biomedical data in pub chem, Presented by Lia...
Exploration of multidimensional biomedical data in pub chem, Presented by Lia...Lucidworks (Archived)
 
Unstructured Or: How I Learned to Stop Worrying and Love the xml, Presented...
Unstructured   Or: How I Learned to Stop Worrying and Love the xml, Presented...Unstructured   Or: How I Learned to Stop Worrying and Love the xml, Presented...
Unstructured Or: How I Learned to Stop Worrying and Love the xml, Presented...Lucidworks (Archived)
 
Big Data Challenges, Presented by Wes Caldwell at SolrExchage DC
Big Data Challenges, Presented by Wes Caldwell at SolrExchage DCBig Data Challenges, Presented by Wes Caldwell at SolrExchage DC
Big Data Challenges, Presented by Wes Caldwell at SolrExchage DCLucidworks (Archived)
 
What's New in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
What's New  in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DCWhat's New  in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
What's New in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DCLucidworks (Archived)
 
Solr At AOL, Presented by Sean Timm at SolrExchage DC
Solr At AOL, Presented by Sean Timm at SolrExchage DCSolr At AOL, Presented by Sean Timm at SolrExchage DC
Solr At AOL, Presented by Sean Timm at SolrExchage DCLucidworks (Archived)
 
Intro to Solr Cloud, Presented by Tim Potter at SolrExchage DC
Intro to Solr Cloud, Presented by Tim Potter at SolrExchage DCIntro to Solr Cloud, Presented by Tim Potter at SolrExchage DC
Intro to Solr Cloud, Presented by Tim Potter at SolrExchage DCLucidworks (Archived)
 
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DCTest Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DCLucidworks (Archived)
 
Building a data driven search application with LucidWorks SiLK
Building a data driven search application with LucidWorks SiLKBuilding a data driven search application with LucidWorks SiLK
Building a data driven search application with LucidWorks SiLKLucidworks (Archived)
 
Introducing LucidWorks App for Splunk Enterprise webinar
Introducing LucidWorks App for Splunk Enterprise webinarIntroducing LucidWorks App for Splunk Enterprise webinar
Introducing LucidWorks App for Splunk Enterprise webinarLucidworks (Archived)
 
Lucene/Solr Revolution 2013: Paul Doscher Opening Remarks
Lucene/Solr Revolution 2013: Paul Doscher Opening Remarks Lucene/Solr Revolution 2013: Paul Doscher Opening Remarks
Lucene/Solr Revolution 2013: Paul Doscher Opening Remarks Lucidworks (Archived)
 
Seeley yonik solr performance key innovations
Seeley yonik   solr performance key innovationsSeeley yonik   solr performance key innovations
Seeley yonik solr performance key innovationsLucidworks (Archived)
 
Implementing Click-through Relevance Ranking in Solr and LucidWorks Enterprise
Implementing Click-through Relevance Ranking in Solr and LucidWorks EnterpriseImplementing Click-through Relevance Ranking in Solr and LucidWorks Enterprise
Implementing Click-through Relevance Ranking in Solr and LucidWorks EnterpriseLucidworks (Archived)
 

More from Lucidworks (Archived) (20)

Integrating Hadoop & Solr
Integrating Hadoop & SolrIntegrating Hadoop & Solr
Integrating Hadoop & Solr
 
The Data-Driven Paradigm
The Data-Driven ParadigmThe Data-Driven Paradigm
The Data-Driven Paradigm
 
Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...
Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...
Downtown SF Lucene/Solr Meetup - September 17: Thoth: Real-time Solr Monitori...
 
SFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for Business
SFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for BusinessSFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for Business
SFBay Area Solr Meetup - June 18th: Box + Solr = Content Search for Business
 
SFBay Area Solr Meetup - June 18th: Benchmarking Solr Performance
SFBay Area Solr Meetup - June 18th: Benchmarking Solr PerformanceSFBay Area Solr Meetup - June 18th: Benchmarking Solr Performance
SFBay Area Solr Meetup - June 18th: Benchmarking Solr Performance
 
What's new in solr june 2014
What's new in solr june 2014What's new in solr june 2014
What's new in solr june 2014
 
Minneapolis Solr Meetup - May 28, 2014: Target.com Search
Minneapolis Solr Meetup - May 28, 2014: Target.com SearchMinneapolis Solr Meetup - May 28, 2014: Target.com Search
Minneapolis Solr Meetup - May 28, 2014: Target.com Search
 
Exploration of multidimensional biomedical data in pub chem, Presented by Lia...
Exploration of multidimensional biomedical data in pub chem, Presented by Lia...Exploration of multidimensional biomedical data in pub chem, Presented by Lia...
Exploration of multidimensional biomedical data in pub chem, Presented by Lia...
 
Unstructured Or: How I Learned to Stop Worrying and Love the xml, Presented...
Unstructured   Or: How I Learned to Stop Worrying and Love the xml, Presented...Unstructured   Or: How I Learned to Stop Worrying and Love the xml, Presented...
Unstructured Or: How I Learned to Stop Worrying and Love the xml, Presented...
 
Big Data Challenges, Presented by Wes Caldwell at SolrExchage DC
Big Data Challenges, Presented by Wes Caldwell at SolrExchage DCBig Data Challenges, Presented by Wes Caldwell at SolrExchage DC
Big Data Challenges, Presented by Wes Caldwell at SolrExchage DC
 
What's New in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
What's New  in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DCWhat's New  in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
What's New in Lucene/Solr Presented by Grant Ingersoll at SolrExchage DC
 
Solr At AOL, Presented by Sean Timm at SolrExchage DC
Solr At AOL, Presented by Sean Timm at SolrExchage DCSolr At AOL, Presented by Sean Timm at SolrExchage DC
Solr At AOL, Presented by Sean Timm at SolrExchage DC
 
Intro to Solr Cloud, Presented by Tim Potter at SolrExchage DC
Intro to Solr Cloud, Presented by Tim Potter at SolrExchage DCIntro to Solr Cloud, Presented by Tim Potter at SolrExchage DC
Intro to Solr Cloud, Presented by Tim Potter at SolrExchage DC
 
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DCTest Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
Test Driven Relevancy, Presented by Doug Turnbull at SolrExchage DC
 
Building a data driven search application with LucidWorks SiLK
Building a data driven search application with LucidWorks SiLKBuilding a data driven search application with LucidWorks SiLK
Building a data driven search application with LucidWorks SiLK
 
Introducing LucidWorks App for Splunk Enterprise webinar
Introducing LucidWorks App for Splunk Enterprise webinarIntroducing LucidWorks App for Splunk Enterprise webinar
Introducing LucidWorks App for Splunk Enterprise webinar
 
Solr4 nosql search_server_2013
Solr4 nosql search_server_2013Solr4 nosql search_server_2013
Solr4 nosql search_server_2013
 
Lucene/Solr Revolution 2013: Paul Doscher Opening Remarks
Lucene/Solr Revolution 2013: Paul Doscher Opening Remarks Lucene/Solr Revolution 2013: Paul Doscher Opening Remarks
Lucene/Solr Revolution 2013: Paul Doscher Opening Remarks
 
Seeley yonik solr performance key innovations
Seeley yonik   solr performance key innovationsSeeley yonik   solr performance key innovations
Seeley yonik solr performance key innovations
 
Implementing Click-through Relevance Ranking in Solr and LucidWorks Enterprise
Implementing Click-through Relevance Ranking in Solr and LucidWorks EnterpriseImplementing Click-through Relevance Ranking in Solr and LucidWorks Enterprise
Implementing Click-through Relevance Ranking in Solr and LucidWorks Enterprise
 

Recently uploaded

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Patryk Bandurski
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationSafe Software
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksSoftradix Technologies
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersThousandEyes
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDGMarianaLemus7
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonetsnaman860154
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Neo4j
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsMemoori
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Allon Mureinik
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxOnBoard
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Enterprise Knowledge
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreternaman860154
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...shyamraj55
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationSlibray Presentation
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationRidwan Fadjar
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountPuma Security, LLC
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticscarlostorres15106
 

Recently uploaded (20)

Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
Integration and Automation in Practice: CI/CD in Mule Integration and Automat...
 
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry InnovationBeyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
Beyond Boundaries: Leveraging No-Code Solutions for Industry Innovation
 
Benefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other FrameworksBenefits Of Flutter Compared To Other Frameworks
Benefits Of Flutter Compared To Other Frameworks
 
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for PartnersEnhancing Worker Digital Experience: A Hands-on Workshop for Partners
Enhancing Worker Digital Experience: A Hands-on Workshop for Partners
 
APIForce Zurich 5 April Automation LPDG
APIForce Zurich 5 April  Automation LPDGAPIForce Zurich 5 April  Automation LPDG
APIForce Zurich 5 April Automation LPDG
 
How to convert PDF to text with Nanonets
How to convert PDF to text with NanonetsHow to convert PDF to text with Nanonets
How to convert PDF to text with Nanonets
 
Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024Build your next Gen AI Breakthrough - April 2024
Build your next Gen AI Breakthrough - April 2024
 
AI as an Interface for Commercial Buildings
AI as an Interface for Commercial BuildingsAI as an Interface for Commercial Buildings
AI as an Interface for Commercial Buildings
 
Pigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food ManufacturingPigging Solutions in Pet Food Manufacturing
Pigging Solutions in Pet Food Manufacturing
 
The transition to renewables in India.pdf
The transition to renewables in India.pdfThe transition to renewables in India.pdf
The transition to renewables in India.pdf
 
Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)Injustice - Developers Among Us (SciFiDevCon 2024)
Injustice - Developers Among Us (SciFiDevCon 2024)
 
Maximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptxMaximizing Board Effectiveness 2024 Webinar.pptx
Maximizing Board Effectiveness 2024 Webinar.pptx
 
Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024Designing IA for AI - Information Architecture Conference 2024
Designing IA for AI - Information Architecture Conference 2024
 
Presentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreterPresentation on how to chat with PDF using ChatGPT code interpreter
Presentation on how to chat with PDF using ChatGPT code interpreter
 
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
Automating Business Process via MuleSoft Composer | Bangalore MuleSoft Meetup...
 
Connect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck PresentationConnect Wave/ connectwave Pitch Deck Presentation
Connect Wave/ connectwave Pitch Deck Presentation
 
My Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 PresentationMy Hashitalk Indonesia April 2024 Presentation
My Hashitalk Indonesia April 2024 Presentation
 
Breaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path MountBreaking the Kubernetes Kill Chain: Host Path Mount
Breaking the Kubernetes Kill Chain: Host Path Mount
 
DMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special EditionDMCC Future of Trade Web3 - Special Edition
DMCC Future of Trade Web3 - Special Edition
 
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmaticsKotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
Kotlin Multiplatform & Compose Multiplatform - Starter kit for pragmatics
 

Minneapolis Solr Meetup - May 28, 2014: eCommerce Search with Apache Solr

  • 1. Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013 eCommerce Search with Apache Solr Grant Ingersoll CTO, LucidWorks Twitter: @gsingers
  • 2. Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013 Tales from the trenches • The case of the missing data • The power of suggestion
  • 3. Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013 Topics • Solr powered commerce – Companies – Features • Relevance, relevance, relevance • Demo
  • 4. Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013 Solr Powers Leading eCommerce and Consumer Sites
  • 5. Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013 Basic Features for eCommerce • High quality OOTB relevance • Facets – Range, Term/Category, Hierarchical, Pivot • Highlighting • Did you mean? • Boosting/Blocking/Landing Pages • Easy scale
  • 6. Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013 Advanced Features • Spatial – Local – Route finding – Open Hours, etc. • Function Queries – Inventory, Margin • Stats Component – Missing data – Bounds, etc.
  • 7. Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013 Tips and Tricks
  • 8. Confidential and Proprietary © Copyright 2014 Look Before You Leap • Before undertaking any relevance tuning, you need to define what “better search” means to you • Once determined, many ways to test/measure • Once tested, many ways to fix http://www.betternetworker.com/files/useruploads/16675/leap.jpg
  • 9. Confidential and Proprietary © Copyright 2014 Understand your… • Domain – Types of documents – Languages present – Document structures, metadata and other features – Lexical resources: jargon, synonyms, abbreviations... – Relationships between documents • Users – Sophistication/Expertise – Search and Discovery needs – Known Item vs. Keyword • Tolerance for Pain – Managers – Business Interests – Release cycles – Obsession in finding the one true relevance model (hint, it doesn’t exist) – “explain() blindness”
  • 10. Confidential and Proprietary © Copyright 2014 Known Item vs. Keyword  eCommerce search often has a split between known item and keyword search  You probably have more “wiggle” room for relevancy on keyword search  E.g. What should be the top result for a search on “women’s shoes”?  Known Item should have best matches at the top  More in a moment
  • 11. Confidential and Proprietary © Copyright 2014 Debugging • Check the analysis (more in the next slide) • Check for data quality issues • Check your query constructs (slop, boosts, etc.) • Try alternate query representations • (exact match)^100 OR (sloppy phrase match)^50 OR (OR query) • Use Lucene’s explain() or Solr’s &debugQuery
  • 12. Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013 Signal Processing for Search and Discovery • Signals power modern relevance – Clicks, conversions, sharing, history, signatures • LucidWorks 5 makes it easy to capture and leverage signals – Recommendations, analytics, discovery • Simplifies your data workflow • Simplify your operational footprint
  • 13. Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013 Solr Powered Signal Processing • Use Case: eCommerce • Data: – Product catalog (~1.2m items) – Click data (~3.9M clicks)
  • 14. Confidential and Proprietary © Copyright 2014Confidential and Proprietary © Copyright 2013 Metadata • http://www.lucidworks.com • grant@lucidworks.com • @gsingers • Lucene/Solr Revolution – Washington DC, Nov 11-14 – http://www.lucenerevolution.org

Editor's Notes

  1. but not as a search engine for content more like a search engine for behavior