SlideShare a Scribd company logo
http://flickr.com/photos/thepma/640151954/
Wednesday, February 25, 2009                                               1
THE RISING SUN:
                               Making the most of Solr power


                                   code4lib '09, Providence, RI
                                          Erik Hatcher
                               erik.hatcher@lucidimagination.com


Wednesday, February 25, 2009                                       2
#9 Performance




                    Measure

Wednesday, February 25, 2009                    3
#8 Memory



                    omitNorms
                    omitTf (1.4)
Wednesday, February 25, 2009               4
#7 Query Parsing




                    quot;It Dependsquot;
                             More Like This:
            http://www.lucidimagination.com/blog/2009/02/22/
                        exploring-query-parsers/
Wednesday, February 25, 2009                                   5
#6 Data Import


                      DIH, Solr Cell,
                      CSV, LuSql, APIs,...
                      solr-ruby Mapper
Wednesday, February 25, 2009                    6
Obligatory XML




Wednesday, February 25, 2009                    7
#5 Request Handlers



                 Leverage Solr's
                 configurability
Wednesday, February 25, 2009                         8
http://localhost:8983/solr/
                                   timeline.xml




Wednesday, February 25, 2009                             9
#4 Solr as IR toolkit


                 TermVectorRequestHandler
                 StatsComponent
                 Luke

Wednesday, February 25, 2009                           10
#3 LocalSolr




                    geo searching

Wednesday, February 25, 2009                  11
#2 Faceting


                 Solr 1.4 performance!
                 (*bows* to Yonik)
                 Multi-select
Wednesday, February 25, 2009                 12
Multi-select faceting




   http://www.lucidimagination.com/search/p:lucene,solr/s:email,wiki/l:user?q=marc
Wednesday, February 25, 2009                                                         13
#1 User Interface



                    The Interface is
                       the App
Wednesday, February 25, 2009                       14
SolrJS




Wednesday, February 25, 2009            15
Solritas


                 Velocity
                 templated
                 response
Wednesday, February 25, 2009              16
#0 Community




        solr-user@lucene.apache.org


Wednesday, February 25, 2009                  17
quot;LucidFindquot;

                        http://www.lucidimagination.com/search
                               find@lucidimagination.com




Wednesday, February 25, 2009                                     18
Lucid Supports Solr
                                   lucidimagination.com

Wednesday, February 25, 2009                              19
Lucid Articles

                      Grant Ingersoll                          Erik Hatcher

                               Getting Started with Lucene        Getting Started with Solr
                                                                  (includes screencast), co-
                                                                  authored with Jonathan
                               Debugging Relevance Issues in
                                                                  Knudsen
                               Search

                                                               Sami Siren
                               Optimizing Findability in
                               Lucene and Solr
                                                                  Content Extraction with Tika
                      Yonik Seeley
                                                               Mark Miller
                               Faceted Search with Solr
                                                                  Scaling Lucene and Solr




Wednesday, February 25, 2009                                                                     20
Lucid Podcasts

                 Interviews with:
                      Doug Cutting (creator of Lucene)
                      Ryan McKinley (Solr committer)
                      Chris Hostetter (Solr committer)
                 More coming!

Wednesday, February 25, 2009                             21
Lucid Blogging

                 Mark Miller: quot;Exploring Query Parsersquot;, quot;Highlighting
                 Highlighter Thoughtsquot;, quot;Investigating OOM and other
                 JVM issuesquot;, quot;Looking forward to new features in Solr
                 1.4quot;

                 Erik Hatcher: quot;acts_as_solr with rich document
                 indexingquot;

                 Grant Ingersoll: quot;Sorting, Faceting, and Schema Design
                 in Solrquot;

Wednesday, February 25, 2009                                              22
Pre-release ebook available now!

Wednesday, February 25, 2009                   23
The Good Ol' Blues Brothers
                             Boys




Wednesday, February 25, 2009                     24

More Related Content

More from Erik Hatcher

Ted Talk
Ted TalkTed Talk
Ted Talk
Erik Hatcher
 
Solr Payloads
Solr PayloadsSolr Payloads
Solr Payloads
Erik Hatcher
 
it's just search
it's just searchit's just search
it's just search
Erik Hatcher
 
Lucene's Latest (for Libraries)
Lucene's Latest (for Libraries)Lucene's Latest (for Libraries)
Lucene's Latest (for Libraries)
Erik Hatcher
 
Solr Indexing and Analysis Tricks
Solr Indexing and Analysis TricksSolr Indexing and Analysis Tricks
Solr Indexing and Analysis Tricks
Erik Hatcher
 
Solr Powered Libraries
Solr Powered LibrariesSolr Powered Libraries
Solr Powered Libraries
Erik Hatcher
 
Solr Query Parsing
Solr Query ParsingSolr Query Parsing
Solr Query ParsingErik Hatcher
 
"Solr Update" at code4lib '13 - Chicago
"Solr Update" at code4lib '13 - Chicago"Solr Update" at code4lib '13 - Chicago
"Solr Update" at code4lib '13 - Chicago
Erik Hatcher
 
Query Parsing - Tips and Tricks
Query Parsing - Tips and TricksQuery Parsing - Tips and Tricks
Query Parsing - Tips and Tricks
Erik Hatcher
 
Solr 4
Solr 4Solr 4
Solr 4
Erik Hatcher
 
Solr Recipes
Solr RecipesSolr Recipes
Solr Recipes
Erik Hatcher
 
Lucene for Solr Developers
Lucene for Solr DevelopersLucene for Solr Developers
Lucene for Solr Developers
Erik Hatcher
 
Introduction to Solr
Introduction to SolrIntroduction to Solr
Introduction to Solr
Erik Hatcher
 
Introduction to Solr
Introduction to SolrIntroduction to Solr
Introduction to Solr
Erik Hatcher
 
Lucene for Solr Developers
Lucene for Solr DevelopersLucene for Solr Developers
Lucene for Solr DevelopersErik Hatcher
 
Introduction to Solr
Introduction to SolrIntroduction to Solr
Introduction to SolrErik Hatcher
 
Rapid Prototyping with Solr
Rapid Prototyping with SolrRapid Prototyping with Solr
Rapid Prototyping with SolrErik Hatcher
 
Lucene for Solr Developers
Lucene for Solr DevelopersLucene for Solr Developers
Lucene for Solr DevelopersErik Hatcher
 
What's New in Solr 3.x / 4.0
What's New in Solr 3.x / 4.0What's New in Solr 3.x / 4.0
What's New in Solr 3.x / 4.0Erik Hatcher
 

More from Erik Hatcher (20)

Ted Talk
Ted TalkTed Talk
Ted Talk
 
Solr Payloads
Solr PayloadsSolr Payloads
Solr Payloads
 
it's just search
it's just searchit's just search
it's just search
 
Lucene's Latest (for Libraries)
Lucene's Latest (for Libraries)Lucene's Latest (for Libraries)
Lucene's Latest (for Libraries)
 
Solr Indexing and Analysis Tricks
Solr Indexing and Analysis TricksSolr Indexing and Analysis Tricks
Solr Indexing and Analysis Tricks
 
Solr Powered Libraries
Solr Powered LibrariesSolr Powered Libraries
Solr Powered Libraries
 
Solr Query Parsing
Solr Query ParsingSolr Query Parsing
Solr Query Parsing
 
"Solr Update" at code4lib '13 - Chicago
"Solr Update" at code4lib '13 - Chicago"Solr Update" at code4lib '13 - Chicago
"Solr Update" at code4lib '13 - Chicago
 
Query Parsing - Tips and Tricks
Query Parsing - Tips and TricksQuery Parsing - Tips and Tricks
Query Parsing - Tips and Tricks
 
Solr 4
Solr 4Solr 4
Solr 4
 
Solr Recipes
Solr RecipesSolr Recipes
Solr Recipes
 
Lucene for Solr Developers
Lucene for Solr DevelopersLucene for Solr Developers
Lucene for Solr Developers
 
Introduction to Solr
Introduction to SolrIntroduction to Solr
Introduction to Solr
 
Solr Flair
Solr FlairSolr Flair
Solr Flair
 
Introduction to Solr
Introduction to SolrIntroduction to Solr
Introduction to Solr
 
Lucene for Solr Developers
Lucene for Solr DevelopersLucene for Solr Developers
Lucene for Solr Developers
 
Introduction to Solr
Introduction to SolrIntroduction to Solr
Introduction to Solr
 
Rapid Prototyping with Solr
Rapid Prototyping with SolrRapid Prototyping with Solr
Rapid Prototyping with Solr
 
Lucene for Solr Developers
Lucene for Solr DevelopersLucene for Solr Developers
Lucene for Solr Developers
 
What's New in Solr 3.x / 4.0
What's New in Solr 3.x / 4.0What's New in Solr 3.x / 4.0
What's New in Solr 3.x / 4.0
 

Recently uploaded

The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
Safe Software
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Nexer Digital
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Paige Cruz
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
Guy Korland
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
Aftab Hussain
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
Neo4j
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
Kari Kakkonen
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
Neo4j
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
SOFTTECHHUB
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
RinaMondal9
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
ControlCase
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
Matthew Sinclair
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
DianaGray10
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
Matthew Sinclair
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
mikeeftimakis1
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
Neo4j
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
Neo4j
 

Recently uploaded (20)

The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
Essentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FMEEssentials of Automations: The Art of Triggers and Actions in FME
Essentials of Automations: The Art of Triggers and Actions in FME
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?Elizabeth Buie - Older adults: Are we really designing for our future selves?
Elizabeth Buie - Older adults: Are we really designing for our future selves?
 
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdfObservability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
Observability Concepts EVERY Developer Should Know -- DeveloperWeek Europe.pdf
 
GraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge GraphGraphRAG is All You need? LLM & Knowledge Graph
GraphRAG is All You need? LLM & Knowledge Graph
 
Removing Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software FuzzingRemoving Uninteresting Bytes in Software Fuzzing
Removing Uninteresting Bytes in Software Fuzzing
 
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
GraphSummit Singapore | The Future of Agility: Supercharging Digital Transfor...
 
Climate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing DaysClimate Impact of Software Testing at Nordic Testing Days
Climate Impact of Software Testing at Nordic Testing Days
 
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
GraphSummit Singapore | Enhancing Changi Airport Group's Passenger Experience...
 
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
Why You Should Replace Windows 11 with Nitrux Linux 3.5.0 for enhanced perfor...
 
Free Complete Python - A step towards Data Science
Free Complete Python - A step towards Data ScienceFree Complete Python - A step towards Data Science
Free Complete Python - A step towards Data Science
 
PCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase TeamPCI PIN Basics Webinar from the Controlcase Team
PCI PIN Basics Webinar from the Controlcase Team
 
20240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 202420240605 QFM017 Machine Intelligence Reading List May 2024
20240605 QFM017 Machine Intelligence Reading List May 2024
 
UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5UiPath Test Automation using UiPath Test Suite series, part 5
UiPath Test Automation using UiPath Test Suite series, part 5
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
20240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 202420240607 QFM018 Elixir Reading List May 2024
20240607 QFM018 Elixir Reading List May 2024
 
Introduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - CybersecurityIntroduction to CHERI technology - Cybersecurity
Introduction to CHERI technology - Cybersecurity
 
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
GraphSummit Singapore | Neo4j Product Vision & Roadmap - Q2 2024
 
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024GraphSummit Singapore | The Art of the  Possible with Graph - Q2 2024
GraphSummit Singapore | The Art of the Possible with Graph - Q2 2024
 

The Rising Sun: Making the most of Solr power

  • 2. THE RISING SUN: Making the most of Solr power code4lib '09, Providence, RI Erik Hatcher erik.hatcher@lucidimagination.com Wednesday, February 25, 2009 2
  • 3. #9 Performance Measure Wednesday, February 25, 2009 3
  • 4. #8 Memory omitNorms omitTf (1.4) Wednesday, February 25, 2009 4
  • 5. #7 Query Parsing quot;It Dependsquot; More Like This: http://www.lucidimagination.com/blog/2009/02/22/ exploring-query-parsers/ Wednesday, February 25, 2009 5
  • 6. #6 Data Import DIH, Solr Cell, CSV, LuSql, APIs,... solr-ruby Mapper Wednesday, February 25, 2009 6
  • 8. #5 Request Handlers Leverage Solr's configurability Wednesday, February 25, 2009 8
  • 9. http://localhost:8983/solr/ timeline.xml Wednesday, February 25, 2009 9
  • 10. #4 Solr as IR toolkit TermVectorRequestHandler StatsComponent Luke Wednesday, February 25, 2009 10
  • 11. #3 LocalSolr geo searching Wednesday, February 25, 2009 11
  • 12. #2 Faceting Solr 1.4 performance! (*bows* to Yonik) Multi-select Wednesday, February 25, 2009 12
  • 13. Multi-select faceting http://www.lucidimagination.com/search/p:lucene,solr/s:email,wiki/l:user?q=marc Wednesday, February 25, 2009 13
  • 14. #1 User Interface The Interface is the App Wednesday, February 25, 2009 14
  • 16. Solritas Velocity templated response Wednesday, February 25, 2009 16
  • 17. #0 Community solr-user@lucene.apache.org Wednesday, February 25, 2009 17
  • 18. quot;LucidFindquot; http://www.lucidimagination.com/search find@lucidimagination.com Wednesday, February 25, 2009 18
  • 19. Lucid Supports Solr lucidimagination.com Wednesday, February 25, 2009 19
  • 20. Lucid Articles Grant Ingersoll Erik Hatcher Getting Started with Lucene Getting Started with Solr (includes screencast), co- authored with Jonathan Debugging Relevance Issues in Knudsen Search Sami Siren Optimizing Findability in Lucene and Solr Content Extraction with Tika Yonik Seeley Mark Miller Faceted Search with Solr Scaling Lucene and Solr Wednesday, February 25, 2009 20
  • 21. Lucid Podcasts Interviews with: Doug Cutting (creator of Lucene) Ryan McKinley (Solr committer) Chris Hostetter (Solr committer) More coming! Wednesday, February 25, 2009 21
  • 22. Lucid Blogging Mark Miller: quot;Exploring Query Parsersquot;, quot;Highlighting Highlighter Thoughtsquot;, quot;Investigating OOM and other JVM issuesquot;, quot;Looking forward to new features in Solr 1.4quot; Erik Hatcher: quot;acts_as_solr with rich document indexingquot; Grant Ingersoll: quot;Sorting, Faceting, and Schema Design in Solrquot; Wednesday, February 25, 2009 22
  • 23. Pre-release ebook available now! Wednesday, February 25, 2009 23
  • 24. The Good Ol' Blues Brothers Boys Wednesday, February 25, 2009 24