2. Who am I?
• Anshum Gupta, Apache Lucene/Solr committer,
Lucidworks Employee.
• Search and related stuff for 9+ years.
• Apache Lucene since 2006 and Solr since 2010 but
consistent community involvement since 2012
• Organizations I am or have been a part of:
3. Apache Solr has a huge install base and tremendous momentum
Solr is both established & growing
250,000+
most widely used search
solution on the planet. 8M+ total downloads
monthly downloads
You use Solr everyday.
Solr has tens of thousands
of applications in production.
2500+ open Solr jobs.
Activity Summary
30 Day summary
Aug 18 - Sep 17 2014
• 128 Commits
• 18 Contributors
12 Month Summary
Sep 17, 2013 - Sep 17, 2014
• 1351 Commits
• 29 Contributors
via https://www.openhub.net/p/solr
7. New Age Search
• Everyone… startups, websites
• Special use cases
• E-commerce
• Mails and personal data
• Personal data - Across devices
• Social and Local!
• Analytics
8. Decision making!
• Short time frame
• Confidence measure:
• Getting started quick
• Configure and see the tip of the iceberg
• Issues only uncover later in the story
10. Times… they are a changin…
• Download
• cd solr
• Standalone: bin/solr start
• SolrCloud, example, interactive:
• bin/solr start -e cloud (< 2 minutes!)
11. Let’s index some data…
• Auto Generation of Unique Key
• Solr accepts a single doc
12. Managed Schema
• Solr is the schema owner
• REST APIs - Hide the implementation details
• When you know what you got
• Or when you don’t! (Schema-less mode)
• Update and Addition of Fields and FieldTypes
More reading: https://lucidworks.com/blog/schemaless-solr-part-1/
13. Configuration APIs
• Configure Solr using APIs
• solrconfig.xml… What did you say?
17. Solr Scale Toolkit
• Easily deploy SolrCloud clusters
• Live patching and rolling restarts
• Dependency on AWS soon to go away
• Chef or Puppet still are valid approaches
More reading: http://lucidworks.com/blog/introducing-the-solr-scale-toolkit/
18. Talking about the Admin UI…
• Already improved from 3.x
• Uploading documents
• Collections API is coming soon
Collection Actions
19. There’s so much more…
• Self describing handlers
• Improved SolrJ API
• More support for other languages
• HDFS: Auto addition of replicas
• Cross Data-center replication
• SOLR - Make an application, not ‘war’.
20. It’s easy.. and stable!
• Benchmarking
• Tons of users testing it
• Evolving test framework
21. Solr scalability is unmatched.
• 10TB+ Index Size
• 10 Billion+ Documents
• 100 Million+ Daily Requests
23. Where is it headed?
• Download
• See that server directory?
• Use start scripts
• Send a document, or a few…
• Things don’t really look the way they should?
• Use the schema APIs
• Add fields… not enough?
• Add field types and then add fields
• Configure Solr using REST APIs
For Production:
• Use Solr Scale Toolkit to deploy,
patch and manage!
• Configure Solr using REST APIs