Anyone who has tried integrating search in their application knows how good and powerful Solr is but always wished it was simpler to get started and simpler to take it to production.
I will talk about the recent features added to Solr making it easier for users and some of the changes we plan on adding soon to make the experience even better.
2. Apache Solr has a huge install base and tremendous momentum
Solr is both established & growing
250,000+
most widely used search
solution on the planet. 8M+ total downloads
monthly downloads
You use Solr everyday.
Solr has tens of thousands
of applications in production.
2500+ open Solr jobs.
Activity Summary
30 Day summary
Aug 18 - Sep 17 2014
• 128 Commits
• 18 Contributors
12 Month Summary
Sep 17, 2013 - Sep 17, 2014
• 1351 Commits
• 29 Contributors
via https://www.openhub.net/p/solr
3. Search - Until recently
• Large organizations (Enterprise)
• Expensive
• Complex
• $$$$$
4. New Age Search
• Everyone… startups, websites
• Special use cases
• E-commerce
• Mails and personal data
• Personal data - Across devices
• Social and Local!
• Analytics
5. Decision making!
• Short time frame
• Confidence measure:
• Getting started quick
• Configure and see the tip of the iceberg
• Issues only uncover later in the story
7. Times… they are a changin…
• Download
• cd solr
• Standalone: bin/solr start
• SolrCloud, example, interactive:
• bin/solr start -e cloud (< 2 minutes!)
8. Let’s index some data
• Flexible JSON Indexing - Solr supports any JSON
document and the document can be indexed in
the required format in Solr
• More reading: https://lucidworks.com/blog/
indexing-custom-json-data/
9. Managed Schema
• Solr is the schema owner
• REST APIs - Hide the implementation details
• Schema-less mode
• Update and Addition of Fields and FieldTypes
• More reading: https://lucidworks.com/blog/
schemaless-solr-part-1/
10. Configuration APIs
• Configure Solr using APIs
• solrconfig.xml… What did you say?
11. Solr Scale Toolkit
• Easily deploy SolrCloud clusters
• Live patching and rolling restarts
• Dependency on AWS soon to go away
• Chef or Puppet still are valid approaches
• More reading: http://lucidworks.com/blog/
introducing-the-solr-scale-toolkit/
12. Talking about the Admin UI…
• Already improved from 3.x
• Uploading documents
• Collections API is coming soon
Collection Actions
13. Recently Added Features
• Document expiration and Time To Live (TTL)
• Cursors: Efficient Deep Paging
• Export Sorted Result Sets
• SSL support in SolrCloud
• Distributed Pivot Faceting
• Suggester v2
• CollapsingQParserPlugin
• ReRankingQParserPlugin
• Collections API improvements
14. There’s so much more coming up…
• Schema Bulk API
• Distributed IDF
• Query DSL
• Cross Data-center replication
• Cluster Backup and Restore
• SOLR - Make an application, not ‘war’.
15. It’s easy.. and stable!
• Benchmarking
• Tons of users testing it
• Evolving test framework
16. Solr scalability is unmatched.
• 10TB+ Index Size
• 10 Billion+ Documents
• 100 Million+ Daily Requests
17. Where is it headed?
• Download
• See that server directory?
• Use start scripts
• Send a document, or a few…
• Things don’t really look the way they should?
• Use the schema APIs
• Add fields… not enough?
• Add field types and then add fields
• Configure Solr using REST APIs
For Production:
• Use Solr Scale Toolkit to deploy,
patch and manage!
• Configure Solr using REST APIs