Apache Solr - 5.0 and beyond
Anshum Gupta
Apache Lucene/Solr PMC Member and Committer
• Anshum Gupta, Apache Lucene/Solr PMC member
and committer, Lucidworks Employee.
• Interested in search and related stuff.
• Apache Lucene since 2006 and Solr since 2010.
• Organizations I am or have been a part of:
Who am I?
• Apache Lucene is a free open source information
retrieval software library
• Originally written in Java by Doug Cutting.
• It is supported by the Apache Software Foundation
and is released under the Apache Software
License.
What is Lucene?
• Solr (pronounced "solar") is an open source
enterprise search platform
• Written in Java,
• For a while now, a part of the Apache Lucene
project.
• Search on Lucene - Replicated (SoLR)
• SolrCloud - Distributed feature set
What is Solr?
Apache Solr is the most widely-used search
solution on the planet.
Solr has tens of thousands of
applications in production.
You use everyday.
8,000,000+
Total downloads
Solr is both established
and growing.
250,000+
Monthly downloads
2,500+
Open Solr jobs and the largest
community of developers.
Apache Solr is also one of the most active open
source projects out there
Activity statistics
30 Day Summary
Mar 14 2015 — Apr 13 2015
12 Month Summary
Apr 13 2014 — Apr 13 2015
160 Commits
23 Contributors
1440 Commits
31 Contributors
Annual commits up +126 (9%)
via https://www.openhub.net/p/solr
Solr Feature Release
Frequency
• Search - Full text, Geo-spatial
• Faceting - Values, Ranges, Pivots, etc.
• Suggestor, highlighting, auto-complete
• Pluggability
• and of course, Speed and Scalability
Solr Essentials
Title TextWhat’s new in Solr 5x?
• Get started in < 5 minutes
• APIs, and more APIs
• Schema
• Config
• Collections
• Auto* - Failover, leader election, addition of replica!
• One of the best official documentation, released almost with
the code.
Ease of Use
• Thousands of collections - Apple
• Billions of Documents - Box
• High throughput and near real time -
Bloomberg
• Impressive indexing performance: 150 k docs/
sec per node
Scalability and Performance
Solr Scalability is unmatched
• Tons of tests and quality code
• Critical systems running in production
• Jepsen tests - Proven again!
• Independent benchmarking and testing
Reliability
• Analytics - Do more with your data!
• Distributed IDF
• It’s an app not a war!
Features and more!
Solr News
• Scalability
• Faster search - SOLR-6810
• Improved indexing - SOLR-6816
• Analytics - HyperLogLog - SOLR-6968
• Security - Authentication and Authorization
framework - SOLR-7230
• And tons more!
What’s coming?
The largest Lucene/Solr conference in the world
OCT 13 - 16, 2015 AUSTIN, TX
CFP is open until May 8, 2015
For more details visit:
http://lucenerevolution.org
Connect @
http://www.twitter.com/anshumgupta
http://www.linkedin.com/in/anshumgupta/
anshum@apache.org

Apache Solr 5.0 and beyond

  • 1.
    Apache Solr -5.0 and beyond Anshum Gupta Apache Lucene/Solr PMC Member and Committer
  • 2.
    • Anshum Gupta,Apache Lucene/Solr PMC member and committer, Lucidworks Employee. • Interested in search and related stuff. • Apache Lucene since 2006 and Solr since 2010. • Organizations I am or have been a part of: Who am I?
  • 3.
    • Apache Luceneis a free open source information retrieval software library • Originally written in Java by Doug Cutting. • It is supported by the Apache Software Foundation and is released under the Apache Software License. What is Lucene?
  • 4.
    • Solr (pronounced"solar") is an open source enterprise search platform • Written in Java, • For a while now, a part of the Apache Lucene project. • Search on Lucene - Replicated (SoLR) • SolrCloud - Distributed feature set What is Solr?
  • 5.
    Apache Solr isthe most widely-used search solution on the planet. Solr has tens of thousands of applications in production. You use everyday. 8,000,000+ Total downloads Solr is both established and growing. 250,000+ Monthly downloads 2,500+ Open Solr jobs and the largest community of developers.
  • 6.
    Apache Solr isalso one of the most active open source projects out there Activity statistics 30 Day Summary Mar 14 2015 — Apr 13 2015 12 Month Summary Apr 13 2014 — Apr 13 2015 160 Commits 23 Contributors 1440 Commits 31 Contributors Annual commits up +126 (9%) via https://www.openhub.net/p/solr Solr Feature Release Frequency
  • 7.
    • Search -Full text, Geo-spatial • Faceting - Values, Ranges, Pivots, etc. • Suggestor, highlighting, auto-complete • Pluggability • and of course, Speed and Scalability Solr Essentials
  • 8.
  • 9.
    • Get startedin < 5 minutes • APIs, and more APIs • Schema • Config • Collections • Auto* - Failover, leader election, addition of replica! • One of the best official documentation, released almost with the code. Ease of Use
  • 10.
    • Thousands ofcollections - Apple • Billions of Documents - Box • High throughput and near real time - Bloomberg • Impressive indexing performance: 150 k docs/ sec per node Scalability and Performance
  • 11.
  • 12.
    • Tons oftests and quality code • Critical systems running in production • Jepsen tests - Proven again! • Independent benchmarking and testing Reliability
  • 13.
    • Analytics -Do more with your data! • Distributed IDF • It’s an app not a war! Features and more!
  • 14.
  • 15.
    • Scalability • Fastersearch - SOLR-6810 • Improved indexing - SOLR-6816 • Analytics - HyperLogLog - SOLR-6968 • Security - Authentication and Authorization framework - SOLR-7230 • And tons more! What’s coming?
  • 16.
    The largest Lucene/Solrconference in the world OCT 13 - 16, 2015 AUSTIN, TX CFP is open until May 8, 2015 For more details visit: http://lucenerevolution.org
  • 17.