Introduction to Solr

2,128 views

Published on

Apache Solr serves search requests at the enterprises and the largest companies around the world. Built on top of the top-notch Apache Lucene library, Solr makes indexing and searching integration into your applications straightforward. Solr provides faceted navigation, spell checking, highlighting, clustering, grouping, and other search features. Solr also scales query volume with replication and collection size with distributed capabilities. Solr can index rich documents such as PDF, Word, HTML, and other file types.

Come learn how you can get your content into Solr and integrate it into your applications!

Published in: Technology, Education
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
2,128
On SlideShare
0
From Embeds
0
Number of Embeds
3
Actions
Shares
0
Downloads
73
Comments
0
Likes
2
Embeds 0
No embeds

No notes for slide

Introduction to Solr

  1. 1. Introduction to erik.hatcher @ lucidimagination.comSunday, October 16, 2011 1
  2. 2. Abstract Apache Solr serves search requests at the enterprises and the largest companies around the world. Built on top of the top- notch Apache Lucene library, Solr makes indexing and searching integration into your applications straightforward. Solr provides faceted navigation, spell checking, highlighting, clustering, grouping, and other search features. Solr also scales query volume with replication and collection size with distributed capabilities. Solr can index rich documents such as PDF, Word, HTML, and other file types. Come learn how you can get your content into Solr and integrate it into your applications!Sunday, October 16, 2011 2
  3. 3. About me...Sunday, October 16, 2011 3
  4. 4. http://lucene.apache.org/solr/Sunday, October 16, 2011 4
  5. 5. http://lucene.apache.org/Sunday, October 16, 2011 5
  6. 6. Solr • Simple: easy to use • Powerful: feature rich and scales • Open Source • from the "Lucene people" • encapsulates Lucene best practicesSunday, October 16, 2011 6
  7. 7. Fire It Up • cd example • java -jar start.jar • [cd example/exampledocs; • java -jar post.jar *.xml]Sunday, October 16, 2011 7
  8. 8. Indexing • /update[/csv|/json|/extract] • stream from local, remote, or POST data • tutorial: • cd example/exampledocs • java -jar post.jar *.xml • Tip: java -jar post.jar -helpSunday, October 16, 2011 8
  9. 9. Indexing JSON POST to /update/json [ {"id" : "1", "title" : "Doc One"}, {"id" : "2", "title" : "Doc Two"} ]Sunday, October 16, 2011 9
  10. 10. Indexing CSV curl http://localhost:8983/solr/update/csv --data-binary @data.csv -H Content-type:text/plain; charset=utf-8’Sunday, October 16, 2011 10
  11. 11. Indexing Rich Documents http://localhost:8983/solr/update/extract ?stream.file=/path/to/file.doc &stream.contentType=application/msword &literal.id=ds1-file.doc"Sunday, October 16, 2011 11
  12. 12. Other conduits • DataImportHandler (DIH) • APIs: SolrJ, RSolr, (py)solr(.py), etc • Its just data over HTTP • "Enterprise" • LucidWorks: SharePoint, (split) crawling, S3, HDFS, etc; including access controlSunday, October 16, 2011 12
  13. 13. Searching • http://localhost:8983/solr/select?q=*:* • Typical looking Solr request - http://localhost:8983/solr/select + • ?q=ipod • &facet=on • &facet.field=cat • &fq=cat:electronics • [&rows=10&start=20] • [&fl=id,name,price&sort=price asc] • [&wt=xml|json|csv|ruby|python|php|xslt|velocity&indent=on] • [&debugQuery=true]Sunday, October 16, 2011 13
  14. 14. /browseSunday, October 16, 2011 14
  15. 15. "Heavy Committing" http://www.apache.org/Sunday, October 16, 2011 15
  16. 16. ... works search platform www.lucidimagination.comSunday, October 16, 2011 16
  17. 17. EventsSunday, October 16, 2011 17

×