Apache Solr for TYPO3 at TYPO3 Usergroup Day Netherlands

2,164 views

Published on

Presentation of an extension to integrate Apache Solr for TYPO3. Apache Solr is an enterprise search server, TYPO3 is a mid-to large size enterprise Content Management System; combining both results in great user search experience.

Published in: Technology
0 Comments
2 Likes
Statistics
Notes
  • Be the first to comment

No Downloads
Views
Total views
2,164
On SlideShare
0
From Embeds
0
Number of Embeds
19
Actions
Shares
0
Downloads
17
Comments
0
Likes
2
Embeds 0
No embeds

No notes for slide

Apache Solr for TYPO3 at TYPO3 Usergroup Day Netherlands

  1. 1. Apache Solr & TYPO3 Ingo Renner TYPO3 Core Developer, Release Manager TYPO3 4.2
  2. 2. 3. org ypo er o @t enn i ng gor il @in ma r tw itte
  3. 3. Indexed Search
  4. 4. Indexed Search • Indexing Frontend / Crawler • Respects access rights • Respects languages • Index in Database • Totally OK for smaller websites Slo ooooooooo ooowww
  5. 5. Apache Solr
  6. 6. So what is Apache Solr? • Enterprise Search Server • Based on Lucene Index • Apache Software Foundation Project • Many powerful features • CNet, Netflix, ilocal.nl, Zappos.com
  7. 7. Solr Concepts • Index = Collection of Documents • Document = Data stored in Fields • Field Type defines processing through Analizers, Tokenizers, Filters • Dynamic Fields bi li ty • Copy Fields l ex i F
  8. 8. Why Apache Solr? • Speed: Many times faster than IS • Better search results • Faceted search • Spellchecker: Did you mean ... ? • Similarity search: More like this ... • & Editorial Content / paid search results ed pe • Synonyms, Stopwords S r e • Boosting of specific index fields • P Replication, distributed search o w
  9. 9. How it works • REST like interface • Indexing of XML Documents through HTTP POST • Querying through HTTP GET • Results as XML, JSON, PHP AP I E a sy
  10. 10. Disadvantages • Needs Java rs • We donʻt want to deal with Java Solr shields us from Java once e • e lo p set-up D e v P H P w i th s ta y
  11. 11. Advantages • Multiple times faster than IS • NO database queries • Easy installation / Configuration • Respects access rights • Respects languages • se erful t y to u w Cutomizability as as F E P o
  12. 12. EXT:solr +
  13. 13. Current Status • „Acts like Indexed Search“ • Indexing through Frontend / Crawler • Search • Search Word Highlighting • Sorting • Spellchecker: Did you mean ... ? • Similarity Search: More like this ... • Faceted Search • Suggest / Autocompletion
  14. 14. Outlook • Backend Modul • API, indexing through BE • Related Searches • Last Searches • Smart Reranking through user usage • Editorial Search Results • Editing of Stopwords, Synonyms
  15. 15. Development Model • Private financing of new features • Financing partners get Early Access and Support • Minimum stake of 5 man days • v2.0 end of Q2 next year • Development as Community Project in parrallel
  16. 16. Community Edition • Released v1.0 on TER • Project on TYPO3 Forge • Open Development • Only few differences compared to „our“ version
  17. 17. Showcases
  18. 18. Showcases
  19. 19. Showcases
  20. 20. Showcases
  21. 21. Showcases
  22. 22. Showcases
  23. 23. Making the sun shine on your search
  24. 24. Requirements, Setup • Requires any J2EE container: Tomcat, Jetty, Resin, ... • Run setup scripts provided with EXT:solr • Copy provided configuration files to Solr • config.index_enable = 1
  25. 25. Customization • Indexing of additional Data through hooks, interfaces, TS configuration • Individual index schema • En/Disable features through TS • Individual, flexible rendering of results
  26. 26. More than Solr
  27. 27. Projects around Solr • Lucene - Search Index Library • Tika - Content Extraction from Files • Nutch - Crawl External Sites
  28. 28. Thanks for listening.
  29. 29. 3. org ypo er o @t enn i ng gor il @in ma r tw itte

×