Presentation of an extension to integrate Apache Solr for TYPO3. Apache Solr is an enterprise search server, TYPO3 is a mid-to large size enterprise Content Management System; combining both results in great user search experience.
4. Indexed Search
• Indexing Frontend / Crawler
• Respects access rights
• Respects languages
• Index in Database
• Totally OK for smaller websites
Slo ooooooooo ooowww
6. So what is Apache Solr?
• Enterprise Search Server
• Based on Lucene Index
• Apache Software Foundation Project
• Many powerful features
• CNet, Netflix, ilocal.nl, Zappos.com
7. Solr Concepts
• Index = Collection of Documents
• Document = Data stored in Fields
• Field Type defines processing through
Analizers, Tokenizers, Filters
• Dynamic Fields
bi li ty
• Copy Fields
l ex i
F
8. Why Apache Solr?
• Speed: Many times faster than IS
• Better search results
• Faceted search
• Spellchecker: Did you mean ... ?
• Similarity search: More like this ...
•
&
Editorial Content / paid search results
ed
pe
• Synonyms, Stopwords
S r
e
• Boosting of specific index fields
•
P
Replication, distributed search
o w
9. How it works
• REST like interface
• Indexing of XML Documents through
HTTP POST
• Querying through HTTP GET
• Results as XML, JSON, PHP
AP I
E a sy
10. Disadvantages
• Needs Java
rs
• We donʻt want to deal with Java
Solr shields us from Java once e
•
e lo p set-up
D e v P H P
w i th
s ta y
11. Advantages
• Multiple times faster than IS
• NO database queries
• Easy installation / Configuration
• Respects access rights
• Respects languages
•
se erful
t y to u w
Cutomizability
as as
F E P o
13. Current Status
• „Acts like Indexed Search“
• Indexing through Frontend / Crawler
• Search
• Search Word Highlighting
• Sorting
• Spellchecker: Did you mean ... ?
• Similarity Search: More like this ...
• Faceted Search
• Suggest / Autocompletion
14. Outlook
• Backend Modul
• API, indexing through BE
• Related Searches
• Last Searches
• Smart Reranking through user usage
• Editorial Search Results
• Editing of Stopwords, Synonyms
15. Development Model
• Private financing of new features
• Financing partners get
Early Access and Support
• Minimum stake of 5 man days
• v2.0 end of Q2 next year
• Development as Community
Project in parrallel
16. Community Edition
• Released v1.0 on TER
• Project on TYPO3 Forge
• Open Development
• Only few differences
compared to „our“ version
24. Requirements, Setup
• Requires any J2EE container:
Tomcat, Jetty, Resin, ...
• Run setup scripts provided with EXT:solr
• Copy provided configuration files to Solr
• config.index_enable = 1
25. Customization
• Indexing of additional Data through
hooks, interfaces, TS configuration
• Individual index schema
• En/Disable features through TS
• Individual, flexible rendering of results