4. Current Status
• First Prototype Summer 2008
• Development Kickoff February 2009
• Public Release v1.0 T3CON09
• v1.1 soon
• v2.0 later this year
Samstag, 22. Mai 2010
5. Development Model
• Initial development by dkd
• Development Partnerships
• Early Access, Trunk Access
• Setup Support
• Development Support
• Development Priorities
Samstag, 22. Mai 2010
6. Development Partners
Eventex
d.k.d Internet Service GmbH
SEB Assetmanagement AG
University of Hohenheim e-netconsulting KG
Eichborn AG Cross Content Media
Andreae-Noris Zahn AG
Marketing Factory Consulting GmbH
marit AG
MÜPRO GmbH Deutsche Lufthansa AG
internezzo AG
Netcreators BV
AOE media GmbH
Samstag, 22. Mai 2010
8. Indexed Search
• Indexing Frontend / Crawler
• Respects access rights
• Respects languages
• Index in Database
• Totally OK for smaller websites
Slo ooooooooo ooowww
Samstag, 22. Mai 2010
10. So what is Apache Solr?
• Enterprise Search Server
• Based on Lucene Index
• Apache Software Foundation Project
• Many powerful features
• CNet, Netflix, ilocal.nl, Zappos.com
Samstag, 22. Mai 2010
11. Solr Concepts
• Index = Collection of Documents
• Document = Data stored in Fields
• Field Type defines processing through
Analizers, Tokenizers, Filters
• Dynamic Fields
bi li ty
• Copy Fields
l ex i
F
Samstag, 22. Mai 2010
12. Why Apache Solr?
• Speed: Many times faster than IS
• Better search results
• Faceted search
• Spellchecker: Did you mean ... ?
• Similarity search: More like this ...
•
&
Editorial Content / paid search results
ed
pe
• Synonyms, Stopwords, Protected Words
S r
e
• Boosting of specific index fields
•
P
Replication, distributed search
o w
Samstag, 22. Mai 2010
13. How it works
• REST like interface
• Indexing of XML Documents through
HTTP POST
• Querying through HTTP GET
• Results as XML, JSON, PHP
AP I
E a sy
Samstag, 22. Mai 2010
14. Disadvantages
• Needs Java
rs
• We donʻt want to deal with Java
Solr shields us from Java once e
•
e lo p set-up
D e v P H P
w i th
s ta y
Samstag, 22. Mai 2010
15. Advantages
• Multiple times faster than IS
• NO database queries
• Easy Installation / Configuration
• Respects access restrictions
• Respects languages
se erful
t y to u w
• Cutomizability
as as
F E P o
Samstag, 22. Mai 2010
16. Inner Workings
• Indexing of XML Documents
• Reversed Index
• Access through GET and POST
(REST like)
• Results as XML, JSON, PHP
Samstag, 22. Mai 2010
17. Inner Workings
Solr Index
Document Field Field Field Field Field
Document Field Field Field Field Field
Document Field Field Field Field Field
Document Field Field Field Field Field
Document Field Field Field Field Field
Document Field Field Field Field Field
Samstag, 22. Mai 2010
18. Inner Workings
HTTP Request Servlet Update Servlet
Admin
Interface Standard DisMax Custom XML
Request Request Request Response XML Update Interface
Handler Handler Handler Writer
Config Caching
Schema
Update
Solr Core
Handler
Analysis Concurrency
Replication
Lucene
Samstag, 22. Mai 2010
19. EXT:solr
Apache Solr for TYPO3
+
Samstag, 22. Mai 2010
20. Content Elevation
Multi Language
Features! Backend Search
Hit Highlighting Scheduler
FE Indexing File Indexing
Spellchecker / Did you mean ...
TYPO3 4.3
Search Reports Score Analyzer
Auto Suggest
Facetted Search
Backend ModuleView Helper
Statistics Search Box
Index Queue
Hierarchical Facets Logging Access Rights
Install Script More Like This Hooks, Interfaces
Simple Form
Last Searches Filter BoostingTemplate Engine
Extbase / Fluid
Sorting
Common Searches
Page Browser TYPO3 4.2
Samstag, 22. Mai 2010
21. 1.0 Features! 2.0
TYPO3 4.3 Index Queue
FE Indexing Statistics
Search Search Box Common Searches
Facetted Search Auto Suggest
Install Script Simple Form Score Analyzer
Spellchecker / Did you mean ... Backend Module
Sortierung Access Rights Last Searches
Boosting Hit Highlighting Hierarchical Facets
Page Browser Extbase / Fluid
Multi Language Logging File Indexing
Filter Hooks, Interfaces Backend Search
Reports Template Engine More Like This
Scheduler View Helper Content Elevation
Samstag, 22. Mai 2010
22. Current Status
• „Acts like Indexed Search“
• Indexing through Frontend / Crawler
• Search
• Search Word Highlighting
• Sorting
• Last and Common Searches
Samstag, 22. Mai 2010
23. Current Status
• Spellchecker: Did you mean ... ?
• Similarity Search: More like this ...
• Faceted Search, Hierarchical Facets
• Suggest / Autocompletion
• Index Queue
• File Indexing
Samstag, 22. Mai 2010
24. Outlook
• Backend Module
• Related Searches
• Editorial / Paid Search Results
• Editing of Stopwords, Synonyms
• Statistics
• Transition to Extbase / Fluid
Samstag, 22. Mai 2010
34. Making the
sun shine on
your search
Samstag, 22. Mai 2010
35. Requirements, Setup
• Requires any J2EE container:
Tomcat, Jetty, Resin, ...
• Run setup scripts provided with EXT:solr
• Copy provided configuration files to Solr
• Install EXT:solr, TypoScript
• config.index_enable = 1
Samstag, 22. Mai 2010
36. Customization
• Indexing of additional Data through
hooks, interfaces, TS configuration
• Individual index schema
• En/Disable features through TS
• Individual, flexible rendering of results
Samstag, 22. Mai 2010