Apache Solr for TYPO3
                        Ingo Renner   TYPO3 Core Developer,
                                      Re...
3. org
                                        ypo er
                                    o @t enn
                       ...
Current Status



Samstag, 22. Mai 2010
Current Status

                        •   First Prototype Summer 2008
                        •   Development Kickoff Fe...
Development Model

                        •   Initial development by dkd
                        •   Development Partners...
Development Partners
                                       Eventex
   d.k.d Internet Service GmbH
                       ...
Indexed
                         Search

Samstag, 22. Mai 2010
Indexed Search
                         •   Indexing Frontend / Crawler
                         •   Respects access right...
Apache Solr


Samstag, 22. Mai 2010
So what is Apache Solr?

                        •   Enterprise Search Server
                        •   Based on Lucene ...
Solr Concepts

                •       Index = Collection of Documents
                •       Document = Data stored in F...
Why Apache Solr?
                        •   Speed: Many times faster than IS
                        •   Better search re...
How it works

                •       REST like interface
                •       Indexing of XML Documents through
      ...
Disadvantages


                        •   Needs Java



                                                             rs
...
Advantages

                        •   Multiple times faster than IS
                        •   NO database queries
    ...
Inner Workings

                        •   Indexing of XML Documents
                        •   Reversed Index
         ...
Inner Workings
       Solr Index
          Document      Field   Field   Field   Field   Field

          Document      Fi...
Inner Workings
                                      HTTP Request Servlet                 Update Servlet


        Admin
 ...
EXT:solr
                        Apache Solr for TYPO3

                                    +

Samstag, 22. Mai 2010
Content Elevation
Multi Language
                        Features!     Backend Search
                    Hit Highlighting...
1.0            Features!               2.0
  TYPO3 4.3                             Index Queue
  FE Indexing              ...
Current Status

                  •     „Acts like Indexed Search“
                  •     Indexing through Frontend / Cra...
Current Status

                  •     Spellchecker: Did you mean ... ?
                  •     Similarity Search: More l...
Outlook

                        •   Backend Module
                        •   Related Searches
                        •...
Showcases




Samstag, 22. Mai 2010
Showcases




Samstag, 22. Mai 2010
Showcases




Samstag, 22. Mai 2010
Showcases




Samstag, 22. Mai 2010
Showcases




Samstag, 22. Mai 2010
Showcases




Samstag, 22. Mai 2010
Showcases




Samstag, 22. Mai 2010
Showcases




Samstag, 22. Mai 2010
Showcases




Samstag, 22. Mai 2010
Making the
                        sun shine on
                        your search


Samstag, 22. Mai 2010
Requirements, Setup

                •       Requires any J2EE container:
                        Tomcat, Jetty, Resin, .....
Customization


                •       Indexing of additional Data through
                        hooks, interfaces, TS ...
Thank you for listening.



Samstag, 22. Mai 2010
3. org
                                        ypo er
                                    o @t enn
                       ...
Upcoming SlideShare
Loading in …5
×

Apache Solr for TYPO3 (@ T3CON10 Dallas, TX)

2,284 views
2,176 views

Published on

An introduction to Apache Solr, what it is and why we use it with TYPO3. Covers Solr, the old Indexed Search, and the new Solr extension.

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
2,284
On SlideShare
0
From Embeds
0
Number of Embeds
38
Actions
Shares
0
Downloads
21
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Apache Solr for TYPO3 (@ T3CON10 Dallas, TX)

  1. 1. Apache Solr for TYPO3 Ingo Renner TYPO3 Core Developer, Release Manager TYPO3 4.2 Samstag, 22. Mai 2010
  2. 2. 3. org ypo er o @t enn i ng gor il @in ma r tw itte Samstag, 22. Mai 2010
  3. 3. Current Status Samstag, 22. Mai 2010
  4. 4. Current Status • First Prototype Summer 2008 • Development Kickoff February 2009 • Public Release v1.0 T3CON09 • v1.1 soon • v2.0 later this year Samstag, 22. Mai 2010
  5. 5. Development Model • Initial development by dkd • Development Partnerships • Early Access, Trunk Access • Setup Support • Development Support • Development Priorities Samstag, 22. Mai 2010
  6. 6. Development Partners Eventex d.k.d Internet Service GmbH SEB Assetmanagement AG University of Hohenheim e-netconsulting KG Eichborn AG Cross Content Media Andreae-Noris Zahn AG Marketing Factory Consulting GmbH marit AG MÜPRO GmbH Deutsche Lufthansa AG internezzo AG Netcreators BV AOE media GmbH Samstag, 22. Mai 2010
  7. 7. Indexed Search Samstag, 22. Mai 2010
  8. 8. Indexed Search • Indexing Frontend / Crawler • Respects access rights • Respects languages • Index in Database • Totally OK for smaller websites Slo ooooooooo ooowww Samstag, 22. Mai 2010
  9. 9. Apache Solr Samstag, 22. Mai 2010
  10. 10. So what is Apache Solr? • Enterprise Search Server • Based on Lucene Index • Apache Software Foundation Project • Many powerful features • CNet, Netflix, ilocal.nl, Zappos.com Samstag, 22. Mai 2010
  11. 11. Solr Concepts • Index = Collection of Documents • Document = Data stored in Fields • Field Type defines processing through Analizers, Tokenizers, Filters • Dynamic Fields bi li ty • Copy Fields l ex i F Samstag, 22. Mai 2010
  12. 12. Why Apache Solr? • Speed: Many times faster than IS • Better search results • Faceted search • Spellchecker: Did you mean ... ? • Similarity search: More like this ... • & Editorial Content / paid search results ed pe • Synonyms, Stopwords, Protected Words S r e • Boosting of specific index fields • P Replication, distributed search o w Samstag, 22. Mai 2010
  13. 13. How it works • REST like interface • Indexing of XML Documents through HTTP POST • Querying through HTTP GET • Results as XML, JSON, PHP AP I E a sy Samstag, 22. Mai 2010
  14. 14. Disadvantages • Needs Java rs • We donʻt want to deal with Java Solr shields us from Java once e • e lo p set-up D e v P H P w i th s ta y Samstag, 22. Mai 2010
  15. 15. Advantages • Multiple times faster than IS • NO database queries • Easy Installation / Configuration • Respects access restrictions • Respects languages se erful t y to u w • Cutomizability as as F E P o Samstag, 22. Mai 2010
  16. 16. Inner Workings • Indexing of XML Documents • Reversed Index • Access through GET and POST (REST like) • Results as XML, JSON, PHP Samstag, 22. Mai 2010
  17. 17. Inner Workings Solr Index Document Field Field Field Field Field Document Field Field Field Field Field Document Field Field Field Field Field Document Field Field Field Field Field Document Field Field Field Field Field Document Field Field Field Field Field Samstag, 22. Mai 2010
  18. 18. Inner Workings HTTP Request Servlet Update Servlet Admin Interface Standard DisMax Custom XML Request Request Request Response XML Update Interface Handler Handler Handler Writer Config Caching Schema Update Solr Core Handler Analysis Concurrency Replication Lucene Samstag, 22. Mai 2010
  19. 19. EXT:solr Apache Solr for TYPO3 + Samstag, 22. Mai 2010
  20. 20. Content Elevation Multi Language Features! Backend Search Hit Highlighting Scheduler FE Indexing File Indexing Spellchecker / Did you mean ... TYPO3 4.3 Search Reports Score Analyzer Auto Suggest Facetted Search Backend ModuleView Helper Statistics Search Box Index Queue Hierarchical Facets Logging Access Rights Install Script More Like This Hooks, Interfaces Simple Form Last Searches Filter BoostingTemplate Engine Extbase / Fluid Sorting Common Searches Page Browser TYPO3 4.2 Samstag, 22. Mai 2010
  21. 21. 1.0 Features! 2.0 TYPO3 4.3 Index Queue FE Indexing Statistics Search Search Box Common Searches Facetted Search Auto Suggest Install Script Simple Form Score Analyzer Spellchecker / Did you mean ... Backend Module Sortierung Access Rights Last Searches Boosting Hit Highlighting Hierarchical Facets Page Browser Extbase / Fluid Multi Language Logging File Indexing Filter Hooks, Interfaces Backend Search Reports Template Engine More Like This Scheduler View Helper Content Elevation Samstag, 22. Mai 2010
  22. 22. Current Status • „Acts like Indexed Search“ • Indexing through Frontend / Crawler • Search • Search Word Highlighting • Sorting • Last and Common Searches Samstag, 22. Mai 2010
  23. 23. Current Status • Spellchecker: Did you mean ... ? • Similarity Search: More like this ... • Faceted Search, Hierarchical Facets • Suggest / Autocompletion • Index Queue • File Indexing Samstag, 22. Mai 2010
  24. 24. Outlook • Backend Module • Related Searches • Editorial / Paid Search Results • Editing of Stopwords, Synonyms • Statistics • Transition to Extbase / Fluid Samstag, 22. Mai 2010
  25. 25. Showcases Samstag, 22. Mai 2010
  26. 26. Showcases Samstag, 22. Mai 2010
  27. 27. Showcases Samstag, 22. Mai 2010
  28. 28. Showcases Samstag, 22. Mai 2010
  29. 29. Showcases Samstag, 22. Mai 2010
  30. 30. Showcases Samstag, 22. Mai 2010
  31. 31. Showcases Samstag, 22. Mai 2010
  32. 32. Showcases Samstag, 22. Mai 2010
  33. 33. Showcases Samstag, 22. Mai 2010
  34. 34. Making the sun shine on your search Samstag, 22. Mai 2010
  35. 35. Requirements, Setup • Requires any J2EE container: Tomcat, Jetty, Resin, ... • Run setup scripts provided with EXT:solr • Copy provided configuration files to Solr • Install EXT:solr, TypoScript • config.index_enable = 1 Samstag, 22. Mai 2010
  36. 36. Customization • Indexing of additional Data through hooks, interfaces, TS configuration • Individual index schema • En/Disable features through TS • Individual, flexible rendering of results Samstag, 22. Mai 2010
  37. 37. Thank you for listening. Samstag, 22. Mai 2010
  38. 38. 3. org ypo er o @t enn i ng gor il @in ma r tw itte Samstag, 22. Mai 2010

×