SlideShare a Scribd company logo
1 of 38
Download to read offline
Apache Solr for TYPO3
                        Ingo Renner   TYPO3 Core Developer,
                                      Release Manager TYPO3 4.2




Samstag, 22. Mai 2010
3. org
                                        ypo er
                                    o @t enn
                                i ng gor
                             il @in
                         ma r
                        tw itte




Samstag, 22. Mai 2010
Current Status



Samstag, 22. Mai 2010
Current Status

                        •   First Prototype Summer 2008
                        •   Development Kickoff February 2009
                        •   Public Release v1.0 T3CON09
                        •   v1.1 soon
                        •   v2.0 later this year




Samstag, 22. Mai 2010
Development Model

                        •   Initial development by dkd
                        •   Development Partnerships
                        •   Early Access, Trunk Access
                        •   Setup Support
                        •   Development Support
                        •   Development Priorities




Samstag, 22. Mai 2010
Development Partners
                                       Eventex
   d.k.d Internet Service GmbH
                           SEB Assetmanagement AG
     University of Hohenheim e-netconsulting KG

     Eichborn AG   Cross Content Media
                              Andreae-Noris Zahn AG
   Marketing Factory Consulting GmbH
                                       marit AG
    MÜPRO GmbH Deutsche Lufthansa AG
                                    internezzo AG
              Netcreators BV
                                   AOE media GmbH
Samstag, 22. Mai 2010
Indexed
                         Search

Samstag, 22. Mai 2010
Indexed Search
                         •   Indexing Frontend / Crawler
                         •   Respects access rights
                         •   Respects languages
                         •   Index in Database
                         •   Totally OK for smaller websites



                        Slo ooooooooo ooowww

Samstag, 22. Mai 2010
Apache Solr


Samstag, 22. Mai 2010
So what is Apache Solr?

                        •   Enterprise Search Server
                        •   Based on Lucene Index
                        •   Apache Software Foundation Project
                        •   Many powerful features


                        •   CNet, Netflix, ilocal.nl, Zappos.com




Samstag, 22. Mai 2010
Solr Concepts

                •       Index = Collection of Documents
                •       Document = Data stored in Fields
                •       Field Type defines processing through
                        Analizers, Tokenizers, Filters
                •       Dynamic Fields

                                                         bi li ty
                •       Copy Fields
                                            l ex i
                                           F
Samstag, 22. Mai 2010
Why Apache Solr?
                        •   Speed: Many times faster than IS
                        •   Better search results
                        •   Faceted search
                        •   Spellchecker: Did you mean ... ?
                        •   Similarity search: More like this ...
                        •
                                                                &
                            Editorial Content / paid search results

                                                             ed
                                                       pe
                        •   Synonyms, Stopwords, Protected Words
                                                    S            r
                                                               e
                        •   Boosting of specific index fields
                        •
                                                        P
                            Replication, distributed search
                                                            o w
Samstag, 22. Mai 2010
How it works

                •       REST like interface
                •       Indexing of XML Documents through
                        HTTP POST
                •       Querying through HTTP GET
                •       Results as XML, JSON, PHP
                                                       AP I
                                              E a sy
Samstag, 22. Mai 2010
Disadvantages


                        •   Needs Java



                                                             rs
                        •   We donʻt want to deal with Java
                            Solr shields us from Java once e
                        •
                                                 e lo   p  set-up

                                       D  e   v              P   H P
                                                w    i th
                                    s  ta   y
Samstag, 22. Mai 2010
Advantages

                        •   Multiple times faster than IS
                        •   NO database queries
                        •   Easy Installation / Configuration
                        •   Respects access restrictions
                        •   Respects languages

               se erful
       t y to u w
                        •   Cutomizability

     as as
    F E       P o
Samstag, 22. Mai 2010
Inner Workings

                        •   Indexing of XML Documents
                        •   Reversed Index
                        •   Access through GET and POST
                            (REST like)
                        •   Results as XML, JSON, PHP




Samstag, 22. Mai 2010
Inner Workings
       Solr Index
          Document      Field   Field   Field   Field   Field

          Document      Field   Field   Field   Field   Field

          Document      Field   Field   Field   Field   Field

          Document      Field   Field   Field   Field   Field

          Document      Field   Field   Field   Field   Field

          Document      Field   Field   Field   Field   Field




Samstag, 22. Mai 2010
Inner Workings
                                      HTTP Request Servlet                 Update Servlet


        Admin
       Interface           Standard   DisMax      Custom       XML
                           Request    Request     Request    Response   XML Update Interface
                           Handler    Handler     Handler     Writer




           Config                                             Caching
                                   Schema
                                                                            Update
                                                Solr Core
                                                                            Handler
                        Analysis                      Concurrency

                                                                                               Replication

                                                 Lucene




Samstag, 22. Mai 2010
EXT:solr
                        Apache Solr for TYPO3

                                    +

Samstag, 22. Mai 2010
Content Elevation
Multi Language
                        Features!     Backend Search
                    Hit Highlighting Scheduler
    FE Indexing                           File Indexing
             Spellchecker / Did you mean ...
  TYPO3 4.3
             Search Reports           Score Analyzer
                            Auto Suggest
 Facetted Search
                        Backend ModuleView Helper
   Statistics Search Box
                              Index Queue
 Hierarchical Facets Logging            Access Rights
 Install Script      More Like This Hooks, Interfaces
           Simple Form
 Last Searches        Filter BoostingTemplate Engine
                      Extbase / Fluid
           Sorting
                              Common Searches
                Page Browser              TYPO3 4.2
Samstag, 22. Mai 2010
1.0            Features!               2.0
  TYPO3 4.3                             Index Queue
  FE Indexing                                Statistics
  Search Search Box               Common Searches
  Facetted Search                       Auto Suggest
  Install Script Simple Form          Score Analyzer
  Spellchecker / Did you mean ...   Backend Module
  Sortierung Access Rights             Last Searches
  Boosting Hit Highlighting       Hierarchical Facets
  Page Browser                        Extbase / Fluid
  Multi Language Logging                 File Indexing
  Filter Hooks, Interfaces           Backend Search
  Reports Template Engine             More Like This
  Scheduler View Helper            Content Elevation

Samstag, 22. Mai 2010
Current Status

                  •     „Acts like Indexed Search“
                  •     Indexing through Frontend / Crawler
                  •     Search
                  •     Search Word Highlighting
                  •     Sorting
                  •     Last and Common Searches



Samstag, 22. Mai 2010
Current Status

                  •     Spellchecker: Did you mean ... ?
                  •     Similarity Search: More like this ...
                  •     Faceted Search, Hierarchical Facets
                  •     Suggest / Autocompletion
                  •     Index Queue
                  •     File Indexing



Samstag, 22. Mai 2010
Outlook

                        •   Backend Module
                        •   Related Searches
                        •   Editorial / Paid Search Results
                        •   Editing of Stopwords, Synonyms
                        •   Statistics
                        •   Transition to Extbase / Fluid




Samstag, 22. Mai 2010
Showcases




Samstag, 22. Mai 2010
Showcases




Samstag, 22. Mai 2010
Showcases




Samstag, 22. Mai 2010
Showcases




Samstag, 22. Mai 2010
Showcases




Samstag, 22. Mai 2010
Showcases




Samstag, 22. Mai 2010
Showcases




Samstag, 22. Mai 2010
Showcases




Samstag, 22. Mai 2010
Showcases




Samstag, 22. Mai 2010
Making the
                        sun shine on
                        your search


Samstag, 22. Mai 2010
Requirements, Setup

                •       Requires any J2EE container:
                        Tomcat, Jetty, Resin, ...


                •       Run setup scripts provided with EXT:solr
                •       Copy provided configuration files to Solr
                •       Install EXT:solr, TypoScript
                •       config.index_enable = 1


Samstag, 22. Mai 2010
Customization


                •       Indexing of additional Data through
                        hooks, interfaces, TS configuration
                •       Individual index schema
                •       En/Disable features through TS
                •       Individual, flexible rendering of results




Samstag, 22. Mai 2010
Thank you for listening.



Samstag, 22. Mai 2010
3. org
                                        ypo er
                                    o @t enn
                                i ng gor
                             il @in
                         ma r
                        tw itte




Samstag, 22. Mai 2010

More Related Content

Similar to Apache Solr for TYPO3 (@ T3CON10 Dallas, TX)

BRAINREPUBLIC - Powered by no-SQL
BRAINREPUBLIC - Powered by no-SQLBRAINREPUBLIC - Powered by no-SQL
BRAINREPUBLIC - Powered by no-SQLAndreas Jung
 
Sean coates fifty things and tricks, confoo 2011
Sean coates fifty things and tricks, confoo 2011Sean coates fifty things and tricks, confoo 2011
Sean coates fifty things and tricks, confoo 2011Bachkoutou Toutou
 
Incorporating site level knowledge to extract structured data from web forums...
Incorporating site level knowledge to extract structured data from web forums...Incorporating site level knowledge to extract structured data from web forums...
Incorporating site level knowledge to extract structured data from web forums...George Ang
 
Better Tools, Better Mindset
Better Tools, Better MindsetBetter Tools, Better Mindset
Better Tools, Better MindsetYusup
 
Web micro-framework BATTLE!
Web micro-framework BATTLE!Web micro-framework BATTLE!
Web micro-framework BATTLE!Richard Jones
 
Building OBO Foundry ontology using semantic web tools
Building OBO Foundry ontology using semantic web toolsBuilding OBO Foundry ontology using semantic web tools
Building OBO Foundry ontology using semantic web toolsMelanie Courtot
 
JBoss Analyst tour Sept 2003
JBoss Analyst tour Sept 2003JBoss Analyst tour Sept 2003
JBoss Analyst tour Sept 2003runsignup
 
Nuxeo World Session: Mobile ECM Apps with Nuxeo EP
Nuxeo World Session: Mobile ECM Apps with Nuxeo EPNuxeo World Session: Mobile ECM Apps with Nuxeo EP
Nuxeo World Session: Mobile ECM Apps with Nuxeo EPNuxeo
 
2010 10-building-global-listening-platform-with-solr
2010 10-building-global-listening-platform-with-solr2010 10-building-global-listening-platform-with-solr
2010 10-building-global-listening-platform-with-solrLucidworks (Archived)
 
Scaling with swagger
Scaling with swaggerScaling with swagger
Scaling with swaggerTony Tam
 
GGUG:Practical DSL Design
GGUG:Practical DSL DesignGGUG:Practical DSL Design
GGUG:Practical DSL DesignSkills Matter
 
The Reluctant SysAdmin : 360|iDev Austin 2010
The Reluctant SysAdmin : 360|iDev Austin 2010The Reluctant SysAdmin : 360|iDev Austin 2010
The Reluctant SysAdmin : 360|iDev Austin 2010Voxilate
 
Inside Wordnik's Architecture
Inside Wordnik's ArchitectureInside Wordnik's Architecture
Inside Wordnik's ArchitectureTony Tam
 
PyData Texas 2015 Keynote
PyData Texas 2015 KeynotePyData Texas 2015 Keynote
PyData Texas 2015 KeynotePeter Wang
 
PyCon 2011 Scaling Disqus
PyCon 2011 Scaling DisqusPyCon 2011 Scaling Disqus
PyCon 2011 Scaling Disquszeeg
 
Robotics, Search and AI with Solr, MyRobotLab, and Deeplearning4j
Robotics, Search and AI with Solr, MyRobotLab, and Deeplearning4jRobotics, Search and AI with Solr, MyRobotLab, and Deeplearning4j
Robotics, Search and AI with Solr, MyRobotLab, and Deeplearning4jKevin Watters
 
The Intersection of Robotics, Search and AI with Solr, MyRobotLab, and Deep L...
The Intersection of Robotics, Search and AI with Solr, MyRobotLab, and Deep L...The Intersection of Robotics, Search and AI with Solr, MyRobotLab, and Deep L...
The Intersection of Robotics, Search and AI with Solr, MyRobotLab, and Deep L...Lucidworks
 

Similar to Apache Solr for TYPO3 (@ T3CON10 Dallas, TX) (20)

BRAINREPUBLIC - Powered by no-SQL
BRAINREPUBLIC - Powered by no-SQLBRAINREPUBLIC - Powered by no-SQL
BRAINREPUBLIC - Powered by no-SQL
 
Sean coates fifty things and tricks, confoo 2011
Sean coates fifty things and tricks, confoo 2011Sean coates fifty things and tricks, confoo 2011
Sean coates fifty things and tricks, confoo 2011
 
Incorporating site level knowledge to extract structured data from web forums...
Incorporating site level knowledge to extract structured data from web forums...Incorporating site level knowledge to extract structured data from web forums...
Incorporating site level knowledge to extract structured data from web forums...
 
Better Tools, Better Mindset
Better Tools, Better MindsetBetter Tools, Better Mindset
Better Tools, Better Mindset
 
Web micro-framework BATTLE!
Web micro-framework BATTLE!Web micro-framework BATTLE!
Web micro-framework BATTLE!
 
Building OBO Foundry ontology using semantic web tools
Building OBO Foundry ontology using semantic web toolsBuilding OBO Foundry ontology using semantic web tools
Building OBO Foundry ontology using semantic web tools
 
JBoss Analyst tour Sept 2003
JBoss Analyst tour Sept 2003JBoss Analyst tour Sept 2003
JBoss Analyst tour Sept 2003
 
App Engine Meetup
App Engine MeetupApp Engine Meetup
App Engine Meetup
 
Nuxeo World Session: Mobile ECM Apps with Nuxeo EP
Nuxeo World Session: Mobile ECM Apps with Nuxeo EPNuxeo World Session: Mobile ECM Apps with Nuxeo EP
Nuxeo World Session: Mobile ECM Apps with Nuxeo EP
 
2010 10-building-global-listening-platform-with-solr
2010 10-building-global-listening-platform-with-solr2010 10-building-global-listening-platform-with-solr
2010 10-building-global-listening-platform-with-solr
 
Oscon 2010
Oscon 2010Oscon 2010
Oscon 2010
 
Scaling with swagger
Scaling with swaggerScaling with swagger
Scaling with swagger
 
GGUG:Practical DSL Design
GGUG:Practical DSL DesignGGUG:Practical DSL Design
GGUG:Practical DSL Design
 
The Reluctant SysAdmin : 360|iDev Austin 2010
The Reluctant SysAdmin : 360|iDev Austin 2010The Reluctant SysAdmin : 360|iDev Austin 2010
The Reluctant SysAdmin : 360|iDev Austin 2010
 
Inside Wordnik's Architecture
Inside Wordnik's ArchitectureInside Wordnik's Architecture
Inside Wordnik's Architecture
 
PyData Texas 2015 Keynote
PyData Texas 2015 KeynotePyData Texas 2015 Keynote
PyData Texas 2015 Keynote
 
PyCon 2011 Scaling Disqus
PyCon 2011 Scaling DisqusPyCon 2011 Scaling Disqus
PyCon 2011 Scaling Disqus
 
SOLR
SOLRSOLR
SOLR
 
Robotics, Search and AI with Solr, MyRobotLab, and Deeplearning4j
Robotics, Search and AI with Solr, MyRobotLab, and Deeplearning4jRobotics, Search and AI with Solr, MyRobotLab, and Deeplearning4j
Robotics, Search and AI with Solr, MyRobotLab, and Deeplearning4j
 
The Intersection of Robotics, Search and AI with Solr, MyRobotLab, and Deep L...
The Intersection of Robotics, Search and AI with Solr, MyRobotLab, and Deep L...The Intersection of Robotics, Search and AI with Solr, MyRobotLab, and Deep L...
The Intersection of Robotics, Search and AI with Solr, MyRobotLab, and Deep L...
 

Recently uploaded

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityIES VE
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI AgeCprime
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxLoriGlavin3
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Alkin Tezuysal
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersRaghuram Pandurangan
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxLoriGlavin3
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationKnoldus Inc.
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.Curtis Poe
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfLoriGlavin3
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .Alan Dix
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsNathaniel Shimoni
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditSkynet Technologies
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPathCommunity
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxLoriGlavin3
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demoHarshalMandlekar2
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch TuesdayIvanti
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersNicole Novielli
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024BookNet Canada
 

Recently uploaded (20)

New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 
Decarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a realityDecarbonising Buildings: Making a net-zero built environment a reality
Decarbonising Buildings: Making a net-zero built environment a reality
 
A Framework for Development in the AI Age
A Framework for Development in the AI AgeA Framework for Development in the AI Age
A Framework for Development in the AI Age
 
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptxThe Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
The Role of FIDO in a Cyber Secure Netherlands: FIDO Paris Seminar.pptx
 
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
Unleashing Real-time Insights with ClickHouse_ Navigating the Landscape in 20...
 
Generative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information DevelopersGenerative AI for Technical Writer or Information Developers
Generative AI for Technical Writer or Information Developers
 
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data PrivacyTrustArc Webinar - How to Build Consumer Trust Through Data Privacy
TrustArc Webinar - How to Build Consumer Trust Through Data Privacy
 
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptxThe Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
The Fit for Passkeys for Employee and Consumer Sign-ins: FIDO Paris Seminar.pptx
 
Data governance with Unity Catalog Presentation
Data governance with Unity Catalog PresentationData governance with Unity Catalog Presentation
Data governance with Unity Catalog Presentation
 
How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.How AI, OpenAI, and ChatGPT impact business and software.
How AI, OpenAI, and ChatGPT impact business and software.
 
Moving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdfMoving Beyond Passwords: FIDO Paris Seminar.pdf
Moving Beyond Passwords: FIDO Paris Seminar.pdf
 
From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .From Family Reminiscence to Scholarly Archive .
From Family Reminiscence to Scholarly Archive .
 
Time Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directionsTime Series Foundation Models - current state and future directions
Time Series Foundation Models - current state and future directions
 
Manual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance AuditManual 508 Accessibility Compliance Audit
Manual 508 Accessibility Compliance Audit
 
UiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to HeroUiPath Community: Communication Mining from Zero to Hero
UiPath Community: Communication Mining from Zero to Hero
 
The State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptxThe State of Passkeys with FIDO Alliance.pptx
The State of Passkeys with FIDO Alliance.pptx
 
Sample pptx for embedding into website for demo
Sample pptx for embedding into website for demoSample pptx for embedding into website for demo
Sample pptx for embedding into website for demo
 
2024 April Patch Tuesday
2024 April Patch Tuesday2024 April Patch Tuesday
2024 April Patch Tuesday
 
A Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software DevelopersA Journey Into the Emotions of Software Developers
A Journey Into the Emotions of Software Developers
 
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
Transcript: New from BookNet Canada for 2024: Loan Stars - Tech Forum 2024
 

Apache Solr for TYPO3 (@ T3CON10 Dallas, TX)

  • 1. Apache Solr for TYPO3 Ingo Renner TYPO3 Core Developer, Release Manager TYPO3 4.2 Samstag, 22. Mai 2010
  • 2. 3. org ypo er o @t enn i ng gor il @in ma r tw itte Samstag, 22. Mai 2010
  • 4. Current Status • First Prototype Summer 2008 • Development Kickoff February 2009 • Public Release v1.0 T3CON09 • v1.1 soon • v2.0 later this year Samstag, 22. Mai 2010
  • 5. Development Model • Initial development by dkd • Development Partnerships • Early Access, Trunk Access • Setup Support • Development Support • Development Priorities Samstag, 22. Mai 2010
  • 6. Development Partners Eventex d.k.d Internet Service GmbH SEB Assetmanagement AG University of Hohenheim e-netconsulting KG Eichborn AG Cross Content Media Andreae-Noris Zahn AG Marketing Factory Consulting GmbH marit AG MÜPRO GmbH Deutsche Lufthansa AG internezzo AG Netcreators BV AOE media GmbH Samstag, 22. Mai 2010
  • 7. Indexed Search Samstag, 22. Mai 2010
  • 8. Indexed Search • Indexing Frontend / Crawler • Respects access rights • Respects languages • Index in Database • Totally OK for smaller websites Slo ooooooooo ooowww Samstag, 22. Mai 2010
  • 10. So what is Apache Solr? • Enterprise Search Server • Based on Lucene Index • Apache Software Foundation Project • Many powerful features • CNet, Netflix, ilocal.nl, Zappos.com Samstag, 22. Mai 2010
  • 11. Solr Concepts • Index = Collection of Documents • Document = Data stored in Fields • Field Type defines processing through Analizers, Tokenizers, Filters • Dynamic Fields bi li ty • Copy Fields l ex i F Samstag, 22. Mai 2010
  • 12. Why Apache Solr? • Speed: Many times faster than IS • Better search results • Faceted search • Spellchecker: Did you mean ... ? • Similarity search: More like this ... • & Editorial Content / paid search results ed pe • Synonyms, Stopwords, Protected Words S r e • Boosting of specific index fields • P Replication, distributed search o w Samstag, 22. Mai 2010
  • 13. How it works • REST like interface • Indexing of XML Documents through HTTP POST • Querying through HTTP GET • Results as XML, JSON, PHP AP I E a sy Samstag, 22. Mai 2010
  • 14. Disadvantages • Needs Java rs • We donʻt want to deal with Java Solr shields us from Java once e • e lo p set-up D e v P H P w i th s ta y Samstag, 22. Mai 2010
  • 15. Advantages • Multiple times faster than IS • NO database queries • Easy Installation / Configuration • Respects access restrictions • Respects languages se erful t y to u w • Cutomizability as as F E P o Samstag, 22. Mai 2010
  • 16. Inner Workings • Indexing of XML Documents • Reversed Index • Access through GET and POST (REST like) • Results as XML, JSON, PHP Samstag, 22. Mai 2010
  • 17. Inner Workings Solr Index Document Field Field Field Field Field Document Field Field Field Field Field Document Field Field Field Field Field Document Field Field Field Field Field Document Field Field Field Field Field Document Field Field Field Field Field Samstag, 22. Mai 2010
  • 18. Inner Workings HTTP Request Servlet Update Servlet Admin Interface Standard DisMax Custom XML Request Request Request Response XML Update Interface Handler Handler Handler Writer Config Caching Schema Update Solr Core Handler Analysis Concurrency Replication Lucene Samstag, 22. Mai 2010
  • 19. EXT:solr Apache Solr for TYPO3 + Samstag, 22. Mai 2010
  • 20. Content Elevation Multi Language Features! Backend Search Hit Highlighting Scheduler FE Indexing File Indexing Spellchecker / Did you mean ... TYPO3 4.3 Search Reports Score Analyzer Auto Suggest Facetted Search Backend ModuleView Helper Statistics Search Box Index Queue Hierarchical Facets Logging Access Rights Install Script More Like This Hooks, Interfaces Simple Form Last Searches Filter BoostingTemplate Engine Extbase / Fluid Sorting Common Searches Page Browser TYPO3 4.2 Samstag, 22. Mai 2010
  • 21. 1.0 Features! 2.0 TYPO3 4.3 Index Queue FE Indexing Statistics Search Search Box Common Searches Facetted Search Auto Suggest Install Script Simple Form Score Analyzer Spellchecker / Did you mean ... Backend Module Sortierung Access Rights Last Searches Boosting Hit Highlighting Hierarchical Facets Page Browser Extbase / Fluid Multi Language Logging File Indexing Filter Hooks, Interfaces Backend Search Reports Template Engine More Like This Scheduler View Helper Content Elevation Samstag, 22. Mai 2010
  • 22. Current Status • „Acts like Indexed Search“ • Indexing through Frontend / Crawler • Search • Search Word Highlighting • Sorting • Last and Common Searches Samstag, 22. Mai 2010
  • 23. Current Status • Spellchecker: Did you mean ... ? • Similarity Search: More like this ... • Faceted Search, Hierarchical Facets • Suggest / Autocompletion • Index Queue • File Indexing Samstag, 22. Mai 2010
  • 24. Outlook • Backend Module • Related Searches • Editorial / Paid Search Results • Editing of Stopwords, Synonyms • Statistics • Transition to Extbase / Fluid Samstag, 22. Mai 2010
  • 34. Making the sun shine on your search Samstag, 22. Mai 2010
  • 35. Requirements, Setup • Requires any J2EE container: Tomcat, Jetty, Resin, ... • Run setup scripts provided with EXT:solr • Copy provided configuration files to Solr • Install EXT:solr, TypoScript • config.index_enable = 1 Samstag, 22. Mai 2010
  • 36. Customization • Indexing of additional Data through hooks, interfaces, TS configuration • Individual index schema • En/Disable features through TS • Individual, flexible rendering of results Samstag, 22. Mai 2010
  • 37. Thank you for listening. Samstag, 22. Mai 2010
  • 38. 3. org ypo er o @t enn i ng gor il @in ma r tw itte Samstag, 22. Mai 2010