Your SlideShare is downloading. ×
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Searching does not mean finding Stuff - Apache Solr for TYPO3
Upcoming SlideShare
Loading in...5
×

Thanks for flagging this SlideShare!

Oops! An error has occurred.

×
Saving this for later? Get the SlideShare app to save on your phone or tablet. Read anywhere, anytime – even offline.
Text the download link to your phone
Standard text messaging rates apply

Searching does not mean finding Stuff - Apache Solr for TYPO3

1,184

Published on

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
1,184
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
9
Comments
0
Likes
0
Embeds 0
No embeds

Report content
Flagged as inappropriate Flag as inappropriate
Flag as inappropriate

Select your reason for flagging this presentation as inappropriate.

Cancel
No notes for slide

Transcript

  • 1. http://www.dkd.de Freitag, 10. Juni 2011
  • 2. d dkdevelopment kommunikation design Freitag, 10. Juni 2011
  • 3. Welcome Olivier Dobberkau CEO dkd Internet Service GmbH Frankfurt am Main, Germany Freitag, 10. Juni 2011
  • 4. Agenda What is search? Search in TYPO3 Search expectations today Apache Solr Why and how? Watch out! Freitag, 10. Juni 2011
  • 5. Aboutme Freitag, 10. Juni 2011
  • 6. OlivierDobberkau Founder of dkd Internet Service GmbH aka „the reverend never-end“ Met TYPO3 with Version 3.2 beta 3 Member of T3A BCC 43 years old olivier.dobberkau@dkd.de Twitter: @T3RevNeverEnd Freitag, 10. Juni 2011
  • 7. WhatisSearch? Freitag, 10. Juni 2011
  • 8. DefinitionofInformationRetrieval Information retrieval (IR) is the area of study concerned with searching for documents, for information within documents, and for metadata about documents, as well as that of searching relational databases and the World Wide Web. Wikipedia: http://en.wikipedia.org/wiki/Information_retrieval Freitag, 10. Juni 2011
  • 9. FactorsinInformationRetrieval Recall Precision Fall-out Scalability Performance Freitag, 10. Juni 2011
  • 10. FactorsinInformationRetrieval Recall Precision Fall-out Scalability Performance Simplicity Flexibility Freitag, 10. Juni 2011
  • 11. Recall Percent of documents that are returned 400 documents 100 containing information 25% recall Freitag, 10. Juni 2011
  • 12. Precision Percentage of documents that are relevant 500 returned, 100 relevant 20% precision Freitag, 10. Juni 2011
  • 13. Best would be: 100% Recall with 100% Precision Freitag, 10. Juni 2011
  • 14. Index The purpose of storing an index is to optimize speed and performance in finding relevant documents for a search query. Freitag, 10. Juni 2011
  • 15. Index Index Document 5 Document 4 Document 3 Document 2 Document 1 Extbase TYPO3 San Baseball My is Francisco is cat T3CON my is a rocks Fort cool Ghetto Mason Sport Freitag, 10. Juni 2011
  • 16. PostingFile Word Document My 1,2 cat 1 is 1,2,5 cool 1 Baseball 2 Sport 2 San 3 Freitag, 10. Juni 2011
  • 17. SearchinTYPO3 Freitag, 10. Juni 2011
  • 18. IndexedSearch Indexed Search since TYPO3 Version 3.5 Frontend Indexing through the Frontend Searches in Pages and in some Filetypes Works with Languages and Accessrights Freitag, 10. Juni 2011
  • 19. IndexedSearch Index in Database Problems with large websites Slow no sorting no Templating OK for small websites Freitag, 10. Juni 2011
  • 20. Search Expectations Freitag, 10. Juni 2011
  • 21. Expectationvs.Experience Users expect „Google-Like“ interface and behaviour in search No one navigates through an online shop up to 30% of users use the search instead of going through text or navigation Search is mediocre on a lot of websites Slow and incomplete Lots of improvement possible Freitag, 10. Juni 2011
  • 22. ApacheSolr Enterprise Search Server Freitag, 10. Juni 2011
  • 23. ApacheSolr Apache Software Foundation Enterprise Search Server uses the Lucene Index Lots of great Features CNet, Netflix, Zappos.com and many more... Freitag, 10. Juni 2011
  • 24. SolrKey-Features Synonyms Stopwords Boosting / Weighting Facetting Paid Content / Elevation Freitag, 10. Juni 2011
  • 25. SolrKey-Features Synonyms Stopwords Boosting / Weighting Facetting Paid Content / Elevation Spellchecking / Did you mean? Freitag, 10. Juni 2011
  • 26. SolrKey-Features Synonyms Stopwords Boosting / Weighting Facetting Paid Content / Elevation Spellchecking / Did you mean? Speed Freitag, 10. Juni 2011
  • 27. Howdoesitwork? REST like Interface Indexing with POST Search with GET Results in XML, JSON, PHP and many more Libraries for many programming languages SolrPhpClient Freitag, 10. Juni 2011
  • 28. Whyandhow? Freitag, 10. Juni 2011
  • 29. ScratchingourItch Why? Indexed Search was too slow misses a lot of now a days requirements Freitag, 10. Juni 2011
  • 30. History Prototype im Summer 2008 Kick-off February 2009 „Acts like Indexed Search“ Early Access Program T3CON September 2009 Version 1.0 Freitag, 10. Juni 2011
  • 31. Components Indexing Search Flexible Templating Analysis and Statistics Administration Freitag, 10. Juni 2011
  • 32. Challenges Page Rendering in TYPO3 Access Rights File Indexing Easy Setup for Non Java People Integrating Solr in general Freitag, 10. Juni 2011
  • 33. Solutions Record Monitor und Indexing Queue Solr Query Parser Plugin Integration of Apache Tika Fully Automated bash Install Script SolrPhpClient Freitag, 10. Juni 2011
  • 34. Features Facetted Search File Indexing Multi-language Support Did you mean Freitag, 10. Juni 2011
  • 35. Features Search Word Highlighting Autocomplete / Suggestions Access Rights Support More to come Freitag, 10. Juni 2011
  • 36. Watchout! Freitag, 10. Juni 2011
  • 37. „I do not have any solution. I admire the problem.“ Ashleight Brillant, Cartonist and Author. Freitag, 10. Juni 2011
  • 38. CommonProblems Relanvancy Perception Trap Assumption: Search should display a certain result like an Employee Name Query: Mike Miller Results: Mill 100% Relanvancy Miller 75% Relanvancy Possible Issue: Stemming on proper Names Solution: Don‘t stemm Fields with Names Freitag, 10. Juni 2011
  • 39. CommonProblems Finding Corpses in your Corpus While Searching you find „interesting“ Results You have forgotten to hide content You have not set the „no search“ Flag You have made copies of records and forgotten them Freitag, 10. Juni 2011
  • 40. CommonProblems Data updates without using the TCE Main You wonder: Why do my new records of table XY not show up You have updated the tables with i.e phpMyAdmin You might have forgotten to add the Language id in the records Freitag, 10. Juni 2011
  • 41. CommonProblems Can‘t access the Solr Server You can not access the Solr Server on another Machine Possible Solution Freitag, 10. Juni 2011
  • 42. CommonProblems Help my Index gets deleted Syntom: Your Index is empty Possible Cause: Your Solr Server is not secured Freitag, 10. Juni 2011
  • 43. CommonProblems My news are not being indexed News that you have in a Sysfolder are not showing up in your Results The Folder in not in the rootline of the Website Configure the PID of the Sysfolder correctly Freitag, 10. Juni 2011
  • 44. Questions? Freitag, 10. Juni 2011
  • 45. d dk development kommunikation design Thankyou. Freitag, 10. Juni 2011

×