Your SlideShare is downloading. ×
0
Building SaaS solutions with        Apache Solr   Alberto Mijares, Canoo Engineering AG   alberto.mijares@canoo.com, 26/05...
Bullet point time!                     3
What I Will Cover§  Practical applications of Apache Solr and    Apache Lucene: how to increase the time    spent by a us...
Who I am§  Alberto Mijares§  Canoo Engineering AG§  Background in web applications and standards:  •  Participated in W...
Who is Canoo§  People:   •  Dirk Koenig: Groovy founder   •  Andres Almiray: Griffon project lead and Java      Champion ...
Canoo FindIThttp://www.canoo.com/videos/FindIT.html                                          7
Stop “bullet-pointing”!                          8
The factsAxel Springer group is a market leader              Bilanz, Handelszeitung and Stocks     In Switzerland financia...
The factsAxel Springer group is a market leader              Bilanz, Handelszeitung and Stocks     In Switzerland financia...
The gapMake the online versions more profitable         Make all newspapers “market leaders”                              ...
The gapMake the online versions more profitable         Make all newspapers “market leaders”                              ...
The howWorkshop                     “Related articles”   “Cross-selling”                                          13
The howWorkshop                     “Related articles”   “Cross-selling”                                          14
The analysisUse Lucene’s “More like this”          Integrate back the suggestions               Implement a selection mech...
The analysisUse Lucene’s “More like this”          Integrate back the suggestions               Implement a selection mech...
The issues “More like this” was “experimental”     Without “semantics” not always makes senseIndexing full pages produces ...
The issues “More like this” was “experimental”     Without “semantics” not always makes senseIndexing full pages produces ...
The key          19
The key          20
The functional requirementsDiscover and index articles                       Extract only content      Simple and flexible...
The functional requirementsDiscover and index articles                       Extract only content      Simple and flexible...
The funding model                    23
The business model        SaaS                     24
The “other” requirementsLucene-based analysis pipeline                           Web oriented platform  Multi-application ...
The “other” requirementsLucene-based analysis pipeline                           Web oriented platform  Multi-application ...
The searchWraps Lucene in a nice way                It is mature and Open Source   Supports scheduling, REST API, DIH,…   ...
The searchWraps Lucene in a nice way                It is mature and Open Source   Supports scheduling, REST API, DIH…    ...
The planFrom POC to PROD in “80 days”                                29
The planFrom POC to PROD in “80 days”                                30
The resultsGoogle analytics                   31
The resultsGoogle analytics                   32
The conclusions                  33
The Q&A Thanks!           34
Sources§  Links   •    http://people.canoo.com/share   •    http://www.canoo.com   •    http://www.canoo.net   •    http:...
Contact§  Alberto Mijares   •  alberto.mijares@canoo.com   •  Twitter: @lemaiol                                  36
Architecture Platform: Apache Solr 1.4.1 Architecture:        Intern access              Extern accessSolr container      ...
Upcoming SlideShare
Loading in...5
×

Building SaaS Solutions for Online Media Using Apache Solr

1,422

Published on

In the last years, the idea of building applications that can be used remotely by mean of the Web, has coined a new concept called "Software as a Service". Such applications, have the advantage of a remote web deployment that can be instantaneously be used by potentially any consumer in internet or of the cost reduction that a Web-based deployment provides.

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
1,422
On Slideshare
0
From Embeds
0
Number of Embeds
0
Actions
Shares
0
Downloads
13
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Transcript of "Building SaaS Solutions for Online Media Using Apache Solr"

  1. 1. Building SaaS solutions with Apache Solr Alberto Mijares, Canoo Engineering AG alberto.mijares@canoo.com, 26/05/2011 Twitter: @lemaiol
  2. 2. Bullet point time! 3
  3. 3. What I Will Cover§  Practical applications of Apache Solr and Apache Lucene: how to increase the time spent by a user in an website and do website “cross-selling”.§  Use case: how Canoo helped Axel Springer Switzerland to increased the page impressions, user permanence time and traffic in their financial online newspapers.§  Key concepts: •  How to achieve this using Lucene & Solr •  How to profit from a SaaS business model 4
  4. 4. Who I am§  Alberto Mijares§  Canoo Engineering AG§  Background in web applications and standards: •  Participated in W3C Semantic Web interest group (SWEO) •  Led web standards compliance tools development in the past (Web Accessibility and Mobile Web) •  Led enterprise information retrieval projects in the recent past •  Actually coaching Google Web Toolkit projects’ development 5
  5. 5. Who is Canoo§  People: •  Dirk Koenig: Groovy founder •  Andres Almiray: Griffon project lead and Java Champion •  Hamlet D’Arcy: Groovy committer and enthusiast •  … almost 40 more top software engineers§  Products: •  WebTest: framework for web functional testing •  RIA Suite (aka ULC): Java based RIA framework •  FindIT: information retrieval and search tools •  WMTrans: language analysis tools 6
  6. 6. Canoo FindIThttp://www.canoo.com/videos/FindIT.html 7
  7. 7. Stop “bullet-pointing”! 8
  8. 8. The factsAxel Springer group is a market leader Bilanz, Handelszeitung and Stocks In Switzerland financials are important!Financial language is German Online media is the future 9
  9. 9. The factsAxel Springer group is a market leader Bilanz, Handelszeitung and Stocks In Switzerland financials are important!Financial language is German Online media is the future 10
  10. 10. The gapMake the online versions more profitable Make all newspapers “market leaders” 11
  11. 11. The gapMake the online versions more profitable Make all newspapers “market leaders” 12
  12. 12. The howWorkshop “Related articles” “Cross-selling” 13
  13. 13. The howWorkshop “Related articles” “Cross-selling” 14
  14. 14. The analysisUse Lucene’s “More like this” Integrate back the suggestions Implement a selection mechanism Find a funding model 15
  15. 15. The analysisUse Lucene’s “More like this” Integrate back the suggestions Implement a selection mechanism Find a funding model 16
  16. 16. The issues “More like this” was “experimental” Without “semantics” not always makes senseIndexing full pages produces noise Works out-of-the-box only in English 17
  17. 17. The issues “More like this” was “experimental” Without “semantics” not always makes senseIndexing full pages produces noise Works out-of-the-box only in English 18
  18. 18. The key 19
  19. 19. The key 20
  20. 20. The functional requirementsDiscover and index articles Extract only content Simple and flexible query service 21
  21. 21. The functional requirementsDiscover and index articles Extract only content Simple and flexible query service 22
  22. 22. The funding model 23
  23. 23. The business model SaaS 24
  24. 24. The “other” requirementsLucene-based analysis pipeline Web oriented platform Multi-application platform Reliable, fast and scalable Plan B? 25
  25. 25. The “other” requirementsLucene-based analysis pipeline Web oriented platform Multi-application platform Reliable, fast and scalable Plan B? 26
  26. 26. The searchWraps Lucene in a nice way It is mature and Open Source Supports scheduling, REST API, DIH,… Scalability out-of-the-boxWell documented and has professional support 27
  27. 27. The searchWraps Lucene in a nice way It is mature and Open Source Supports scheduling, REST API, DIH… Scalability out-of-the-boxWell documented and has professional support 28
  28. 28. The planFrom POC to PROD in “80 days” 29
  29. 29. The planFrom POC to PROD in “80 days” 30
  30. 30. The resultsGoogle analytics 31
  31. 31. The resultsGoogle analytics 32
  32. 32. The conclusions 33
  33. 33. The Q&A Thanks! 34
  34. 34. Sources§  Links •  http://people.canoo.com/share •  http://www.canoo.com •  http://www.canoo.net •  http://www.leo.org •  http://www.bilanz.ch •  http://www.handelszeitung.ch •  http://www.stocks.ch 35
  35. 35. Contact§  Alberto Mijares •  alberto.mijares@canoo.com •  Twitter: @lemaiol 36
  36. 36. Architecture Platform: Apache Solr 1.4.1 Architecture: Intern access Extern accessSolr container Web container Requests Springer Solr Springer WebApp Customer 2 Solr Customer 2 WebApp Customer 3 Solr Customer 3 WebApp
  1. A particular slide catching your eye?

    Clipping is a handy way to collect important slides you want to go back to later.

×