PLAT-4 Understanding the SOLR Integration


Published on

Video that accompanies this presentation at:

Join us for a guided tour of the Alfresco SOLR integration and new search sub-systems. We’ll discuss how it works, the limitations of eventual consistency, guidance for configuration and set-up. We’ll also cover the steps required to migrate, improved PATH performance, in-query ACL evaluation, cross-language support and monitoring as well as performance.

Published in: Technology
  • Be the first to comment

No Downloads
Total views
On SlideShare
From Embeds
Number of Embeds
Embeds 0
No embeds

No notes for slide

PLAT-4 Understanding the SOLR Integration

  1. 1. Understanding The SOLR Integration Andy Hind • Senior Developer • twitter @andy_hind
  2. 2. Agenda•  Why SOLR?•  What is supported?•  Eventual consistency•  Configuration and setup•  How to migrate•  Status/reporting•  Improvements
  3. 3. Why SOLR?•  Issues… o  Cluster – index per node o  Performance •  Permission evaluation •  Structural queries •  In-transaction indexing o  Scale query independently o  Cross-locale support o  Sub-system and dynamic configuration
  4. 4. What is supported?•  Spaces store•  Archive•  Query languages•  NOT o  WCM based on AVM o  Records Management o  All stores o  Multi-tenant o  In transaction (eventually consistent)
  5. 5. Eventual consistency•  SOLR is tracking Alfresco o  Following transactions – a bit like clustering o  Eventual consistency o  Transactions that may take some time to commit o  Two cores •  SpacesStore •  ArchiveStore
  6. 6. Eventual consistency•  Models•  ACLs•  Metadata•  Content•  Ownership•  Structure - PATH
  7. 7. High Level Architecture Search Requests Search Results Repository Async: Solr Index Polling Content Database Store(s) Storage Updates Models Solr Cores: ACLS Solr - Workspace Cores Properties & Content - Archive
  8. 8. Setup•  SOLR is a web app o  zip•  Communicates over SSL o  Generate and configure your certificates …•  Per core configuration in SOLR o  Data location•  Installer default
  9. 9. Configuration•  Search sub-systems o  solr, lucene o  Change configuration without restarting Alfresco•  JMX/Share admin•  Lucene o  Lots – sub-set in share•  SOLR o  Host/port/SSL•  Properties
  10. 10. How to migrate•  Carry on using lucene•  Configure SOLR•  Configure Alfresco o  Support SOLR tracking•  Monitor SOLR tracking•  Switch sub-systems when ready•  You can switch back to lucene o  It will check its state as it does now at start up
  11. 11. Stats and reporting•  JMX/Share o  Later ….•  Direct to SOLR o  https://localhost:8443/solr/admin/cores?action=SUMMARY o  https://localhost:8443/solr/admin/cores?action=REPORT•  Fix o  JMX o  https://localhost:8443/solr/admin/cores?action=FIX
  12. 12. Improvements•  PATH•  Access evaluation o  Query time•  Cross-language/locale support o  Query/Tokenisation o  Sorting•  SOLR o  Query caching o  Facets
  13. 13. Improvements …•  Cross-language o  Standard tokenisation o  Configurable o  Default – SOLR WordDelimiterFilterFactory •  BigWoof-123-A47.txt •  .txt, Big, 123A, 123a47txt, 47, A47, BigWoof123A47txt
  14. 14. Improvements …•  Cross-language o  Sort •  d:text –  en: peach péché pêche sin –  fr: peach pêche péché sin •  d:mltext –  Nearest match
  15. 15. Improvements …•  Indexing Control o  cm:indexControl o  cm:isIndexed (Boolean) •  Enable/disable All indexing (properties & content) o  cm:isContentIndexed (Boolean) •  Enable/disable Content Indexing
  16. 16. Improvements …•  Canned Queries o  How is share affected by eventual consistency? o  DB o  Not lucene/SOLR
  17. 17. Where is SOLR/Lucene used?•  Advanced Search•  Filters•  Tags (not the roll up)•  Categories (facets)•  Dashlets o  E.g. Recently Modified•  People, Groups, Sites will use DB query unless o  Start with *xyz o  Other wildcards
  18. 18. SOLR futures•  SOLR cloud•  SOLR/Lucene improvements o  Performance o  Future 3.4, 4.0, ...•  Geo
  19. 19. Demos ….
  20. 20. Questions?