Successfully reported this slideshow.
We use your LinkedIn profile and activity data to personalize ads and to show you more relevant ads. You can change your ad preferences anytime.

Smart Content Migration using Apache ManifoldCF Output Connectors

843 views

Published on

In our days, enterprise digital content is scattered on several independent systems and subsystems, which perform services such as user authentication, document storage, and provide search capabilities. Centralising enterprise data into a single repository is a growing necessity for organisations.

Piergiorgio Lucidi (TAI Solutions) and Luis Cabaceira (Alfresco) proposed an approach that could be the "silver bullet"and that will open a clear path for Enterprise digital content centralisation.

In its genesis, Apache ManifoldCF is a crawler that allows you to manage content indexes in your search engines, this was the main goal of the product.

We've realised that we could leverage ManifoldCF to also migrate content, and not only indexes, making it a very good migration tool.

This talk will focus on 2 new output connectors for Apache ManifoldCF that are being developed by us.

This session was presented at the Alfresco DevCon 2018 in Lisbon.

Please see also the retrospective article available here:
https://www.open4dev.com/journal/2018/1/22/alfresco-devcon-2018-wrapping-up

Published in: Technology
  • Be the first to comment

Smart Content Migration using Apache ManifoldCF Output Connectors

  1. 1. Apache ManifoldCF Output Connectors Luis Cabaceira Piergiorgio Lucidi
  2. 2. Learn. Connect. Collaborate. Agenda • Apache ManifoldCF Overview • CMIS Output Connector • Alfresco BFSI Output Connector
  3. 3. Learn. Connect. Collaborate. Luis Cabaceira Solutions Architect @ Alfresco Alfresco Certified Engineer Alfresco Certified Administrator Java Certified Architect Committer @ ASF • Apache ManifoldCF
  4. 4. Learn. Connect. Collaborate. Piergiorgio Lucidi Chief Technology Evangelist / EIM Specialist @ TAI Solutions Alfresco Certified Instructor / Engineer / Administrator Alfresco Forum Moderator / Community Star (OpenPJ) Mentor / PMC Member / Committer @ ASF • Apache ManifoldCF • Apache Community Development Member of Technical Advisory Group @ Microsoft Reviewer of toolkits and white papers @ AIIM Author and Technical Reviewer @ Packt Publishing Project Leader / Committer @ JBoss Community
  5. 5. Apache ManifoldCF - Overview
  6. 6. What is Apache ManifoldCF? Repository 1 Repository 3 Repository 4 Repository 2 Search Server 1 Search Server 2 Search Server 3 Search Server 4
  7. 7. Learn. Connect. Collaborate. Apache ManifoldCF - Concepts 1/2 Crawling Repository Connection Output Connection Authority Connection Track injected contents for incremental executions Fetch from content repositories Inject contents to search servers and repositories Access tokens
  8. 8. Learn. Connect. Collaborate. Apache ManifoldCF - Concepts 2/2 Tranformation Connection Notification Connection Status Reports History Reports Manipulate fetched contents before the injection Send notification messages for ending and error events Inspect the current queues for the executing jobs Search and discover transactions for executed jobs
  9. 9. Learn. Connect. Collaborate. Schedule your jobs using UI Configure repositories for getting contents Configure search servers for indexing Configure your transformations Look at the history for each job and connection Configure your target content repositories to migrate contents Apache ManifoldCF - What you can do
  10. 10. Learn. Connect. Collaborate. Big swerve: Content Migration Output Connectors can be used for migrating contents to target repositories What we are doing / what we need: Changes to the core framework Brand new connector implementations Unit and integration tests!!! Scheduled for the next ManifoldCF 2.10 / 3.x
  11. 11. Content Migration Fetching Injecting
  12. 12. CMIS Output Connector - DEMO
  13. 13. Content Migration - CMIS Output Connector
  14. 14. Content Migration - Source repo - Generic CMIS
  15. 15. Content Migration - Source repo - Images
  16. 16. Content Migration - Source repo - Icons
  17. 17. Content Migration - Source repo - Manuals
  18. 18. Content Migration - Source repo - Screenshots
  19. 19. Content Migration - Source repo - Soundtracks
  20. 20. Content Migration - Source repo - Videos
  21. 21. Content Migration - Target repo
  22. 22. Content Migration - Available repository connectors
  23. 23. Content Migration - Repository connection - Fetching
  24. 24. Content Migration - CMIS output connection - Injecting
  25. 25. Content Migration - Jobs - Pipeline
  26. 26. Content Migration - Jobs - CMIS Query - Fetching
  27. 27. Content Migration - Jobs - Crawling
  28. 28. Content Migration - Jobs - Crawling
  29. 29. Content Migration - Jobs - Crawling
  30. 30. Content Migration - Jobs - Crawling
  31. 31. Content Migration - Jobs - Crawling
  32. 32. Content Migration - Jobs - Crawling
  33. 33. Alfresco BFSI Output Connector - DEMO
  34. 34. Content Migration - Alfresco BFSI Output Connector
  35. 35. Content Migration - File System to BFSI
  36. 36. Content Migration - File System to BFSI - Paths
  37. 37. Content Migration - File System to BFSI - Generated
  38. 38. Content Migration - File System to BFSI - Metadata
  39. 39. Content Migration - File System to BFSI - Inplace
  40. 40. Content Migration - File System to BFSI - Migrated
  41. 41. Content Migration - Alfresco WebScripts repository connector
  42. 42. Content Migration - Alfresco WebScripts to BFSI - Job
  43. 43. Content Migration - Alfresco WebScripts to BFSI - Crawling
  44. 44. Content Migration - Alfresco WebScripts to BFSI - Crawling Share sites
  45. 45. Content Migration - Alfresco WebScripts to BFSI - New contents
  46. 46. Content Migration - Alfresco WebScripts to BFSI - Incremental crawling
  47. 47. Content Migration - CMIS to BFSI - Job
  48. 48. Content Migration - CMIS to BFSI - Incremental crawling
  49. 49. Learn. Connect. Collaborate. Content Migration - Roadmap Repository connectors New connectors: Azure Storage Content migration extension support on existing ones Output Connectors New connectors: Azure Storage and Amazon S3 Metadata mapping Bugfixing Testing If you want to help us your involvement is welcome :)
  50. 50. Luis Cabaceira luis.cabaceira@alfresco.com Twitter: @LuisCabaceira Piergiorgio Lucidi p.lucidi@tai.it Twitter: @pjlucidi Thank you! Repository assets in our demos provided by ProgettoSNAPS. The freely available MAME’s archive http://www.progettosnaps.net/

×