Click to edit document name         IMPACT     Interoperability       Framework       Clemens Neudecker,National Library o...
BackgroundIMPACT from a technical perspective:•   > 20 software components for solving specific challenges•   Prototyping ...
ArchitectureIMPACT Interoperability Framework: Technologies• Java• Tomcat• Axis2• Synapse• TavernaAll components open sour...
Generic Web Service Wrapper• Source code available: https://github.com/impactcentre/toolwrapper• Facilitates easy integrat...
Integration stepsCommand line application  Web service  Workflow module
Workflows            • OCR workflow =               data pipeline            • Building blocks =               processing ...
Workflows            • OCR workflow =               data pipeline            • Building blocks =               processing ...
Workflow Management• Web 2.0 style registry: myExperiment• Local client: Taverna Workbench• Remote client: Project website
Local client: Taverna Workbench                                                    Background:                            ...
Remote client: Taverna Server / Workflow Parser• Remote execution of workflows via REST/SOAP API• Client application for w...
Repository integration• Custom WebDAV service for IMPACT:    –   Configurable storage of result sets & provenance    –   F...
Community• Web2.0 style workflow registry• Discover, rate, tag, review• Community of experts• Sharing of resources• Knowle...
Scalability• Enterprise Service Bus   receives requests from   users and distributes   the load to the available   worker ...
Evaluation• Text based comparison of result with ground truth,    using Levenshtein distance method•   Layout based compar...
Outlook• Extending the scope:    – Workflows for linguistic analysis: CLARIN    – Workflows for digital preservation: SCAP...
IMPACT Interoperability Framework - Clemens Neudecker
Upcoming SlideShare
Loading in …5
×

IMPACT Interoperability Framework - Clemens Neudecker

1,071
-1

Published on

0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total Views
1,071
On Slideshare
0
From Embeds
0
Number of Embeds
3
Actions
Shares
0
Downloads
0
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

IMPACT Interoperability Framework - Clemens Neudecker

  1. 1. Click to edit document name IMPACT Interoperability Framework Clemens Neudecker,National Library of the Netherlands
  2. 2. BackgroundIMPACT from a technical perspective:• > 20 software components for solving specific challenges• Prototyping new algorithms, improving commercial solutions• Different frameworks (C, C++, Java, etc.), platforms (Win/Linux)• Extensible with third party applications  IMPACT Interoperability Framework (IIF)
  3. 3. ArchitectureIMPACT Interoperability Framework: Technologies• Java• Tomcat• Axis2• Synapse• TavernaAll components open source (Apache License 2.0, LGPL 2.1)
  4. 4. Generic Web Service Wrapper• Source code available: https://github.com/impactcentre/toolwrapper• Facilitates easy integration: developers can focus on their application and have to worry less about integration = higher quality software components
  5. 5. Integration stepsCommand line application  Web service  Workflow module
  6. 6. Workflows • OCR workflow = data pipeline • Building blocks = processing modules • Integration = interaction between nodes (mashups) • Collaboration with
  7. 7. Workflows • OCR workflow = data pipeline • Building blocks = processing modules • Integration = interaction between nodes (mashups) • Collaboration with
  8. 8. Workflow Management• Web 2.0 style registry: myExperiment• Local client: Taverna Workbench• Remote client: Project website
  9. 9. Local client: Taverna Workbench Background: • Life Sciences • Developed and maintained by myGrid, UK • Active community Windows/Linux/OSX & source code available: http://www.taverna.org.uk/
  10. 10. Remote client: Taverna Server / Workflow Parser• Remote execution of workflows via REST/SOAP API• Client application for website integration• Source code available: https://github.com/impactcentre/interfaces/taverna
  11. 11. Repository integration• Custom WebDAV service for IMPACT: – Configurable storage of result sets & provenance – Fully interoperable, since HTTP-based – Report API – Source code available• Integration with PRIMA image & ground truth repository• Connectors for Fedora digital object repository
  12. 12. Community• Web2.0 style workflow registry• Discover, rate, tag, review• Community of experts• Sharing of resources• Knowledge exchange Central meeting point for users & researchers, tools & data
  13. 13. Scalability• Enterprise Service Bus receives requests from users and distributes the load to the available worker nodes• Main effects: Process parallelization, Load distribution, Fail over, Monitoring• Tested on Dutch Supercomputing Cloud HPC
  14. 14. Evaluation• Text based comparison of result with ground truth, using Levenshtein distance method• Layout based comparison of result with ground truth, using the Page Analysis And Ground Truth Elements Framework• Example:
  15. 15. Outlook• Extending the scope: – Workflows for linguistic analysis: CLARIN – Workflows for digital preservation: SCAPE• Even better scalability: MapReduce/Hadoop• Supported by a community of developers & practitioners in the Centre of Competence Interested? Get in touch! http://www.digitisation.eu/contact-us/

×