EUDAT receives funding from the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No. 654065 www.eudat.eu
Towards mutually beneficial industrial engagement
with the EUDAT collaborative data infrastructure
Dave Wright, Stefan Zasada
University College London
B2 SERVICE SUITE
http://www.eudat.eu/services
B2 Service Suite
SME Engagement Pilots
Explore collaboration with private stakeholders, integrating EUDAT and
commercial services to pioneer new commercial and sustainability
opportunities.
Two fold approach:
1) enabling industrial users to store data on EUDAT resources and have
access to research data
2) testing the integration of community resources with commercial services
First activity will be involve running a series of pilot projects in collaboration
with two industrial partners focusing on real use cases
Goal is to evaluate EUDAT services for corporate users and understand the
level of QoS and security needed to meet industry expectations and
requirements.
Implementation of the Pilots
● Clear working plan defining concrete goals
● Short interaction cycles up to max 6 months of
duration
● Test resources (storage) and manpower offered by
involved EUDAT centres
● Processing of new requirements as part of the main
EUDAT service development process
● Communication aspects:
− Collaborative area, mailing lists, regular meetings
Pilot 1: Feasibility Study
● Focus on technology
and application for
molecular dynamics
● Main clients
pharmaceutical
industry and
academia
● SME founded in
Spain and UK
● Service provider for
pharmaceutical industry
● Around 1000 employees
● Based in Germany and
UK
• Purpose: Develop research data
management tools for MD data
• Terabytes of trajectory data
• Analysis and visualizations
• Target audience: small biotech
companies and academia
• Tasks:
• Assess EUDAT API coverage
• Develop metadata schema for use
with MD data in B2Share
• Develop EUDAT API to meet
company needs
Pilot 1: Feasibility Study
● Three possible use cases
considered
1. Molecular simulation data
curation
2. Collaboration suite for small
biotech companies
3. Combined commercial/public
databases
● Task: Evaluate suitability of
EUDAT services
Pilot 1 Results
● Pilot ran Oct-Nov 2016
● Both industrial partners examined using B2SHARE
● Requirements arising from this pilot were passed to
WP5 in EUDAT (Service Building). They included:
● User controllable Access Control Lists – now added
● Need to assert hierarchy in metadata model – added in
B2SHARE v2
● API accessibility was incomplete – partially fixed in
B2SHARE v1 and full fixed in v2
● QoS concerns, availability of documentation, pricing,
security
• Pilot 2: B2SHARE deployment investigated by Statmodatics Ltd
• UK SME working with German companies on analysis of anonymised
data from medical devices
• Sebastian from Statmodatics Ltd will provide details
• The results incorporated into EUDAT deliverable D7.5 - Pilot Activity
Involving Commercial Providers and Private Users
• This work will inform industrial engagement and data use in EOSC-Hub
Current/Ongoing Work
• Desire for ‘own solutions’
• Resistance to ‘open data’
• Sensitive data issues
• Legal security concerns
• Perceived barriers around certification
• This is a challenge for Amazon/Microsoft/Google too
• Need for clear pricing
• Require ‘solutions’ based documentation
Challenges for Industrial Engagement

Towards mutually beneficial industrial engagement with the EUDAT collaborative data infrastructure

  • 1.
    EUDAT receives fundingfrom the European Union's Horizon 2020 programme - DG CONNECT e-Infrastructures. Contract No. 654065 www.eudat.eu Towards mutually beneficial industrial engagement with the EUDAT collaborative data infrastructure Dave Wright, Stefan Zasada University College London
  • 2.
  • 3.
    SME Engagement Pilots Explorecollaboration with private stakeholders, integrating EUDAT and commercial services to pioneer new commercial and sustainability opportunities. Two fold approach: 1) enabling industrial users to store data on EUDAT resources and have access to research data 2) testing the integration of community resources with commercial services First activity will be involve running a series of pilot projects in collaboration with two industrial partners focusing on real use cases Goal is to evaluate EUDAT services for corporate users and understand the level of QoS and security needed to meet industry expectations and requirements.
  • 4.
    Implementation of thePilots ● Clear working plan defining concrete goals ● Short interaction cycles up to max 6 months of duration ● Test resources (storage) and manpower offered by involved EUDAT centres ● Processing of new requirements as part of the main EUDAT service development process ● Communication aspects: − Collaborative area, mailing lists, regular meetings
  • 5.
    Pilot 1: FeasibilityStudy ● Focus on technology and application for molecular dynamics ● Main clients pharmaceutical industry and academia ● SME founded in Spain and UK ● Service provider for pharmaceutical industry ● Around 1000 employees ● Based in Germany and UK
  • 6.
    • Purpose: Developresearch data management tools for MD data • Terabytes of trajectory data • Analysis and visualizations • Target audience: small biotech companies and academia • Tasks: • Assess EUDAT API coverage • Develop metadata schema for use with MD data in B2Share • Develop EUDAT API to meet company needs Pilot 1: Feasibility Study ● Three possible use cases considered 1. Molecular simulation data curation 2. Collaboration suite for small biotech companies 3. Combined commercial/public databases ● Task: Evaluate suitability of EUDAT services
  • 7.
    Pilot 1 Results ●Pilot ran Oct-Nov 2016 ● Both industrial partners examined using B2SHARE ● Requirements arising from this pilot were passed to WP5 in EUDAT (Service Building). They included: ● User controllable Access Control Lists – now added ● Need to assert hierarchy in metadata model – added in B2SHARE v2 ● API accessibility was incomplete – partially fixed in B2SHARE v1 and full fixed in v2 ● QoS concerns, availability of documentation, pricing, security
  • 8.
    • Pilot 2:B2SHARE deployment investigated by Statmodatics Ltd • UK SME working with German companies on analysis of anonymised data from medical devices • Sebastian from Statmodatics Ltd will provide details • The results incorporated into EUDAT deliverable D7.5 - Pilot Activity Involving Commercial Providers and Private Users • This work will inform industrial engagement and data use in EOSC-Hub Current/Ongoing Work
  • 9.
    • Desire for‘own solutions’ • Resistance to ‘open data’ • Sensitive data issues • Legal security concerns • Perceived barriers around certification • This is a challenge for Amazon/Microsoft/Google too • Need for clear pricing • Require ‘solutions’ based documentation Challenges for Industrial Engagement