eROSA has received funding from the European
Union’s Horizon 2020 research and innovation
programme under grant agreement No 730988
E-infrastructure for open agri-food
sciences – vision and roadmap
Odile Hologne, INRA, Head of the department of scientific information
eROSA coordinator @Holo_08
Johannes Keizer, e-ROSA consultant
@H2020_eROSA
http://www.erosa.aginfra.eu/
Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
Agenda
e-ROSA in brief and context
Overview of the landscape
Vision
Conclusion
Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
eROSA in brief
Coordination and support action (infrasupp 3 2016)
18 months
Started in January 2017
Consortium: INRA (FR), WUR Alterra (NL), Agroknow (GR)
« Brother » project: AgINFRA+ : prototype new
services
▪ Support small-size foresight roadmaps for research and education communities and
operators of e-infrastructure services.
▪ Identification of potential collaboration from stakeholders across different
geographic areas and scientific domains.
Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
Objectives
Community building: researchers in agri-food sciences
and ICT specialists; international
Improve the knowledge of the landscape: infra/e-infra,
projects, policies … relevant for an « e-infra for open
science in agriculture »
Roadmap: conception, advocacy
e-ROSA approach
Stakeholder Workshops
① ② ③
We are here !
What we have done so far
Bibliometric analysis
> Initial scoping of the e-ROSA community, updating in progress
Online map
> Cataloguing key stakeholders, initiatives and infrastructures; ongoing open call
First Stakeholder Workshop: 6-7 July 2017 in Montpellier
> Initiate comunity-building and improve knowledge of the current landscape
Second Stakeholder Workshop: 27-28 November 2017 in Wageningen
> Identify scientific and data/ICT needs and discuss the vision
Vision paper
http://www.erosa.aginfra.eu/sites/erosa_deliverables/D1.1.pdf
http://www.aginfra.eu/discover
http://www.erosa.agin
fra.eu/node/47
http://www.erosa.aginfra.eu/node/8
Horizon 2020 research and innovation programme - grant agreement No 730988
Open Harvest – 06/02 2017
European initiatives
build an internet of FAIR data and Services (for
machines)
✓ Improve & federate existing structures (standards,
interoperability, governance, financing) based on user
needs
✓ Incentives for data sharing in science & training
Quick overview of
the actual landscape
http://www.aginfra.eu/discover
http://www.aginfra.eu
AgINFRA/e-ROSA online map
Godan data ecosystem wg
Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
Data challenges in agricultural sciences
Massive data production in labs (sensors,
robots, models ….) but also in farms (of huge
interest for science)
✓ Big data : more variety than volume
✓ space, time and scale issues
Data silos, poorly documented
Not easy to find, nor to access
Different level of maturity of the practices
about data (management, sharing, analysis)
Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
E-infra conceptuel model to understand the ecosystem
Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
Some of the gems
Data repositories +/- open
Data catalog : http://ring.ciard.net/
Resources catalog: semantics, metadata
http://vest.agrisemantics.org
http://agroportal.lirmm.fr/
http://agrisemantics.org/
GACS
Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
How to analyze the technical ecosystem ?
https://www.rd-alliance.org/sites/default/files/recommendation-jan-2017-v8.pdf
Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
Implementation Status of key services
https://eoscpilot.eu/content/erosa-position-paper
Johannes Keizer, e-ROSA project
Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
Current landscape: Challenges
Technical challenges
Generic
Content-dependent
(agri-food)
Long-term preservation
Data processing
Authentication and access rights
Data types
……
Semantics - standards
From genes to ecosytem
Public and private stakeholders
Registry of services
….
Cultural challenges
Community engagement and incentives
Policies and regulation
Sustainability and governance
Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
Potential Governance and business model issues
https://eoscpilot.eu/content/erosa-position-paper
Johannes Keizer, e-ROSA project
From open science to
technological considerations
Vision
Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
Common Patterns in Revolutionary
Infrastructures and Data
https://www.rd-alliance.org/sites/default/files/Common_Patterns_in_Revolutionising_Infrastructures-
final.pdf
Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
Vision paper : A collaborative and open
system …
… In the open science based knowledge system, researchers :
✓ openly collaborate with different societal stakeholders to further improve
the functioning of the food system;
✓ deploy a systems approach including the impacts and consequences in the
whole food systems in their research, not studying effects and disciplines in
isolation;
✓ undertake fully data-driven research …. And also hypothesis-driven research;
✓ work impact-based, to place their research in the broader societal context
and show what the implications of the research are.
http://www.erosa.aginfra.eu/node/8
Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
An e-infra that supports the Data flows and
open science
The agri-food sector is dealing with an increasing amount and variety of
data due to:
• The multidisciplinary, multiscale nature of agri-food science, which is adopting
a more and more systemic approach;
• The automation of data collection thanks to robots, sensors, etc., as well as
new engineering tools such as in the omics field;
• The development of new types of data sources: e.g. Internet of Things, citizen
science, text-mining
Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
But F.A.I.R is not necessary open
Societal priorities imply the maximum of transparency and
access to data;
Business interests can support differing objectives
Personal privacy concerns I want to
protect my
« know
how »I want to
know where
it comes
from
I want to
understand
….
22
Services for the Research
Data Lifecycle
Processing & Analysis
Data Management, Curation &
Preservation
Access, Deposition & Sharing
Federation Services
● B2FIND (data)
● Marketplace (Services)
● Applications on Demand
● Federated HTC & Cloud Compute IaaS &
PaaS
● Processing of sensitive data
● Jupyter Notebook
● Application DB (software & VM)
● B2DROP (data)
● B2Note (data)
● B2SHARE (data)
● DataHub
● Federated AAI. monitoring,
accounting
● SLA and order Management
● Security incident response and
policies
● Technical support & Training
● B2HANDLE
● B2SAFE
● European Certified Trusted
Repository
● Thematic data analytics
● Scientific Workflow Management,
Orchestration (DIRAC, PaaS
Orchestrator)
1
2
3
4
Discover & Reuse
Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
Semantic : the missing component of EOSC pilot and Hub
Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
What we want!
Helping all players in the food system to
access information wherever and in which
way ever it is stored and to process it in the
way they want
Considering our
scattered ecosystem
and multiple
stakeholders, resources
and culture
Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
What we need
Extensive and powerful shared semantics
Performant and scalable triple stores
An API factory that produces APIs which link
to any back end and any front end
A powerful dataset/information source
discover/registration mechanism
High Level Architecture
Multiple
Frontends
- Galaxy
- R
- Portals
- D4Science
- Excel
- PDF
- CMS
Whatever
people want
to do
Multiple
Backends
• SQL
• Dataverse
• CKAN
• Text
Whatever
people want
to
do
Temporary
and
permanent
triple stores
Shared Semantics
(Agroportal - GACS- Maps of Standards – RING) V 3.0
APIs to link
to all kind of
backends
APIs to link
to all kind of
front ends
Next steps
Conclusion
Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
Conclusion and next steps
Work in progress :
✓ Technical architecture
vision
✓ Roadmap
Input needed from you
Implementation
opportunities
✓ EU and other
CONSORTIUM
WWW.EROSA.AGINFRA.EU
Thank you for your attention!
odile.hologne@inra.fr
@Holo_08
@H2020_eROSA

E-infrastructure for open agri-food sciences: Vision & Roadmap

  • 1.
    eROSA has receivedfunding from the European Union’s Horizon 2020 research and innovation programme under grant agreement No 730988 E-infrastructure for open agri-food sciences – vision and roadmap Odile Hologne, INRA, Head of the department of scientific information eROSA coordinator @Holo_08 Johannes Keizer, e-ROSA consultant @H2020_eROSA http://www.erosa.aginfra.eu/
  • 2.
    Horizon 2020 researchand innovation programme - grant agreement No 730988 RDA 11th, Berlin, IGAD pre-meeting, 03/20/18 Agenda e-ROSA in brief and context Overview of the landscape Vision Conclusion
  • 3.
    Horizon 2020 researchand innovation programme - grant agreement No 730988 RDA 11th, Berlin, IGAD pre-meeting, 03/20/18 eROSA in brief Coordination and support action (infrasupp 3 2016) 18 months Started in January 2017 Consortium: INRA (FR), WUR Alterra (NL), Agroknow (GR) « Brother » project: AgINFRA+ : prototype new services ▪ Support small-size foresight roadmaps for research and education communities and operators of e-infrastructure services. ▪ Identification of potential collaboration from stakeholders across different geographic areas and scientific domains.
  • 4.
    Horizon 2020 researchand innovation programme - grant agreement No 730988 RDA 11th, Berlin, IGAD pre-meeting, 03/20/18 Objectives Community building: researchers in agri-food sciences and ICT specialists; international Improve the knowledge of the landscape: infra/e-infra, projects, policies … relevant for an « e-infra for open science in agriculture » Roadmap: conception, advocacy
  • 5.
  • 6.
    What we havedone so far Bibliometric analysis > Initial scoping of the e-ROSA community, updating in progress Online map > Cataloguing key stakeholders, initiatives and infrastructures; ongoing open call First Stakeholder Workshop: 6-7 July 2017 in Montpellier > Initiate comunity-building and improve knowledge of the current landscape Second Stakeholder Workshop: 27-28 November 2017 in Wageningen > Identify scientific and data/ICT needs and discuss the vision Vision paper http://www.erosa.aginfra.eu/sites/erosa_deliverables/D1.1.pdf http://www.aginfra.eu/discover http://www.erosa.agin fra.eu/node/47 http://www.erosa.aginfra.eu/node/8
  • 7.
    Horizon 2020 researchand innovation programme - grant agreement No 730988 Open Harvest – 06/02 2017 European initiatives build an internet of FAIR data and Services (for machines) ✓ Improve & federate existing structures (standards, interoperability, governance, financing) based on user needs ✓ Incentives for data sharing in science & training
  • 8.
    Quick overview of theactual landscape
  • 9.
  • 10.
    Horizon 2020 researchand innovation programme - grant agreement No 730988 RDA 11th, Berlin, IGAD pre-meeting, 03/20/18 Data challenges in agricultural sciences Massive data production in labs (sensors, robots, models ….) but also in farms (of huge interest for science) ✓ Big data : more variety than volume ✓ space, time and scale issues Data silos, poorly documented Not easy to find, nor to access Different level of maturity of the practices about data (management, sharing, analysis)
  • 11.
    Horizon 2020 researchand innovation programme - grant agreement No 730988 RDA 11th, Berlin, IGAD pre-meeting, 03/20/18 E-infra conceptuel model to understand the ecosystem
  • 12.
    Horizon 2020 researchand innovation programme - grant agreement No 730988 RDA 11th, Berlin, IGAD pre-meeting, 03/20/18 Some of the gems Data repositories +/- open Data catalog : http://ring.ciard.net/ Resources catalog: semantics, metadata http://vest.agrisemantics.org http://agroportal.lirmm.fr/ http://agrisemantics.org/ GACS
  • 13.
    Horizon 2020 researchand innovation programme - grant agreement No 730988 RDA 11th, Berlin, IGAD pre-meeting, 03/20/18 How to analyze the technical ecosystem ? https://www.rd-alliance.org/sites/default/files/recommendation-jan-2017-v8.pdf
  • 14.
    Horizon 2020 researchand innovation programme - grant agreement No 730988 RDA 11th, Berlin, IGAD pre-meeting, 03/20/18 Implementation Status of key services https://eoscpilot.eu/content/erosa-position-paper Johannes Keizer, e-ROSA project
  • 15.
    Horizon 2020 researchand innovation programme - grant agreement No 730988 RDA 11th, Berlin, IGAD pre-meeting, 03/20/18 Current landscape: Challenges Technical challenges Generic Content-dependent (agri-food) Long-term preservation Data processing Authentication and access rights Data types …… Semantics - standards From genes to ecosytem Public and private stakeholders Registry of services …. Cultural challenges Community engagement and incentives Policies and regulation Sustainability and governance
  • 16.
    Horizon 2020 researchand innovation programme - grant agreement No 730988 RDA 11th, Berlin, IGAD pre-meeting, 03/20/18 Potential Governance and business model issues https://eoscpilot.eu/content/erosa-position-paper Johannes Keizer, e-ROSA project
  • 17.
    From open scienceto technological considerations Vision
  • 18.
    Horizon 2020 researchand innovation programme - grant agreement No 730988 RDA 11th, Berlin, IGAD pre-meeting, 03/20/18 Common Patterns in Revolutionary Infrastructures and Data https://www.rd-alliance.org/sites/default/files/Common_Patterns_in_Revolutionising_Infrastructures- final.pdf
  • 19.
    Horizon 2020 researchand innovation programme - grant agreement No 730988 RDA 11th, Berlin, IGAD pre-meeting, 03/20/18 Vision paper : A collaborative and open system … … In the open science based knowledge system, researchers : ✓ openly collaborate with different societal stakeholders to further improve the functioning of the food system; ✓ deploy a systems approach including the impacts and consequences in the whole food systems in their research, not studying effects and disciplines in isolation; ✓ undertake fully data-driven research …. And also hypothesis-driven research; ✓ work impact-based, to place their research in the broader societal context and show what the implications of the research are. http://www.erosa.aginfra.eu/node/8
  • 20.
    Horizon 2020 researchand innovation programme - grant agreement No 730988 RDA 11th, Berlin, IGAD pre-meeting, 03/20/18 An e-infra that supports the Data flows and open science The agri-food sector is dealing with an increasing amount and variety of data due to: • The multidisciplinary, multiscale nature of agri-food science, which is adopting a more and more systemic approach; • The automation of data collection thanks to robots, sensors, etc., as well as new engineering tools such as in the omics field; • The development of new types of data sources: e.g. Internet of Things, citizen science, text-mining
  • 21.
    Horizon 2020 researchand innovation programme - grant agreement No 730988 RDA 11th, Berlin, IGAD pre-meeting, 03/20/18 But F.A.I.R is not necessary open Societal priorities imply the maximum of transparency and access to data; Business interests can support differing objectives Personal privacy concerns I want to protect my « know how »I want to know where it comes from I want to understand ….
  • 22.
    22 Services for theResearch Data Lifecycle Processing & Analysis Data Management, Curation & Preservation Access, Deposition & Sharing Federation Services ● B2FIND (data) ● Marketplace (Services) ● Applications on Demand ● Federated HTC & Cloud Compute IaaS & PaaS ● Processing of sensitive data ● Jupyter Notebook ● Application DB (software & VM) ● B2DROP (data) ● B2Note (data) ● B2SHARE (data) ● DataHub ● Federated AAI. monitoring, accounting ● SLA and order Management ● Security incident response and policies ● Technical support & Training ● B2HANDLE ● B2SAFE ● European Certified Trusted Repository ● Thematic data analytics ● Scientific Workflow Management, Orchestration (DIRAC, PaaS Orchestrator) 1 2 3 4 Discover & Reuse
  • 23.
    Horizon 2020 researchand innovation programme - grant agreement No 730988 RDA 11th, Berlin, IGAD pre-meeting, 03/20/18 Semantic : the missing component of EOSC pilot and Hub
  • 24.
    Horizon 2020 researchand innovation programme - grant agreement No 730988 RDA 11th, Berlin, IGAD pre-meeting, 03/20/18 What we want! Helping all players in the food system to access information wherever and in which way ever it is stored and to process it in the way they want Considering our scattered ecosystem and multiple stakeholders, resources and culture
  • 25.
    Horizon 2020 researchand innovation programme - grant agreement No 730988 RDA 11th, Berlin, IGAD pre-meeting, 03/20/18 What we need Extensive and powerful shared semantics Performant and scalable triple stores An API factory that produces APIs which link to any back end and any front end A powerful dataset/information source discover/registration mechanism
  • 26.
    High Level Architecture Multiple Frontends -Galaxy - R - Portals - D4Science - Excel - PDF - CMS Whatever people want to do Multiple Backends • SQL • Dataverse • CKAN • Text Whatever people want to do Temporary and permanent triple stores Shared Semantics (Agroportal - GACS- Maps of Standards – RING) V 3.0 APIs to link to all kind of backends APIs to link to all kind of front ends
  • 27.
  • 28.
    Horizon 2020 researchand innovation programme - grant agreement No 730988 RDA 11th, Berlin, IGAD pre-meeting, 03/20/18 Conclusion and next steps Work in progress : ✓ Technical architecture vision ✓ Roadmap Input needed from you Implementation opportunities ✓ EU and other
  • 29.
    CONSORTIUM WWW.EROSA.AGINFRA.EU Thank you foryour attention! odile.hologne@inra.fr @Holo_08 @H2020_eROSA