E-infrastructure for open agri-food sciences: Vision & Roadmap
1. eROSA has received funding from the European
Union’s Horizon 2020 research and innovation
programme under grant agreement No 730988
E-infrastructure for open agri-food
sciences – vision and roadmap
Odile Hologne, INRA, Head of the department of scientific information
eROSA coordinator @Holo_08
Johannes Keizer, e-ROSA consultant
@H2020_eROSA
http://www.erosa.aginfra.eu/
2. Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
Agenda
e-ROSA in brief and context
Overview of the landscape
Vision
Conclusion
3. Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
eROSA in brief
Coordination and support action (infrasupp 3 2016)
18 months
Started in January 2017
Consortium: INRA (FR), WUR Alterra (NL), Agroknow (GR)
« Brother » project: AgINFRA+ : prototype new
services
▪ Support small-size foresight roadmaps for research and education communities and
operators of e-infrastructure services.
▪ Identification of potential collaboration from stakeholders across different
geographic areas and scientific domains.
4. Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
Objectives
Community building: researchers in agri-food sciences
and ICT specialists; international
Improve the knowledge of the landscape: infra/e-infra,
projects, policies … relevant for an « e-infra for open
science in agriculture »
Roadmap: conception, advocacy
6. What we have done so far
Bibliometric analysis
> Initial scoping of the e-ROSA community, updating in progress
Online map
> Cataloguing key stakeholders, initiatives and infrastructures; ongoing open call
First Stakeholder Workshop: 6-7 July 2017 in Montpellier
> Initiate comunity-building and improve knowledge of the current landscape
Second Stakeholder Workshop: 27-28 November 2017 in Wageningen
> Identify scientific and data/ICT needs and discuss the vision
Vision paper
http://www.erosa.aginfra.eu/sites/erosa_deliverables/D1.1.pdf
http://www.aginfra.eu/discover
http://www.erosa.agin
fra.eu/node/47
http://www.erosa.aginfra.eu/node/8
7. Horizon 2020 research and innovation programme - grant agreement No 730988
Open Harvest – 06/02 2017
European initiatives
build an internet of FAIR data and Services (for
machines)
✓ Improve & federate existing structures (standards,
interoperability, governance, financing) based on user
needs
✓ Incentives for data sharing in science & training
10. Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
Data challenges in agricultural sciences
Massive data production in labs (sensors,
robots, models ….) but also in farms (of huge
interest for science)
✓ Big data : more variety than volume
✓ space, time and scale issues
Data silos, poorly documented
Not easy to find, nor to access
Different level of maturity of the practices
about data (management, sharing, analysis)
11. Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
E-infra conceptuel model to understand the ecosystem
12. Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
Some of the gems
Data repositories +/- open
Data catalog : http://ring.ciard.net/
Resources catalog: semantics, metadata
http://vest.agrisemantics.org
http://agroportal.lirmm.fr/
http://agrisemantics.org/
GACS
13. Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
How to analyze the technical ecosystem ?
https://www.rd-alliance.org/sites/default/files/recommendation-jan-2017-v8.pdf
14. Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
Implementation Status of key services
https://eoscpilot.eu/content/erosa-position-paper
Johannes Keizer, e-ROSA project
15. Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
Current landscape: Challenges
Technical challenges
Generic
Content-dependent
(agri-food)
Long-term preservation
Data processing
Authentication and access rights
Data types
……
Semantics - standards
From genes to ecosytem
Public and private stakeholders
Registry of services
….
Cultural challenges
Community engagement and incentives
Policies and regulation
Sustainability and governance
16. Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
Potential Governance and business model issues
https://eoscpilot.eu/content/erosa-position-paper
Johannes Keizer, e-ROSA project
18. Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
Common Patterns in Revolutionary
Infrastructures and Data
https://www.rd-alliance.org/sites/default/files/Common_Patterns_in_Revolutionising_Infrastructures-
final.pdf
19. Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
Vision paper : A collaborative and open
system …
… In the open science based knowledge system, researchers :
✓ openly collaborate with different societal stakeholders to further improve
the functioning of the food system;
✓ deploy a systems approach including the impacts and consequences in the
whole food systems in their research, not studying effects and disciplines in
isolation;
✓ undertake fully data-driven research …. And also hypothesis-driven research;
✓ work impact-based, to place their research in the broader societal context
and show what the implications of the research are.
http://www.erosa.aginfra.eu/node/8
20. Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
An e-infra that supports the Data flows and
open science
The agri-food sector is dealing with an increasing amount and variety of
data due to:
• The multidisciplinary, multiscale nature of agri-food science, which is adopting
a more and more systemic approach;
• The automation of data collection thanks to robots, sensors, etc., as well as
new engineering tools such as in the omics field;
• The development of new types of data sources: e.g. Internet of Things, citizen
science, text-mining
21. Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
But F.A.I.R is not necessary open
Societal priorities imply the maximum of transparency and
access to data;
Business interests can support differing objectives
Personal privacy concerns I want to
protect my
« know
how »I want to
know where
it comes
from
I want to
understand
….
22. 22
Services for the Research
Data Lifecycle
Processing & Analysis
Data Management, Curation &
Preservation
Access, Deposition & Sharing
Federation Services
● B2FIND (data)
● Marketplace (Services)
● Applications on Demand
● Federated HTC & Cloud Compute IaaS &
PaaS
● Processing of sensitive data
● Jupyter Notebook
● Application DB (software & VM)
● B2DROP (data)
● B2Note (data)
● B2SHARE (data)
● DataHub
● Federated AAI. monitoring,
accounting
● SLA and order Management
● Security incident response and
policies
● Technical support & Training
● B2HANDLE
● B2SAFE
● European Certified Trusted
Repository
● Thematic data analytics
● Scientific Workflow Management,
Orchestration (DIRAC, PaaS
Orchestrator)
1
2
3
4
Discover & Reuse
23. Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
Semantic : the missing component of EOSC pilot and Hub
24. Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
What we want!
Helping all players in the food system to
access information wherever and in which
way ever it is stored and to process it in the
way they want
Considering our
scattered ecosystem
and multiple
stakeholders, resources
and culture
25. Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
What we need
Extensive and powerful shared semantics
Performant and scalable triple stores
An API factory that produces APIs which link
to any back end and any front end
A powerful dataset/information source
discover/registration mechanism
26. High Level Architecture
Multiple
Frontends
- Galaxy
- R
- Portals
- D4Science
- Excel
- PDF
- CMS
Whatever
people want
to do
Multiple
Backends
• SQL
• Dataverse
• CKAN
• Text
Whatever
people want
to
do
Temporary
and
permanent
triple stores
Shared Semantics
(Agroportal - GACS- Maps of Standards – RING) V 3.0
APIs to link
to all kind of
backends
APIs to link
to all kind of
front ends
28. Horizon 2020 research and innovation programme - grant agreement No 730988
RDA 11th, Berlin, IGAD pre-meeting, 03/20/18
Conclusion and next steps
Work in progress :
✓ Technical architecture
vision
✓ Roadmap
Input needed from you
Implementation
opportunities
✓ EU and other