Poznan Supercomputing
and Networking Center
Data Analytics and Semantics Department,
Poznan Supercomputing and Networking Center
(PSNC)
Contact: Raul Palma
rpalma@man.poznan.pl
The Resesarch Object
Management Platform in
support of FAIR and Open
Science
© Poznan Supercomputing and Networking Center
How Scientists usually work
Example from Earth Science
© Poznan Supercomputing and Networking Center
Motivation for FAIR digital objects & Open Science
A. Fouilloux, et. Al. FAIR Research Objects for realising Open Science with RELIANCE EOSC Project
Need: mechanisms to manage data, methods and other resources which could: i) enhance visibility of scientific
breakthroughs; ii) encourage reuse, and iii) foster a broader research accessibility
© Poznan Supercomputing and Networking Center
Research Objects
Goal: Account, describe and share everything about your research,
including how those things are related
© Poznan Supercomputing and Networking Center
Research outcomes and related resources
Each object has its own metadata and repositories
All are first class citizens and are required to make research FAIR
[source RO-Crate: A framework for packaging research products into FAIR Research
Objects]
© Poznan Supercomputing and Networking Center
Encapsulated content and references to external resources
© Poznan Supercomputing and Networking Center
From FAIR data to FAIR Digital Objects
C. Goble, S. Soiland-Reyes. RO-Crate: A framework for packaging research products into FAIR Research Objects
© Poznan Supercomputing and Networking Center
RO-crate as FAIR Digital Object
implementation of FAIR Digital Objects using RO Crate
C. Goble, S. Soiland-Reyes. RO-Crate: A framework for
packaging research products into FAIR Research Objects
© Poznan Supercomputing and Networking Center
ROHub platform
• Holistic solution for research objects management
• Reference platform
• implements natively the RO model and paradigm
• implements RO-Crate as the default RO exchange format
• support different stakeholders, with the primary focus on
scientists, researchers, students and enthusiasts
• provides the backbone to a wealth of RO-centric applications
and interfaces across different scientific communities
• integrates several added value services to deliver a rich VRE
supporting the adoption of Open Science principles
https://www.rohub.org
/
Onboarded and
integrated in EOSC
© Poznan Supercomputing and Networking Center
High-level features
ROHUB enables:
• to create and manage FAIR and high-quality ROs that can be interpreted and reproduced in the future
• to reference, share and preserve scientific studies, campaigns, and observations related resources,
including internal ones, links to external ones as well as other ROs (nested ROs)
• to collaborate with colleagues and to discover new knowledge via advanced exploratory search
interfaces that exploit RO metadata (both explicitly provided and automatically extracted from its
content), as well as via an standard search API OpenSearch with Geo extensions
• to manage the RO evolution including the ability to generate snapshots/releases and to allow others to
fork the RO to reuse it and extend it.
• to publish the associated work and assign it a DOI to allow its citation in scholarly communications
• to monitor and follow a particular RO, getting notifications about its progress or quality changes
• researchers to build reputation by enabling users to rate and favorite ROs created by others
• to find related works or researchers in a a domain, e.g., for possible collaborations or reviews
© Poznan Supercomputing and Networking Center
ROHub with RO added value services
© Poznan Supercomputing and Networking Center
ROHub connections with EOSC Core and Exchange services for building VRE
• ROHub is onboarded in
EOSC marketplace
• ROHub integrates and rely
on different EOSC core
and other EOSC Exchange
services
© Poznan Supercomputing and Networking Center
EOSC services and their role in ROHub for building VRE
● EGI check-in: allows users to access different EOSC services uniformly, using their preferred IdP. As a result, ROHub
users can use the same account to access all services from different providers, e.g., EGI, RELIANCE, etc.
● Zenodo, a EOSC catch-all repository for research results (data, software and publications) that are not part of existing
institutional or subject-based repositories. ROHub users can publish releases of their research object as ro-crates
objects for further dissemination.
● B2share, another EOSC catch-all repository that ROHub users can use to publish releases of their research object as
ro-crates objects for further dissemination.
● B2drop, an EOSC file storage service on the cloud. ROHub users have the possibility to use the default ROHub storage,
or B2drop as their personal storage space where the resources uploaded to their research objects will be stored.
B2drop resources are synchronized with the corresponding research objects in both ways.
● Semantic enrichment and recommendation, EOSC services that extract automatically semantic metadata from ROs
content to enhance their FAIRness, and that provide recommendations of similar objects based their semantic
content.
© Poznan Supercomputing and Networking Center
EOSC services and their role in ROHub for building VRE
● EGI Notebooks, a browser-based tool, based on JupyterHub, for interactive analysis of data using EGI storage and
compute services. ROHub users can open and load the Jupyter notebooks in the ROs automatically in EGI Notebooks
directly from ROHub and execute their methods/processing in an interactive computing environment (eproducible
science).
● EGI Replay, a place, based on Jupyter Binder, to reproduce notebooks and their run-time environment.
Complementing the previous functionality, ROHub users can open and load automatically Jupyter notebooks and
reproduce their associated computing environment with Replay, including any related input datasets, directly from
the ROs in ROHub. (highest reproducible science)
● OpenAire Graph, an open resource that aggregates a collection of research data properties (metadata, links). This
graph is the source for searching and discovering resources in EOSC explore. ROHub resources, particularly ROs,
Jupyter notebooks and data cubes, are harvested in the graph, and thus they are findable directly from the EOSC
marketplace.
© Poznan Supercomputing and Networking Center
EOSC services and their role in ROHub for building VRE
● Argos, an online machine-actionable tool to facilitate creation of Data Management Plans (DMPs). ROHub enables
the creation of ROs from DMPs in Argos, leveraging and representing all the DMP information in machine-readable
format, enabling researchers to shift their DMP into machine actionable DMPs, connected with the datasets
themselves.
● ADAM, EOSC service enabling efficient and scalable structured data access and discovery of Earth Observation Data
via data cubes. ROHub enables the aggregation of data cubes from ADAM by reference, leveraging all the available
metadata available in ADAM to describe them in the RO. ROHub users can open and load data cubes in ADAM directly
from ROHub for their usage and exploration.
• Anne, a scientist from Oslo and her team
want to perform a climate change research
under the atmospheric perspective.
• Anne goes to EOSC resource catalogue where
she searches for existing results and finds an
executable RO from another researcher
• She opens the RO and finds a Jupyter
notebook that was used to analyze the
data.
• Anne clicks on the notebook and it is opened
in EGI notebooks. She uses Data Cubes to
exploit EO data provided by the Copernicus
program, and saves results in EGI DataHub
• Anne creates her own RO (forking the reused RO)
and starts to work on it, i.e., aggregates the new
notebook, the data cube, and other resources. The
new RO will then also appear in EOSC catalogue
• Anne invites colleagues to contribute; the
shared RO will keep track of the provenance
of contributions. They can be notified when
the RO is modified.
• Before publishing, they make a self-
assessment of the FAIR-ness of their
research, and check the quality of the RO
• Once RO is ready, Anne makes a release with
a DOI and publish it in Zenodo/B2Share,
making it also available in EOSC catalogue so
others can find/reuse
Find research work, access and reproduce it, reuse
it in new research, collaborate, assess quality and
publish it leveraging different EOSC services
check-in
EOSC based VRE in support of OS
Replay
Notebook
© Poznan Supercomputing and Networking Center
Research Objects as key enabler of Open Science
https://doi.org/10.24424/nz65-v565
Poznan Supercomputing
and Networking Center
61-139 Poznan
ul. Jana Pawła II 10
phone: (+48 61) 858-20-01
fax: (+48 61) 852-59-54
office@man.poznan.pl
www.psnc.pl
Raul Palma
rpalma@man.poznan.pl

ROHub - RO-Crate Research Object Management Platform General Introduction

  • 1.
    Poznan Supercomputing and NetworkingCenter Data Analytics and Semantics Department, Poznan Supercomputing and Networking Center (PSNC) Contact: Raul Palma rpalma@man.poznan.pl The Resesarch Object Management Platform in support of FAIR and Open Science
  • 2.
    © Poznan Supercomputingand Networking Center How Scientists usually work Example from Earth Science
  • 3.
    © Poznan Supercomputingand Networking Center Motivation for FAIR digital objects & Open Science A. Fouilloux, et. Al. FAIR Research Objects for realising Open Science with RELIANCE EOSC Project Need: mechanisms to manage data, methods and other resources which could: i) enhance visibility of scientific breakthroughs; ii) encourage reuse, and iii) foster a broader research accessibility
  • 4.
    © Poznan Supercomputingand Networking Center Research Objects Goal: Account, describe and share everything about your research, including how those things are related
  • 5.
    © Poznan Supercomputingand Networking Center Research outcomes and related resources Each object has its own metadata and repositories All are first class citizens and are required to make research FAIR [source RO-Crate: A framework for packaging research products into FAIR Research Objects]
  • 6.
    © Poznan Supercomputingand Networking Center Encapsulated content and references to external resources
  • 7.
    © Poznan Supercomputingand Networking Center From FAIR data to FAIR Digital Objects C. Goble, S. Soiland-Reyes. RO-Crate: A framework for packaging research products into FAIR Research Objects
  • 8.
    © Poznan Supercomputingand Networking Center RO-crate as FAIR Digital Object implementation of FAIR Digital Objects using RO Crate C. Goble, S. Soiland-Reyes. RO-Crate: A framework for packaging research products into FAIR Research Objects
  • 9.
    © Poznan Supercomputingand Networking Center ROHub platform • Holistic solution for research objects management • Reference platform • implements natively the RO model and paradigm • implements RO-Crate as the default RO exchange format • support different stakeholders, with the primary focus on scientists, researchers, students and enthusiasts • provides the backbone to a wealth of RO-centric applications and interfaces across different scientific communities • integrates several added value services to deliver a rich VRE supporting the adoption of Open Science principles https://www.rohub.org / Onboarded and integrated in EOSC
  • 10.
    © Poznan Supercomputingand Networking Center High-level features ROHUB enables: • to create and manage FAIR and high-quality ROs that can be interpreted and reproduced in the future • to reference, share and preserve scientific studies, campaigns, and observations related resources, including internal ones, links to external ones as well as other ROs (nested ROs) • to collaborate with colleagues and to discover new knowledge via advanced exploratory search interfaces that exploit RO metadata (both explicitly provided and automatically extracted from its content), as well as via an standard search API OpenSearch with Geo extensions • to manage the RO evolution including the ability to generate snapshots/releases and to allow others to fork the RO to reuse it and extend it. • to publish the associated work and assign it a DOI to allow its citation in scholarly communications • to monitor and follow a particular RO, getting notifications about its progress or quality changes • researchers to build reputation by enabling users to rate and favorite ROs created by others • to find related works or researchers in a a domain, e.g., for possible collaborations or reviews
  • 11.
    © Poznan Supercomputingand Networking Center ROHub with RO added value services
  • 12.
    © Poznan Supercomputingand Networking Center ROHub connections with EOSC Core and Exchange services for building VRE • ROHub is onboarded in EOSC marketplace • ROHub integrates and rely on different EOSC core and other EOSC Exchange services
  • 13.
    © Poznan Supercomputingand Networking Center EOSC services and their role in ROHub for building VRE ● EGI check-in: allows users to access different EOSC services uniformly, using their preferred IdP. As a result, ROHub users can use the same account to access all services from different providers, e.g., EGI, RELIANCE, etc. ● Zenodo, a EOSC catch-all repository for research results (data, software and publications) that are not part of existing institutional or subject-based repositories. ROHub users can publish releases of their research object as ro-crates objects for further dissemination. ● B2share, another EOSC catch-all repository that ROHub users can use to publish releases of their research object as ro-crates objects for further dissemination. ● B2drop, an EOSC file storage service on the cloud. ROHub users have the possibility to use the default ROHub storage, or B2drop as their personal storage space where the resources uploaded to their research objects will be stored. B2drop resources are synchronized with the corresponding research objects in both ways. ● Semantic enrichment and recommendation, EOSC services that extract automatically semantic metadata from ROs content to enhance their FAIRness, and that provide recommendations of similar objects based their semantic content.
  • 14.
    © Poznan Supercomputingand Networking Center EOSC services and their role in ROHub for building VRE ● EGI Notebooks, a browser-based tool, based on JupyterHub, for interactive analysis of data using EGI storage and compute services. ROHub users can open and load the Jupyter notebooks in the ROs automatically in EGI Notebooks directly from ROHub and execute their methods/processing in an interactive computing environment (eproducible science). ● EGI Replay, a place, based on Jupyter Binder, to reproduce notebooks and their run-time environment. Complementing the previous functionality, ROHub users can open and load automatically Jupyter notebooks and reproduce their associated computing environment with Replay, including any related input datasets, directly from the ROs in ROHub. (highest reproducible science) ● OpenAire Graph, an open resource that aggregates a collection of research data properties (metadata, links). This graph is the source for searching and discovering resources in EOSC explore. ROHub resources, particularly ROs, Jupyter notebooks and data cubes, are harvested in the graph, and thus they are findable directly from the EOSC marketplace.
  • 15.
    © Poznan Supercomputingand Networking Center EOSC services and their role in ROHub for building VRE ● Argos, an online machine-actionable tool to facilitate creation of Data Management Plans (DMPs). ROHub enables the creation of ROs from DMPs in Argos, leveraging and representing all the DMP information in machine-readable format, enabling researchers to shift their DMP into machine actionable DMPs, connected with the datasets themselves. ● ADAM, EOSC service enabling efficient and scalable structured data access and discovery of Earth Observation Data via data cubes. ROHub enables the aggregation of data cubes from ADAM by reference, leveraging all the available metadata available in ADAM to describe them in the RO. ROHub users can open and load data cubes in ADAM directly from ROHub for their usage and exploration.
  • 16.
    • Anne, ascientist from Oslo and her team want to perform a climate change research under the atmospheric perspective. • Anne goes to EOSC resource catalogue where she searches for existing results and finds an executable RO from another researcher • She opens the RO and finds a Jupyter notebook that was used to analyze the data. • Anne clicks on the notebook and it is opened in EGI notebooks. She uses Data Cubes to exploit EO data provided by the Copernicus program, and saves results in EGI DataHub • Anne creates her own RO (forking the reused RO) and starts to work on it, i.e., aggregates the new notebook, the data cube, and other resources. The new RO will then also appear in EOSC catalogue • Anne invites colleagues to contribute; the shared RO will keep track of the provenance of contributions. They can be notified when the RO is modified. • Before publishing, they make a self- assessment of the FAIR-ness of their research, and check the quality of the RO • Once RO is ready, Anne makes a release with a DOI and publish it in Zenodo/B2Share, making it also available in EOSC catalogue so others can find/reuse Find research work, access and reproduce it, reuse it in new research, collaborate, assess quality and publish it leveraging different EOSC services check-in EOSC based VRE in support of OS Replay Notebook
  • 17.
    © Poznan Supercomputingand Networking Center Research Objects as key enabler of Open Science https://doi.org/10.24424/nz65-v565
  • 18.
    Poznan Supercomputing and NetworkingCenter 61-139 Poznan ul. Jana Pawła II 10 phone: (+48 61) 858-20-01 fax: (+48 61) 852-59-54 office@man.poznan.pl www.psnc.pl Raul Palma rpalma@man.poznan.pl