SlideShare a Scribd company logo
ELSEVIER | The Research Object Authoring Tool --- CNI 2018 1
FAIR4CURES
A Research Object Authoring Tool for the Data Commons
December 11, 2018
Anita de Waard (she, her)
VP Research Collaborations
ELSEVIER | The Research Object Authoring Tool --- CNI 2018
Overview:
1. The NIH Data Commons: a very short introduction
2. The FAIR4CURES Project
3. A Global Unique Identifier Broker
4. Research Objects: a very very short introduction
5. Building a Research Object Authoring Tool on Mendeley Data
ELSEVIER | The Research Object Authoring Tool --- CNI 2018
The NIH Data Commons Pilot Phase aims to
provide a marketplace for tools, data and
workflows
based on existing technologies of commercial and
academic platforms that strive to embody the FAIR
Data principles.
Overview:
ELSEVIER | The Research Object Authoring Tool --- CNI 2018
Data Commons Overview:
Goal of the project:
1. Advance the policies and protocols for accessing human subjects data
2. Support global identification, indexing and searching of available data sets;
3. Provide a collection of computational pipelines that can be applied to data sets
4. Utilize standards to globally identify and access data sets, tools and workflows
5. Create policies for data citation, reuse and reproducibility
6. Enable researchers to port their own data and workflows into the cloud
Project structure:
• DCPPC research groups are addressing important Key Capabilities =>
• The Commons will be composed of four stacks, incorporating products from the KCs
Final output:
• Data from three large NIH Databases will be available through all of these systems
• Users can securely access data within all stacks, on multiple cloud providers
• Users have access a basic set of applications that run the same way on all stacks.
https://public.nihdatacommons.us/ExecutiveSummary_4YP/
Key Capabilities:
1: FAIR Guidelines & Metrics
2: Global Unique IDs for FAIR Digital Objects
3: Open Standard APIs
4: Cloud Agnostic Architecture Framework
5: Workspaces for Computation
6: Research Ethics, Privacy, and Security
7: Indexing and Search
ELSEVIER | The Research Object Authoring Tool --- CNI 2018
Data Commons Guiding Principles:
• 1. Identifiers for data: Develop and implement an interoperable global unique identifier system for digital
objects.
• 2. Data access: Develop and implement authentication and authorization policies and protocols for controlled
access to digital objects and derivatives.
• 3. Findability: Enable search and indexing of digital objects and data sets.
• 4. Software stacks: The Commons will encompass multiple robust and sustainable software stacks
implementing Commons standards and systems.
• 5. Data use, standards: All tools will be build using standard application interfaces.
• 6. Use cases: The Commons will develop and utilize an extensive use case library.
• 7. Community: The Commons is developed through intense Community engagement and support across
multiple levels of expertise.
• 8. Community: Governance, membership, and coordination will be established with and through the
community.
• 9. Evaluation methods and metrics: We plan a culture of frequent release of products, with small iterations,
routine evaluation and redesign.
• 10. FAIR guidelines and metrics: Once FAIR metrics and rubrics are defined, these will be used to measure the
level of “FAIRness” of repositories, datasets, and other digital objects.
https://public.nihdatacommons.us/executive-summary/
ELSEVIER | The Research Object Authoring Tool --- CNI 2018
Team Xenon – Four partner organisations
Findable Accessible Interoperable Reusable
Collaborative Usable Reproducible Extendable Scalable
The FAIR4CURES Collaboration:
Index 3 datasets:
• Trans-omics for Precision Medicine (TOPMed)
• Genotype Tissue Expression (GTEx)
• Model Organisms Database (MODs)
ELSEVIER | The Research Object Authoring Tool --- CNI 2018
The FAIR4CURES PlatformThe FAIR4CURES System:
ELSEVIER | The Research Object Authoring Tool --- CNI 2018
• Identifiers for hosted data files within TOPMed studies, GTEx dataset, and MODs
• Feature for researchers to register identifiers for their derived data files on the
platform, making the content public and searchable
• Selecting types of identifiers to support in the Data Commons ecosystem and the
required identifier metadata
• Open Source tool, connected to the SevenBridges Platform
• Also accessible via Github/SmartAPI
Global Unique Identifier Broker:
ELSEVIER | The Research Object Authoring Tool --- CNI 2018
Digital Object Types Identified following the KC2 Metadata Spec:
Seven Bridges
Object Type
DataCite
Resource
Type
Proposed
Schema.Org
CreativeWork Types
Supported Relationships Notes
File Dataset Dataset Source Of a Task (input file)
Derived From a Task
(output file)
Part Of a Collection
One (or more) files packaged with metadata as a dataset
App (Tool) Software SoftwareSourceCode Part of Task or Collection or
Workflow
Same as dataset, but file is source code
App
(Workflow)
Workflow SoftwareSourceCode
(?)
Has Part of Software An aggregation of Tools (Software). File is CWL definition
describing how the tools are chained.
Task Collection Collection Composition of Files and
Apps (Tools or Workflows)
An aggregation of Apps (either tools or workflows), plus files
(input & output) plus a record of all the settings used for each
App.
Collection
(Study)
Collection Collection Composition of any object An aggregation of heterogeneous objects for purpose of
publishing.
https://docs.google.com/document/d/1FD3aXr_uHnPy-YrFhQhuXET73tBVxu7F_Q5uS9TPUZs/edit
ELSEVIER | The Research Object Authoring Tool --- CNI 2018
Seven Bridges Data Publication Concept
Requirements Analysis:
1. Landing page URL including GUID
2. URL for page where file can be accessed (downloaded)
3. Metadata for object
4. Reference to the Task (zero or one) that this dataset was Derived From
5. Reference to the Task(s) (zero, one or more) that this dataset is the Source Of
1
2
3
4
5
ELSEVIER | The Research Object Authoring Tool --- CNI 2018
Seven Bridges Workflow Configuration (CWL)
ELSEVIER | The Research Object Authoring Tool --- CNI 2018
Standards-based metadata framework for
logically and physically bundling resources
with context
http://researchobject.org
What are Research Objects?
Aggregates
link things together
Annotations
about things & their
relationships
Container
Packaging content & links:
Zip files, BagIt, Docker images
Identification
locate things
regardless where
ELSEVIER | The Research Object Authoring Tool --- CNI 2018
Research Objects can be used to capture outputs in a wide range of scopes
• Profiles help define the shape and form of a research object.
• A profile defines the general purpose of that type of Research Objects:
• A format (e.g. Research Object Bundle),
• An expectation of what kind of resources should be expected,
• A link to any specific vocabularies that should be used in its annotations.
Applications of Research Objects include BDBags (Big Data Bags):
• In digital libraries, preservation of source artifacts commonly use the BagIt format for archive serialization, capturing
digital resources like audio recordings, document scans and their transcriptions, provenance and annotations.
• The Research Object BagIt archive is a profile for describing a BagIt archive and its content as a Research Object to
structure the metadata and relate the captured resources
• The NIH-funded Big Data for Discovery Science (BDDS) project captures Big Data bags (BDBag) of large complex datasets
from genomics workflows (https://doi.org/10.1109/BigData.2016.7840618).
• A key aspect of BDBag is the ability to use Minimal Viable Identifiers (minid) for referencing potentially large data sources
held in multiple remote repositories, effectively making a “Big Data” Research Object for large-scale workflows
(https://doi.org/10.1101/268755).
• A bag of bags (minid:b9vx04) is a metadata skeleton which may be completed with tools like bdbag to download the big
data
• The bags’ Research Object manifests can be consumed independently, linking to the remote resources.
Research Objects and BDBags:
http://www.researchobject.org/scopes/
ELSEVIER | The Research Object Authoring Tool --- CNI 2018
Moving from Datasets to Research Objects in Mendeley Data:
In Mendeley Data Repository, datasets are lists of files (stored in our S3 bucket) with metadata packaging (e.g. Titles,
Description, Categories, License) and a persistent identifier DOI).
We will introduce:
• Collections as an aggregation of Datasets. Similar to a Dataset, BUT, the contents are other datasets, not files.
• Software and Workflow as different types of Digital Objects. Similar to a Dataset, BUT files are source code or
workflow specifications (e.g. CWL) and metadata properties could be a bit different.
This forms the foundation for Research Objects, which are:
• Collections or aggregations of different types of Digital Objects (not just datasets)
• References to digital objects on other platforms, based on standard identifiers (e.g. DOIs or ARKs)
• A manifest which lists and describes the contents of the Research Object
• Exposed in JSON-LD:
ELSEVIER | The Research Object Authoring Tool --- CNI 2018
GUID Broker (API Only)
Seven Bridges
Fair4CURES Platform
Phase 1
Pilot Project
(Apr – Sep 2018)
Register Datasets (Data Files)
Register Software Objects
Register Workflow Objects
Uses
Register a Collection as a list of
digital objects (data, sw, wf)
In Summary:
Objective 1 – support “Task” type
Research Objects on Seven Bridges
platform.
Objective 2 - support configurable
Research Objects on Mendeley Data
platform.
Phase 2
Project
(Oct 2018 - 2019) Add annotation and relationships
to collection to describe a research
object
Research Object Composer
Serialise Research Object in
standard format based on BDBags
and RO standards Mendeley Data
Platform
Uses Re-uses
http://smart-api.info/ui/
bf9abe9c17c9c78c432832382ef9e16a#/
ELSEVIER | The Research Object Authoring Tool --- CNI 2018
Acknowledgements:
• This work is supported by the NIH Data Commons Pilot Phase under the Research Opportunity
Announcement (ROA) RM-17-026 https://commonfund.nih.gov/commons/:
• NIH Data Commons - 1 OT3 OD025463-01
• NHLBI STAGE Project - 1 OT3 HL142478-01
• The FAIR4CURES Project lead by SevenBridges (Alison Leaf, Brandi Davis-Dusenbury and Sarper Avcil)
• We partner in the Project with Repositive UK and the US Dept of Veteran’s Affairs
• The metadata standards development was done by KC2, lead by Team Sodium (esp. Merce Crosas, Tim
Clark, Trisha Cruse and Martin Fenner)
• The Research Objects Authoring Tool work is lead by the University of Manchester, who pioneered work
on Research Objects (Stian Soiland-Reyes and Carole Goble)
• The Mendeley Data team has built the GUID Broker Prototype (Gabriel Oscares, Gareth Harvey

More Related Content

What's hot

FAIR Workflows: A step closer to the Scientific Paper of the Future
FAIR Workflows: A step closer to the Scientific Paper of the FutureFAIR Workflows: A step closer to the Scientific Paper of the Future
FAIR Workflows: A step closer to the Scientific Paper of the Future
dgarijo
 
Wikidata as a hub for the linked data cloud
Wikidata as a hub for the linked data cloudWikidata as a hub for the linked data cloud
Wikidata as a hub for the linked data cloud
Joachim Neubert
 
Leverage DSpace for an enterprise, mission critical platform
Leverage DSpace for an enterprise, mission critical platformLeverage DSpace for an enterprise, mission critical platform
Leverage DSpace for an enterprise, mission critical platform
Andrea Bollini
 
EOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow Collaboratory
Carole Goble
 
Preparing your data for sharing and publishing
Preparing your data for sharing and publishingPreparing your data for sharing and publishing
Preparing your data for sharing and publishing
Varsha Khodiyar
 
RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research Objects
Carole Goble
 
DSpace standard Data model and DSpace-CRIS
DSpace standard Data model and DSpace-CRISDSpace standard Data model and DSpace-CRIS
DSpace standard Data model and DSpace-CRIS
Andrea Bollini
 
DSpace-CRIS: a CRIS enhanced repository platform
DSpace-CRIS: a CRIS enhanced repository platformDSpace-CRIS: a CRIS enhanced repository platform
DSpace-CRIS: a CRIS enhanced repository platform
Andrea Bollini
 
Building Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the CloudBuilding Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the Cloud
Peter Haase
 
Dataverse opportunities
Dataverse opportunitiesDataverse opportunities
Dataverse opportunities
vty
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data Management
Carole Goble
 
6.15.17 DSpace-Cris Webinar Presentation Slides
6.15.17 DSpace-Cris Webinar Presentation Slides6.15.17 DSpace-Cris Webinar Presentation Slides
6.15.17 DSpace-Cris Webinar Presentation Slides
DuraSpace
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Anita de Waard
 
ESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge GraphsESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge Graphs
Peter Haase
 
Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Jian Qin
 
PIDs and DOI registration with DataCite - IATUL Workshop 2013
PIDs and DOI registration with DataCite - IATUL Workshop 2013PIDs and DOI registration with DataCite - IATUL Workshop 2013
PIDs and DOI registration with DataCite - IATUL Workshop 2013Frauke Ziedorn
 
Putting Historical Data in Context: how to use DSpace-GLAM
Putting Historical Data in Context: how to use DSpace-GLAMPutting Historical Data in Context: how to use DSpace-GLAM
Putting Historical Data in Context: how to use DSpace-GLAM
4Science
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
Carole Goble
 
The Information Workbench - Linked Data and Semantic Wikis in the Enterprise
The Information Workbench - Linked Data and Semantic Wikis in the EnterpriseThe Information Workbench - Linked Data and Semantic Wikis in the Enterprise
The Information Workbench - Linked Data and Semantic Wikis in the Enterprise
Peter Haase
 
Building COVID-19 Knowledge Graph at CoronaWhy
Building COVID-19 Knowledge Graph at CoronaWhyBuilding COVID-19 Knowledge Graph at CoronaWhy
Building COVID-19 Knowledge Graph at CoronaWhy
vty
 

What's hot (20)

FAIR Workflows: A step closer to the Scientific Paper of the Future
FAIR Workflows: A step closer to the Scientific Paper of the FutureFAIR Workflows: A step closer to the Scientific Paper of the Future
FAIR Workflows: A step closer to the Scientific Paper of the Future
 
Wikidata as a hub for the linked data cloud
Wikidata as a hub for the linked data cloudWikidata as a hub for the linked data cloud
Wikidata as a hub for the linked data cloud
 
Leverage DSpace for an enterprise, mission critical platform
Leverage DSpace for an enterprise, mission critical platformLeverage DSpace for an enterprise, mission critical platform
Leverage DSpace for an enterprise, mission critical platform
 
EOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow Collaboratory
 
Preparing your data for sharing and publishing
Preparing your data for sharing and publishingPreparing your data for sharing and publishing
Preparing your data for sharing and publishing
 
RO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research Objects
 
DSpace standard Data model and DSpace-CRIS
DSpace standard Data model and DSpace-CRISDSpace standard Data model and DSpace-CRIS
DSpace standard Data model and DSpace-CRIS
 
DSpace-CRIS: a CRIS enhanced repository platform
DSpace-CRIS: a CRIS enhanced repository platformDSpace-CRIS: a CRIS enhanced repository platform
DSpace-CRIS: a CRIS enhanced repository platform
 
Building Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the CloudBuilding Enterprise-Ready Knowledge Graph Applications in the Cloud
Building Enterprise-Ready Knowledge Graph Applications in the Cloud
 
Dataverse opportunities
Dataverse opportunitiesDataverse opportunities
Dataverse opportunities
 
A Big Picture in Research Data Management
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data Management
 
6.15.17 DSpace-Cris Webinar Presentation Slides
6.15.17 DSpace-Cris Webinar Presentation Slides6.15.17 DSpace-Cris Webinar Presentation Slides
6.15.17 DSpace-Cris Webinar Presentation Slides
 
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
Research Object Composer: A Tool for Publishing Complex Data Objects in the C...
 
ESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge GraphsESWC 2017 Tutorial Knowledge Graphs
ESWC 2017 Tutorial Knowledge Graphs
 
Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08Data repositories -- Xiamen University 2012 06-08
Data repositories -- Xiamen University 2012 06-08
 
PIDs and DOI registration with DataCite - IATUL Workshop 2013
PIDs and DOI registration with DataCite - IATUL Workshop 2013PIDs and DOI registration with DataCite - IATUL Workshop 2013
PIDs and DOI registration with DataCite - IATUL Workshop 2013
 
Putting Historical Data in Context: how to use DSpace-GLAM
Putting Historical Data in Context: how to use DSpace-GLAMPutting Historical Data in Context: how to use DSpace-GLAM
Putting Historical Data in Context: how to use DSpace-GLAM
 
FAIR Computational Workflows
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
 
The Information Workbench - Linked Data and Semantic Wikis in the Enterprise
The Information Workbench - Linked Data and Semantic Wikis in the EnterpriseThe Information Workbench - Linked Data and Semantic Wikis in the Enterprise
The Information Workbench - Linked Data and Semantic Wikis in the Enterprise
 
Building COVID-19 Knowledge Graph at CoronaWhy
Building COVID-19 Knowledge Graph at CoronaWhyBuilding COVID-19 Knowledge Graph at CoronaWhy
Building COVID-19 Knowledge Graph at CoronaWhy
 

Similar to CNI 2018: A Research Object Authoring Tool for the Data Commons

DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
OpenAIRE
 
UKSG 2018 Lightning Talk - Annotations as research objects: findable, indexab...
UKSG 2018 Lightning Talk - Annotations as research objects: findable, indexab...UKSG 2018 Lightning Talk - Annotations as research objects: findable, indexab...
UKSG 2018 Lightning Talk - Annotations as research objects: findable, indexab...
UKSG: connecting the knowledge community
 
DSpace standard Data model and DSpace-CRIS
DSpace standard Data model and DSpace-CRISDSpace standard Data model and DSpace-CRIS
DSpace standard Data model and DSpace-CRIS
4Science
 
Trm Trusted Repositories
Trm Trusted RepositoriesTrm Trusted Repositories
Trm Trusted Repositories
DigitalPreservationEurope
 
Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...
Sarah Anna Stewart
 
Access the world’s research outputs through the CORE API
Access the world’s research outputs through the CORE API Access the world’s research outputs through the CORE API
Access the world’s research outputs through the CORE API
Matteo Cancellieri
 
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseMendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Anita de Waard
 
FDO as building block for digitization technology stacks
FDO as building block for digitization technology stacksFDO as building block for digitization technology stacks
FDO as building block for digitization technology stacks
Raul Palma
 
Data management
Data management Data management
Data management
Graça Gabriel
 
DSpace-CRIS Workshop OR2015: Slides
DSpace-CRIS Workshop OR2015: SlidesDSpace-CRIS Workshop OR2015: Slides
DSpace-CRIS Workshop OR2015: Slides
Andrea Bollini
 
OAI-PMH
OAI-PMHOAI-PMH
OAI-PMH
CSMeena1
 
Metadata-powered dissemination of content
Metadata-powered dissemination of contentMetadata-powered dissemination of content
Metadata-powered dissemination of content
Nikos Manouselis
 
NIH BD2K DataMed metadata model - Force11, 2016
NIH BD2K DataMed metadata model - Force11, 2016NIH BD2K DataMed metadata model - Force11, 2016
NIH BD2K DataMed metadata model - Force11, 2016
Susanna-Assunta Sansone
 
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data ResourcesApplication of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
Pistoia Alliance
 
Open Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and ExchangeOpen Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and Exchange
lagoze
 
NDS Relevant Update from the NIH Data Science (ADDS) Office
NDS Relevant Update from the NIH Data Science (ADDS) OfficeNDS Relevant Update from the NIH Data Science (ADDS) Office
NDS Relevant Update from the NIH Data Science (ADDS) Office
Philip Bourne
 
Publishing the Full Research Data Lifecycle
Publishing the Full Research Data LifecyclePublishing the Full Research Data Lifecycle
Publishing the Full Research Data Lifecycle
Anita de Waard
 
Vellino presentationtocisti
Vellino presentationtocistiVellino presentationtocisti
Vellino presentationtocistiAndre Vellino
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout
Carole Goble
 
DataCite and its DOI infrastructure - IASSIST 2013
DataCite and its DOI infrastructure - IASSIST 2013DataCite and its DOI infrastructure - IASSIST 2013
DataCite and its DOI infrastructure - IASSIST 2013Frauke Ziedorn
 

Similar to CNI 2018: A Research Object Authoring Tool for the Data Commons (20)

DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
DataCite – Bridging the gap and helping to find, access and reuse data – Herb...
 
UKSG 2018 Lightning Talk - Annotations as research objects: findable, indexab...
UKSG 2018 Lightning Talk - Annotations as research objects: findable, indexab...UKSG 2018 Lightning Talk - Annotations as research objects: findable, indexab...
UKSG 2018 Lightning Talk - Annotations as research objects: findable, indexab...
 
DSpace standard Data model and DSpace-CRIS
DSpace standard Data model and DSpace-CRISDSpace standard Data model and DSpace-CRIS
DSpace standard Data model and DSpace-CRIS
 
Trm Trusted Repositories
Trm Trusted RepositoriesTrm Trusted Repositories
Trm Trusted Repositories
 
Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...Research Data (and Software) Management at Imperial: (Everything you need to ...
Research Data (and Software) Management at Imperial: (Everything you need to ...
 
Access the world’s research outputs through the CORE API
Access the world’s research outputs through the CORE API Access the world’s research outputs through the CORE API
Access the world’s research outputs through the CORE API
 
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and ReuseMendeley Data: Enhancing Data Discovery, Sharing and Reuse
Mendeley Data: Enhancing Data Discovery, Sharing and Reuse
 
FDO as building block for digitization technology stacks
FDO as building block for digitization technology stacksFDO as building block for digitization technology stacks
FDO as building block for digitization technology stacks
 
Data management
Data management Data management
Data management
 
DSpace-CRIS Workshop OR2015: Slides
DSpace-CRIS Workshop OR2015: SlidesDSpace-CRIS Workshop OR2015: Slides
DSpace-CRIS Workshop OR2015: Slides
 
OAI-PMH
OAI-PMHOAI-PMH
OAI-PMH
 
Metadata-powered dissemination of content
Metadata-powered dissemination of contentMetadata-powered dissemination of content
Metadata-powered dissemination of content
 
NIH BD2K DataMed metadata model - Force11, 2016
NIH BD2K DataMed metadata model - Force11, 2016NIH BD2K DataMed metadata model - Force11, 2016
NIH BD2K DataMed metadata model - Force11, 2016
 
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data ResourcesApplication of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
 
Open Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and ExchangeOpen Archives Initiative Object Reuse and Exchange
Open Archives Initiative Object Reuse and Exchange
 
NDS Relevant Update from the NIH Data Science (ADDS) Office
NDS Relevant Update from the NIH Data Science (ADDS) OfficeNDS Relevant Update from the NIH Data Science (ADDS) Office
NDS Relevant Update from the NIH Data Science (ADDS) Office
 
Publishing the Full Research Data Lifecycle
Publishing the Full Research Data LifecyclePublishing the Full Research Data Lifecycle
Publishing the Full Research Data Lifecycle
 
Vellino presentationtocisti
Vellino presentationtocistiVellino presentationtocisti
Vellino presentationtocisti
 
FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout
 
DataCite and its DOI infrastructure - IASSIST 2013
DataCite and its DOI infrastructure - IASSIST 2013DataCite and its DOI infrastructure - IASSIST 2013
DataCite and its DOI infrastructure - IASSIST 2013
 

More from Anita de Waard

Why would a publisher care about open data?
Why would a publisher care about open data?Why would a publisher care about open data?
Why would a publisher care about open data?
Anita de Waard
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR Data
Anita de Waard
 
Enabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring GuidelinesEnabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring Guidelines
Anita de Waard
 
Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.
Anita de Waard
 
Data, Data Everywhere: What's A Publisher to Do?
Data, Data Everywhere: What's  A Publisher to Do?Data, Data Everywhere: What's  A Publisher to Do?
Data, Data Everywhere: What's A Publisher to Do?
Anita de Waard
 
Talk on Research Data Management
Talk on Research Data ManagementTalk on Research Data Management
Talk on Research Data Management
Anita de Waard
 
History of the future
History of the futureHistory of the future
History of the future
Anita de Waard
 
Networked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseNetworked Science, And Integrating with Dataverse
Networked Science, And Integrating with Dataverse
Anita de Waard
 
Big Data and the Future of Publishing
Big Data and the Future of PublishingBig Data and the Future of Publishing
Big Data and the Future of Publishing
Anita de Waard
 
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsReal-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
Anita de Waard
 
Data Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost RecoveryData Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost Recovery
Anita de Waard
 
The Economics of Data Sharing
The Economics of Data SharingThe Economics of Data Sharing
The Economics of Data Sharing
Anita de Waard
 
Public Identifiers in Scholarly Publishing
Public Identifiers in Scholarly PublishingPublic Identifiers in Scholarly Publishing
Public Identifiers in Scholarly Publishing
Anita de Waard
 
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne UlitmatumElsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Anita de Waard
 
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective DataElsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
Anita de Waard
 
Charleston Conference 2016
Charleston Conference 2016Charleston Conference 2016
Charleston Conference 2016
Anita de Waard
 
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
Anita de Waard
 
RDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest GroupRDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest Group
Anita de Waard
 
The Rocky Road to Reuse
The Rocky Road to ReuseThe Rocky Road to Reuse
The Rocky Road to Reuse
Anita de Waard
 
Collaboratively creating a network of ideas, data and software
Collaboratively creating a network of ideas, data and softwareCollaboratively creating a network of ideas, data and software
Collaboratively creating a network of ideas, data and software
Anita de Waard
 

More from Anita de Waard (20)

Why would a publisher care about open data?
Why would a publisher care about open data?Why would a publisher care about open data?
Why would a publisher care about open data?
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR Data
 
Enabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring GuidelinesEnabling FAIR Data: TAG B Authoring Guidelines
Enabling FAIR Data: TAG B Authoring Guidelines
 
Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.Scientific facts are myths, told through fairytales and spread by gossip.
Scientific facts are myths, told through fairytales and spread by gossip.
 
Data, Data Everywhere: What's A Publisher to Do?
Data, Data Everywhere: What's  A Publisher to Do?Data, Data Everywhere: What's  A Publisher to Do?
Data, Data Everywhere: What's A Publisher to Do?
 
Talk on Research Data Management
Talk on Research Data ManagementTalk on Research Data Management
Talk on Research Data Management
 
History of the future
History of the futureHistory of the future
History of the future
 
Networked Science, And Integrating with Dataverse
Networked Science, And Integrating with DataverseNetworked Science, And Integrating with Dataverse
Networked Science, And Integrating with Dataverse
 
Big Data and the Future of Publishing
Big Data and the Future of PublishingBig Data and the Future of Publishing
Big Data and the Future of Publishing
 
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data EcosystemsReal-World Data Challenges: Moving Towards Richer Data Ecosystems
Real-World Data Challenges: Moving Towards Richer Data Ecosystems
 
Data Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost RecoveryData Repositories: Recommendation, Certification and Models for Cost Recovery
Data Repositories: Recommendation, Certification and Models for Cost Recovery
 
The Economics of Data Sharing
The Economics of Data SharingThe Economics of Data Sharing
The Economics of Data Sharing
 
Public Identifiers in Scholarly Publishing
Public Identifiers in Scholarly PublishingPublic Identifiers in Scholarly Publishing
Public Identifiers in Scholarly Publishing
 
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne UlitmatumElsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
Elsevier‘s RDM Program: Habits of Effective Data and the Bourne Ulitmatum
 
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective DataElsevier‘s RDM Program: Ten Habits of Highly Effective Data
Elsevier‘s RDM Program: Ten Habits of Highly Effective Data
 
Charleston Conference 2016
Charleston Conference 2016Charleston Conference 2016
Charleston Conference 2016
 
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
The Narrative Structure of Research Articles, or, Why Science is Like a Fairy...
 
RDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest GroupRDA-WDS Publishing Data Interest Group
RDA-WDS Publishing Data Interest Group
 
The Rocky Road to Reuse
The Rocky Road to ReuseThe Rocky Road to Reuse
The Rocky Road to Reuse
 
Collaboratively creating a network of ideas, data and software
Collaboratively creating a network of ideas, data and softwareCollaboratively creating a network of ideas, data and software
Collaboratively creating a network of ideas, data and software
 

Recently uploaded

AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
Product School
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
Thijs Feryn
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
RTTS
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
Prayukth K V
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Ramesh Iyer
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
Product School
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Product School
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
Ralf Eggert
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
Laura Byrne
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 

Recently uploaded (20)

AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
AI for Every Business: Unlocking Your Product's Universal Potential by VP of ...
 
Accelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish CachingAccelerate your Kubernetes clusters with Varnish Caching
Accelerate your Kubernetes clusters with Varnish Caching
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
JMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and GrafanaJMeter webinar - integration with InfluxDB and Grafana
JMeter webinar - integration with InfluxDB and Grafana
 
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 previewState of ICS and IoT Cyber Threat Landscape Report 2024 preview
State of ICS and IoT Cyber Threat Landscape Report 2024 preview
 
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
Builder.ai Founder Sachin Dev Duggal's Strategic Approach to Create an Innova...
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
De-mystifying Zero to One: Design Informed Techniques for Greenfield Innovati...
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...Designing Great Products: The Power of Design and Leadership by Chief Designe...
Designing Great Products: The Power of Design and Leadership by Chief Designe...
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)PHP Frameworks: I want to break free (IPC Berlin 2024)
PHP Frameworks: I want to break free (IPC Berlin 2024)
 
The Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and SalesThe Art of the Pitch: WordPress Relationships and Sales
The Art of the Pitch: WordPress Relationships and Sales
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdfFIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
FIDO Alliance Osaka Seminar: FIDO Security Aspects.pdf
 
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdfFIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
FIDO Alliance Osaka Seminar: Passkeys and the Road Ahead.pdf
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 

CNI 2018: A Research Object Authoring Tool for the Data Commons

  • 1. ELSEVIER | The Research Object Authoring Tool --- CNI 2018 1 FAIR4CURES A Research Object Authoring Tool for the Data Commons December 11, 2018 Anita de Waard (she, her) VP Research Collaborations
  • 2. ELSEVIER | The Research Object Authoring Tool --- CNI 2018 Overview: 1. The NIH Data Commons: a very short introduction 2. The FAIR4CURES Project 3. A Global Unique Identifier Broker 4. Research Objects: a very very short introduction 5. Building a Research Object Authoring Tool on Mendeley Data
  • 3. ELSEVIER | The Research Object Authoring Tool --- CNI 2018 The NIH Data Commons Pilot Phase aims to provide a marketplace for tools, data and workflows based on existing technologies of commercial and academic platforms that strive to embody the FAIR Data principles. Overview:
  • 4. ELSEVIER | The Research Object Authoring Tool --- CNI 2018 Data Commons Overview: Goal of the project: 1. Advance the policies and protocols for accessing human subjects data 2. Support global identification, indexing and searching of available data sets; 3. Provide a collection of computational pipelines that can be applied to data sets 4. Utilize standards to globally identify and access data sets, tools and workflows 5. Create policies for data citation, reuse and reproducibility 6. Enable researchers to port their own data and workflows into the cloud Project structure: • DCPPC research groups are addressing important Key Capabilities => • The Commons will be composed of four stacks, incorporating products from the KCs Final output: • Data from three large NIH Databases will be available through all of these systems • Users can securely access data within all stacks, on multiple cloud providers • Users have access a basic set of applications that run the same way on all stacks. https://public.nihdatacommons.us/ExecutiveSummary_4YP/ Key Capabilities: 1: FAIR Guidelines & Metrics 2: Global Unique IDs for FAIR Digital Objects 3: Open Standard APIs 4: Cloud Agnostic Architecture Framework 5: Workspaces for Computation 6: Research Ethics, Privacy, and Security 7: Indexing and Search
  • 5. ELSEVIER | The Research Object Authoring Tool --- CNI 2018 Data Commons Guiding Principles: • 1. Identifiers for data: Develop and implement an interoperable global unique identifier system for digital objects. • 2. Data access: Develop and implement authentication and authorization policies and protocols for controlled access to digital objects and derivatives. • 3. Findability: Enable search and indexing of digital objects and data sets. • 4. Software stacks: The Commons will encompass multiple robust and sustainable software stacks implementing Commons standards and systems. • 5. Data use, standards: All tools will be build using standard application interfaces. • 6. Use cases: The Commons will develop and utilize an extensive use case library. • 7. Community: The Commons is developed through intense Community engagement and support across multiple levels of expertise. • 8. Community: Governance, membership, and coordination will be established with and through the community. • 9. Evaluation methods and metrics: We plan a culture of frequent release of products, with small iterations, routine evaluation and redesign. • 10. FAIR guidelines and metrics: Once FAIR metrics and rubrics are defined, these will be used to measure the level of “FAIRness” of repositories, datasets, and other digital objects. https://public.nihdatacommons.us/executive-summary/
  • 6. ELSEVIER | The Research Object Authoring Tool --- CNI 2018 Team Xenon – Four partner organisations Findable Accessible Interoperable Reusable Collaborative Usable Reproducible Extendable Scalable The FAIR4CURES Collaboration: Index 3 datasets: • Trans-omics for Precision Medicine (TOPMed) • Genotype Tissue Expression (GTEx) • Model Organisms Database (MODs)
  • 7. ELSEVIER | The Research Object Authoring Tool --- CNI 2018 The FAIR4CURES PlatformThe FAIR4CURES System:
  • 8. ELSEVIER | The Research Object Authoring Tool --- CNI 2018 • Identifiers for hosted data files within TOPMed studies, GTEx dataset, and MODs • Feature for researchers to register identifiers for their derived data files on the platform, making the content public and searchable • Selecting types of identifiers to support in the Data Commons ecosystem and the required identifier metadata • Open Source tool, connected to the SevenBridges Platform • Also accessible via Github/SmartAPI Global Unique Identifier Broker:
  • 9. ELSEVIER | The Research Object Authoring Tool --- CNI 2018 Digital Object Types Identified following the KC2 Metadata Spec: Seven Bridges Object Type DataCite Resource Type Proposed Schema.Org CreativeWork Types Supported Relationships Notes File Dataset Dataset Source Of a Task (input file) Derived From a Task (output file) Part Of a Collection One (or more) files packaged with metadata as a dataset App (Tool) Software SoftwareSourceCode Part of Task or Collection or Workflow Same as dataset, but file is source code App (Workflow) Workflow SoftwareSourceCode (?) Has Part of Software An aggregation of Tools (Software). File is CWL definition describing how the tools are chained. Task Collection Collection Composition of Files and Apps (Tools or Workflows) An aggregation of Apps (either tools or workflows), plus files (input & output) plus a record of all the settings used for each App. Collection (Study) Collection Collection Composition of any object An aggregation of heterogeneous objects for purpose of publishing. https://docs.google.com/document/d/1FD3aXr_uHnPy-YrFhQhuXET73tBVxu7F_Q5uS9TPUZs/edit
  • 10. ELSEVIER | The Research Object Authoring Tool --- CNI 2018 Seven Bridges Data Publication Concept Requirements Analysis: 1. Landing page URL including GUID 2. URL for page where file can be accessed (downloaded) 3. Metadata for object 4. Reference to the Task (zero or one) that this dataset was Derived From 5. Reference to the Task(s) (zero, one or more) that this dataset is the Source Of 1 2 3 4 5
  • 11. ELSEVIER | The Research Object Authoring Tool --- CNI 2018 Seven Bridges Workflow Configuration (CWL)
  • 12. ELSEVIER | The Research Object Authoring Tool --- CNI 2018 Standards-based metadata framework for logically and physically bundling resources with context http://researchobject.org What are Research Objects? Aggregates link things together Annotations about things & their relationships Container Packaging content & links: Zip files, BagIt, Docker images Identification locate things regardless where
  • 13. ELSEVIER | The Research Object Authoring Tool --- CNI 2018 Research Objects can be used to capture outputs in a wide range of scopes • Profiles help define the shape and form of a research object. • A profile defines the general purpose of that type of Research Objects: • A format (e.g. Research Object Bundle), • An expectation of what kind of resources should be expected, • A link to any specific vocabularies that should be used in its annotations. Applications of Research Objects include BDBags (Big Data Bags): • In digital libraries, preservation of source artifacts commonly use the BagIt format for archive serialization, capturing digital resources like audio recordings, document scans and their transcriptions, provenance and annotations. • The Research Object BagIt archive is a profile for describing a BagIt archive and its content as a Research Object to structure the metadata and relate the captured resources • The NIH-funded Big Data for Discovery Science (BDDS) project captures Big Data bags (BDBag) of large complex datasets from genomics workflows (https://doi.org/10.1109/BigData.2016.7840618). • A key aspect of BDBag is the ability to use Minimal Viable Identifiers (minid) for referencing potentially large data sources held in multiple remote repositories, effectively making a “Big Data” Research Object for large-scale workflows (https://doi.org/10.1101/268755). • A bag of bags (minid:b9vx04) is a metadata skeleton which may be completed with tools like bdbag to download the big data • The bags’ Research Object manifests can be consumed independently, linking to the remote resources. Research Objects and BDBags: http://www.researchobject.org/scopes/
  • 14. ELSEVIER | The Research Object Authoring Tool --- CNI 2018 Moving from Datasets to Research Objects in Mendeley Data: In Mendeley Data Repository, datasets are lists of files (stored in our S3 bucket) with metadata packaging (e.g. Titles, Description, Categories, License) and a persistent identifier DOI). We will introduce: • Collections as an aggregation of Datasets. Similar to a Dataset, BUT, the contents are other datasets, not files. • Software and Workflow as different types of Digital Objects. Similar to a Dataset, BUT files are source code or workflow specifications (e.g. CWL) and metadata properties could be a bit different. This forms the foundation for Research Objects, which are: • Collections or aggregations of different types of Digital Objects (not just datasets) • References to digital objects on other platforms, based on standard identifiers (e.g. DOIs or ARKs) • A manifest which lists and describes the contents of the Research Object • Exposed in JSON-LD:
  • 15. ELSEVIER | The Research Object Authoring Tool --- CNI 2018 GUID Broker (API Only) Seven Bridges Fair4CURES Platform Phase 1 Pilot Project (Apr – Sep 2018) Register Datasets (Data Files) Register Software Objects Register Workflow Objects Uses Register a Collection as a list of digital objects (data, sw, wf) In Summary: Objective 1 – support “Task” type Research Objects on Seven Bridges platform. Objective 2 - support configurable Research Objects on Mendeley Data platform. Phase 2 Project (Oct 2018 - 2019) Add annotation and relationships to collection to describe a research object Research Object Composer Serialise Research Object in standard format based on BDBags and RO standards Mendeley Data Platform Uses Re-uses http://smart-api.info/ui/ bf9abe9c17c9c78c432832382ef9e16a#/
  • 16. ELSEVIER | The Research Object Authoring Tool --- CNI 2018 Acknowledgements: • This work is supported by the NIH Data Commons Pilot Phase under the Research Opportunity Announcement (ROA) RM-17-026 https://commonfund.nih.gov/commons/: • NIH Data Commons - 1 OT3 OD025463-01 • NHLBI STAGE Project - 1 OT3 HL142478-01 • The FAIR4CURES Project lead by SevenBridges (Alison Leaf, Brandi Davis-Dusenbury and Sarper Avcil) • We partner in the Project with Repositive UK and the US Dept of Veteran’s Affairs • The metadata standards development was done by KC2, lead by Team Sodium (esp. Merce Crosas, Tim Clark, Trisha Cruse and Martin Fenner) • The Research Objects Authoring Tool work is lead by the University of Manchester, who pioneered work on Research Objects (Stian Soiland-Reyes and Carole Goble) • The Mendeley Data team has built the GUID Broker Prototype (Gabriel Oscares, Gareth Harvey