Let’s go on a FAIR safari!

Let’s go on a
FAIR Safari!
Prof Carole Goble
The FAIRDOM Consortium
ELIXIR UK Head of Node
BioComputeObject Partnership
The University of Manchester, UK
carole.goble@manchester.ac.uk
COMBINE 2019, EU-STANDS4PM, Heidelberg,Germany 18 July 2019
A European standardization framework for data
integration and data-driven in silico models for
personalised medicine
harmonised transnational standards, recommendations
and guidelines that allow a broad application of
predictive in silico methodologies in personalised
medicine across Europe.
A European standardization framework for data
integration and data-driven in silico models for
personalised medicine
Scientific Data 3, 160018 (2016)
doi:10.1038/sdata.2016.18
A potted history
Many went before
2014 - Lorentz workshop
2015 - BioHackathon
2016 - Published
Went bananas
Grassroots activity that has
become a top down one.
sharing/publishing assets in public archives…
Data Models
*top three most popular
The evolution of standards and data management practices in systems biology
(2015). Stanford et al, Molecular Systems Biology, 11(12):851
… model reuse is tricky…
Stanford et alThe evolution of standards and data management practices in systems biology,
Molecular Systems Biology (2015) 11: 851 DOI 10.15252/msb.20156053
COMBINE sessions on Reproducibility
... different repositories, owners,
sovereignties, infrastructure, platforms …
The evolution of standards and data management practices in systems biology (2015).
Stanford et al, Molecular Systems Biology, 11(12):851
A jungle
An ecosystem
http://genexplain.com/mypathsem/
FAIR for the widest possible use from EHR to Research …
Just how feasible is it to interoperate (integrate)
and reuse data collected for another purpose in
another domain?
The FAIR Jungle
The FAIR Hype
Clarity
Infrastructure
Methodologies
Incentives
Cutting a path through the jungle….
PEST – political, economic, social, technical
What does it mean to be FAIR?
What is the cost / benefit analysis
Lets
examine
FAIR more
closely….
FAIR principles in the paper…
some people seem to have taken as the law of the jungle…
FAIR Principles
machine-actionable data and metadata
Findable Accessible Interoperable Reusable
Find: with
machine
readable
metadata
Locate and id:
with standard
identification
mechanism
Available and
obtainable
Human &
machine
Metadata
always
STANDARDS
Semantically
encoded,
syntactically
parsable
References
Sufficiently
described
Provenance
Least restrictive
licenses
Community
compliant
Increase exchange, integration and reuse
Across disciplines and borders
FAIR Principles reality check
• An aspiration, a journey.
• A call for machine actionability
of data and metadata.
• Ambiguous.
• Work in progress.
• A subset of indicators:
– ROI, impact, community
need, sustainability of
repository, quality of
service….
Are Are not
• A standard.
• Strict.
• Just about humans being able to
find, access, reformat and finally
reuse data.
• Technology specific.
• Domain specific.
• Tablets of stone
Mons et al Cloudy, increasingly FAIR; Revisiting the FAIR Data guiding principles for the European Open
ScienceCloud. Information Services & Use. 37. 1-8. 10.3233/ISU-170824.
Dunning et al Are the FAIR Data Principles fair? IDCC17
Lets measure it!
Framework for metrics
Automated services
Manual services
Authorities
Wilkinson et al, Evaluating FAIR MaturityThrough a Scalable, Automated,Community-Governed Framework
https://doi.org/10.1101/649202
Lets measure it!
Dunkelziffer
“Not everything that
can be counted counts.
Not everything that
counts can be counted”
[William Bruce Cameron]
Compliance
Awareness
Expectation
setting
Self-evaluation
Reporting
By Providers,
Users &
Community
Certification
Judgement
Regulation
ByWhom???
Comparison
Monitoring
Review
Quality
By Community
Contract
http://blog.ukdataser
vice.ac.uk/fair-data-
assessment-tool/
https://fairshake.cloud/
Indicators: Robustness,
Humility,Transparency,
Diversity, Reflexivity*
Context dependency
Community standards
Incremental
Matrix of metrics
Maturity levels for each
+
*The MetricTide, https://responsiblemetrics.org/the-metric-tide/
F and A are not so bad
I and R are hard
A FAIR Ecosystem means....
FAIR indicators, models and trust
Transparent
evaluation
Capability Maturity Model
of entities & their capabilities
Indicators and metrics
measuring levels
Foundational
Components
FAIRification
Process
Awareness and Policy
Standards and Guidelines
People
Infrastructure
Value Based Assessment
Selection
Goal Setting
Process planning
Modelling
Transformation
Publishing
Impl. Outcome:
Dataset
Persistent Identification
Data Set Discovery
Machine Readability
Data Access and Usage
Preservation and Sustainability
RDA FAIR Data
Maturity Model
Working Group
Cataloguing the FAIR ecosystem
What do we mean by a Maturity Model?
[Susheel Varma]
Only way more
elaborate ….
[Wilkinson et al, 2019]
FAIR Evaluator Workflows
Rubrics, Indicators andTests
are FAIR objects and community decisions
https://doi.org/10.1101/649202
Scale up and scale out
automation of
indicators and their
evaluation…
FAIRification Pipelines
Rare Diseasehttps://www.go-fair.org/fair-
principles/fairification-process/
[Marco Roos]
FAIRification Cookbooks … for models?
https://fairplus-project.eu/
More than just data
Software, models, workflows, SOPs, Lab Protocols….
FAIR Digital Objects
FAIR Models
properties of data + software
FAIR Software
FAIR Workflows*
Maintainability
Testing
Portability
Composite structure
Forms (spec or code?)
Versioning
Executability
Maturity models
Contributor policy
Identity
Copyright
Licenses Documentation
Sustainability
Model
Reproducibility
& Exchange
*FAIR ComputationalWorkflows https://doi.org/10.5281/zenodo.3268653
FAIR Precision
Medicine
Models …
Indicators &
Maturity
Model eXchange
Standards
Identifiers
AAI & Licensing
Repositories
Search
DMP
Policies Governance
Cloud of
registries
Federation
Scale out mark-up for federation
https://eosc-edmi.github.io/
http://bioschemas.org
EOSC Dataset
Minimum
Information
FAIR OPEN
SAFE
privacy preservation, regulatory rigour
crossing domain and sovereignty boundaries
Privacy Preservation of data
data book keeping
https://f1000research.com/posters/7-1036
https://www.monarc.lu
[Pinar Alper]
Privacy Preservation of analysis
take (distributed) analysis to the (distributed) data
https://www.health-ri.org/
Personal Health Train
Collect privacy sensitive data using mobile containers
Regulatory Practice
robust, safe exchange and reuse of
HTS computational analytical
workflows
http://biocomputeobject.org
IEEE P2791
BioCompute
Working Group
[Vahan Simonyan]
BioCompute Framework
to advance Regulatory Science to support NGS analysis
Emphasis on robust, safe reuse.
Describe and validate the
metadata of packages, and
their contents, both inside
and outside
Standardise data formats and
elements and exchange of
Electronic Health Records
Describe and
validate analysis
workflows, to be
portable and
interoperable
Standardise and support
sharing and analysis of
Genomic data
Alterovitz, Dean II, Goble, Crusoe, Soiland-Reyes et al “Enabling
Precision Medicine via standard communication of NGS provenance,
analysis, and results” PLOS Biology 2018
Bechhofer et al (2013)Why linked data is not enough for scientists https://doi.org/10.1016/j.future.2011.08.004
Bechhofer et al (2010) Research Objects:Towards Exchange and Reuse of Digital Knowledge, https://eprints.soton.ac.uk/268555/
Self-describing machine processable
metadata in common and specific to
different object types.
bundle together references or
the objects themselves. Relate
digital resources
snapshot | cite | exchange
Research Object
Framework
COMBINE was early to the party….
Combine Archive
Scharm M,Wendland F, Peters M,Wolfien M,TheileT,Waltemath D SEMS, University of Rostock zip-like file with a manifest & metadata
- Bundle files - Keep provenance
- Exchange data - Ship results
Bergmann, F.T. (2014). COMBINE archive and OMEX format: one file to share all information
to reproduce a modeling project. BMC bioinformatics,15(1), 1.
https://sems.unirostock.de/projects/combinearchive/
Big data distributed over multiple locations,
Efficiently and safely moved on demand
ROs are verified collections of references
[Chard, et al 2016]
FAIR Research Objects
The KnowledgeObject Reference Ontology (KORO): A formalism to support management and sharing of computable
biomedical knowledge for learning health systems
Flynn, Friedman, Boisvert, Landis‐Lewis, Lagoze (2018), https://doi.org/10.1002/lrh2.10054
Graphs of ROs
Track ROs
Combine and enrich ROs
Learning Health Systems
and Research Objects
EOSC-Life: FAIR data and tools (workflows, models) for
cloud use
RI data (distributed over
facilities)
Ecosystem of innovative
tools in EOSC
Publish FAIR life
science data in EOSC
Data Catalogues
Tools Catalogues
Workflow Catalogues
Service Catalogues
[Niklas Blomberg]
Let’s go on a FAIR safari!
FAIR Challenges
for Projects
Track collection of data and metadata X X X
Maintain experimental context X X
Find and exchange assets X X X X
Long-term retain results beyond a project X X X
Share, disseminate and publish assets sensitively X X X
Consistently report for interpretation, interoperability
& comparison
X X
Promote standardised metadata practices. X X
Organise and link assets X X X
Reuse tools and community archives X X
Integrate with other data stores and platforms X X X
Support reproducible publications X X X X
Credit owners X X
Public Project
Commons
Platform
Service hosted at HITS
50+
installations
140+
projects
Support
A Commons
Project
Investigations and Assets
Simulate model
Launch workflows
Models
SOPs
People Projects
Publications
Documents
Presentations
Workflows
Data
Events
Federated Catalogue, Integrated view
interlinked objects, structured organisation, resources ecosystem
Investigations
Studies
Assays/Analyses
Workflows
Federated Catalogue, Integrated view
interlinked objects, structured organisation, resources ecosystem
Stores Archives
FAIR Membrane
Investigations
Studies
Assays/Analyses
Federated Catalogue, Integrated view
A Commons is only as FAIR as the content (inside and outside)
FAIR Membrane
FAIR(ish) after death ….
https://fairdom
hub.org/projec
ts/129
https://wellcomeopenresearch.org/articles/4-104/v1
Zielinski, Hay, Millar, The grant is dead, long live the data - migration as a pragmatic exit strategy
for research data preservation,
Data Sovereignty: FAIR but not yet Open
A Project
Commons
not an
integrated
data
warehouse
e.g. (Pillar III)
in-house in-house
All LiSyM
Patient-related
clinical data
Aggregated data
API
External Tools
API
Data Sovereignty: FAIR but never Open
[Mueller]
Data Sovereignty: Personal Health Tram
Less automatic, more transparent, when partners cannot share
Share table structure
Share common code
Share summaries
FAIR at the First Mile
[Christian R Bauer]
FAIR at the First Mile
Project Commons Integrated Data Warehouse[Christian R Bauer]
EU-STAND4PM: First and Last Mile
Neylon, Knowledge Exchange Report: http://www.knowledge-exchange.info/event/ke-approach-open-scholarship
FAIR at last mile
FAIR at first
mile / source
FAIR Protected
Data/Compute
FAIR
Objects
FAIRification
EU-STANDS4PM
FAIR path through the jungle
Indicators and Maturity Models
obtainable & understandable
Technical infrastructure & Stewardship Skills
possible
Communities & Culture
easy (or at least feasible)
User Experience
normative
rewarding
Incentives
required
Policies
Based on Matt Spritzer’s figure, COS
Acknowledgements
FAIRDOM Team
– http://www.fair-dom.org
Research Object Team
– http://www.researchobject.org
BioComputeObject
– http://biocomputeobject.org/
FAIR folks, esp. FAIRplus and FAIR Metrics
– https://fairplus-project.eu/
– http://www.fairmetrics.org
CommonWorkflow Language
– http://www.commonwl.org
ELIXIR
– http://www.elixir-europe.org
BioExcel
– http://bioexcel.eu
Acknowledgements
1 of 58

Recommended

Introduction to FAIRDOM by
Introduction to FAIRDOMIntroduction to FAIRDOM
Introduction to FAIRDOMCarole Goble
1.3K views41 slides
Reproducible and citable data and models: an introduction. by
Reproducible and citable data and models: an introduction.Reproducible and citable data and models: an introduction.
Reproducible and citable data and models: an introduction.FAIRDOM
4.2K views15 slides
How are we Faring with FAIR? (and what FAIR is not) by
How are we Faring with FAIR? (and what FAIR is not)How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)Carole Goble
814 views36 slides
FAIRy stories: tales from building the FAIR Research Commons by
FAIRy stories: tales from building the FAIR Research CommonsFAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research CommonsCarole Goble
1.4K views59 slides
Research Objects, SEEK and FAIRDOM by
Research Objects, SEEK and FAIRDOMResearch Objects, SEEK and FAIRDOM
Research Objects, SEEK and FAIRDOMCarole Goble
1.7K views13 slides
FAIR data and model management for systems biology. by
FAIR data and model management for systems biology.FAIR data and model management for systems biology.
FAIR data and model management for systems biology.FAIRDOM
1.6K views21 slides

More Related Content

What's hot

FAIR Data and Model Management for Systems Biology (and SOPs too!) by
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)Carole Goble
1.1K views50 slides
FAIRer Research by
FAIRer ResearchFAIRer Research
FAIRer ResearchCarole Goble
1K views40 slides
FAIR Computational Workflows by
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
982 views49 slides
FAIR Data, Operations and Model management for Systems Biology and Systems Me... by
FAIR Data, Operations and Model management for Systems Biology and Systems Me...FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...Carole Goble
1.5K views47 slides
Capturing the context: one small(ish step for modellers, one giant leap for m... by
Capturing the context: one small(ish step for modellers, one giant leap for m...Capturing the context: one small(ish step for modellers, one giant leap for m...
Capturing the context: one small(ish step for modellers, one giant leap for m...FAIRDOM
3.4K views26 slides
Reproducible Research: how could Research Objects help by
Reproducible Research: how could Research Objects helpReproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects helpCarole Goble
605 views30 slides

What's hot(20)

FAIR Data and Model Management for Systems Biology (and SOPs too!) by Carole Goble
FAIR Data and Model Management for Systems Biology(and SOPs too!)FAIR Data and Model Management for Systems Biology(and SOPs too!)
FAIR Data and Model Management for Systems Biology (and SOPs too!)
Carole Goble1.1K views
FAIR Computational Workflows by Carole Goble
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
Carole Goble982 views
FAIR Data, Operations and Model management for Systems Biology and Systems Me... by Carole Goble
FAIR Data, Operations and Model management for Systems Biology and Systems Me...FAIR Data, Operations and Model management for Systems Biology and Systems Me...
FAIR Data, Operations and Model management for Systems Biology and Systems Me...
Carole Goble1.5K views
Capturing the context: one small(ish step for modellers, one giant leap for m... by FAIRDOM
Capturing the context: one small(ish step for modellers, one giant leap for m...Capturing the context: one small(ish step for modellers, one giant leap for m...
Capturing the context: one small(ish step for modellers, one giant leap for m...
FAIRDOM3.4K views
Reproducible Research: how could Research Objects help by Carole Goble
Reproducible Research: how could Research Objects helpReproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects help
Carole Goble605 views
Citing data in research articles: principles, implementation, challenges - an... by FAIRDOM
Citing data in research articles: principles, implementation, challenges - an...Citing data in research articles: principles, implementation, challenges - an...
Citing data in research articles: principles, implementation, challenges - an...
FAIRDOM3.8K views
FAIRy stories: the FAIR Data principles in theory and in practice by Carole Goble
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practice
Carole Goble246 views
Making your data good enough for sharing. by FAIRDOM
Making your data good enough for sharing.Making your data good enough for sharing.
Making your data good enough for sharing.
FAIRDOM4.8K views
The FAIRDOM Commons for Systems Biology by FAIRDOM
The FAIRDOM Commons for Systems BiologyThe FAIRDOM Commons for Systems Biology
The FAIRDOM Commons for Systems Biology
FAIRDOM2.3K views
Being FAIR: FAIR data and model management SSBSS 2017 Summer School by Carole Goble
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Carole Goble978 views
Open Science: how to serve the needs of the researcher? by Carole Goble
Open Science: how to serve the needs of the researcher? Open Science: how to serve the needs of the researcher?
Open Science: how to serve the needs of the researcher?
Carole Goble222 views
The swings and roundabouts of a decade of fun and games with Research Objects by Carole Goble
The swings and roundabouts of a decade of fun and games with Research Objects The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects
Carole Goble168 views
FAIR Workflows and Research Objects get a Workout by Carole Goble
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout
Carole Goble480 views
Data management, data sharing: the SysMO-SEEK Story by Carole Goble
Data management, data sharing: the SysMO-SEEK StoryData management, data sharing: the SysMO-SEEK Story
Data management, data sharing: the SysMO-SEEK Story
Carole Goble1.2K views
FAIR History and the Future by Carole Goble
FAIR History and the FutureFAIR History and the Future
FAIR History and the Future
Carole Goble308 views
What is Reproducibility? The R* brouhaha (and how Research Objects can help) by Carole Goble
What is Reproducibility? The R* brouhaha (and how Research Objects can help)What is Reproducibility? The R* brouhaha (and how Research Objects can help)
What is Reproducibility? The R* brouhaha (and how Research Objects can help)
Carole Goble1.5K views
RO-Crate: A framework for packaging research products into FAIR Research Objects by Carole Goble
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research Objects
Carole Goble425 views
FAIR Data Bridging from researcher data management to ELIXIR archives in the... by Carole Goble
FAIR Data Bridging from researcher data management to ELIXIR archives in the...FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
Carole Goble120 views
Crediting informatics and data folks in life science teams by Carole Goble
Crediting informatics and data folks in life science teamsCrediting informatics and data folks in life science teams
Crediting informatics and data folks in life science teams
Carole Goble1.1K views

Similar to Let’s go on a FAIR safari!

The FAIR Principles and FAIRsharing by
The FAIR Principles and FAIRsharingThe FAIR Principles and FAIRsharing
The FAIR Principles and FAIRsharingSusanna-Assunta Sansone
454 views33 slides
FAIR data and model management for systems biology (and SOPs too!) by
FAIR data and model management for systems biology (and SOPs too!)FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)FAIRDOM
6.6K views50 slides
dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021 by
dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021
dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021dkNET
313 views60 slides
FAIR Computational Workflows by
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows Carole Goble
630 views49 slides
FAIRsharing presentation at the Japan Science and Technology Agency by
FAIRsharing presentation at the Japan Science and Technology AgencyFAIRsharing presentation at the Japan Science and Technology Agency
FAIRsharing presentation at the Japan Science and Technology AgencyPeter McQuilton
556 views38 slides
VODAN Africa IN.pptx by
VODAN Africa IN.pptxVODAN Africa IN.pptx
VODAN Africa IN.pptxGetu Tadele
4 views30 slides

Similar to Let’s go on a FAIR safari!(20)

FAIR data and model management for systems biology (and SOPs too!) by FAIRDOM
FAIR data and model management for systems biology (and SOPs too!)FAIR data and model management for systems biology (and SOPs too!)
FAIR data and model management for systems biology (and SOPs too!)
FAIRDOM6.6K views
dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021 by dkNET
dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021
dkNET Webinar: FAIR Data & Software in the Research Life Cycle 01/22/2021
dkNET313 views
FAIR Computational Workflows by Carole Goble
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
Carole Goble630 views
FAIRsharing presentation at the Japan Science and Technology Agency by Peter McQuilton
FAIRsharing presentation at the Japan Science and Technology AgencyFAIRsharing presentation at the Japan Science and Technology Agency
FAIRsharing presentation at the Japan Science and Technology Agency
Peter McQuilton556 views
FAIR Computational Workflows by Carole Goble
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
Carole Goble493 views
FAIR data: what it means, how we achieve it, and the role of RDA by Sarah Jones
FAIR data: what it means, how we achieve it, and the role of RDAFAIR data: what it means, how we achieve it, and the role of RDA
FAIR data: what it means, how we achieve it, and the role of RDA
Sarah Jones871 views
NIH Data Summit - The NIH Data Commons by Vivien Bonazzi
NIH Data Summit - The NIH Data CommonsNIH Data Summit - The NIH Data Commons
NIH Data Summit - The NIH Data Commons
Vivien Bonazzi768 views
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook by Susanna-Assunta Sansone
FAIRification is a Team Sport: FAIRsharing and the FAIR CookbookFAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The Hyve by Kees van Bochove
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The HyveOpen Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The Hyve
Open Insights Harvard DBMI - Personal Health Train - Kees van Bochove - The Hyve
Kees van Bochove260 views
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo... by Carole Goble
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
Carole Goble45 views
Making Data FAIR (Findable, Accessible, Interoperable, Reusable) by Tom Plasterer
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Tom Plasterer769 views
A Big Picture in Research Data Management by Carole Goble
A Big Picture in Research Data ManagementA Big Picture in Research Data Management
A Big Picture in Research Data Management
Carole Goble666 views

More from Carole Goble

Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research... by
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...Carole Goble
38 views33 slides
Research Software Sustainability takes a Village by
Research Software Sustainability takes a VillageResearch Software Sustainability takes a Village
Research Software Sustainability takes a VillageCarole Goble
40 views29 slides
FAIR Computational Workflows by
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
193 views29 slides
Open Research: Manchester leading and learning by
Open Research: Manchester leading and learningOpen Research: Manchester leading and learning
Open Research: Manchester leading and learningCarole Goble
143 views17 slides
RDMkit, a Research Data Management Toolkit. Built by the Community for the ... by
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...Carole Goble
712 views38 slides
FAIR Computational Workflows by
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
415 views14 slides

More from Carole Goble(13)

Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research... by Carole Goble
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
Carole Goble38 views
Research Software Sustainability takes a Village by Carole Goble
Research Software Sustainability takes a VillageResearch Software Sustainability takes a Village
Research Software Sustainability takes a Village
Carole Goble40 views
FAIR Computational Workflows by Carole Goble
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
Carole Goble193 views
Open Research: Manchester leading and learning by Carole Goble
Open Research: Manchester leading and learningOpen Research: Manchester leading and learning
Open Research: Manchester leading and learning
Carole Goble143 views
RDMkit, a Research Data Management Toolkit. Built by the Community for the ... by Carole Goble
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
Carole Goble712 views
FAIR Computational Workflows by Carole Goble
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
Carole Goble415 views
EOSC-Life Workflow Collaboratory by Carole Goble
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow Collaboratory
Carole Goble132 views
What is Reproducibility? The R* brouhaha and how Research Objects can help by Carole Goble
What is Reproducibility? The R* brouhaha and how Research Objects can helpWhat is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can help
Carole Goble258 views
ELIXIR UK Node presentation to the ELIXIR Board by Carole Goble
ELIXIR UK Node presentation to the ELIXIR BoardELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR Board
Carole Goble501 views
Reflections on a (slightly unusual) multi-disciplinary academic career by Carole Goble
Reflections on a (slightly unusual) multi-disciplinary academic careerReflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic career
Carole Goble482 views
Better Software, Better Research by Carole Goble
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better Research
Carole Goble657 views
Research Object Community Update by Carole Goble
Research Object Community UpdateResearch Object Community Update
Research Object Community Update
Carole Goble196 views
Being FAIR: Enabling Reproducible Data Science by Carole Goble
Being FAIR: Enabling Reproducible Data ScienceBeing FAIR: Enabling Reproducible Data Science
Being FAIR: Enabling Reproducible Data Science
Carole Goble1.2K views

Recently uploaded

별헤는 사람들 2023년 12월호 전명원 교수 자료 by
별헤는 사람들 2023년 12월호 전명원 교수 자료별헤는 사람들 2023년 12월호 전명원 교수 자료
별헤는 사람들 2023년 12월호 전명원 교수 자료sciencepeople
63 views30 slides
Experimental animal Guinea pigs.pptx by
Experimental animal Guinea pigs.pptxExperimental animal Guinea pigs.pptx
Experimental animal Guinea pigs.pptxMansee Arya
38 views16 slides
RemeOs science and clinical evidence by
RemeOs science and clinical evidenceRemeOs science and clinical evidence
RemeOs science and clinical evidencePetrusViitanen1
53 views96 slides
Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe... by
Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe...Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe...
Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe...Anmol Vishnu Gupta
26 views12 slides
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance... by
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...InsideScientific
105 views62 slides
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ... by
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...ILRI
5 views6 slides

Recently uploaded(20)

별헤는 사람들 2023년 12월호 전명원 교수 자료 by sciencepeople
별헤는 사람들 2023년 12월호 전명원 교수 자료별헤는 사람들 2023년 12월호 전명원 교수 자료
별헤는 사람들 2023년 12월호 전명원 교수 자료
sciencepeople63 views
Experimental animal Guinea pigs.pptx by Mansee Arya
Experimental animal Guinea pigs.pptxExperimental animal Guinea pigs.pptx
Experimental animal Guinea pigs.pptx
Mansee Arya38 views
RemeOs science and clinical evidence by PetrusViitanen1
RemeOs science and clinical evidenceRemeOs science and clinical evidence
RemeOs science and clinical evidence
PetrusViitanen153 views
Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe... by Anmol Vishnu Gupta
Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe...Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe...
Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe...
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance... by InsideScientific
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...
InsideScientific105 views
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ... by ILRI
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...
ILRI5 views
2. Natural Sciences and Technology Author Siyavula.pdf by ssuser821efa
2. Natural Sciences and Technology Author Siyavula.pdf2. Natural Sciences and Technology Author Siyavula.pdf
2. Natural Sciences and Technology Author Siyavula.pdf
ssuser821efa10 views
Effect of Integrated Nutrient Management on Growth and Yield of Solanaceous F... by SwagatBehera9
Effect of Integrated Nutrient Management on Growth and Yield of Solanaceous F...Effect of Integrated Nutrient Management on Growth and Yield of Solanaceous F...
Effect of Integrated Nutrient Management on Growth and Yield of Solanaceous F...
SwagatBehera95 views
Light Pollution for LVIS students by CWBarthlmew
Light Pollution for LVIS studentsLight Pollution for LVIS students
Light Pollution for LVIS students
CWBarthlmew12 views
Applications of Large Language Models in Materials Discovery and Design by Anubhav Jain
Applications of Large Language Models in Materials Discovery and DesignApplications of Large Language Models in Materials Discovery and Design
Applications of Large Language Models in Materials Discovery and Design
Anubhav Jain13 views
Factors affecting fluorescence and phosphorescence.pptx by SamarthGiri1
Factors affecting fluorescence and phosphorescence.pptxFactors affecting fluorescence and phosphorescence.pptx
Factors affecting fluorescence and phosphorescence.pptx
SamarthGiri17 views
application of genetic engineering 2.pptx by SankSurezz
application of genetic engineering 2.pptxapplication of genetic engineering 2.pptx
application of genetic engineering 2.pptx
SankSurezz14 views
ELECTRON TRANSPORT CHAIN by DEEKSHA RANI
ELECTRON TRANSPORT CHAINELECTRON TRANSPORT CHAIN
ELECTRON TRANSPORT CHAIN
DEEKSHA RANI10 views
Note on the Riemann Hypothesis by vegafrank2
Note on the Riemann HypothesisNote on the Riemann Hypothesis
Note on the Riemann Hypothesis
vegafrank27 views
Evaluation and Standardization of the Marketed Polyherbal drug Patanjali Divy... by Anmol Vishnu Gupta
Evaluation and Standardization of the Marketed Polyherbal drug Patanjali Divy...Evaluation and Standardization of the Marketed Polyherbal drug Patanjali Divy...
Evaluation and Standardization of the Marketed Polyherbal drug Patanjali Divy...
How to be(come) a successful PhD student by Tom Mens
How to be(come) a successful PhD studentHow to be(come) a successful PhD student
How to be(come) a successful PhD student
Tom Mens537 views
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ... by ILRI
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...
ILRI8 views

Let’s go on a FAIR safari!

  • 1. Let’s go on a FAIR Safari! Prof Carole Goble The FAIRDOM Consortium ELIXIR UK Head of Node BioComputeObject Partnership The University of Manchester, UK carole.goble@manchester.ac.uk COMBINE 2019, EU-STANDS4PM, Heidelberg,Germany 18 July 2019
  • 2. A European standardization framework for data integration and data-driven in silico models for personalised medicine harmonised transnational standards, recommendations and guidelines that allow a broad application of predictive in silico methodologies in personalised medicine across Europe.
  • 3. A European standardization framework for data integration and data-driven in silico models for personalised medicine
  • 4. Scientific Data 3, 160018 (2016) doi:10.1038/sdata.2016.18 A potted history Many went before 2014 - Lorentz workshop 2015 - BioHackathon 2016 - Published Went bananas Grassroots activity that has become a top down one.
  • 5. sharing/publishing assets in public archives… Data Models *top three most popular The evolution of standards and data management practices in systems biology (2015). Stanford et al, Molecular Systems Biology, 11(12):851
  • 6. … model reuse is tricky… Stanford et alThe evolution of standards and data management practices in systems biology, Molecular Systems Biology (2015) 11: 851 DOI 10.15252/msb.20156053 COMBINE sessions on Reproducibility
  • 7. ... different repositories, owners, sovereignties, infrastructure, platforms … The evolution of standards and data management practices in systems biology (2015). Stanford et al, Molecular Systems Biology, 11(12):851 A jungle An ecosystem
  • 9. FAIR for the widest possible use from EHR to Research …
  • 10. Just how feasible is it to interoperate (integrate) and reuse data collected for another purpose in another domain?
  • 13. Cutting a path through the jungle…. PEST – political, economic, social, technical What does it mean to be FAIR? What is the cost / benefit analysis
  • 15. FAIR principles in the paper… some people seem to have taken as the law of the jungle…
  • 16. FAIR Principles machine-actionable data and metadata Findable Accessible Interoperable Reusable Find: with machine readable metadata Locate and id: with standard identification mechanism Available and obtainable Human & machine Metadata always STANDARDS Semantically encoded, syntactically parsable References Sufficiently described Provenance Least restrictive licenses Community compliant Increase exchange, integration and reuse Across disciplines and borders
  • 17. FAIR Principles reality check • An aspiration, a journey. • A call for machine actionability of data and metadata. • Ambiguous. • Work in progress. • A subset of indicators: – ROI, impact, community need, sustainability of repository, quality of service…. Are Are not • A standard. • Strict. • Just about humans being able to find, access, reformat and finally reuse data. • Technology specific. • Domain specific. • Tablets of stone Mons et al Cloudy, increasingly FAIR; Revisiting the FAIR Data guiding principles for the European Open ScienceCloud. Information Services & Use. 37. 1-8. 10.3233/ISU-170824. Dunning et al Are the FAIR Data Principles fair? IDCC17
  • 18. Lets measure it! Framework for metrics Automated services Manual services Authorities Wilkinson et al, Evaluating FAIR MaturityThrough a Scalable, Automated,Community-Governed Framework https://doi.org/10.1101/649202
  • 19. Lets measure it! Dunkelziffer “Not everything that can be counted counts. Not everything that counts can be counted” [William Bruce Cameron]
  • 21. Indicators: Robustness, Humility,Transparency, Diversity, Reflexivity* Context dependency Community standards Incremental Matrix of metrics Maturity levels for each + *The MetricTide, https://responsiblemetrics.org/the-metric-tide/ F and A are not so bad I and R are hard A FAIR Ecosystem means.... FAIR indicators, models and trust Transparent evaluation
  • 22. Capability Maturity Model of entities & their capabilities Indicators and metrics measuring levels Foundational Components FAIRification Process Awareness and Policy Standards and Guidelines People Infrastructure Value Based Assessment Selection Goal Setting Process planning Modelling Transformation Publishing Impl. Outcome: Dataset Persistent Identification Data Set Discovery Machine Readability Data Access and Usage Preservation and Sustainability RDA FAIR Data Maturity Model Working Group Cataloguing the FAIR ecosystem
  • 23. What do we mean by a Maturity Model? [Susheel Varma] Only way more elaborate ….
  • 24. [Wilkinson et al, 2019] FAIR Evaluator Workflows Rubrics, Indicators andTests are FAIR objects and community decisions https://doi.org/10.1101/649202 Scale up and scale out automation of indicators and their evaluation…
  • 26. FAIRification Cookbooks … for models? https://fairplus-project.eu/
  • 27. More than just data Software, models, workflows, SOPs, Lab Protocols…. FAIR Digital Objects
  • 28. FAIR Models properties of data + software FAIR Software FAIR Workflows* Maintainability Testing Portability Composite structure Forms (spec or code?) Versioning Executability Maturity models Contributor policy Identity Copyright Licenses Documentation Sustainability Model Reproducibility & Exchange *FAIR ComputationalWorkflows https://doi.org/10.5281/zenodo.3268653
  • 31. Scale out mark-up for federation https://eosc-edmi.github.io/ http://bioschemas.org EOSC Dataset Minimum Information
  • 32. FAIR OPEN SAFE privacy preservation, regulatory rigour crossing domain and sovereignty boundaries
  • 33. Privacy Preservation of data data book keeping https://f1000research.com/posters/7-1036 https://www.monarc.lu [Pinar Alper]
  • 34. Privacy Preservation of analysis take (distributed) analysis to the (distributed) data https://www.health-ri.org/ Personal Health Train Collect privacy sensitive data using mobile containers
  • 35. Regulatory Practice robust, safe exchange and reuse of HTS computational analytical workflows http://biocomputeobject.org IEEE P2791 BioCompute Working Group [Vahan Simonyan]
  • 36. BioCompute Framework to advance Regulatory Science to support NGS analysis Emphasis on robust, safe reuse. Describe and validate the metadata of packages, and their contents, both inside and outside Standardise data formats and elements and exchange of Electronic Health Records Describe and validate analysis workflows, to be portable and interoperable Standardise and support sharing and analysis of Genomic data Alterovitz, Dean II, Goble, Crusoe, Soiland-Reyes et al “Enabling Precision Medicine via standard communication of NGS provenance, analysis, and results” PLOS Biology 2018
  • 37. Bechhofer et al (2013)Why linked data is not enough for scientists https://doi.org/10.1016/j.future.2011.08.004 Bechhofer et al (2010) Research Objects:Towards Exchange and Reuse of Digital Knowledge, https://eprints.soton.ac.uk/268555/ Self-describing machine processable metadata in common and specific to different object types. bundle together references or the objects themselves. Relate digital resources snapshot | cite | exchange Research Object Framework
  • 38. COMBINE was early to the party…. Combine Archive Scharm M,Wendland F, Peters M,Wolfien M,TheileT,Waltemath D SEMS, University of Rostock zip-like file with a manifest & metadata - Bundle files - Keep provenance - Exchange data - Ship results Bergmann, F.T. (2014). COMBINE archive and OMEX format: one file to share all information to reproduce a modeling project. BMC bioinformatics,15(1), 1. https://sems.unirostock.de/projects/combinearchive/
  • 39. Big data distributed over multiple locations, Efficiently and safely moved on demand ROs are verified collections of references [Chard, et al 2016] FAIR Research Objects
  • 40. The KnowledgeObject Reference Ontology (KORO): A formalism to support management and sharing of computable biomedical knowledge for learning health systems Flynn, Friedman, Boisvert, Landis‐Lewis, Lagoze (2018), https://doi.org/10.1002/lrh2.10054 Graphs of ROs Track ROs Combine and enrich ROs Learning Health Systems and Research Objects
  • 41. EOSC-Life: FAIR data and tools (workflows, models) for cloud use RI data (distributed over facilities) Ecosystem of innovative tools in EOSC Publish FAIR life science data in EOSC Data Catalogues Tools Catalogues Workflow Catalogues Service Catalogues [Niklas Blomberg]
  • 43. FAIR Challenges for Projects Track collection of data and metadata X X X Maintain experimental context X X Find and exchange assets X X X X Long-term retain results beyond a project X X X Share, disseminate and publish assets sensitively X X X Consistently report for interpretation, interoperability & comparison X X Promote standardised metadata practices. X X Organise and link assets X X X Reuse tools and community archives X X Integrate with other data stores and platforms X X X Support reproducible publications X X X X Credit owners X X
  • 44. Public Project Commons Platform Service hosted at HITS 50+ installations 140+ projects Support
  • 45. A Commons Project Investigations and Assets Simulate model Launch workflows
  • 46. Models SOPs People Projects Publications Documents Presentations Workflows Data Events Federated Catalogue, Integrated view interlinked objects, structured organisation, resources ecosystem Investigations Studies Assays/Analyses
  • 47. Workflows Federated Catalogue, Integrated view interlinked objects, structured organisation, resources ecosystem Stores Archives FAIR Membrane Investigations Studies Assays/Analyses
  • 48. Federated Catalogue, Integrated view A Commons is only as FAIR as the content (inside and outside) FAIR Membrane
  • 49. FAIR(ish) after death …. https://fairdom hub.org/projec ts/129 https://wellcomeopenresearch.org/articles/4-104/v1 Zielinski, Hay, Millar, The grant is dead, long live the data - migration as a pragmatic exit strategy for research data preservation,
  • 50. Data Sovereignty: FAIR but not yet Open A Project Commons not an integrated data warehouse
  • 51. e.g. (Pillar III) in-house in-house All LiSyM Patient-related clinical data Aggregated data API External Tools API Data Sovereignty: FAIR but never Open [Mueller]
  • 52. Data Sovereignty: Personal Health Tram Less automatic, more transparent, when partners cannot share Share table structure Share common code Share summaries
  • 53. FAIR at the First Mile [Christian R Bauer]
  • 54. FAIR at the First Mile Project Commons Integrated Data Warehouse[Christian R Bauer]
  • 55. EU-STAND4PM: First and Last Mile Neylon, Knowledge Exchange Report: http://www.knowledge-exchange.info/event/ke-approach-open-scholarship FAIR at last mile FAIR at first mile / source FAIR Protected Data/Compute FAIR Objects FAIRification
  • 56. EU-STANDS4PM FAIR path through the jungle Indicators and Maturity Models obtainable & understandable Technical infrastructure & Stewardship Skills possible Communities & Culture easy (or at least feasible) User Experience normative rewarding Incentives required Policies Based on Matt Spritzer’s figure, COS
  • 57. Acknowledgements FAIRDOM Team – http://www.fair-dom.org Research Object Team – http://www.researchobject.org BioComputeObject – http://biocomputeobject.org/ FAIR folks, esp. FAIRplus and FAIR Metrics – https://fairplus-project.eu/ – http://www.fairmetrics.org CommonWorkflow Language – http://www.commonwl.org ELIXIR – http://www.elixir-europe.org BioExcel – http://bioexcel.eu