How are we Faring with FAIR? (and what FAIR is not)

How are we
Faring with FAIR
(and what FAIR is not)
Carole Goble
The University of Manchester
FAIRDOM
ELIXIR, EOSC-Life, IBISBA, BioExcel CoE, FAIRplus
carole.goble@manchester.ac.uk
The views expressed in this talk are my own
Workshop: FAIRe Data Infrastructures, 15 October 2020
Data discovery and reuse at scale
through good data management
2016
A set of PRINCIPLES to enhance the
value of all digital resources and their
reuse by PEOPLE and by MACHINES
Scientific Data 3, 160018 (2016) doi:10.1038/sdata.2016.18 [Credit: Susanna Sansone]
Branding a trend, stimulating a movement …
2014
2016
2015
Open Science
Data Sharing
Recognition & Credit
Reproducibility Data-driven Science
Automation, AI
How are we Faring with FAIR? (and what FAIR is not)
The compulsory COVID-19 reference
“COVID is a good example .. there must be loads of legacy data.
We’re desperately trying to go back and look at what we knew from
SARS 10 years ago” – Pharma manager, FAIRplus project
https://www.covid19dataportal.org/
https://www.nature.com/articles/s41431-020-0635-7
https://www.rd-alliance.org/group/rda-covid19-rda-covid19-omics-rda-covid19-epidemiology-rda-covid19-
clinical-rda-covid19-1
https://doi.org/10.15497/rda00052
The compulsory COVID-19 reference
+ve • Data sharing boost –
• Impossible becomes normal
• Data infrastructure investments
• Mobilising rapid response
-ve • Political, technical and territorial issues
• Licensing, access to datasets, quality …
• Short-term vs long term sustainability
• Collection and governance bottlenecks
https://covid19.galaxyproject.org/
The compulsory what are the FAIR principles slide
… in a break out box,
without explanation or justification.
Aspirational, not a standard.
Relaunch a dialogue within the
research and policy communities.
Reboot a journey to wider
accessibility and reusability of data.
Prepare the community for
automation-readiness by supporting
FAIR for machines.
In the paper… 15 overlapping and ambiguous ….
Jacobsen et al FAIR Principles: Interpretations and
Implementation Considerations, J Data Intelligence (2020)
Mons et al Cloudy, increasingly FAIR; Revisiting the FAIR Data guiding
principles for the European Open Science Cloud. Information Services &
Use. 37. 1-8. 10.3233/ISU-170824 (2017)
FAIR Principles in spirit
identifiers, metadata, availability, standards
[adapted from Susanna Sansone]
Findable
Accessible
Interoperable
Reusable
Globally unique, resolvable, and persistent identifiers
▪ To retrieve and connect data
Community defined descriptive metadata that is catalogued / searchable
▪ To enhance discoverability and reusability
Common terminologies and standards
▪ To use the same terms and they mean the same thing
Detailed provenance
▪ To contextualize the data and facilitate reproducibility
Terms of access
▪ Open as possible, closed as necessary
Terms of use
▪ Clear licences, ideally to enable innovation and reuse
Automation
FAIR Services in practice
identifiers, metadata, availability, standards
Findable
Accessible
Interoperable
Reusable
Persistent Identifiers
Metadata Services
Access Control
Repositories & registries
Data applications
“the internet of data and services”
Not one size fits all
FAIR is a set of guiding principles that
provide for a contract of expectation
between data providers and users.
A continuum of features, attributes and
behaviours, via many different
implementations for different use cases.
Communities will need to develop their own FAIR profile:
• for their portfolios of data, processes, governance, policies, assessment
A limited pan-discipline profile.
Funders
Public Health
Educators
WHY?WHO?
Data integration
Reannotation
Credit to:
A global movement rather quickly…a FAIR frenzy!
Many of these
projects will
speak today.
Researcher,
clinician
confusion.
Community
coordination
cacophony
Movement Start up Phase:
Picking apart the Principles, Inventing Indicators, Assessment and Maturity Frameworks
1. (Re)define the principles and what they mean
2. Measure the “FAIRness level” of data
The target, before & after levels of data FAIRness in various
“FAIRification” processes
3. Measure an organisation’s capability & performance
for FAIR data generation & management
Support strategic investment decisions, cost/benefit analysis, processes &
monitoring, capacity building, change management for FAIR by Design
DOI: 10.15497/RDA0050
Dataset maturity model
Data Management
Infrastructure maturity model
What were the principles about?
Contract
Compliance
Certification?
Judgement
Endorsement
Regulation
Trusted Repositories
Commitment of a community of providers
Federation of FAIR data, registries and services
Comparison, Monitoring, Review,Quality
Assessment
Expectation setting of a data provider
Self-evaluation
Awareness, Reporting
Community respectful
Context aware
Health data - Regulation
What were the principles about?
Contract
Compliance
Certification?
EOSC Strategic Research &
Innovation Agenda consultation 2020:
metrics & certification least popular
action area
FAIRware
The Tyranny of Metrics
FAIR is a Spectrum
Not all data are equal, not all will be worth it
Spectrum of FAIR indicators
Different levels of maturity and importance to
different stakeholders and communities
Communities define levels, depths, coverage
[Barend Mons]
FAIRify
• just in case
• just in time
• just enough
Dataset portfolio
FAIR is not
Not Fuzzy “enhancing the ability of machines to automatically find and use data or any
digital object, and support its reuse by individuals” INCF Statement
Not Free
Not Fast
Not Simple
Needs experts, stewards, infrastructure, processes, maintenance …
“FAIR is non-trivial, and domain specific at anything other than the most
superficial level” - MarkWilkinson
From high effort high gain to low effort light gain
All require consensus, process change and maintenance.
• Mons et al Cloudy, increasingly FAIR; Revisiting the FAIR Data guiding principles for the European Open Science Cloud. Information Services & Use. 37. 1-8. 10.3233/ISU-170824.
• Dunning et alAre the FAIR Data Principles fair? IDCC17
Not One Approach Not about turning everything into RDF
Coverage and Implementation ChoicesVary
Findable
Accessible
Interoperable
Reusable
Shallow wide,
low cost,
loose federation
restricted value
Deep narrow,
tight federation
harmonisation
high cost, high value
https://fairplus.github.io/cookbook-dev/intro
https://bioschemas.org/
https://www.go-fair.org/2020/07/08/a-
three-point-framework-for-fairification/
FAIR by Design
At the start of a collection, built in throughout the life cycle
change management, capacity building
FAIRifying Retrospectively
Legacy datasets, build a cohort,
cost benefit and FAIR readiness over a collection of datasets
Other FAIRVariants
FA(I)R Interoperability is
the hardest and
most costly to
define, implement
& maintain
Interoperability
is usually for a
purpose not
“just in case”
FAIR is not about
harmonising all
metadata to one
schema
FAIR+R
Reproducibility is
not the same as
Reusability
FAIR for all digital
objects – software,
workflows, SOPs,
models, containers,
training materials …
Depends on
availability and
metadata
Containers
FAIR++ Business and
change analysis.
Cost Benefit
Analysis.
Scientific /
BusinessValue
Quality control
Impact
Process maturity
Sustainability
FAIR for all digital
objects – software,
workflows, SOPs,
models, containers,
training materials …
EC Report:
Turning FAIR into Reality, 2018
https://www.natureindex.com/news-blog/what-
scientists-need-to-know-about-fair-data, 2019
47% respondents
needed greater
clarification
Step back…
Why do we need FAIR?
KnowledgeTurning, Information Flow Josh Sommer, Chordoma Foundation, 2011
Promote information flow
• groups and disciplines
• organisational boundaries at all levels
• technical infrastructure
• enable federation
Biomedical flow
• needs to control the information flow
– Ethical & governance frameworks,GDPR, consent
• federation feasibility varies
Fragmented and independent resources,
infrastructures, governance
Community enclaves.
Churn which leads to knowledge loss.
Scattered Fragmentation and
knowledge churn
FAIR is not synonymous with Open
respect authority and governance frameworks
Federated System of Catalogues and Repos
federation means common agreements and compliance
respecting accountability and responsibility
• Connected by PIDs
• Moving metadata around
• Common vocabularies
• Common cataloguing data elements
• Term cross-walks and mappings
• Shared API standards.
FAIR Profile for Biomedical Community
Findable
Accessible
Interoperable
Reusable
Awareness of data
Thin metadata sparingly shared
Data visiting
Not open but access controlled
Limited exchange under strict governance
Authentication and authorisation protocols
Data visiting
Ethics, consent, privacy preservation
Compliance , Governance
Standards and Regulation are different things
Federated analysis
Combining clinical, research and
public health data. Different scales,
collected for different purposes
Standards for Usage Access
• Beacon API for genomic variants
• Data Use Ontology – usage restrictions
• GA4GH Passports – access policies
Community standards
Trusted Research Environments
Community implementation profiles
GO-FAIR
implementation
networks
federated
FAIR Data Management for Projects
FAIR by Design because not everything can be fixed at the end
Platform to build Project Product Hubs
Projects to collaborate and get their
results organised and retained
• Metadata cataloguing: collection &
organisation
• PIDs, ontologies, metadata for machines
• Controlled sharing and access
• Plug into ecosystem of other registries
and repositories
• Data submission brokering to community
repositories
• Credit to contributors
FAIR itself
• FAIR Services are not necessarily FAIR
themselves (e.g. COVID Data Portal)
http://fair-dom.org
Commons,
Separated Groups
Investigation
Study Analysis
Data
Model
SOPAssay
https://fairdomhub.org/investigations/56
Metadata
Provenance
Standards
Federated Catalogue, IntegratedView
PIDs
fairdomhub.org
https://www.nature.com/articles/s41597-020-0477-8
https://covid.pages.uni.lu/
Curating FAIR models
Mixture of shared and private objects
organised by metadata
https://www.health-atlas.de/
e.g. (Pillar III)
in-house data
in-house data
All LiSyM
Patient-related
clinical data
Aggregated data
API
External Tools
API
LiSyM: German Liver Systems Medicine Network
FAIR but never Open
[Wolfgang Mueller, HITS]
Share table
structure
Share
common
code
Share summaries
Embed into Practices
National Data
Infrastructure
Authorised Access
Secure data
Spreadsheets
FAIR along the Pipeline
Understanding the pipeline,
moving metadata across
resources
FAIR stewardship effort and
pipeline design
ELIXIR RDMToolkit
The FAIR data infrastructure needed ….
tools, services, registries & catalogues, repositories…
Federation of fair repositories & registries
with mixed authority models
FAIR services – metadata services,
ontology servers, search engines, PID
services, validators, integrators,
annotators, assessors, brokers …
Interfaces - shared APIs, common terms,
cross-walks.
FAIR digital objects.
FAIRsFAIR confusagram, FAIR Ecosystem Components:Vision, V2.0
10.5281/zenodo.3734273 (missing the processing of data…)https://www.eoscsecretariat.eu/working-groups/fair-working-group
More than indicators, metrics and tech Infrastructure
“digital technologies (hardware, software), resources (data, services, digital libraries,
standards), comms (protocols, access rights, networks), people and organisational structures”
Data stewardship
Software Engs
Professionalisation
POSSIBLE
Processes
Organisational &
Cultural change
NORMATIVE
Incentives
Cost Benefit
REWARDING
Governance
Regulatory
Frameworks
ACCEPTABLE
Policy
REQUIRED
Education
Training
UNDERSTOOD
Sustainability -TRUSTED
How is FAIR faring? Maturing…
• Shifted the conversation
• Concept & mobilisation frenzy
• Rush to measure & certify
• Community specific
• FAIR not FOIR
It’s now the phase to embed, sustain and yield benefits for users
A marathon journey not a sprint
It is not simple, but it is no longer optional
Acknowledgements
Special thanks to
• Susanna Sansone (University of Oxford, OERC)
• Frederik Coppens (VIB)
• Barend Mons (GO-FAIR)
• Oya Deniz Beyan (Fraunhofer)
• Ibrahim Emam (Imperial College)
• FAIRDOM, GO-FAIR, FAIRplus and ELIXIR colleagues
FAIRDOM is sponsored by
1 of 36

Recommended

FAIRy stories: the FAIR Data principles in theory and in practice by
FAIRy stories: the FAIR Data principles in theory and in practiceFAIRy stories: the FAIR Data principles in theory and in practice
FAIRy stories: the FAIR Data principles in theory and in practiceCarole Goble
246 views52 slides
FAIR Computational Workflows by
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows Carole Goble
630 views49 slides
FAIR Computational Workflows by
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
983 views49 slides
Towards Knowledge Graph based Representation, Augmentation and Exploration of... by
Towards Knowledge Graph based Representation, Augmentation and Exploration of...Towards Knowledge Graph based Representation, Augmentation and Exploration of...
Towards Knowledge Graph based Representation, Augmentation and Exploration of...Sören Auer
1.5K views90 slides
FAIR Workflows and Research Objects get a Workout by
FAIR Workflows and Research Objects get a Workout FAIR Workflows and Research Objects get a Workout
FAIR Workflows and Research Objects get a Workout Carole Goble
480 views31 slides
Introduction to RDF & SPARQL by
Introduction to RDF & SPARQLIntroduction to RDF & SPARQL
Introduction to RDF & SPARQLOpen Data Support
12.2K views43 slides

More Related Content

What's hot

Property graph vs. RDF Triplestore comparison in 2020 by
Property graph vs. RDF Triplestore comparison in 2020Property graph vs. RDF Triplestore comparison in 2020
Property graph vs. RDF Triplestore comparison in 2020Ontotext
17K views33 slides
Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo... by
Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo...Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo...
Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo...Neo4j
155 views33 slides
Fair by design by
Fair by designFair by design
Fair by designPistoia Alliance
279 views94 slides
FAIR-4-GSC-Sansone-Aug23.pdf by
FAIR-4-GSC-Sansone-Aug23.pdfFAIR-4-GSC-Sansone-Aug23.pdf
FAIR-4-GSC-Sansone-Aug23.pdfSusanna-Assunta Sansone
54 views25 slides
Introduction to Knowledge Graphs and Semantic AI by
Introduction to Knowledge Graphs and Semantic AIIntroduction to Knowledge Graphs and Semantic AI
Introduction to Knowledge Graphs and Semantic AISemantic Web Company
7.9K views89 slides
Debunking some “RDF vs. Property Graph” Alternative Facts by
Debunking some “RDF vs. Property Graph” Alternative FactsDebunking some “RDF vs. Property Graph” Alternative Facts
Debunking some “RDF vs. Property Graph” Alternative FactsNeo4j
6.5K views38 slides

What's hot(20)

Property graph vs. RDF Triplestore comparison in 2020 by Ontotext
Property graph vs. RDF Triplestore comparison in 2020Property graph vs. RDF Triplestore comparison in 2020
Property graph vs. RDF Triplestore comparison in 2020
Ontotext17K views
Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo... by Neo4j
Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo...Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo...
Knowledge Graphs & Graph Data Science, More Context, Better Predictions - Neo...
Neo4j155 views
Debunking some “RDF vs. Property Graph” Alternative Facts by Neo4j
Debunking some “RDF vs. Property Graph” Alternative FactsDebunking some “RDF vs. Property Graph” Alternative Facts
Debunking some “RDF vs. Property Graph” Alternative Facts
Neo4j6.5K views
Introduction of Knowledge Graphs by Jeff Z. Pan
Introduction of Knowledge GraphsIntroduction of Knowledge Graphs
Introduction of Knowledge Graphs
Jeff Z. Pan1.9K views
Introdution to Dataops and AIOps (or MLOps) by Adrien Blind
Introdution to Dataops and AIOps (or MLOps)Introdution to Dataops and AIOps (or MLOps)
Introdution to Dataops and AIOps (or MLOps)
Adrien Blind1.2K views
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021 by Tristan Baker
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Intuit's Data Mesh - Data Mesh Leaning Community meetup 5.13.2021
Tristan Baker903 views
Build a car with Graphs, Fabien Batejat, Volvo Cars by Neo4j
Build a car with Graphs, Fabien Batejat, Volvo CarsBuild a car with Graphs, Fabien Batejat, Volvo Cars
Build a car with Graphs, Fabien Batejat, Volvo Cars
Neo4j6.2K views
Supply Chain and Logistics Management with Graph & AI by TigerGraph
Supply Chain and Logistics Management with Graph & AISupply Chain and Logistics Management with Graph & AI
Supply Chain and Logistics Management with Graph & AI
TigerGraph193 views
Neo4j Popular use case by Neo4j
Neo4j Popular use case Neo4j Popular use case
Neo4j Popular use case
Neo4j540 views
Introduction to Open Science and EOSC by Sarah Jones
Introduction to Open Science and EOSCIntroduction to Open Science and EOSC
Introduction to Open Science and EOSC
Sarah Jones331 views
The future of FAIR by Sarah Jones
The future of FAIRThe future of FAIR
The future of FAIR
Sarah Jones301 views
Knowledge Graph Introduction by Sören Auer
Knowledge Graph IntroductionKnowledge Graph Introduction
Knowledge Graph Introduction
Sören Auer616 views

Similar to How are we Faring with FAIR? (and what FAIR is not)

FAIR data: what it means, how we achieve it, and the role of RDA by
FAIR data: what it means, how we achieve it, and the role of RDAFAIR data: what it means, how we achieve it, and the role of RDA
FAIR data: what it means, how we achieve it, and the role of RDASarah Jones
873 views29 slides
FAIR, FAIRplus and the FAIR Cookbook by
FAIR, FAIRplus and the FAIR Cookbook FAIR, FAIRplus and the FAIR Cookbook
FAIR, FAIRplus and the FAIR Cookbook Susanna-Assunta Sansone
69 views53 slides
My FAIR share of the work - Diamond Light Source - Dec 2018 by
My FAIR share of the work - Diamond Light Source - Dec 2018My FAIR share of the work - Diamond Light Source - Dec 2018
My FAIR share of the work - Diamond Light Source - Dec 2018Susanna-Assunta Sansone
237 views56 slides
Turning FAIR into Reality: Briefing on the EC’s report on FAIR data by
Turning FAIR into Reality: Briefing on the EC’s report on FAIR dataTurning FAIR into Reality: Briefing on the EC’s report on FAIR data
Turning FAIR into Reality: Briefing on the EC’s report on FAIR datadri_ireland
295 views29 slides
FAIR play? by
FAIR play? FAIR play?
FAIR play? Sarah Jones
861 views39 slides
FAIR History and the Future by
FAIR History and the FutureFAIR History and the Future
FAIR History and the FutureCarole Goble
308 views56 slides

Similar to How are we Faring with FAIR? (and what FAIR is not)(20)

FAIR data: what it means, how we achieve it, and the role of RDA by Sarah Jones
FAIR data: what it means, how we achieve it, and the role of RDAFAIR data: what it means, how we achieve it, and the role of RDA
FAIR data: what it means, how we achieve it, and the role of RDA
Sarah Jones873 views
Turning FAIR into Reality: Briefing on the EC’s report on FAIR data by dri_ireland
Turning FAIR into Reality: Briefing on the EC’s report on FAIR dataTurning FAIR into Reality: Briefing on the EC’s report on FAIR data
Turning FAIR into Reality: Briefing on the EC’s report on FAIR data
dri_ireland295 views
FAIR History and the Future by Carole Goble
FAIR History and the FutureFAIR History and the Future
FAIR History and the Future
Carole Goble308 views
Let’s go on a FAIR safari! by Carole Goble
Let’s go on a FAIR safari!Let’s go on a FAIR safari!
Let’s go on a FAIR safari!
Carole Goble1.4K views
Turning FAIR into Reality: Final outcomes from the European Commission FAIR D... by Sarah Jones
Turning FAIR into Reality: Final outcomes from the European Commission FAIR D...Turning FAIR into Reality: Final outcomes from the European Commission FAIR D...
Turning FAIR into Reality: Final outcomes from the European Commission FAIR D...
Sarah Jones1.2K views
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook by Susanna-Assunta Sansone
FAIRification is a Team Sport: FAIRsharing and the FAIR CookbookFAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
FAIRification is a Team Sport: FAIRsharing and the FAIR Cookbook
Making Data FAIR (Findable, Accessible, Interoperable, Reusable) by Tom Plasterer
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Tom Plasterer769 views
Results from the FAIR Expert Group Stakeholder Consultation on the FAIR Data ... by EOSCpilot .eu
Results from the FAIR Expert Group Stakeholder Consultation on the FAIR Data ...Results from the FAIR Expert Group Stakeholder Consultation on the FAIR Data ...
Results from the FAIR Expert Group Stakeholder Consultation on the FAIR Data ...
EOSCpilot .eu821 views

More from Carole Goble

The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo... by
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...Carole Goble
45 views23 slides
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research... by
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...Carole Goble
38 views33 slides
Research Software Sustainability takes a Village by
Research Software Sustainability takes a VillageResearch Software Sustainability takes a Village
Research Software Sustainability takes a VillageCarole Goble
40 views29 slides
FAIR Computational Workflows by
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational WorkflowsCarole Goble
193 views29 slides
Open Research: Manchester leading and learning by
Open Research: Manchester leading and learningOpen Research: Manchester leading and learning
Open Research: Manchester leading and learningCarole Goble
143 views17 slides
RDMkit, a Research Data Management Toolkit. Built by the Community for the ... by
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...Carole Goble
712 views38 slides

More from Carole Goble(20)

The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo... by Carole Goble
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
The ELIXIR FAIR Knowledge Ecosystem for practical know-how: RDMkit and FAIRCo...
Carole Goble45 views
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research... by Carole Goble
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science,  a Digital Research...
Can’t Pay, Won’t Pay, Don’t Pay: Delivering open science, a Digital Research...
Carole Goble38 views
Research Software Sustainability takes a Village by Carole Goble
Research Software Sustainability takes a VillageResearch Software Sustainability takes a Village
Research Software Sustainability takes a Village
Carole Goble40 views
FAIR Computational Workflows by Carole Goble
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
Carole Goble193 views
Open Research: Manchester leading and learning by Carole Goble
Open Research: Manchester leading and learningOpen Research: Manchester leading and learning
Open Research: Manchester leading and learning
Carole Goble143 views
RDMkit, a Research Data Management Toolkit. Built by the Community for the ... by Carole Goble
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
Carole Goble712 views
FAIR Computational Workflows by Carole Goble
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
Carole Goble415 views
EOSC-Life Workflow Collaboratory by Carole Goble
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow Collaboratory
Carole Goble132 views
FAIR Computational Workflows by Carole Goble
FAIR Computational WorkflowsFAIR Computational Workflows
FAIR Computational Workflows
Carole Goble493 views
FAIR Data Bridging from researcher data management to ELIXIR archives in the... by Carole Goble
FAIR Data Bridging from researcher data management to ELIXIR archives in the...FAIR Data Bridging from researcher data management to ELIXIR archives in the...
FAIR Data Bridging from researcher data management to ELIXIR archives in the...
Carole Goble120 views
RO-Crate: A framework for packaging research products into FAIR Research Objects by Carole Goble
RO-Crate: A framework for packaging research products into FAIR Research ObjectsRO-Crate: A framework for packaging research products into FAIR Research Objects
RO-Crate: A framework for packaging research products into FAIR Research Objects
Carole Goble425 views
The swings and roundabouts of a decade of fun and games with Research Objects by Carole Goble
The swings and roundabouts of a decade of fun and games with Research Objects The swings and roundabouts of a decade of fun and games with Research Objects
The swings and roundabouts of a decade of fun and games with Research Objects
Carole Goble168 views
What is Reproducibility? The R* brouhaha and how Research Objects can help by Carole Goble
What is Reproducibility? The R* brouhaha and how Research Objects can helpWhat is Reproducibility? The R* brouhaha and how Research Objects can help
What is Reproducibility? The R* brouhaha and how Research Objects can help
Carole Goble258 views
ELIXIR UK Node presentation to the ELIXIR Board by Carole Goble
ELIXIR UK Node presentation to the ELIXIR BoardELIXIR UK Node presentation to the ELIXIR Board
ELIXIR UK Node presentation to the ELIXIR Board
Carole Goble501 views
FAIRy stories: tales from building the FAIR Research Commons by Carole Goble
FAIRy stories: tales from building the FAIR Research CommonsFAIRy stories: tales from building the FAIR Research Commons
FAIRy stories: tales from building the FAIR Research Commons
Carole Goble1.4K views
Reproducible Research: how could Research Objects help by Carole Goble
Reproducible Research: how could Research Objects helpReproducible Research: how could Research Objects help
Reproducible Research: how could Research Objects help
Carole Goble605 views
Reflections on a (slightly unusual) multi-disciplinary academic career by Carole Goble
Reflections on a (slightly unusual) multi-disciplinary academic careerReflections on a (slightly unusual) multi-disciplinary academic career
Reflections on a (slightly unusual) multi-disciplinary academic career
Carole Goble482 views
Better Software, Better Research by Carole Goble
Better Software, Better ResearchBetter Software, Better Research
Better Software, Better Research
Carole Goble657 views
Reproducibility (and the R*) of Science: motivations, challenges and trends by Carole Goble
Reproducibility (and the R*) of Science: motivations, challenges and trendsReproducibility (and the R*) of Science: motivations, challenges and trends
Reproducibility (and the R*) of Science: motivations, challenges and trends
Carole Goble1.8K views
Research Object Community Update by Carole Goble
Research Object Community UpdateResearch Object Community Update
Research Object Community Update
Carole Goble197 views

Recently uploaded

vitamine B1.pptx by
vitamine B1.pptxvitamine B1.pptx
vitamine B1.pptxajithkilpart
29 views22 slides
Krishna VSC 692 Credit Seminar.pptx by
Krishna VSC 692 Credit Seminar.pptxKrishna VSC 692 Credit Seminar.pptx
Krishna VSC 692 Credit Seminar.pptxKrishnaSharma682993
11 views54 slides
Indian council for child welfare by
Indian council for child welfareIndian council for child welfare
Indian council for child welfareRenuWaghmare2
7 views21 slides
ALGAL PRODUCTS.pptx by
ALGAL PRODUCTS.pptxALGAL PRODUCTS.pptx
ALGAL PRODUCTS.pptxRASHMI M G
7 views17 slides
Applications of Large Language Models in Materials Discovery and Design by
Applications of Large Language Models in Materials Discovery and DesignApplications of Large Language Models in Materials Discovery and Design
Applications of Large Language Models in Materials Discovery and DesignAnubhav Jain
14 views17 slides
ELECTRON TRANSPORT CHAIN by
ELECTRON TRANSPORT CHAINELECTRON TRANSPORT CHAIN
ELECTRON TRANSPORT CHAINDEEKSHA RANI
16 views16 slides

Recently uploaded(20)

Indian council for child welfare by RenuWaghmare2
Indian council for child welfareIndian council for child welfare
Indian council for child welfare
RenuWaghmare27 views
Applications of Large Language Models in Materials Discovery and Design by Anubhav Jain
Applications of Large Language Models in Materials Discovery and DesignApplications of Large Language Models in Materials Discovery and Design
Applications of Large Language Models in Materials Discovery and Design
Anubhav Jain14 views
ELECTRON TRANSPORT CHAIN by DEEKSHA RANI
ELECTRON TRANSPORT CHAINELECTRON TRANSPORT CHAIN
ELECTRON TRANSPORT CHAIN
DEEKSHA RANI16 views
Discovery of therapeutic agents targeting PKLR for NAFLD using drug repositio... by Trustlife
Discovery of therapeutic agents targeting PKLR for NAFLD using drug repositio...Discovery of therapeutic agents targeting PKLR for NAFLD using drug repositio...
Discovery of therapeutic agents targeting PKLR for NAFLD using drug repositio...
Trustlife207 views
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance... by InsideScientific
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...
A Ready-to-Analyze High-Plex Spatial Signature Development Workflow for Cance...
InsideScientific121 views
Best Hybrid Event Platform.pptx by Harriet Davis
Best Hybrid Event Platform.pptxBest Hybrid Event Platform.pptx
Best Hybrid Event Platform.pptx
Harriet Davis10 views
Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe... by Anmol Vishnu Gupta
Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe...Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe...
Study on Drug Drug Interaction Through Prescription Analysis of Type II Diabe...
Exploring the nature and synchronicity of early cluster formation in the Larg... by Sérgio Sacani
Exploring the nature and synchronicity of early cluster formation in the Larg...Exploring the nature and synchronicity of early cluster formation in the Larg...
Exploring the nature and synchronicity of early cluster formation in the Larg...
Sérgio Sacani1.5K views
Experimental animal Guinea pigs.pptx by Mansee Arya
Experimental animal Guinea pigs.pptxExperimental animal Guinea pigs.pptx
Experimental animal Guinea pigs.pptx
Mansee Arya42 views
Presentation on experimental laboratory animal- Hamster by Kanika13641
Presentation on experimental laboratory animal- HamsterPresentation on experimental laboratory animal- Hamster
Presentation on experimental laboratory animal- Hamster
Kanika136416 views
Ellagic Acid and Its Metabolites as Potent and Selective Allosteric Inhibitor... by Trustlife
Ellagic Acid and Its Metabolites as Potent and Selective Allosteric Inhibitor...Ellagic Acid and Its Metabolites as Potent and Selective Allosteric Inhibitor...
Ellagic Acid and Its Metabolites as Potent and Selective Allosteric Inhibitor...
Trustlife154 views
Note on the Riemann Hypothesis by vegafrank2
Note on the Riemann HypothesisNote on the Riemann Hypothesis
Note on the Riemann Hypothesis
vegafrank28 views
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ... by ILRI
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...
Small ruminant keepers’ knowledge, attitudes and practices towards peste des ...
ILRI9 views
A giant thin stellar stream in the Coma Galaxy Cluster by Sérgio Sacani
A giant thin stellar stream in the Coma Galaxy ClusterA giant thin stellar stream in the Coma Galaxy Cluster
A giant thin stellar stream in the Coma Galaxy Cluster
Sérgio Sacani20 views
Vegetable grafting: A new crop improvement approach.pptx by Himul Suthar
Vegetable grafting: A new crop improvement approach.pptxVegetable grafting: A new crop improvement approach.pptx
Vegetable grafting: A new crop improvement approach.pptx
Himul Suthar8 views

How are we Faring with FAIR? (and what FAIR is not)

  • 1. How are we Faring with FAIR (and what FAIR is not) Carole Goble The University of Manchester FAIRDOM ELIXIR, EOSC-Life, IBISBA, BioExcel CoE, FAIRplus carole.goble@manchester.ac.uk The views expressed in this talk are my own Workshop: FAIRe Data Infrastructures, 15 October 2020
  • 2. Data discovery and reuse at scale through good data management 2016 A set of PRINCIPLES to enhance the value of all digital resources and their reuse by PEOPLE and by MACHINES Scientific Data 3, 160018 (2016) doi:10.1038/sdata.2016.18 [Credit: Susanna Sansone]
  • 3. Branding a trend, stimulating a movement … 2014 2016 2015 Open Science Data Sharing Recognition & Credit Reproducibility Data-driven Science Automation, AI
  • 5. The compulsory COVID-19 reference “COVID is a good example .. there must be loads of legacy data. We’re desperately trying to go back and look at what we knew from SARS 10 years ago” – Pharma manager, FAIRplus project https://www.covid19dataportal.org/ https://www.nature.com/articles/s41431-020-0635-7 https://www.rd-alliance.org/group/rda-covid19-rda-covid19-omics-rda-covid19-epidemiology-rda-covid19- clinical-rda-covid19-1 https://doi.org/10.15497/rda00052
  • 6. The compulsory COVID-19 reference +ve • Data sharing boost – • Impossible becomes normal • Data infrastructure investments • Mobilising rapid response -ve • Political, technical and territorial issues • Licensing, access to datasets, quality … • Short-term vs long term sustainability • Collection and governance bottlenecks https://covid19.galaxyproject.org/
  • 7. The compulsory what are the FAIR principles slide … in a break out box, without explanation or justification. Aspirational, not a standard. Relaunch a dialogue within the research and policy communities. Reboot a journey to wider accessibility and reusability of data. Prepare the community for automation-readiness by supporting FAIR for machines. In the paper… 15 overlapping and ambiguous …. Jacobsen et al FAIR Principles: Interpretations and Implementation Considerations, J Data Intelligence (2020) Mons et al Cloudy, increasingly FAIR; Revisiting the FAIR Data guiding principles for the European Open Science Cloud. Information Services & Use. 37. 1-8. 10.3233/ISU-170824 (2017)
  • 8. FAIR Principles in spirit identifiers, metadata, availability, standards [adapted from Susanna Sansone] Findable Accessible Interoperable Reusable Globally unique, resolvable, and persistent identifiers ▪ To retrieve and connect data Community defined descriptive metadata that is catalogued / searchable ▪ To enhance discoverability and reusability Common terminologies and standards ▪ To use the same terms and they mean the same thing Detailed provenance ▪ To contextualize the data and facilitate reproducibility Terms of access ▪ Open as possible, closed as necessary Terms of use ▪ Clear licences, ideally to enable innovation and reuse Automation
  • 9. FAIR Services in practice identifiers, metadata, availability, standards Findable Accessible Interoperable Reusable Persistent Identifiers Metadata Services Access Control Repositories & registries Data applications “the internet of data and services”
  • 10. Not one size fits all FAIR is a set of guiding principles that provide for a contract of expectation between data providers and users. A continuum of features, attributes and behaviours, via many different implementations for different use cases. Communities will need to develop their own FAIR profile: • for their portfolios of data, processes, governance, policies, assessment A limited pan-discipline profile.
  • 12. Credit to: A global movement rather quickly…a FAIR frenzy! Many of these projects will speak today. Researcher, clinician confusion. Community coordination cacophony
  • 13. Movement Start up Phase: Picking apart the Principles, Inventing Indicators, Assessment and Maturity Frameworks 1. (Re)define the principles and what they mean 2. Measure the “FAIRness level” of data The target, before & after levels of data FAIRness in various “FAIRification” processes 3. Measure an organisation’s capability & performance for FAIR data generation & management Support strategic investment decisions, cost/benefit analysis, processes & monitoring, capacity building, change management for FAIR by Design DOI: 10.15497/RDA0050 Dataset maturity model Data Management Infrastructure maturity model
  • 14. What were the principles about? Contract Compliance Certification? Judgement Endorsement Regulation Trusted Repositories Commitment of a community of providers Federation of FAIR data, registries and services Comparison, Monitoring, Review,Quality Assessment Expectation setting of a data provider Self-evaluation Awareness, Reporting Community respectful Context aware Health data - Regulation
  • 15. What were the principles about? Contract Compliance Certification? EOSC Strategic Research & Innovation Agenda consultation 2020: metrics & certification least popular action area FAIRware The Tyranny of Metrics
  • 16. FAIR is a Spectrum Not all data are equal, not all will be worth it Spectrum of FAIR indicators Different levels of maturity and importance to different stakeholders and communities Communities define levels, depths, coverage [Barend Mons] FAIRify • just in case • just in time • just enough Dataset portfolio
  • 17. FAIR is not Not Fuzzy “enhancing the ability of machines to automatically find and use data or any digital object, and support its reuse by individuals” INCF Statement Not Free Not Fast Not Simple Needs experts, stewards, infrastructure, processes, maintenance … “FAIR is non-trivial, and domain specific at anything other than the most superficial level” - MarkWilkinson From high effort high gain to low effort light gain All require consensus, process change and maintenance. • Mons et al Cloudy, increasingly FAIR; Revisiting the FAIR Data guiding principles for the European Open Science Cloud. Information Services & Use. 37. 1-8. 10.3233/ISU-170824. • Dunning et alAre the FAIR Data Principles fair? IDCC17 Not One Approach Not about turning everything into RDF
  • 18. Coverage and Implementation ChoicesVary Findable Accessible Interoperable Reusable Shallow wide, low cost, loose federation restricted value Deep narrow, tight federation harmonisation high cost, high value https://fairplus.github.io/cookbook-dev/intro https://bioschemas.org/ https://www.go-fair.org/2020/07/08/a- three-point-framework-for-fairification/
  • 19. FAIR by Design At the start of a collection, built in throughout the life cycle change management, capacity building FAIRifying Retrospectively Legacy datasets, build a cohort, cost benefit and FAIR readiness over a collection of datasets
  • 20. Other FAIRVariants FA(I)R Interoperability is the hardest and most costly to define, implement & maintain Interoperability is usually for a purpose not “just in case” FAIR is not about harmonising all metadata to one schema FAIR+R Reproducibility is not the same as Reusability FAIR for all digital objects – software, workflows, SOPs, models, containers, training materials … Depends on availability and metadata Containers FAIR++ Business and change analysis. Cost Benefit Analysis. Scientific / BusinessValue Quality control Impact Process maturity Sustainability FAIR for all digital objects – software, workflows, SOPs, models, containers, training materials … EC Report: Turning FAIR into Reality, 2018 https://www.natureindex.com/news-blog/what- scientists-need-to-know-about-fair-data, 2019 47% respondents needed greater clarification
  • 21. Step back… Why do we need FAIR? KnowledgeTurning, Information Flow Josh Sommer, Chordoma Foundation, 2011 Promote information flow • groups and disciplines • organisational boundaries at all levels • technical infrastructure • enable federation Biomedical flow • needs to control the information flow – Ethical & governance frameworks,GDPR, consent • federation feasibility varies Fragmented and independent resources, infrastructures, governance Community enclaves. Churn which leads to knowledge loss. Scattered Fragmentation and knowledge churn
  • 22. FAIR is not synonymous with Open respect authority and governance frameworks
  • 23. Federated System of Catalogues and Repos federation means common agreements and compliance respecting accountability and responsibility • Connected by PIDs • Moving metadata around • Common vocabularies • Common cataloguing data elements • Term cross-walks and mappings • Shared API standards.
  • 24. FAIR Profile for Biomedical Community Findable Accessible Interoperable Reusable Awareness of data Thin metadata sparingly shared Data visiting Not open but access controlled Limited exchange under strict governance Authentication and authorisation protocols Data visiting Ethics, consent, privacy preservation Compliance , Governance Standards and Regulation are different things Federated analysis Combining clinical, research and public health data. Different scales, collected for different purposes Standards for Usage Access • Beacon API for genomic variants • Data Use Ontology – usage restrictions • GA4GH Passports – access policies Community standards Trusted Research Environments Community implementation profiles GO-FAIR implementation networks federated
  • 25. FAIR Data Management for Projects FAIR by Design because not everything can be fixed at the end Platform to build Project Product Hubs Projects to collaborate and get their results organised and retained • Metadata cataloguing: collection & organisation • PIDs, ontologies, metadata for machines • Controlled sharing and access • Plug into ecosystem of other registries and repositories • Data submission brokering to community repositories • Credit to contributors FAIR itself • FAIR Services are not necessarily FAIR themselves (e.g. COVID Data Portal) http://fair-dom.org
  • 29. Mixture of shared and private objects organised by metadata https://www.health-atlas.de/
  • 30. e.g. (Pillar III) in-house data in-house data All LiSyM Patient-related clinical data Aggregated data API External Tools API LiSyM: German Liver Systems Medicine Network FAIR but never Open [Wolfgang Mueller, HITS] Share table structure Share common code Share summaries
  • 31. Embed into Practices National Data Infrastructure Authorised Access Secure data Spreadsheets
  • 32. FAIR along the Pipeline Understanding the pipeline, moving metadata across resources FAIR stewardship effort and pipeline design ELIXIR RDMToolkit
  • 33. The FAIR data infrastructure needed …. tools, services, registries & catalogues, repositories… Federation of fair repositories & registries with mixed authority models FAIR services – metadata services, ontology servers, search engines, PID services, validators, integrators, annotators, assessors, brokers … Interfaces - shared APIs, common terms, cross-walks. FAIR digital objects. FAIRsFAIR confusagram, FAIR Ecosystem Components:Vision, V2.0 10.5281/zenodo.3734273 (missing the processing of data…)https://www.eoscsecretariat.eu/working-groups/fair-working-group
  • 34. More than indicators, metrics and tech Infrastructure “digital technologies (hardware, software), resources (data, services, digital libraries, standards), comms (protocols, access rights, networks), people and organisational structures” Data stewardship Software Engs Professionalisation POSSIBLE Processes Organisational & Cultural change NORMATIVE Incentives Cost Benefit REWARDING Governance Regulatory Frameworks ACCEPTABLE Policy REQUIRED Education Training UNDERSTOOD Sustainability -TRUSTED
  • 35. How is FAIR faring? Maturing… • Shifted the conversation • Concept & mobilisation frenzy • Rush to measure & certify • Community specific • FAIR not FOIR It’s now the phase to embed, sustain and yield benefits for users A marathon journey not a sprint It is not simple, but it is no longer optional
  • 36. Acknowledgements Special thanks to • Susanna Sansone (University of Oxford, OERC) • Frederik Coppens (VIB) • Barend Mons (GO-FAIR) • Oya Deniz Beyan (Fraunhofer) • Ibrahim Emam (Imperial College) • FAIRDOM, GO-FAIR, FAIRplus and ELIXIR colleagues FAIRDOM is sponsored by

Editor's Notes

  1. https://www.gmds.de/aktivitaeten/medizinische-informatik/projektgruppenseiten/faire-dateninfrastrukturen-fuer-die-biomedizinische-informatik/workshop-2020/ perform in a specified way in a particular situation or over a particular period. Personal reflections with a keynote presentation about the FAIR guiding principles for scientific data management and stewardship and their implementation in the life sciences, Remarkably it was only in 2016 that the ‘FAIR Guiding Principles for scientific data management and stewardship’ appeared in Scientific Data. The paper was intended to launch a dialogue within the research and policy communities: to start a journey to wider accessibility and reusability of data and prepare for automation-readiness by supporting findability, accessibility, interoperability and reusability for machines. Many of the authors (including myself) came from biomedical and associated communities.  The paper succeeded in its aim, at least at the policy, enterprise and professional data infrastructure level. Whether FAIR has impacted the researcher at the bench or bedside is open to doubt. It certainly inspired a great deal of activity, many projects, a lot of positioning of interests and raised awareness. COVID has injected impetus and urgency to the FAIR cause (good) and also highlighted its politicisation (not so good). In this talk I’ll make some personal reflections on how we are faring with FAIR: as one of the original principles authors; as a participant in many current FAIR initiatives (particularly in the biomedical sector and for research objects other than data) and as a veteran of FAIR before we had the principles.
  2. 3000 google scholar citations
  3. The paper was intended to launch a dialogue within the research and policy communities: to start a journey to wider accessibility and reusability of data and prepare for automation-readiness by supporting findability, accessibility, interoperability and reusability for machines.
  4. Hidden slide
  5. Linking and moving metadata around Common metadata and ids, shared APIs and cross-walks
  6. HIDDEN SLIDE
  7. Like a goldrush
  8. HIDDEN SLIDE
  9. Not everything that can be counted counts. Not everything that counts can be counted – Einstein & William Bruce Cameron FAIRsFAIR will partner with the Research on Research Institute (RoRI) in FAIRware, a new initiative to build open source software tools and systems with the potential to transform uptake of the FAIR principles by researchers.   https://www.eoscsecretariat.eu/open-consultation-eosc-strategic-research-and-innovation-agenda https://www.fairsfair.eu/fairsfair-data-object-assessment-metrics-request-comments
  10. There will be a "mixed economy" of FAIR data in a community. Something may be FAIR but not useful or high quality. Something may be essential but not fully FAIR.
  11. Simple word but is not simple to do. And be incremental Not one approach - RDF
  12. https://www.nature.com/articles/s41431-020-0635-7 https://www.go-fair.org/implementation-networks/overview/vodan/ ‘recipes’ for making different types of data FAIR
  13. Different stages where the FAIR goes The steps towards FAIR
  14. FAIR is not about a resource’s Quality or Impact or Scientific value or Business value
  15. across and within groups and disciplines across and within organisational boundaries at all levels: the lab, the project, the organisation, the community, the country across all technical infrastructure platforms, repositories, registries, tools enable federation Fit for purpose scholarly comms Fit for purpose publishing while collaborating to compete in churning
  16. Secure data sharing
  17. Different levels FAIR Accountability and Responsibility Links in interfaces, common vocabularies, common minimum cataloguing terms and cross-walks, shared API standards. Generic approaches (e.g. using schema.org), Generic infrastructure (e.g. Ontology Lookup Service), Generic catalogues (e.g. WorkflowHub) Federated System of Systems ecosystem of catalogues, repositories, of different types. Degree to which you can get compliance National and regional level… Tiered levels…. Registries federated or not Degree you can get everyone working together Neylon, Knowledge Exchange Report: http://www.knowledge-exchange.info/event/ke-approach-open-scholarship Zoo of catalogues and registries. LOTS of catalogies Some flock around 1 Some rely on federated metadata Moving between stages, moving things through boundaries. NIH meeting for institutional repositories https://www.scgcorp.com/repositories2020/regclosed. in the Commons goods Micro, meso and macro players. OpenAIRE repository – helps find, access and reuse. Not meant to be interoperable.
  18. What is special about biomedical EOSCLife recommendations Heterogeneous, fragmented
  19. But its content may vary in its FAIRness!
  20. Curating retrospectively….
  21. HIDDEN SLIDE
  22. LiSyM Data ManagementShare table structure Create & share common code Make it easy-to-install Create and share summaries
  23. HIDDEN SLIDE Basic research data management solution, simplifying daily study management tasks • GUI effective in engaging users in the platform • Main structure modeled after biological research • Next steps – Testing large data files – Adapting labels to be more suitable for our enviroment – Establish more automation facilitating the RESTful service – Looking into advanced features of SEEK
  24. FAIRDOM SEEK + NeLS First mile - FAIR data management for projects at the “first mile” Last mile - Organise data / models to deposit / link to (ELIXIR) datasets FAIR across the pipeline, across infrastructures, across platforms The FAIR chain of custody Provenance metadata propagation Synch registered SOPs and data transformations in NeLS Permissions and AAI through NeLS Portal and FAIRDOM SEEK FAIR stewardship effort and pipeline design Manual and automated Ramps and skills (see Filip Pattyn, TuesdayLots about “my silo is fair” SEEK register data and make it available in samples, so they can look at the samples when they are made public with no problem. There is an issue if they want to access the original data as only Norwegian researchers can log in to the NeLS portal Technically SOPs recorded in SEEK can/should carry information about data transformations that happened in NeLS However, that relies on rese Provenance of data analysis is provided by our Galaxy instances, and we would like to capture some of that into NeLS and SBI (for possibly later reopening in Galaxy or elsewhere), but that is not implemented yet. If you think of more generic data provenance over time, what is added to a dataset when etc, NeLS does not support that.archers adding it, which means it practice it won't exist es, we register data and make it avaialble in samples, so they can look at the samples when they are made public with no problem. There is an issue if they want to access the original data as only Norwegian researchers can log in to the NeLS portal so we can track provenance from the registration point in SEEK Given NeLS is run by ELIXIR Norway I would expect them at the very least to have a plan, f not some things implemented, but we will see
  25. Moving DOs across FAIR digital objects – moving around Links in interfaces, common vocabularies, common minimum cataloguing terms and cross-walks, shared API standards. Generic approaches (e.g. using schema.org), Generic infrastructure (e.g. Ontology Lookup Service), Generic catalogues (e.g. WorkflowHub) Globally unique, resolvable, and persistent identifiers Community defined descriptive metadata that is catalogued / searchable Common terminologies and standards Detailed provenance Terms of access Terms of use
  26. HIDDEN SLIDE Was 7 recommendations, now 6 Mentions certification but does not argue for it.
  27. PEOPLE MATTER! Its about community building and consensus
  28. BTW: Majority of researchers have never heard of FAIR https://www.natureindex.com/news-blog/what-scientists-need-to-know-about-fair-data
  29. HIDDE SLIDE