SlideShare a Scribd company logo
1 of 29
S a r a h J o n e s
A s s o c i a t e D i r e c t o r , D i g i t a l C u r a t i o n C e n t r e
R a p p o r t e u r o f F A I R d a t a e x p e r t g r o u p
s a r a h . j o n e s @ g l a s g o w . a c . u k
T w i t t e r : @ s j D C C
FAIR data: what it means, how we
achieve it, and the role of RDA
What is FAIR?
A set of principles that describe the attributes
data need to have to enable and enhance reuse,
by humans and machines
Image CC-BY-SA by SangyaPundir
Origins of FAIR
• Emerged from a workshop held in Leiden in 2014
• Come from life sciences but intended for all data
• Issued by FORCE11 community
• Echo previous principles on open data & curation
OECD Principles and
Guidelines for Access to
Research Data from
Public Funding (2007)
Science as an Open Enterprise (2012)
notion of ‘intelligent openness’ where
data are accessible, intelligible,
assessable and useable G8 Science Ministers
Statement (2013)
How do Open and FAIR intersect?
Open
FAIR data
Managed data
Internal
Self-interest
External
Community
benefit
Open data and FAIR data
FAIR
data
Open
data
FAIR and Open are
not synonymous.
Data can be both,
one or neither.
Degrees of Open and FAIR
And both are on a scale
Implementing FAIR – the EC expert group
Simon Hodson, CODATA
Chair of FAIR Data EG
Rūta Petrauskaité,
Vytautas Magnus University
Peter Wittenburg, Max Planck
Computing & Data Facility
Sarah Jones, Digital Curation
Centre (DCC), Rapporteur
Daniel Mietchen, Data
Science Institute,
University of Virginia
Françoise Genova,
Observatoire Astronomique
de Strasbourg
Leif Laaksonen, CSC-IT
Centre for Science
Natalie Harrower,
Digital Repository of
Ireland – year 2 only
Sandra Collins,
National Library of
Ireland – year 1 only
Remit of the FAIR data expert group
1. To develop recommendations on what needs to be done to turn each
component of the FAIR data principles into reality
2. To propose indicators to measure progress on each of the FAIR
components
3. To provide input to the proposed European Open Science Cloud
(EOSC) action plan on how to make data FAIR
4. To contribute to the evaluation of the Horizon 2020 Data
Management Plan (DMP) template and development of associated
sector / discipline-specific guidance
5. To provide input on the issue of costing and financing data
management activities
http://tinyurl.com/FAIR-EG
Report framework
1. Concepts – why FAIR?
2. Creating a culture of FAIR data
3. Creating a technical ecosystem for FAIR data
4. Skills and capacity building
5. Measuring change
6. Funding and sustaining FAIR data
7. FAIR Data Action Plan
Culture and technology for FAIR
• Science/research is a cultural
system with considerable
technological dependency.
• Culture and technology for
FAIR data are deeply
interrelated.
• Fundamentally important to
address cultural and
technological requirements
for FAIR data holistically.
Image CC-BY by Nicolas Raymond https://www.flickr.com/photos/80497449@N04/8691983876
How the Action Plan is structured
A short tweetable recommendation
• Underpinned by practical and specific action points
• Action points to be allocated to stakeholders
FAIR Data Action Plan applies to EC, member states,
and international level, but we also place in context of
EOSC to inform this roadmap
Primary recommendations and actions
Step 1: Define and apply FAIR appropriately
Rec. 1: Definitions of FAIR
Rec. 2: Mandates and boundaries for Open
Rec. 3: A model for FAIR Data Objects
FAIR and Open
 Ensure FAIR also covers appropriate openness,
assessability, long-term stewardship, timely sharing,
data selection and legal interoperability.
 Recognise practices varies. Communities need to self-
organise and define what FAIR data means.
 Make Open Data mandate explicit using the maxim
‘as open as possible as closed as necessary’ but apply
proportionately with genuine best efforts to share
FAIR Data Objects
DATA
The core bits
At its most basic level, data is a bitstream or binary sequence. For data
to have meaning and to be FAIR, it needs to be represented in standard
formats and be accompanied by Persistent Identifiers (PIDs), metadata
and code. These layers of meaning enrich the data and enable reuse.
IDENTIFIERS
Persistent and unique (PIDs)
Data should be assigned a unique and persistent identifier such as a DOI
or URN. This enables stable links to the object and supports citation and
reuse to be tracked. Identifiers should also be applied to other related
concepts such as the data authors (ORCIDs), projects (RAIDs), funders
and associated research resources (RRIDs).
STANDARDS & CODE
Open, documented formats
Data should be represented in common and ideally open file formats.
This enables others to reuse the data as the format is in widespread use
and software is available to read the files. Open and well-documented
formats are easier to preserve . Data also need to be accompanied by
the code use to process and analyse the data.
METADATA
Contextual documentation
In order for data to be assessable and reusable, it should be accompanied
by sufficient metadata and documentation. Basic metadata will enable
data discovery, but much richer information and provenance is required
to understand how, why, when and by whom the data were created. To
enable the broadest reuse, data should be accompanied by a 'plurality of
relevant attributes' and a clear and accessible data usage license.
Primary recommendations and actions
Step 2: Develop and support a sustainable FAIR data
ecosystem
Rec. 4: Components of a FAIR data ecosystem
Rec. 5: Sustainable funding for FAIR components
Rec. 6: Strategic and evidence-based funding
FAIR data ecosystem
Standards
Skills
Rewards
Metrics
Investment
FAIR Data Objects stored
in Trusted repositories &
Cloud Services
Data policies
People: researchers, funders,
publishers, data stewards…
PIDs
Define & regulate
Provide hub of
info on FAIR
data objects
Assigned to
Create & use FAIR components
Motivated by outside drivers
DMP
• Components include:
policies, DMPs,
identifiers, standards
and repositories.
• Need registries
cataloguing each
component of the
ecosystem
• Require automated
workflows between
components
• Cultural aspects (skills,
metrics, rewards, €€)
drive the system
Sustainable & strategic funding
 The components of the FAIR ecosystem need to be
maintained at a professional service level with
sustainable funding.
 Funders of research data services should consolidate
and build on existing investments in infrastructure and
tools, where they demonstrate impact and community
adoption.
 Funding should be tied to metrics to ensure
interoperable data services and reusable data.
Primary recommendation and actions
Step 3: Ensure robust FAIR data and trusted services
Rec. 7: Disciplinary interoperability frameworks
Rec. 8: Cross-disciplinary FAIRness
Rec. 9: Develop robust FAIR data metrics
Rec. 10: Trusted Digital Repositories
Rec. 11: Develop metrics to assess and certify services
Interoperability frameworks
 Research communities must be supported to develop and
maintain their disciplinary interoperability frameworks.
 These include principles and practices for data management
and sharing, community agreements, data formats, metadata
standards, tools and data infrastructure.
 Interoperability frameworks should be articulated in common
ways and adopt global standards where possible to enable
interdisciplinary research.
 Important role for international, cross-disciplinary initiatives
and fora in development of practices, protocols and standards.
Metrics: for data & services
 A set of metrics for FAIR Data Objects should be developed
and implemented, starting from the basic common core of
descriptive metadata, PIDs and access.
 Metrics and certification are needed that assess services.
Such schemes should not compete with existing repository
accreditation, but should measure and accredit components
such as identifier services, standards and vocabularies.
Primary recommendations and actions
Step 4: Embed a culture of FAIR in research practice
Rec. 12: Data Management via DMPs
Rec. 13: Professionalise data science and stewardship roles
Rec. 14: Recognise and reward FAIR data and FAIR
stewardship
Data management via DMPs
 Any research project should include data management as
a core element necessary for the delivery of its scientific
objectives, addressing this in a Data Management Plan.
 The DMP should be regularly updated to provide a hub of
information on the FAIR data objects.
 Also: use information held in Data Management Plans –
structure data to enable information exchange across the
FAIR data ecosystem
Roles: data science & data stewardship
 Two cohorts of professionals to support FAIR data:
– data scientists embedded in those research projects
– data stewards who will ensure the curation of FAIR data
 Coordinate, systematise and accelerate the pedagogy
 Accreditation of courses to professionalise roles
 Support skills transfer schemes
Recognition & rewards: data & services
 Data should be recognised as a core research output and
included in the assessment of research contributions and
career progression.
 The provision of data-related infrastructure and services
must also be recognised as an essential part of the FAIR
ecosystem.
 Require evidence of FAIR data and accredited services and
reward accordingly.
Next steps for the Expert Group
 Launch interim report next week at EOSC Summit
 Run consultations over summer via
– Online platform
– Webinars
– Events e.g. FORCE18, International Data Week
 Revise and finalise report and Action Plan
 Formally launch at Austrian Presidency event on 23 Nov
Role of the RDA in FAIR
For FAIR to really succeed, global agreements need to be in
place. RDA is an ideal international forum to:
 Clarify definitions of FAIR
 Support research communities to develop disciplinary
interoperability frameworks
 Identify commonalities across FAIR implementation to
promote the formation and adoption of global standards
 Promote the exchange of good practice and lessons
 Collect use cases and examples…
Explore / define FAIR implementations
 What does FAIR mean in different disciplines?
 What are the best practices in data sharing?
 What are the appropriate boundaries of Open?
 How do we recognise skills and formalise career pathways?
 How do we certify FAIR data and services?
 What metrics denote which services should be sustained?
 How do registries interact and know of each other?
 How do we test that the ecosystem is fit-for-purpose?
Offer guidance and support
 Pointing researchers to the most appropriate services for
them (Re3data, FAIRsharing…)
 Raising awareness and adoption of standards (Metadata
Standards Catalogue, DCAT)
 Training researchers in data science (CODATA / RDA
schools)
 Professionalising data stewardship (Community of Practice,
accreditation…)
 Supporting certification of repositories & other services
(CoreTrustSeal)
Context specific FAIR Action Plans
 Support the definition of
more detailed FAIR Data
Action Plans at research
community and member
state level
 Potential role for
European nodes?
Thanks - questions?
http://tinyurl.com/FAIR-EG
www.force11.org/group/fairgroup/fairprinciples

More Related Content

What's hot

Designing and developing vocabularies in RDF
Designing and developing vocabularies in RDFDesigning and developing vocabularies in RDF
Designing and developing vocabularies in RDF
Open Data Support
 
How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)
Carole Goble
 

What's hot (20)

European Open Science Cloud architecture future view
European Open Science Cloud architecture future viewEuropean Open Science Cloud architecture future view
European Open Science Cloud architecture future view
 
FAIR principles and metrics for evaluation
FAIR principles and metrics for evaluationFAIR principles and metrics for evaluation
FAIR principles and metrics for evaluation
 
CKAN as an open-source data management solution for open data
CKAN as an open-source data management solution for open data CKAN as an open-source data management solution for open data
CKAN as an open-source data management solution for open data
 
Big data in education
Big data in educationBig data in education
Big data in education
 
Security issues associated with big data in cloud
Security issues associated  with big data in cloudSecurity issues associated  with big data in cloud
Security issues associated with big data in cloud
 
Neo4j – The Fastest Path to Scalable Real-Time Analytics
Neo4j – The Fastest Path to Scalable Real-Time AnalyticsNeo4j – The Fastest Path to Scalable Real-Time Analytics
Neo4j – The Fastest Path to Scalable Real-Time Analytics
 
Open, FAIR data and RDM
Open, FAIR data and RDMOpen, FAIR data and RDM
Open, FAIR data and RDM
 
Slides: Data Monetization — Demonstrating Quantifiable Financial Benefits fro...
Slides: Data Monetization — Demonstrating Quantifiable Financial Benefits fro...Slides: Data Monetization — Demonstrating Quantifiable Financial Benefits fro...
Slides: Data Monetization — Demonstrating Quantifiable Financial Benefits fro...
 
What it means to be FAIR
What it means to be FAIRWhat it means to be FAIR
What it means to be FAIR
 
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
Denodo’s Data Catalog: Bridging the Gap between Data and Business (APAC)
 
Session 1 - Introduction to i4Trust Data Spaces, building blocks, and roles |...
Session 1 - Introduction to i4Trust Data Spaces, building blocks, and roles |...Session 1 - Introduction to i4Trust Data Spaces, building blocks, and roles |...
Session 1 - Introduction to i4Trust Data Spaces, building blocks, and roles |...
 
Designing and developing vocabularies in RDF
Designing and developing vocabularies in RDFDesigning and developing vocabularies in RDF
Designing and developing vocabularies in RDF
 
220222.FederatiefDataDelen.TopsectorLogistiek.pdf
220222.FederatiefDataDelen.TopsectorLogistiek.pdf220222.FederatiefDataDelen.TopsectorLogistiek.pdf
220222.FederatiefDataDelen.TopsectorLogistiek.pdf
 
Evolution of Data Spaces
Evolution of Data SpacesEvolution of Data Spaces
Evolution of Data Spaces
 
A chart of the big data ecosystem
A chart of the big data ecosystemA chart of the big data ecosystem
A chart of the big data ecosystem
 
Data discovery through federated dataset catalogs
Data discovery through federated dataset catalogsData discovery through federated dataset catalogs
Data discovery through federated dataset catalogs
 
How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)How are we Faring with FAIR? (and what FAIR is not)
How are we Faring with FAIR? (and what FAIR is not)
 
Setting up a data repository, what does it entail?
Setting up a data repository, what does it entail?Setting up a data repository, what does it entail?
Setting up a data repository, what does it entail?
 
Creative Interactive Browser Visualizations with Bokeh by Bryan Van de ven
Creative Interactive Browser Visualizations with Bokeh by Bryan Van de venCreative Interactive Browser Visualizations with Bokeh by Bryan Van de ven
Creative Interactive Browser Visualizations with Bokeh by Bryan Van de ven
 
Big Data and Machine Learning with FIWARE
Big Data and Machine Learning with FIWAREBig Data and Machine Learning with FIWARE
Big Data and Machine Learning with FIWARE
 

Similar to FAIR data: what it means, how we achieve it, and the role of RDA

Similar to FAIR data: what it means, how we achieve it, and the role of RDA (20)

Results from the FAIR Expert Group Stakeholder Consultation on the FAIR Data ...
Results from the FAIR Expert Group Stakeholder Consultation on the FAIR Data ...Results from the FAIR Expert Group Stakeholder Consultation on the FAIR Data ...
Results from the FAIR Expert Group Stakeholder Consultation on the FAIR Data ...
 
Turning FAIR into Reality: Final outcomes from the European Commission FAIR D...
Turning FAIR into Reality: Final outcomes from the European Commission FAIR D...Turning FAIR into Reality: Final outcomes from the European Commission FAIR D...
Turning FAIR into Reality: Final outcomes from the European Commission FAIR D...
 
Turning FAIR data into reality
Turning FAIR data into realityTurning FAIR data into reality
Turning FAIR data into reality
 
FAIR Data Interim Report and Action Plan
FAIR Data Interim Report and Action PlanFAIR Data Interim Report and Action Plan
FAIR Data Interim Report and Action Plan
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon Hodson
 
Turning FAIR into Reality - Role for Libraries
Turning FAIR into Reality - Role for Libraries Turning FAIR into Reality - Role for Libraries
Turning FAIR into Reality - Role for Libraries
 
LIBER Webinar: Turning FAIR Data Into Reality
LIBER Webinar: Turning FAIR Data Into RealityLIBER Webinar: Turning FAIR Data Into Reality
LIBER Webinar: Turning FAIR Data Into Reality
 
FAIR play?
FAIR play? FAIR play?
FAIR play?
 
Turning FAIR into Reality: Briefing on the EC’s report on FAIR data
Turning FAIR into Reality: Briefing on the EC’s report on FAIR dataTurning FAIR into Reality: Briefing on the EC’s report on FAIR data
Turning FAIR into Reality: Briefing on the EC’s report on FAIR data
 
Open Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonOpen Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon Hodson
 
PARTHENOS Common Policies and Implementation Strategies
PARTHENOS Common Policies and Implementation StrategiesPARTHENOS Common Policies and Implementation Strategies
PARTHENOS Common Policies and Implementation Strategies
 
Framework and Roadmap towards an Open Science Infrastructure/Simon Hodson
Framework and Roadmap towards an Open Science Infrastructure/Simon HodsonFramework and Roadmap towards an Open Science Infrastructure/Simon Hodson
Framework and Roadmap towards an Open Science Infrastructure/Simon Hodson
 
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
 
H2020 data pilot openaire
H2020 data pilot openaireH2020 data pilot openaire
H2020 data pilot openaire
 
The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J...
The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J...The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J...
The Horizon 2020 Open Data Pilot - OpenAIRE webinar (Oct. 21 2014) by Sarah J...
 
Datajalostamo-seminaari 5.6.2014: Tutkimusdatan avoimuus – globaalit tutkimus...
Datajalostamo-seminaari 5.6.2014: Tutkimusdatan avoimuus – globaalit tutkimus...Datajalostamo-seminaari 5.6.2014: Tutkimusdatan avoimuus – globaalit tutkimus...
Datajalostamo-seminaari 5.6.2014: Tutkimusdatan avoimuus – globaalit tutkimus...
 
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
Big Data Europe: SC6 Workshop 3: The European Research Data Landscape: Opport...
 
My FAIR share of the work - Diamond Light Source - Dec 2018
My FAIR share of the work - Diamond Light Source - Dec 2018My FAIR share of the work - Diamond Light Source - Dec 2018
My FAIR share of the work - Diamond Light Source - Dec 2018
 
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
 
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
EUDAT & OpenAIRE Webinar: How to write a Data Management Plan - July 7, 2016|...
 

More from Sarah Jones

More from Sarah Jones (20)

Data training tips and tricks
Data training tips and tricksData training tips and tricks
Data training tips and tricks
 
EOSC and libraries
EOSC and librariesEOSC and libraries
EOSC and libraries
 
EOSC Association priorities and activities
EOSC Association priorities and activitiesEOSC Association priorities and activities
EOSC Association priorities and activities
 
Managing and sharing data: lessons from the European context
Managing and sharing data: lessons from the European contextManaging and sharing data: lessons from the European context
Managing and sharing data: lessons from the European context
 
Reflections on Open Science
Reflections on Open ScienceReflections on Open Science
Reflections on Open Science
 
MAR comments analysis
MAR comments analysisMAR comments analysis
MAR comments analysis
 
EOSC-MAR-update.pptx
EOSC-MAR-update.pptxEOSC-MAR-update.pptx
EOSC-MAR-update.pptx
 
Intro-EOSC.pptx
Intro-EOSC.pptxIntro-EOSC.pptx
Intro-EOSC.pptx
 
Why is EOSC so hard?
Why is EOSC so hard?Why is EOSC so hard?
Why is EOSC so hard?
 
Data Management Planning for researchers
Data Management Planning for researchersData Management Planning for researchers
Data Management Planning for researchers
 
Is Europe ready for Open Science
Is Europe ready for Open ScienceIs Europe ready for Open Science
Is Europe ready for Open Science
 
DMPonline: 10 years, 10 lessons
DMPonline: 10 years, 10 lessonsDMPonline: 10 years, 10 lessons
DMPonline: 10 years, 10 lessons
 
Do & don't of supporting Open Science
Do & don't of supporting Open ScienceDo & don't of supporting Open Science
Do & don't of supporting Open Science
 
Why institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIRWhy institutions need to raise their capabilities to support FAIR
Why institutions need to raise their capabilities to support FAIR
 
It takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commonsIt takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commons
 
DMPTuuli - what's new?
DMPTuuli - what's new?DMPTuuli - what's new?
DMPTuuli - what's new?
 
DCC and FAIR initiatives
DCC and FAIR initiativesDCC and FAIR initiatives
DCC and FAIR initiatives
 
Reflections on EOSC through the mirror of ARDC
Reflections on EOSC through the mirror of ARDCReflections on EOSC through the mirror of ARDC
Reflections on EOSC through the mirror of ARDC
 
Future EOSC roadmap
Future EOSC roadmapFuture EOSC roadmap
Future EOSC roadmap
 
Global Open Research Commons IG
Global Open Research Commons IGGlobal Open Research Commons IG
Global Open Research Commons IG
 

Recently uploaded

Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 

Recently uploaded (20)

DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 AmsterdamDEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
DEV meet-up UiPath Document Understanding May 7 2024 Amsterdam
 
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot ModelMcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
Mcleodganj Call Girls 🥰 8617370543 Service Offer VIP Hot Model
 
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
Navigating the Deluge_ Dubai Floods and the Resilience of Dubai International...
 
Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)Introduction to Multilingual Retrieval Augmented Generation (RAG)
Introduction to Multilingual Retrieval Augmented Generation (RAG)
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 
MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024MINDCTI Revenue Release Quarter One 2024
MINDCTI Revenue Release Quarter One 2024
 
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
Apidays New York 2024 - Passkeys: Developing APIs to enable passwordless auth...
 
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​Elevate Developer Efficiency & build GenAI Application with Amazon Q​
Elevate Developer Efficiency & build GenAI Application with Amazon Q​
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
Emergent Methods: Multi-lingual narrative tracking in the news - real-time ex...
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Six Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal OntologySix Myths about Ontologies: The Basics of Formal Ontology
Six Myths about Ontologies: The Basics of Formal Ontology
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWEREMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
EMPOWERMENT TECHNOLOGY GRADE 11 QUARTER 2 REVIEWER
 
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemkeProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke
 

FAIR data: what it means, how we achieve it, and the role of RDA

  • 1. S a r a h J o n e s A s s o c i a t e D i r e c t o r , D i g i t a l C u r a t i o n C e n t r e R a p p o r t e u r o f F A I R d a t a e x p e r t g r o u p s a r a h . j o n e s @ g l a s g o w . a c . u k T w i t t e r : @ s j D C C FAIR data: what it means, how we achieve it, and the role of RDA
  • 2. What is FAIR? A set of principles that describe the attributes data need to have to enable and enhance reuse, by humans and machines Image CC-BY-SA by SangyaPundir
  • 3. Origins of FAIR • Emerged from a workshop held in Leiden in 2014 • Come from life sciences but intended for all data • Issued by FORCE11 community • Echo previous principles on open data & curation OECD Principles and Guidelines for Access to Research Data from Public Funding (2007) Science as an Open Enterprise (2012) notion of ‘intelligent openness’ where data are accessible, intelligible, assessable and useable G8 Science Ministers Statement (2013)
  • 4. How do Open and FAIR intersect? Open FAIR data Managed data Internal Self-interest External Community benefit
  • 5. Open data and FAIR data FAIR data Open data FAIR and Open are not synonymous. Data can be both, one or neither. Degrees of Open and FAIR And both are on a scale
  • 6. Implementing FAIR – the EC expert group Simon Hodson, CODATA Chair of FAIR Data EG Rūta Petrauskaité, Vytautas Magnus University Peter Wittenburg, Max Planck Computing & Data Facility Sarah Jones, Digital Curation Centre (DCC), Rapporteur Daniel Mietchen, Data Science Institute, University of Virginia Françoise Genova, Observatoire Astronomique de Strasbourg Leif Laaksonen, CSC-IT Centre for Science Natalie Harrower, Digital Repository of Ireland – year 2 only Sandra Collins, National Library of Ireland – year 1 only
  • 7. Remit of the FAIR data expert group 1. To develop recommendations on what needs to be done to turn each component of the FAIR data principles into reality 2. To propose indicators to measure progress on each of the FAIR components 3. To provide input to the proposed European Open Science Cloud (EOSC) action plan on how to make data FAIR 4. To contribute to the evaluation of the Horizon 2020 Data Management Plan (DMP) template and development of associated sector / discipline-specific guidance 5. To provide input on the issue of costing and financing data management activities http://tinyurl.com/FAIR-EG
  • 8. Report framework 1. Concepts – why FAIR? 2. Creating a culture of FAIR data 3. Creating a technical ecosystem for FAIR data 4. Skills and capacity building 5. Measuring change 6. Funding and sustaining FAIR data 7. FAIR Data Action Plan
  • 9. Culture and technology for FAIR • Science/research is a cultural system with considerable technological dependency. • Culture and technology for FAIR data are deeply interrelated. • Fundamentally important to address cultural and technological requirements for FAIR data holistically. Image CC-BY by Nicolas Raymond https://www.flickr.com/photos/80497449@N04/8691983876
  • 10. How the Action Plan is structured A short tweetable recommendation • Underpinned by practical and specific action points • Action points to be allocated to stakeholders FAIR Data Action Plan applies to EC, member states, and international level, but we also place in context of EOSC to inform this roadmap
  • 11. Primary recommendations and actions Step 1: Define and apply FAIR appropriately Rec. 1: Definitions of FAIR Rec. 2: Mandates and boundaries for Open Rec. 3: A model for FAIR Data Objects
  • 12. FAIR and Open  Ensure FAIR also covers appropriate openness, assessability, long-term stewardship, timely sharing, data selection and legal interoperability.  Recognise practices varies. Communities need to self- organise and define what FAIR data means.  Make Open Data mandate explicit using the maxim ‘as open as possible as closed as necessary’ but apply proportionately with genuine best efforts to share
  • 13. FAIR Data Objects DATA The core bits At its most basic level, data is a bitstream or binary sequence. For data to have meaning and to be FAIR, it needs to be represented in standard formats and be accompanied by Persistent Identifiers (PIDs), metadata and code. These layers of meaning enrich the data and enable reuse. IDENTIFIERS Persistent and unique (PIDs) Data should be assigned a unique and persistent identifier such as a DOI or URN. This enables stable links to the object and supports citation and reuse to be tracked. Identifiers should also be applied to other related concepts such as the data authors (ORCIDs), projects (RAIDs), funders and associated research resources (RRIDs). STANDARDS & CODE Open, documented formats Data should be represented in common and ideally open file formats. This enables others to reuse the data as the format is in widespread use and software is available to read the files. Open and well-documented formats are easier to preserve . Data also need to be accompanied by the code use to process and analyse the data. METADATA Contextual documentation In order for data to be assessable and reusable, it should be accompanied by sufficient metadata and documentation. Basic metadata will enable data discovery, but much richer information and provenance is required to understand how, why, when and by whom the data were created. To enable the broadest reuse, data should be accompanied by a 'plurality of relevant attributes' and a clear and accessible data usage license.
  • 14. Primary recommendations and actions Step 2: Develop and support a sustainable FAIR data ecosystem Rec. 4: Components of a FAIR data ecosystem Rec. 5: Sustainable funding for FAIR components Rec. 6: Strategic and evidence-based funding
  • 15. FAIR data ecosystem Standards Skills Rewards Metrics Investment FAIR Data Objects stored in Trusted repositories & Cloud Services Data policies People: researchers, funders, publishers, data stewards… PIDs Define & regulate Provide hub of info on FAIR data objects Assigned to Create & use FAIR components Motivated by outside drivers DMP • Components include: policies, DMPs, identifiers, standards and repositories. • Need registries cataloguing each component of the ecosystem • Require automated workflows between components • Cultural aspects (skills, metrics, rewards, €€) drive the system
  • 16. Sustainable & strategic funding  The components of the FAIR ecosystem need to be maintained at a professional service level with sustainable funding.  Funders of research data services should consolidate and build on existing investments in infrastructure and tools, where they demonstrate impact and community adoption.  Funding should be tied to metrics to ensure interoperable data services and reusable data.
  • 17. Primary recommendation and actions Step 3: Ensure robust FAIR data and trusted services Rec. 7: Disciplinary interoperability frameworks Rec. 8: Cross-disciplinary FAIRness Rec. 9: Develop robust FAIR data metrics Rec. 10: Trusted Digital Repositories Rec. 11: Develop metrics to assess and certify services
  • 18. Interoperability frameworks  Research communities must be supported to develop and maintain their disciplinary interoperability frameworks.  These include principles and practices for data management and sharing, community agreements, data formats, metadata standards, tools and data infrastructure.  Interoperability frameworks should be articulated in common ways and adopt global standards where possible to enable interdisciplinary research.  Important role for international, cross-disciplinary initiatives and fora in development of practices, protocols and standards.
  • 19. Metrics: for data & services  A set of metrics for FAIR Data Objects should be developed and implemented, starting from the basic common core of descriptive metadata, PIDs and access.  Metrics and certification are needed that assess services. Such schemes should not compete with existing repository accreditation, but should measure and accredit components such as identifier services, standards and vocabularies.
  • 20. Primary recommendations and actions Step 4: Embed a culture of FAIR in research practice Rec. 12: Data Management via DMPs Rec. 13: Professionalise data science and stewardship roles Rec. 14: Recognise and reward FAIR data and FAIR stewardship
  • 21. Data management via DMPs  Any research project should include data management as a core element necessary for the delivery of its scientific objectives, addressing this in a Data Management Plan.  The DMP should be regularly updated to provide a hub of information on the FAIR data objects.  Also: use information held in Data Management Plans – structure data to enable information exchange across the FAIR data ecosystem
  • 22. Roles: data science & data stewardship  Two cohorts of professionals to support FAIR data: – data scientists embedded in those research projects – data stewards who will ensure the curation of FAIR data  Coordinate, systematise and accelerate the pedagogy  Accreditation of courses to professionalise roles  Support skills transfer schemes
  • 23. Recognition & rewards: data & services  Data should be recognised as a core research output and included in the assessment of research contributions and career progression.  The provision of data-related infrastructure and services must also be recognised as an essential part of the FAIR ecosystem.  Require evidence of FAIR data and accredited services and reward accordingly.
  • 24. Next steps for the Expert Group  Launch interim report next week at EOSC Summit  Run consultations over summer via – Online platform – Webinars – Events e.g. FORCE18, International Data Week  Revise and finalise report and Action Plan  Formally launch at Austrian Presidency event on 23 Nov
  • 25. Role of the RDA in FAIR For FAIR to really succeed, global agreements need to be in place. RDA is an ideal international forum to:  Clarify definitions of FAIR  Support research communities to develop disciplinary interoperability frameworks  Identify commonalities across FAIR implementation to promote the formation and adoption of global standards  Promote the exchange of good practice and lessons  Collect use cases and examples…
  • 26. Explore / define FAIR implementations  What does FAIR mean in different disciplines?  What are the best practices in data sharing?  What are the appropriate boundaries of Open?  How do we recognise skills and formalise career pathways?  How do we certify FAIR data and services?  What metrics denote which services should be sustained?  How do registries interact and know of each other?  How do we test that the ecosystem is fit-for-purpose?
  • 27. Offer guidance and support  Pointing researchers to the most appropriate services for them (Re3data, FAIRsharing…)  Raising awareness and adoption of standards (Metadata Standards Catalogue, DCAT)  Training researchers in data science (CODATA / RDA schools)  Professionalising data stewardship (Community of Practice, accreditation…)  Supporting certification of repositories & other services (CoreTrustSeal)
  • 28. Context specific FAIR Action Plans  Support the definition of more detailed FAIR Data Action Plans at research community and member state level  Potential role for European nodes?