SlideShare a Scribd company logo
Assessing FAIR data principles
against the 5-star Open data
principles
Ali Hasnain and Dietrich Rebholz-Schuhmann
June 2019
Outline
• Motivation
• Linked Open Data
• FAIR Data
• 5 Open Data and FAIR Data
• Conclusions
2
Motivation:
• Linked Open Data has been defined and designed
to join up data in the public space
• FAIR data principles have been defined to make
data reusable
• Both share principles of
– Interoperability of data, and
– Findability of data
• How do both principled approaches differ?
• How does LOD relate to FAIR?
3
Data driven world
4
The Linked Data Cloud
5
Heterogeneous Data – Multiple Datasources
DrugBank
DailyMed
CheBI, KEGG
Reactome
Sider
BioPax
Medicare
6
Size & type of data we talk about
Dataset Category Year* Topic Size/ Coverage
Drugbank LODD 2010 Drugs 766920 triples, 4800 drugs
LinkedCT LODD X Clinical Trials 25 M triples, 106000 trials
DailyMed LODD 2010 Drugs 1604983 triples, >36K products
Dbpedia LODD 2009 Drugs/ Diseases/Proteins 218M triples, 2300 drugs, 2200 proteins
Diseasome LODD 2010 Diseases/ Genes 91182 triples, 2600 genes
SIDER LODD 2010 Diseases/ Side Effects 192515 triples, 63K effects, 1737 genes
STITCH LODD 2010 Chemicals/ Proteins 7.5 M chemicals, 0.5 M proteins
ChEMBLE LODD 2010 Assay/ Proteins/ Organisms 130 M triples
Affymetrix Bio2RDF 2014 Microarrays 8694237 triples, 6679943 entities
BioModels Bio2RDF 2014 Biological/ mathematical models 2380009 triples, 188308 entities
BioPortal Bio2RDF 2014 Biological/ biomedical entities 19920395 triples, 2199594 entities
KEGG Bio2RDF 2014 Genes 50197150 triples, 6533307 entities
PharmaG-KB Bio2RDF 2014 Genotypes/ Phenotypes 278049209 triples, 25325504 entities
PubMed Bio2RDF 2014 Citations 5005343905 triples, 412593720 entities
Taxonomy Bio2RDF 2014 Taxonomy 21310356 triples, 1147211 entities
LLD LLD 2014 Drugs, Chromosomes 10192641644 statements
*Statistics as of Aug 2016 (source DataHub) - Year specify the time when the last- most recent version is
produced. “X” means information not available.
7
5 ★ OPEN DATA (I.E. THE LOD)
8
Source:http://5stardata.info/en/
Open data, open science and free
software• Open in contrast to not fenced
• open library (“just walk in”), open society (“allow religious
diversity”), open skies (“allow fly-overs”), open university
(“studies for all”)
• Free software:
• different levels of use and reuse, free like “free speech”,
• not “free beer” and not “free puppy”
• Creative Commons license right parameters
• to govern Copyright:
• Attribution – Non-commercial – Share-like – No Derivatives
• In manufacturing:
• Open Design, Open Source Hardware
• Open means “open access”, e.g., scholarly literature,
data, meta-data, but includes “free” and “public
access”
• Open science: OA to all parts of the scientific process,
but access without use?03/06/2018 INSIGHT@NUIG
Open data, more specifically
• Open Education:
• Reuse of all data, for educational purposes
• Open Government: public access to governmental
data, achieving transparency and participation (Open
Data Governance Board)
• Open Data Institute: Breeding openness through
institutional structures, Insight is part of it
• Openness requires standards for more openness
• Agricultural openness: Open Source Seed Initiative,
others
• But also: “Open source Beer” and “Free Beer
Foundation”
03/06/2018 INSIGHT@NUIG
Data Interoperability, Integration and Reuse is
the Key
nih:EGFR
epidermal growth
factor receptor
Homo
sapiens
CCCCGGCGCAGCGCGGCCGCAGCA
GCCTCCGCCCCCCGCACGGTGTGA
GCGCCCGACGCGGCCGAGGCGG …
nih:EGF
nci:has_description
nih:sequence
nih:organism
nih:interacts
nih:organism
rea:EGFR
rea:Membrane
rea:Receptor
rea:Transferase
rea:keyword
rea:keyword
rea:keyword
NCBI Reactome
sameAs
11
12
Source:
FAIR Data
Findable
– (Meta)data is assigned a globally unique and persistent identifier
– Data is described with rich metadata
– (Meta)data is registered or indexed in a searchable resource
– (Meta)data specifies the data identifier
Accessible
– (Meta)data is retrieved by the identifier using a standardized protocol
– The protocol is open, free and universally implementable
– Protocol allows for an authentication and authorization procedure when
necessary
– The (meta)data remains accessible even when the data is not
Interoperable
– (Meta)data uses a formal, accessible, shared and broadly applicable language for
knowledge representation
– (Meta)data uses vocabularies that follow the Fair Principles
– (Meta)data includes qualified references to other metadata
Reusable
– (Meta)data must have a plurality of accurate and relevant attributes
– (Meta)data is released with a clear and accessible data usage license
– (Meta)data is associated with its provenance
– (Meta)data must meet domain-relevant community standards 13
To be Findable:
• F1. (meta)data are assigned a globally unique and
persistent identifier
– usage of URIs
• F2. data are described with rich metadata
– requires structured representation
– relate to linkage to other data for context
• F3. metadata clearly and explicitly include the
identifier of the data it describes
– corresponds to the usage of URIs
• F4. (meta)data are registered or indexed in a
searchable resource
– about technologies making use of the data, which is the
consequence of data access over the Web
14
To be Accessible:
• A1. (meta)data are retrievable by their identifier using a
standardised communications protocol
– usage of URIs, although other identifiers could be used as well
• A1.1 the protocol is open, free, and universally
implementable
– URIs and related open data standards
• A1.2 the protocol allows for an authentication and
authorisation procedure, where necessary
– the LOD only covers the open access part as the primary
requirement
• A2. metadata are accessible, even when the data are no
longer available
– At best, this corresponds to the URIs,
– but long-term acquisition is not a matter of data standards but
relates to IT infrastructures.
15
To be Interoperable:
• I1. (meta)data use a formal, accessible, shared, and
broadly applicable language for knowledge
representation
– use of ”non-proprietary open formats”
• I2. (meta)data use vocabularies that follow FAIR
principles
– refers to contextual standardised data
• I3. (meta)data include qualified references to other
(meta)data
– refers to contextual standardised data
16
To be Reusable:
• – R1. meta(data) are richly described with a plurality of
accurate and relevant attributes.
– this refers to the structured data, but makes a stronger reference
to contextual information .
• – R1.1. (meta)data are released with a clear and
accessible data usage license
– a statement about the delivery of (meta)data, which is meant to be
open according to LOD .
• – R1.2. (meta)data are associated with detailed
provenance
– provenance is part of the contextual data.
• – R1.3. (meta)data meet domain-relevant community
standards
– community standards are part of the contextual data and should be
openly available. 17
FAIR principles mapping to LOD 5-star
scheme
18
Distinctions and Overlaps
• LOD and non-data assets
– The LOD 5 stars scheme is for the Open Data.
– FAIR data must not be openly available and requires that access to
the license agreement is made available.
– FAIR asks for meta-data to be provided with the data to improve
interoperability, which relates to the contextual information from the
LOD principles through interoperable data linked as one or several
interoperable open data sources.
• Tool and technology independent
– FAIR Guiding Principles are independent of implementation choices,
and do not suggest any specific technology, standard, or
implementation solution.
– FAIR principles are not, themselves, a standard or a specification but
a guide to data publishers and consumers to assist regarding
Findable, Accessible, Interoperable, and Reusable implementations
and resources.
– LOD 5-star scheme are not themselves a standard or a specification
but provides the guideline for choosing the tools and technologies to19
Conclusions
• FAIR principles are focused to user needs around a wide
range of existing data, whereas the LOD principles form an
idealistic approach to a world using open data.
• LOD principles appear to be more stringent in its own
principles, whereas the FAIR data principles can be seen
as reusing the LOD principles without making explicit
reference and with added aspects of separating data into
the core data and the meta-data, including license
considerations.
• The future will tell, whether the FAIR data principles lead to
the reusability of data under the consideration that the
access to meta-data and to license agreements open the
avenues of reusabiltiy, or whether the LOD principles are
the gatekeepers to reusability, since -- a priori -- the data
should be openly available anyways.
20
Thanks for your attention…

More Related Content

What's hot

It takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commonsIt takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commons
Sarah Jones
 
Data Management Planning for researchers
Data Management Planning for researchersData Management Planning for researchers
Data Management Planning for researchers
Sarah Jones
 
Fair data principles for AOASG
Fair data principles for AOASGFair data principles for AOASG
Fair data principles for AOASG
Keith Russell
 
"Cool" metadata for FAIR data
"Cool" metadata for FAIR data"Cool" metadata for FAIR data
"Cool" metadata for FAIR data
Research Data Alliance
 
Rda in a_nutshell_february_2017_updated
Rda in a_nutshell_february_2017_updatedRda in a_nutshell_february_2017_updated
Rda in a_nutshell_february_2017_updated
Research Data Alliance
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon Hodson
African Open Science Platform
 
FAIR play?
FAIR play? FAIR play?
FAIR play?
Sarah Jones
 
D4Science Data infrastructure: a facilitator for a FAIR data management
D4Science Data infrastructure: a facilitator for a FAIR data managementD4Science Data infrastructure: a facilitator for a FAIR data management
D4Science Data infrastructure: a facilitator for a FAIR data management
Research Data Alliance
 
Overcoming obstacles to sharing data about human subjects
Overcoming obstacles to sharing data about human subjectsOvercoming obstacles to sharing data about human subjects
Overcoming obstacles to sharing data about human subjects
Robin Rice
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
Carole Goble
 
Open Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonOpen Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon Hodson
African Open Science Platform
 
Fair webinar, Ted slater: progress towards commercial fair data products and ...
Fair webinar, Ted slater: progress towards commercial fair data products and ...Fair webinar, Ted slater: progress towards commercial fair data products and ...
Fair webinar, Ted slater: progress towards commercial fair data products and ...
Pistoia Alliance
 
Linked Open Data_mlanet13
Linked Open Data_mlanet13Linked Open Data_mlanet13
Linked Open Data_mlanet13
Kristi Holmes
 
Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.Susanna-Assunta Sansone
 
FAIRsharing: how we assist with FAIRness
FAIRsharing: how we assist with FAIRnessFAIRsharing: how we assist with FAIRness
FAIRsharing: how we assist with FAIRness
Susanna-Assunta Sansone
 
FAIRsharing: what we do for policies
FAIRsharing: what we do for policiesFAIRsharing: what we do for policies
FAIRsharing: what we do for policies
Susanna-Assunta Sansone
 
Findable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) dataFindable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) data
ARDC
 
Data quality supporting AI in Life Sciences webinar 10 dec 2018
Data quality supporting AI in Life Sciences webinar 10 dec 2018Data quality supporting AI in Life Sciences webinar 10 dec 2018
Data quality supporting AI in Life Sciences webinar 10 dec 2018
Pistoia Alliance
 
Framework and Roadmap towards an Open Science Infrastructure/Simon Hodson
Framework and Roadmap towards an Open Science Infrastructure/Simon HodsonFramework and Roadmap towards an Open Science Infrastructure/Simon Hodson
Framework and Roadmap towards an Open Science Infrastructure/Simon Hodson
African Open Science Platform
 
Burton - Security, Privacy and Trust
Burton - Security, Privacy and TrustBurton - Security, Privacy and Trust
Burton - Security, Privacy and Trust
National Information Standards Organization (NISO)
 

What's hot (20)

It takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commonsIt takes more than a village: lessons on building global research commons
It takes more than a village: lessons on building global research commons
 
Data Management Planning for researchers
Data Management Planning for researchersData Management Planning for researchers
Data Management Planning for researchers
 
Fair data principles for AOASG
Fair data principles for AOASGFair data principles for AOASG
Fair data principles for AOASG
 
"Cool" metadata for FAIR data
"Cool" metadata for FAIR data"Cool" metadata for FAIR data
"Cool" metadata for FAIR data
 
Rda in a_nutshell_february_2017_updated
Rda in a_nutshell_february_2017_updatedRda in a_nutshell_february_2017_updated
Rda in a_nutshell_february_2017_updated
 
A coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon HodsonA coordinated framework for open data open science in Botswana/Simon Hodson
A coordinated framework for open data open science in Botswana/Simon Hodson
 
FAIR play?
FAIR play? FAIR play?
FAIR play?
 
D4Science Data infrastructure: a facilitator for a FAIR data management
D4Science Data infrastructure: a facilitator for a FAIR data managementD4Science Data infrastructure: a facilitator for a FAIR data management
D4Science Data infrastructure: a facilitator for a FAIR data management
 
Overcoming obstacles to sharing data about human subjects
Overcoming obstacles to sharing data about human subjectsOvercoming obstacles to sharing data about human subjects
Overcoming obstacles to sharing data about human subjects
 
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...RDMkit, a Research Data Management Toolkit.  Built by the Community for the ...
RDMkit, a Research Data Management Toolkit. Built by the Community for the ...
 
Open Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonOpen Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon Hodson
 
Fair webinar, Ted slater: progress towards commercial fair data products and ...
Fair webinar, Ted slater: progress towards commercial fair data products and ...Fair webinar, Ted slater: progress towards commercial fair data products and ...
Fair webinar, Ted slater: progress towards commercial fair data products and ...
 
Linked Open Data_mlanet13
Linked Open Data_mlanet13Linked Open Data_mlanet13
Linked Open Data_mlanet13
 
Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.Managing Big Data - Berlin, July 9-10, 201.
Managing Big Data - Berlin, July 9-10, 201.
 
FAIRsharing: how we assist with FAIRness
FAIRsharing: how we assist with FAIRnessFAIRsharing: how we assist with FAIRness
FAIRsharing: how we assist with FAIRness
 
FAIRsharing: what we do for policies
FAIRsharing: what we do for policiesFAIRsharing: what we do for policies
FAIRsharing: what we do for policies
 
Findable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) dataFindable, Accessible, Interoperable and Reusable (FAIR) data
Findable, Accessible, Interoperable and Reusable (FAIR) data
 
Data quality supporting AI in Life Sciences webinar 10 dec 2018
Data quality supporting AI in Life Sciences webinar 10 dec 2018Data quality supporting AI in Life Sciences webinar 10 dec 2018
Data quality supporting AI in Life Sciences webinar 10 dec 2018
 
Framework and Roadmap towards an Open Science Infrastructure/Simon Hodson
Framework and Roadmap towards an Open Science Infrastructure/Simon HodsonFramework and Roadmap towards an Open Science Infrastructure/Simon Hodson
Framework and Roadmap towards an Open Science Infrastructure/Simon Hodson
 
Burton - Security, Privacy and Trust
Burton - Security, Privacy and TrustBurton - Security, Privacy and Trust
Burton - Security, Privacy and Trust
 

Similar to Fair data vs 5 star open data final

FAIR data and data management
FAIR data and data managementFAIR data and data management
FAIR data and data management
Hugo Besemer
 
Kr slides fair astronomy 20181019
Kr slides fair astronomy 20181019Kr slides fair astronomy 20181019
Kr slides fair astronomy 20181019
ARDC
 
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Tom Plasterer
 
OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...
OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...
OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...
Open Science Fair
 
CARARE: Can I use this data? FAIR into practice
CARARE: Can I use this data? FAIR into practiceCARARE: Can I use this data? FAIR into practice
CARARE: Can I use this data? FAIR into practice
CARARE
 
FAIR Ddata in trustworthy repositories: the basics
FAIR Ddata in trustworthy repositories: the basicsFAIR Ddata in trustworthy repositories: the basics
FAIR Ddata in trustworthy repositories: the basics
OpenAIRE
 
Fair sample and data access -David Van enckevort
Fair sample and data access -David Van enckevortFair sample and data access -David Van enckevort
Fair sample and data access -David Van enckevort
Data Science NIH
 
David Van Enckevort - FAIR sample and data access
David Van Enckevort - FAIR sample and data access David Van Enckevort - FAIR sample and data access
David Van Enckevort - FAIR sample and data access
DataSciSIG
 
NIH Data Science Special Interest Group
NIH Data Science Special Interest GroupNIH Data Science Special Interest Group
NIH Data Science Special Interest Group
Yaffa Rubinstien
 
DTL Integrator's meeting
DTL Integrator's meetingDTL Integrator's meeting
DTL Integrator's meeting
Luiz Olavo Bonino da Silva Santos
 
VODAN Africa IN.pptx
VODAN Africa IN.pptxVODAN Africa IN.pptx
VODAN Africa IN.pptx
Getu Tadele
 
Towards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRnessTowards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRness
Michel Dumontier
 
Webinar@AIMS_FAIR Principles and Data Management Planning
Webinar@AIMS_FAIR Principles and Data Management PlanningWebinar@AIMS_FAIR Principles and Data Management Planning
Webinar@AIMS_FAIR Principles and Data Management Planning
AIMS (Agricultural Information Management Standards)
 
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE
 
Essentials 4 Data Support: a fine course in FAIR Data Support
Essentials 4 Data Support: a fine course in FAIR Data SupportEssentials 4 Data Support: a fine course in FAIR Data Support
Essentials 4 Data Support: a fine course in FAIR Data Support
Ellen Verbakel
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR Data
Anita de Waard
 
OSFair2017 Training | FAIR metrics - Starring your data sets
OSFair2017 Training | FAIR metrics - Starring your data setsOSFair2017 Training | FAIR metrics - Starring your data sets
OSFair2017 Training | FAIR metrics - Starring your data sets
Open Science Fair
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
Research Data Alliance
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
Research Data Alliance
 
04 findable imming
04 findable imming04 findable imming
04 findable imming
ShareCareX
 

Similar to Fair data vs 5 star open data final (20)

FAIR data and data management
FAIR data and data managementFAIR data and data management
FAIR data and data management
 
Kr slides fair astronomy 20181019
Kr slides fair astronomy 20181019Kr slides fair astronomy 20181019
Kr slides fair astronomy 20181019
 
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
 
OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...
OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...
OSFair2017 workshop | Monitoring the FAIRness of data sets - Introducing the ...
 
CARARE: Can I use this data? FAIR into practice
CARARE: Can I use this data? FAIR into practiceCARARE: Can I use this data? FAIR into practice
CARARE: Can I use this data? FAIR into practice
 
FAIR Ddata in trustworthy repositories: the basics
FAIR Ddata in trustworthy repositories: the basicsFAIR Ddata in trustworthy repositories: the basics
FAIR Ddata in trustworthy repositories: the basics
 
Fair sample and data access -David Van enckevort
Fair sample and data access -David Van enckevortFair sample and data access -David Van enckevort
Fair sample and data access -David Van enckevort
 
David Van Enckevort - FAIR sample and data access
David Van Enckevort - FAIR sample and data access David Van Enckevort - FAIR sample and data access
David Van Enckevort - FAIR sample and data access
 
NIH Data Science Special Interest Group
NIH Data Science Special Interest GroupNIH Data Science Special Interest Group
NIH Data Science Special Interest Group
 
DTL Integrator's meeting
DTL Integrator's meetingDTL Integrator's meeting
DTL Integrator's meeting
 
VODAN Africa IN.pptx
VODAN Africa IN.pptxVODAN Africa IN.pptx
VODAN Africa IN.pptx
 
Towards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRnessTowards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRness
 
Webinar@AIMS_FAIR Principles and Data Management Planning
Webinar@AIMS_FAIR Principles and Data Management PlanningWebinar@AIMS_FAIR Principles and Data Management Planning
Webinar@AIMS_FAIR Principles and Data Management Planning
 
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
 
Essentials 4 Data Support: a fine course in FAIR Data Support
Essentials 4 Data Support: a fine course in FAIR Data SupportEssentials 4 Data Support: a fine course in FAIR Data Support
Essentials 4 Data Support: a fine course in FAIR Data Support
 
NFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR DataNFAIS Talk on Enabling FAIR Data
NFAIS Talk on Enabling FAIR Data
 
OSFair2017 Training | FAIR metrics - Starring your data sets
OSFair2017 Training | FAIR metrics - Starring your data setsOSFair2017 Training | FAIR metrics - Starring your data sets
OSFair2017 Training | FAIR metrics - Starring your data sets
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
 
04 findable imming
04 findable imming04 findable imming
04 findable imming
 

More from Syed Muhammad Ali Hasnain

Quantifying the content of biomedical semantic resources as a core for drug d...
Quantifying the content of biomedical semantic resources as a core for drug d...Quantifying the content of biomedical semantic resources as a core for drug d...
Quantifying the content of biomedical semantic resources as a core for drug d...
Syed Muhammad Ali Hasnain
 
SHARP: Harmonizing cross-workflow Provenance
SHARP: Harmonizing cross-workflow ProvenanceSHARP: Harmonizing cross-workflow Provenance
SHARP: Harmonizing cross-workflow Provenance
Syed Muhammad Ali Hasnain
 
SHARP: Harmonizing Galaxy and Taverna workflow provenance
SHARP: Harmonizing Galaxy and Taverna workflow provenanceSHARP: Harmonizing Galaxy and Taverna workflow provenance
SHARP: Harmonizing Galaxy and Taverna workflow provenance
Syed Muhammad Ali Hasnain
 
Exploiting Cognitive Computing and Frame Semantic Features for Biomedical Doc...
Exploiting Cognitive Computing and Frame Semantic Features for Biomedical Doc...Exploiting Cognitive Computing and Frame Semantic Features for Biomedical Doc...
Exploiting Cognitive Computing and Frame Semantic Features for Biomedical Doc...
Syed Muhammad Ali Hasnain
 
An Approach for Discovering and Exploring Semantic Relationships between Genes
An Approach for Discovering and Exploring Semantic Relationships between GenesAn Approach for Discovering and Exploring Semantic Relationships between Genes
An Approach for Discovering and Exploring Semantic Relationships between Genes
Syed Muhammad Ali Hasnain
 
Federated Query Formulation and Processing through BioFed
Federated Query Formulation and Processing through BioFedFederated Query Formulation and Processing through BioFed
Federated Query Formulation and Processing through BioFed
Syed Muhammad Ali Hasnain
 
Processing Life Science Data at Scale - using Semantic Web Technologies
Processing Life Science Data at Scale - using Semantic Web TechnologiesProcessing Life Science Data at Scale - using Semantic Web Technologies
Processing Life Science Data at Scale - using Semantic Web Technologies
Syed Muhammad Ali Hasnain
 
A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud
A Provenance assisted Roadmap for Life Sciences Linked Open Data CloudA Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud
A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud
Syed Muhammad Ali Hasnain
 
Improving discovery in Life Sciences Linked Open Data Cloud
Improving discovery in Life Sciences Linked Open Data CloudImproving discovery in Life Sciences Linked Open Data Cloud
Improving discovery in Life Sciences Linked Open Data Cloud
Syed Muhammad Ali Hasnain
 
Knowledge Processing with Big Data and Semantic Web Technologies
Knowledge Processing with Big Data and  Semantic Web TechnologiesKnowledge Processing with Big Data and  Semantic Web Technologies
Knowledge Processing with Big Data and Semantic Web Technologies
Syed Muhammad Ali Hasnain
 
FedViz: A Visual Interface for SPARQL Queries Formulation and Execution
FedViz: A Visual Interface for SPARQL Queries Formulation and ExecutionFedViz: A Visual Interface for SPARQL Queries Formulation and Execution
FedViz: A Visual Interface for SPARQL Queries Formulation and Execution
Syed Muhammad Ali Hasnain
 

More from Syed Muhammad Ali Hasnain (11)

Quantifying the content of biomedical semantic resources as a core for drug d...
Quantifying the content of biomedical semantic resources as a core for drug d...Quantifying the content of biomedical semantic resources as a core for drug d...
Quantifying the content of biomedical semantic resources as a core for drug d...
 
SHARP: Harmonizing cross-workflow Provenance
SHARP: Harmonizing cross-workflow ProvenanceSHARP: Harmonizing cross-workflow Provenance
SHARP: Harmonizing cross-workflow Provenance
 
SHARP: Harmonizing Galaxy and Taverna workflow provenance
SHARP: Harmonizing Galaxy and Taverna workflow provenanceSHARP: Harmonizing Galaxy and Taverna workflow provenance
SHARP: Harmonizing Galaxy and Taverna workflow provenance
 
Exploiting Cognitive Computing and Frame Semantic Features for Biomedical Doc...
Exploiting Cognitive Computing and Frame Semantic Features for Biomedical Doc...Exploiting Cognitive Computing and Frame Semantic Features for Biomedical Doc...
Exploiting Cognitive Computing and Frame Semantic Features for Biomedical Doc...
 
An Approach for Discovering and Exploring Semantic Relationships between Genes
An Approach for Discovering and Exploring Semantic Relationships between GenesAn Approach for Discovering and Exploring Semantic Relationships between Genes
An Approach for Discovering and Exploring Semantic Relationships between Genes
 
Federated Query Formulation and Processing through BioFed
Federated Query Formulation and Processing through BioFedFederated Query Formulation and Processing through BioFed
Federated Query Formulation and Processing through BioFed
 
Processing Life Science Data at Scale - using Semantic Web Technologies
Processing Life Science Data at Scale - using Semantic Web TechnologiesProcessing Life Science Data at Scale - using Semantic Web Technologies
Processing Life Science Data at Scale - using Semantic Web Technologies
 
A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud
A Provenance assisted Roadmap for Life Sciences Linked Open Data CloudA Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud
A Provenance assisted Roadmap for Life Sciences Linked Open Data Cloud
 
Improving discovery in Life Sciences Linked Open Data Cloud
Improving discovery in Life Sciences Linked Open Data CloudImproving discovery in Life Sciences Linked Open Data Cloud
Improving discovery in Life Sciences Linked Open Data Cloud
 
Knowledge Processing with Big Data and Semantic Web Technologies
Knowledge Processing with Big Data and  Semantic Web TechnologiesKnowledge Processing with Big Data and  Semantic Web Technologies
Knowledge Processing with Big Data and Semantic Web Technologies
 
FedViz: A Visual Interface for SPARQL Queries Formulation and Execution
FedViz: A Visual Interface for SPARQL Queries Formulation and ExecutionFedViz: A Visual Interface for SPARQL Queries Formulation and Execution
FedViz: A Visual Interface for SPARQL Queries Formulation and Execution
 

Recently uploaded

Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
Ana-Maria Mihalceanu
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Thierry Lestable
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
Dorra BARTAGUIZ
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
Elena Simperl
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
DianaGray10
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Inflectra
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Albert Hoitingh
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
James Anderson
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
Product School
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
KatiaHIMEUR1
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
Kari Kakkonen
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
DianaGray10
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
UiPathCommunity
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
Paul Groth
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
Alan Dix
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
Product School
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Product School
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
BookNet Canada
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Tobias Schneck
 

Recently uploaded (20)

Monitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR EventsMonitoring Java Application Security with JDK Tools and JFR Events
Monitoring Java Application Security with JDK Tools and JFR Events
 
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
Empowering NextGen Mobility via Large Action Model Infrastructure (LAMI): pav...
 
Elevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object CalisthenicsElevating Tactical DDD Patterns Through Object Calisthenics
Elevating Tactical DDD Patterns Through Object Calisthenics
 
Knowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and backKnowledge engineering: from people to machines and back
Knowledge engineering: from people to machines and back
 
UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3UiPath Test Automation using UiPath Test Suite series, part 3
UiPath Test Automation using UiPath Test Suite series, part 3
 
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered QualitySoftware Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
Software Delivery At the Speed of AI: Inflectra Invests In AI-Powered Quality
 
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
Encryption in Microsoft 365 - ExpertsLive Netherlands 2024
 
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
GDG Cloud Southlake #33: Boule & Rebala: Effective AppSec in SDLC using Deplo...
 
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdfFIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
FIDO Alliance Osaka Seminar: The WebAuthn API and Discoverable Credentials.pdf
 
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
From Daily Decisions to Bottom Line: Connecting Product Work to Revenue by VP...
 
Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !Securing your Kubernetes cluster_ a step-by-step guide to success !
Securing your Kubernetes cluster_ a step-by-step guide to success !
 
DevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA ConnectDevOps and Testing slides at DASA Connect
DevOps and Testing slides at DASA Connect
 
Connector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a buttonConnector Corner: Automate dynamic content and events by pushing a button
Connector Corner: Automate dynamic content and events by pushing a button
 
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
Dev Dives: Train smarter, not harder – active learning and UiPath LLMs for do...
 
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMsTo Graph or Not to Graph Knowledge Graph Architectures and LLMs
To Graph or Not to Graph Knowledge Graph Architectures and LLMs
 
Epistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI supportEpistemic Interaction - tuning interfaces to provide information for AI support
Epistemic Interaction - tuning interfaces to provide information for AI support
 
How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...How world-class product teams are winning in the AI era by CEO and Founder, P...
How world-class product teams are winning in the AI era by CEO and Founder, P...
 
Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...Mission to Decommission: Importance of Decommissioning Products to Increase E...
Mission to Decommission: Importance of Decommissioning Products to Increase E...
 
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...Transcript: Selling digital books in 2024: Insights from industry leaders - T...
Transcript: Selling digital books in 2024: Insights from industry leaders - T...
 
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
Kubernetes & AI - Beauty and the Beast !?! @KCD Istanbul 2024
 

Fair data vs 5 star open data final

  • 1. Assessing FAIR data principles against the 5-star Open data principles Ali Hasnain and Dietrich Rebholz-Schuhmann June 2019
  • 2. Outline • Motivation • Linked Open Data • FAIR Data • 5 Open Data and FAIR Data • Conclusions 2
  • 3. Motivation: • Linked Open Data has been defined and designed to join up data in the public space • FAIR data principles have been defined to make data reusable • Both share principles of – Interoperability of data, and – Findability of data • How do both principled approaches differ? • How does LOD relate to FAIR? 3
  • 5. The Linked Data Cloud 5
  • 6. Heterogeneous Data – Multiple Datasources DrugBank DailyMed CheBI, KEGG Reactome Sider BioPax Medicare 6
  • 7. Size & type of data we talk about Dataset Category Year* Topic Size/ Coverage Drugbank LODD 2010 Drugs 766920 triples, 4800 drugs LinkedCT LODD X Clinical Trials 25 M triples, 106000 trials DailyMed LODD 2010 Drugs 1604983 triples, >36K products Dbpedia LODD 2009 Drugs/ Diseases/Proteins 218M triples, 2300 drugs, 2200 proteins Diseasome LODD 2010 Diseases/ Genes 91182 triples, 2600 genes SIDER LODD 2010 Diseases/ Side Effects 192515 triples, 63K effects, 1737 genes STITCH LODD 2010 Chemicals/ Proteins 7.5 M chemicals, 0.5 M proteins ChEMBLE LODD 2010 Assay/ Proteins/ Organisms 130 M triples Affymetrix Bio2RDF 2014 Microarrays 8694237 triples, 6679943 entities BioModels Bio2RDF 2014 Biological/ mathematical models 2380009 triples, 188308 entities BioPortal Bio2RDF 2014 Biological/ biomedical entities 19920395 triples, 2199594 entities KEGG Bio2RDF 2014 Genes 50197150 triples, 6533307 entities PharmaG-KB Bio2RDF 2014 Genotypes/ Phenotypes 278049209 triples, 25325504 entities PubMed Bio2RDF 2014 Citations 5005343905 triples, 412593720 entities Taxonomy Bio2RDF 2014 Taxonomy 21310356 triples, 1147211 entities LLD LLD 2014 Drugs, Chromosomes 10192641644 statements *Statistics as of Aug 2016 (source DataHub) - Year specify the time when the last- most recent version is produced. “X” means information not available. 7
  • 8. 5 ★ OPEN DATA (I.E. THE LOD) 8 Source:http://5stardata.info/en/
  • 9. Open data, open science and free software• Open in contrast to not fenced • open library (“just walk in”), open society (“allow religious diversity”), open skies (“allow fly-overs”), open university (“studies for all”) • Free software: • different levels of use and reuse, free like “free speech”, • not “free beer” and not “free puppy” • Creative Commons license right parameters • to govern Copyright: • Attribution – Non-commercial – Share-like – No Derivatives • In manufacturing: • Open Design, Open Source Hardware • Open means “open access”, e.g., scholarly literature, data, meta-data, but includes “free” and “public access” • Open science: OA to all parts of the scientific process, but access without use?03/06/2018 INSIGHT@NUIG
  • 10. Open data, more specifically • Open Education: • Reuse of all data, for educational purposes • Open Government: public access to governmental data, achieving transparency and participation (Open Data Governance Board) • Open Data Institute: Breeding openness through institutional structures, Insight is part of it • Openness requires standards for more openness • Agricultural openness: Open Source Seed Initiative, others • But also: “Open source Beer” and “Free Beer Foundation” 03/06/2018 INSIGHT@NUIG
  • 11. Data Interoperability, Integration and Reuse is the Key nih:EGFR epidermal growth factor receptor Homo sapiens CCCCGGCGCAGCGCGGCCGCAGCA GCCTCCGCCCCCCGCACGGTGTGA GCGCCCGACGCGGCCGAGGCGG … nih:EGF nci:has_description nih:sequence nih:organism nih:interacts nih:organism rea:EGFR rea:Membrane rea:Receptor rea:Transferase rea:keyword rea:keyword rea:keyword NCBI Reactome sameAs 11
  • 13. FAIR Data Findable – (Meta)data is assigned a globally unique and persistent identifier – Data is described with rich metadata – (Meta)data is registered or indexed in a searchable resource – (Meta)data specifies the data identifier Accessible – (Meta)data is retrieved by the identifier using a standardized protocol – The protocol is open, free and universally implementable – Protocol allows for an authentication and authorization procedure when necessary – The (meta)data remains accessible even when the data is not Interoperable – (Meta)data uses a formal, accessible, shared and broadly applicable language for knowledge representation – (Meta)data uses vocabularies that follow the Fair Principles – (Meta)data includes qualified references to other metadata Reusable – (Meta)data must have a plurality of accurate and relevant attributes – (Meta)data is released with a clear and accessible data usage license – (Meta)data is associated with its provenance – (Meta)data must meet domain-relevant community standards 13
  • 14. To be Findable: • F1. (meta)data are assigned a globally unique and persistent identifier – usage of URIs • F2. data are described with rich metadata – requires structured representation – relate to linkage to other data for context • F3. metadata clearly and explicitly include the identifier of the data it describes – corresponds to the usage of URIs • F4. (meta)data are registered or indexed in a searchable resource – about technologies making use of the data, which is the consequence of data access over the Web 14
  • 15. To be Accessible: • A1. (meta)data are retrievable by their identifier using a standardised communications protocol – usage of URIs, although other identifiers could be used as well • A1.1 the protocol is open, free, and universally implementable – URIs and related open data standards • A1.2 the protocol allows for an authentication and authorisation procedure, where necessary – the LOD only covers the open access part as the primary requirement • A2. metadata are accessible, even when the data are no longer available – At best, this corresponds to the URIs, – but long-term acquisition is not a matter of data standards but relates to IT infrastructures. 15
  • 16. To be Interoperable: • I1. (meta)data use a formal, accessible, shared, and broadly applicable language for knowledge representation – use of ”non-proprietary open formats” • I2. (meta)data use vocabularies that follow FAIR principles – refers to contextual standardised data • I3. (meta)data include qualified references to other (meta)data – refers to contextual standardised data 16
  • 17. To be Reusable: • – R1. meta(data) are richly described with a plurality of accurate and relevant attributes. – this refers to the structured data, but makes a stronger reference to contextual information . • – R1.1. (meta)data are released with a clear and accessible data usage license – a statement about the delivery of (meta)data, which is meant to be open according to LOD . • – R1.2. (meta)data are associated with detailed provenance – provenance is part of the contextual data. • – R1.3. (meta)data meet domain-relevant community standards – community standards are part of the contextual data and should be openly available. 17
  • 18. FAIR principles mapping to LOD 5-star scheme 18
  • 19. Distinctions and Overlaps • LOD and non-data assets – The LOD 5 stars scheme is for the Open Data. – FAIR data must not be openly available and requires that access to the license agreement is made available. – FAIR asks for meta-data to be provided with the data to improve interoperability, which relates to the contextual information from the LOD principles through interoperable data linked as one or several interoperable open data sources. • Tool and technology independent – FAIR Guiding Principles are independent of implementation choices, and do not suggest any specific technology, standard, or implementation solution. – FAIR principles are not, themselves, a standard or a specification but a guide to data publishers and consumers to assist regarding Findable, Accessible, Interoperable, and Reusable implementations and resources. – LOD 5-star scheme are not themselves a standard or a specification but provides the guideline for choosing the tools and technologies to19
  • 20. Conclusions • FAIR principles are focused to user needs around a wide range of existing data, whereas the LOD principles form an idealistic approach to a world using open data. • LOD principles appear to be more stringent in its own principles, whereas the FAIR data principles can be seen as reusing the LOD principles without making explicit reference and with added aspects of separating data into the core data and the meta-data, including license considerations. • The future will tell, whether the FAIR data principles lead to the reusability of data under the consideration that the access to meta-data and to license agreements open the avenues of reusabiltiy, or whether the LOD principles are the gatekeepers to reusability, since -- a priori -- the data should be openly available anyways. 20
  • 21. Thanks for your attention…

Editor's Notes

  1. We live in a world of data!
  2. EGFR: Epidermal growth factor receptor
  3. Thank you for your attention and I open the floor for any questions and comments.