SlideShare a Scribd company logo
1 of 47
Accelerating biomedical discovery
with an Internet of FAIR data and services
@micheldumontier::Semantics:2019-09-101
Michel Dumontier, Ph.D.
Distinguished Professor of Data Science
Director, Institute of Data Science
An increasing number of discoveries
are made using already available data
@micheldumontier::Semantics:2019-09-102
3
A common rejection module (CRM) for acute rejection across multiple organs identifies novel
therapeutics for organ transplantation
Khatri et al. JEM. 210 (11): 2205
DOI: 10.1084/jem.20122709
@micheldumontier::Semantics:2019-09-10
Main Findings:
1. CRM genes predicted future injury to a graft
2. Mice treated with drugs against the CRM genes extended graft survival
3. Retrospective EHR analysis supports treatment prediction
Key Observations:
1. Meta-analysis offers a more reliable estimate of the magnitude of the effect
2. Data can be used to generate and support/dispute new hypotheses
However, significant effort is
still needed to find the right
dataset(s), make sense of them,
and use for a new purpose
@micheldumontier::Semantics:2019-09-104
How do you find your data?
• Datasets you learned about in yesterday’s tutorials and
workshops
• Datasets used in the lab or organization
• Datasets associated with a paper you read or were told about
• A search in a specific repository
• A search on the internet
@micheldumontier::Semantics:2019-09-105
@micheldumontier::Semantics:2019-09-106
Poor quality metadata frustrates reuse
Vast number of lexically unique keys (and values)
7 @micheldumontier::Semantics:2019-09-10
Our ability to reproduce landmark studies is surprisingly low:
39% (39/100) in psychology1
21% (14/67) in pharmacology2
11% (6/53) in cancer3
unsatisfactory in machine learning4
1doi:10.1038/nature.2015.17433 2doi:10.1038/nrd3439-c1 3doi:10.1038/483531a 4https://openreview.net/pdf?id=By4l2PbQ-
Most published research findings are false.
- John Ioannidis, Stanford University
PLoS Med 2005;2(8): e124.
@micheldumontier::Semantics:2019-09-108
9
What hope do we really have to realize
?
@micheldumontier::Semantics:2019-09-1010
Poor quality
(meta)data Reproducibility
Crisis
Translational
Failure
Broken windows theory
Inadequate reusability theory
visible signs of crime, anti-
social behavior, and civil
disorder create an urban
environment that
encourages further crime
and disorder, including
serious crimes
Poor quality metadata and the
inaccessibility of original research
results make it less likely to
reproduce original work, resulting
in an ineffective translation of
research into useful applications
@micheldumontier::Semantics:2019-09-1011
It’s time to completely rethink
how we perform research
@micheldumontier::Semantics:2019-09-1012
Lambin et al. Radiother Oncol. 2013. 109(1):159-64. doi: 10.1016/j.radonc.2013.07.007
Human Machine collaboration
will be crucial to our future success
@micheldumontier::Semantics:2019-09-1013
We need a new social contract, supported
by legal and technological infrastructure
to make digital resources available in a
responsible manner
@micheldumontier::Semantics:2019-09-1014
@micheldumontier::Semantics:2019-09-1015
An international, bottom-up paradigm for
the discovery and reuse of digital content
for the machines that people use
@micheldumontier::Semantics:2019-09-1016
@micheldumontier::Semantics:2019-09-1017
http://www.nature.com/articles/sdata201618
@micheldumontier::Semantics:2019-09-1018
FAIR: Impact
FAIR in a nutshell
FAIR aims to create social and economic impact by facilitating the
discovery and reuse of digital resources through a set of requirements:
– unique identifiers to retrieve all forms of digital content and knowledge
– high quality meta(data) to enhance discovery of digital resources
– use of common vocabularies to share terms and facilitate query
– establishment of community standards for more facile knowledge utilisation
– detailed provenance to provide context and reproducibility
– registered in appropriate repositories with high quality metadata for future
content seekers
– social and technological commitments to realize reliable access
– simpler terms of use to clarify expectations and intensify innovation
@micheldumontier::Semantics:2019-09-1019
@micheldumontier::Semantics:2019-09-1020
FAIR != Open
Open as possible
closed as is necessary
@micheldumontier::Semantics:2019-09-1021
FAIR Hype Curve
22
Credit: Carole Goble
Why Should *I* Go FAIR?
• Makes it easier for me to use my own data for a new purpose
• Makes it easier for other people to find, use and cite my data,
and for them to understand what I expect in return
• Makes it easier/possible for people to verify my work
• Ensure that the data are available in the future, especially as I
may not want the responsibility
• Satisfy expectations around data management from
institution, funding agency, journal, my peers
@micheldumontier::Semantics:2019-09-1023
Let’s build the Internet of FAIR data and services
@micheldumontier::Semantics:2019-09-1024
@micheldumontier::Semantics:2019-09-1025
https://lod-cloud.net/
The Semantic Web
is a portal to the web of knowledge
26 @micheldumontier::Semantics:2019-09-10
standards for publishing, sharing and querying
facts, expert knowledge and services
scalable approach for the discovery
of independently constructed,
collaboratively described,
distributed knowledge
(in principle)
• 30+ biomedical data sources
• 10B+ interlinked statements
• EBI, SIB, NCBI, DBCLS, NCBO, and many others
produce this content
chemicals/drugs/formulations,
genomes/genes/proteins, domains
Interactions, complexes & pathways
animal models and phenotypes
Disease, genetic markers, treatments
Terminologies & publications
27
Alison Callahan, Jose Cruz-Toledo, Peter Ansell, Michel Dumontier:
Bio2RDF Release 2: Improved Coverage, Interoperability and
Provenance of Life Science Linked Data. ESWC 2013: 200-212
Linked Data for the Life Sciences
Bio2RDF is an open source project that uses semantic web
technologies to make it easier to reuse biomedical data
@micheldumontier::Semantics:2019-09-10
Federated query over the biological web of data
@micheldumontier::Semantics:2019-09-1028
Phenotypes of
knock-out
mouse models
for the targets
of a selected
drug (Imatinib)
Reproduce original research
@micheldumontier::Semantics:2019-09-1029
AUC 0.91 across all therapeutic indications Scripts not available. Feature tables available.
Result: ROCAUC 0.83 … doesn’t quite match
@micheldumontier::Semantics:2019-09-1030
Efficiently explore the web of data
Finding melanoma drugs through a probabilistic knowledge graph.
PeerJ Computer Science. 2017. 3:e106 https://doi.org/10.7717/peerj-cs.106
by exploring a probabilistic
semantic knowledge graph
And validate them against
pipelines for drug discovery
@micheldumontier::Semantics:2019-09-1031
Success depends on quality of metadataSearch registries for relevant datasets
@micheldumontier::Semantics:2019-09-1032
Metadata identifier
Resource identifier
Standardized, machine readable format
Use of community vocabularies
License?
Provenance?
@micheldumontier::Semantics:2019-09-1033
http://www.w3.org/TR/hcls-dataset/
standard is
registered in
FAIRsharing
Conformance to the (meta)data standard
should be machine actionable
@micheldumontier::Semantics:2019-09-1034
http://hw-swel.github.io/Validata/
RDF constraint validation tool that is configurable
to any profile
Declarative reusable schema description
Shape Expression (ShEx) based, but with extension
https://github.com/micheldumontier/hcls-shex
Now compliant with ShEx
Convertible to SHACL
Working on conversion to JSON-Schema
• 14 universal metrics covering each of the FAIR sub-principles. The metrics don’t dictate
any particular standards. They simply demand evidence (using protocols of the Web)
that the resource has met community expectations.
• Digital resource providers must provide at least one web-accessible document with
machine-readable metadata (FM-F2, FM-F3), resource management plan (FM-A2),
and any additional authorization procedures (FM-A1.2).
• They must use publically registered: identifier schemes (FM-F1A), (secure) access
protocols (FM-A1.1), knowledge representation languages (FM-I1), licenses (FM-R1.1),
provenance specifications (FM-R1.2), and community standards (FM-R1.3)
• They must evidence that their resource can be located in search results (FM-F4), that
it provides links to other (FAIR) resources (FM-I3; FM-I2), and it validates against
community standards (FM-R1.3)
@micheldumontier::Semantics:2019-09-1035
http://fairmetrics.org
@micheldumontier::Semantics:2019-09-1036
37
http://w3id.org/AmIFAIR
To appear in Nature Scientific Data
38
FAIR Data Maturity Model Working Group
• Bring together
stakeholders to build on
existing approaches and
expertise
• Establish core assessment
criteria for FAIRness
• Explore a FAIR data
maturity model & toolset
• Produce an RDA
Recommendation
• Develop FAIR data
checklist
39
Your invitation to participate
https://osf.io/n7uwp/
erik.schultes@go-fair.org
40
The Internet of FAIR data and services
must enable seamless traversal of heterogeneous digital resources
41 @micheldumontier::Semantics:2019-09-10
API
API
@micheldumontier::Semantics:2019-09-1042
(agents)
A community building a shared infrastructure…
Mine distributed, access restricted FAIR datasets
in a privacy preserving manner
Maastricht Study + MUMC CBS
Goal is to learn high confidence determinants of health in a privacy preserving manner
over vertically partitioned data from the Maastricht Study and Statistics Netherlands.
The data are made available through FAIR data stations that provide access to
allowable subsets of data to authorized users of approved algorithms.
Establish a new social, legal, ethical and technological infrastructure for discovery
science in and across health and non-health settings, including scalable governance
and flexible consent to underpin the responsible use of Big Data.
@micheldumontier::Semantics:2019-09-1043
s
Summary
FAIR represents a global initiative to enhance the discovery and reuse of all kinds
of digital resources. It is a work in progress!
It demands a new social, legal, ethical, scientific and technological
infrastructure that currently doesn’t exist in whole, but has to be built for and
adopted by digital savvy communities! It must answer the questions:
– How can we share data and perform analyses in a responsible manner?
– What incentives, rewards and penalties are needed to maximize trust, participation,
legality, and utility?
Semantics, coupled with AI technologies, may enable humans, aided by
intelligent machine agents, to exploit the Internet of FAIR data and services,
and hence to accelerate discovery in biomedicine and in other disciplines.
@micheldumontier::Semantics:2019-09-1044
FAIR is a part of the solution that will enable
arbitrary machines to work with each other
@micheldumontier::Semantics:2019-09-1045
Tim Berners-Lee
Ross King
Semantic Web Robot Science
Large Scale, Autonomous Scientific Discovery
Acknowledgements
@micheldumontier::Semantics:2019-09-1046
FAIR FAIR metrics
Dumontier Lab (Maastricht University, Stanford University, Carleton University)
MU: Seun Adekunle, Remzi Celebi, Dorina Claessens, Ricardo De Miranda Azevedo, Pedro Hernandez Serrano, Massimiliano Grassi, Andine Havelange,
Lianne Ippel, Alexander Malic, Kody Moodley, Stuti Nayak, Nadine Rouleaux, Claudia van open, Chang Sun, Amrapali Zaveri
SU: Sandeep Ayyar, Remzi Celebi, Shima Dastgheib, Maulik Kamdar, David Odgers, Maryam Panahiazar, Amrapali Zaveri
CU: Alison Callahan, Jose Toledo-Cruz, Natalia Villaneuva-Rosales
michel.dumontier@maastrichtuniversity.nl
Website: http://maastrichtuniversity.nl/ids
47 @micheldumontier::Semantics:2019-09-10
The mission of the Institute of Data Science at Maastricht University is to foster a
collaborative environment for multi-disciplinary data science research,
interdisciplinary training, and data-driven innovation .
We tackle key scientific, technical, social, legal, ethical issues that advance our
understanding across a variety of disciplines and strengthen our communities in the
face of these developments.

More Related Content

What's hot

Towards enrichment of the open government data: a stakeholder-centered determ...
Towards enrichment of the open government data: a stakeholder-centered determ...Towards enrichment of the open government data: a stakeholder-centered determ...
Towards enrichment of the open government data: a stakeholder-centered determ...Anastasija Nikiforova
 
BioPharma and FAIR Data, a Collaborative Advantage
BioPharma and FAIR Data, a Collaborative AdvantageBioPharma and FAIR Data, a Collaborative Advantage
BioPharma and FAIR Data, a Collaborative AdvantageTom Plasterer
 
Dataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* DataDataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* DataTom Plasterer
 
PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences Pistoia Alliance
 
Towards FAIR principles for research software @ FAIR Software Session, Nation...
Towards FAIR principles for research software @ FAIR Software Session, Nation...Towards FAIR principles for research software @ FAIR Software Session, Nation...
Towards FAIR principles for research software @ FAIR Software Session, Nation...annalenalamprecht
 
Analysis of open health data quality using data object-driven approach to dat...
Analysis of open health data quality using data object-driven approach to dat...Analysis of open health data quality using data object-driven approach to dat...
Analysis of open health data quality using data object-driven approach to dat...Anastasija Nikiforova
 
DataCite and its Members: Connecting Research and Identifying Knowledge
DataCite and its Members: Connecting Research and Identifying KnowledgeDataCite and its Members: Connecting Research and Identifying Knowledge
DataCite and its Members: Connecting Research and Identifying KnowledgeETH-Bibliothek
 
OPEN DATA: ECOSYSTEM, CURRENT AND FUTURE TRENDS, SUCCESS STORIES AND BARRIERS
OPEN DATA: ECOSYSTEM, CURRENT AND FUTURE TRENDS, SUCCESS STORIES AND BARRIERSOPEN DATA: ECOSYSTEM, CURRENT AND FUTURE TRENDS, SUCCESS STORIES AND BARRIERS
OPEN DATA: ECOSYSTEM, CURRENT AND FUTURE TRENDS, SUCCESS STORIES AND BARRIERSAnastasija Nikiforova
 
Stakeholder-centred Identification of Data Quality Issues: Knowledge that Can...
Stakeholder-centred Identification of Data Quality Issues: Knowledge that Can...Stakeholder-centred Identification of Data Quality Issues: Knowledge that Can...
Stakeholder-centred Identification of Data Quality Issues: Knowledge that Can...Anastasija Nikiforova
 
Linked Data for Biopharma
Linked Data for BiopharmaLinked Data for Biopharma
Linked Data for BiopharmaTom Plasterer
 
Big data supporting drug discovery - cautionary tales from the world of chemi...
Big data supporting drug discovery - cautionary tales from the world of chemi...Big data supporting drug discovery - cautionary tales from the world of chemi...
Big data supporting drug discovery - cautionary tales from the world of chemi...Valery Tkachenko
 
AI-SDV 2020: AI in bioinformatics: An analysis of the patent landscape Anna M...
AI-SDV 2020: AI in bioinformatics: An analysis of the patent landscape Anna M...AI-SDV 2020: AI in bioinformatics: An analysis of the patent landscape Anna M...
AI-SDV 2020: AI in bioinformatics: An analysis of the patent landscape Anna M...Dr. Haxel Consult
 
Data quality supporting AI in Life Sciences webinar 10 dec 2018
Data quality supporting AI in Life Sciences webinar 10 dec 2018Data quality supporting AI in Life Sciences webinar 10 dec 2018
Data quality supporting AI in Life Sciences webinar 10 dec 2018Pistoia Alliance
 
EOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryCarole Goble
 
Fair webinar, Ted slater: progress towards commercial fair data products and ...
Fair webinar, Ted slater: progress towards commercial fair data products and ...Fair webinar, Ted slater: progress towards commercial fair data products and ...
Fair webinar, Ted slater: progress towards commercial fair data products and ...Pistoia Alliance
 
Knowledge graphs ilaria maresi the hyve 23apr2020
Knowledge graphs   ilaria maresi the hyve 23apr2020Knowledge graphs   ilaria maresi the hyve 23apr2020
Knowledge graphs ilaria maresi the hyve 23apr2020Pistoia Alliance
 
Darwin ai covid-net mitre
Darwin ai   covid-net mitreDarwin ai   covid-net mitre
Darwin ai covid-net mitreianmitch
 

What's hot (20)

Towards enrichment of the open government data: a stakeholder-centered determ...
Towards enrichment of the open government data: a stakeholder-centered determ...Towards enrichment of the open government data: a stakeholder-centered determ...
Towards enrichment of the open government data: a stakeholder-centered determ...
 
BioPharma and FAIR Data, a Collaborative Advantage
BioPharma and FAIR Data, a Collaborative AdvantageBioPharma and FAIR Data, a Collaborative Advantage
BioPharma and FAIR Data, a Collaborative Advantage
 
Dataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* DataDataset Catalogs as a Foundation for FAIR* Data
Dataset Catalogs as a Foundation for FAIR* Data
 
PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences
 
Towards FAIR principles for research software @ FAIR Software Session, Nation...
Towards FAIR principles for research software @ FAIR Software Session, Nation...Towards FAIR principles for research software @ FAIR Software Session, Nation...
Towards FAIR principles for research software @ FAIR Software Session, Nation...
 
Analysis of open health data quality using data object-driven approach to dat...
Analysis of open health data quality using data object-driven approach to dat...Analysis of open health data quality using data object-driven approach to dat...
Analysis of open health data quality using data object-driven approach to dat...
 
DataCite and its Members: Connecting Research and Identifying Knowledge
DataCite and its Members: Connecting Research and Identifying KnowledgeDataCite and its Members: Connecting Research and Identifying Knowledge
DataCite and its Members: Connecting Research and Identifying Knowledge
 
OPEN DATA: ECOSYSTEM, CURRENT AND FUTURE TRENDS, SUCCESS STORIES AND BARRIERS
OPEN DATA: ECOSYSTEM, CURRENT AND FUTURE TRENDS, SUCCESS STORIES AND BARRIERSOPEN DATA: ECOSYSTEM, CURRENT AND FUTURE TRENDS, SUCCESS STORIES AND BARRIERS
OPEN DATA: ECOSYSTEM, CURRENT AND FUTURE TRENDS, SUCCESS STORIES AND BARRIERS
 
Stakeholder-centred Identification of Data Quality Issues: Knowledge that Can...
Stakeholder-centred Identification of Data Quality Issues: Knowledge that Can...Stakeholder-centred Identification of Data Quality Issues: Knowledge that Can...
Stakeholder-centred Identification of Data Quality Issues: Knowledge that Can...
 
Linked Data for Biopharma
Linked Data for BiopharmaLinked Data for Biopharma
Linked Data for Biopharma
 
Big data supporting drug discovery - cautionary tales from the world of chemi...
Big data supporting drug discovery - cautionary tales from the world of chemi...Big data supporting drug discovery - cautionary tales from the world of chemi...
Big data supporting drug discovery - cautionary tales from the world of chemi...
 
AI-SDV 2020: AI in bioinformatics: An analysis of the patent landscape Anna M...
AI-SDV 2020: AI in bioinformatics: An analysis of the patent landscape Anna M...AI-SDV 2020: AI in bioinformatics: An analysis of the patent landscape Anna M...
AI-SDV 2020: AI in bioinformatics: An analysis of the patent landscape Anna M...
 
Data quality supporting AI in Life Sciences webinar 10 dec 2018
Data quality supporting AI in Life Sciences webinar 10 dec 2018Data quality supporting AI in Life Sciences webinar 10 dec 2018
Data quality supporting AI in Life Sciences webinar 10 dec 2018
 
IC-SDV 2019: OntoChem
IC-SDV 2019: OntoChemIC-SDV 2019: OntoChem
IC-SDV 2019: OntoChem
 
Pad 500
Pad 500Pad 500
Pad 500
 
Cis 500
Cis 500Cis 500
Cis 500
 
EOSC-Life Workflow Collaboratory
EOSC-Life Workflow CollaboratoryEOSC-Life Workflow Collaboratory
EOSC-Life Workflow Collaboratory
 
Fair webinar, Ted slater: progress towards commercial fair data products and ...
Fair webinar, Ted slater: progress towards commercial fair data products and ...Fair webinar, Ted slater: progress towards commercial fair data products and ...
Fair webinar, Ted slater: progress towards commercial fair data products and ...
 
Knowledge graphs ilaria maresi the hyve 23apr2020
Knowledge graphs   ilaria maresi the hyve 23apr2020Knowledge graphs   ilaria maresi the hyve 23apr2020
Knowledge graphs ilaria maresi the hyve 23apr2020
 
Darwin ai covid-net mitre
Darwin ai   covid-net mitreDarwin ai   covid-net mitre
Darwin ai covid-net mitre
 

Similar to Acclerating biomedical discovery with an internet of FAIR data and services - keynote @ Semantics2019

Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...Michel Dumontier
 
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...Michel Dumontier
 
Accelerating biomedical discovery with an Internet of FAIR data and services ...
Accelerating biomedical discovery with an Internet of FAIR data and services ...Accelerating biomedical discovery with an Internet of FAIR data and services ...
Accelerating biomedical discovery with an Internet of FAIR data and services ...Platform Linked Data Netherlands (PLDN)
 
The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...Michel Dumontier
 
Data-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge GraphsData-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge GraphsMichel Dumontier
 
IRJET- A Novel Approach for Accomplishing Data Reliability and Isolation Safe...
IRJET- A Novel Approach for Accomplishing Data Reliability and Isolation Safe...IRJET- A Novel Approach for Accomplishing Data Reliability and Isolation Safe...
IRJET- A Novel Approach for Accomplishing Data Reliability and Isolation Safe...IRJET Journal
 
IRJET- Fake News Detection
IRJET- Fake News DetectionIRJET- Fake News Detection
IRJET- Fake News DetectionIRJET Journal
 
TOP TEN: Big Data_ Issue 16 _ Dec 2014
TOP TEN: Big Data_ Issue 16 _ Dec 2014TOP TEN: Big Data_ Issue 16 _ Dec 2014
TOP TEN: Big Data_ Issue 16 _ Dec 2014MOTC Qatar
 
Fake News and Message Detection
Fake News and Message DetectionFake News and Message Detection
Fake News and Message DetectionIRJET Journal
 
Big data analytics and its impact on internet users
Big data analytics and its impact on internet usersBig data analytics and its impact on internet users
Big data analytics and its impact on internet usersStruggler Ever
 
FEDERAL LEARNING BASED SOLUTIONS FOR PRIVACY AND ANONYMITY IN INTERNET OF MED...
FEDERAL LEARNING BASED SOLUTIONS FOR PRIVACY AND ANONYMITY IN INTERNET OF MED...FEDERAL LEARNING BASED SOLUTIONS FOR PRIVACY AND ANONYMITY IN INTERNET OF MED...
FEDERAL LEARNING BASED SOLUTIONS FOR PRIVACY AND ANONYMITY IN INTERNET OF MED...IRJET Journal
 
NIC Linked Data: the OHIO project
NIC Linked Data:   the OHIO projectNIC Linked Data:   the OHIO project
NIC Linked Data: the OHIO projectMichael Wilkinson
 
How life sciences can win with blockchain
How life sciences can win with blockchainHow life sciences can win with blockchain
How life sciences can win with blockchainToni Borges
 
Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?Michel Dumontier
 
big data on science of analytics and innovativeness among udergraduate studen...
big data on science of analytics and innovativeness among udergraduate studen...big data on science of analytics and innovativeness among udergraduate studen...
big data on science of analytics and innovativeness among udergraduate studen...johnmutiso245
 
big data on science of analytics and innovativeness among udergraduate studen...
big data on science of analytics and innovativeness among udergraduate studen...big data on science of analytics and innovativeness among udergraduate studen...
big data on science of analytics and innovativeness among udergraduate studen...johnmutiso245
 

Similar to Acclerating biomedical discovery with an internet of FAIR data and services - keynote @ Semantics2019 (20)

Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
 
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
 
Accelerating biomedical discovery with an Internet of FAIR data and services ...
Accelerating biomedical discovery with an Internet of FAIR data and services ...Accelerating biomedical discovery with an Internet of FAIR data and services ...
Accelerating biomedical discovery with an Internet of FAIR data and services ...
 
Are we FAIR yet?
Are we FAIR yet?Are we FAIR yet?
Are we FAIR yet?
 
The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...The Future of FAIR Data: An international social, legal and technological inf...
The Future of FAIR Data: An international social, legal and technological inf...
 
Data-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge GraphsData-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge Graphs
 
IRJET- A Novel Approach for Accomplishing Data Reliability and Isolation Safe...
IRJET- A Novel Approach for Accomplishing Data Reliability and Isolation Safe...IRJET- A Novel Approach for Accomplishing Data Reliability and Isolation Safe...
IRJET- A Novel Approach for Accomplishing Data Reliability and Isolation Safe...
 
IRJET- Fake News Detection
IRJET- Fake News DetectionIRJET- Fake News Detection
IRJET- Fake News Detection
 
TOP TEN: Big Data_ Issue 16 _ Dec 2014
TOP TEN: Big Data_ Issue 16 _ Dec 2014TOP TEN: Big Data_ Issue 16 _ Dec 2014
TOP TEN: Big Data_ Issue 16 _ Dec 2014
 
EHLP - July 2015 pg 6-8
EHLP - July 2015 pg 6-8EHLP - July 2015 pg 6-8
EHLP - July 2015 pg 6-8
 
Fake News and Message Detection
Fake News and Message DetectionFake News and Message Detection
Fake News and Message Detection
 
Big data analytics and its impact on internet users
Big data analytics and its impact on internet usersBig data analytics and its impact on internet users
Big data analytics and its impact on internet users
 
Connecting the Data Wires
Connecting the Data WiresConnecting the Data Wires
Connecting the Data Wires
 
FEDERAL LEARNING BASED SOLUTIONS FOR PRIVACY AND ANONYMITY IN INTERNET OF MED...
FEDERAL LEARNING BASED SOLUTIONS FOR PRIVACY AND ANONYMITY IN INTERNET OF MED...FEDERAL LEARNING BASED SOLUTIONS FOR PRIVACY AND ANONYMITY IN INTERNET OF MED...
FEDERAL LEARNING BASED SOLUTIONS FOR PRIVACY AND ANONYMITY IN INTERNET OF MED...
 
NIC Linked Data: the OHIO project
NIC Linked Data:   the OHIO projectNIC Linked Data:   the OHIO project
NIC Linked Data: the OHIO project
 
Cloud computing in healthcare
Cloud computing in healthcare Cloud computing in healthcare
Cloud computing in healthcare
 
How life sciences can win with blockchain
How life sciences can win with blockchainHow life sciences can win with blockchain
How life sciences can win with blockchain
 
Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?
 
big data on science of analytics and innovativeness among udergraduate studen...
big data on science of analytics and innovativeness among udergraduate studen...big data on science of analytics and innovativeness among udergraduate studen...
big data on science of analytics and innovativeness among udergraduate studen...
 
big data on science of analytics and innovativeness among udergraduate studen...
big data on science of analytics and innovativeness among udergraduate studen...big data on science of analytics and innovativeness among udergraduate studen...
big data on science of analytics and innovativeness among udergraduate studen...
 

More from Michel Dumontier

A metadata standard for Knowledge Graphs
A metadata standard for Knowledge GraphsA metadata standard for Knowledge Graphs
A metadata standard for Knowledge GraphsMichel Dumontier
 
The Role of the FAIR Guiding Principles for an effective Learning Health System
The Role of the FAIR Guiding Principles for an effective Learning Health SystemThe Role of the FAIR Guiding Principles for an effective Learning Health System
The Role of the FAIR Guiding Principles for an effective Learning Health SystemMichel Dumontier
 
The role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health SystemThe role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health SystemMichel Dumontier
 
Keynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University DinnerKeynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University DinnerMichel Dumontier
 
The future of science and business - a UM Star Lecture
The future of science and business - a UM Star LectureThe future of science and business - a UM Star Lecture
The future of science and business - a UM Star LectureMichel Dumontier
 
A Framework to develop the FAIR Metrics
A Framework to develop the FAIR MetricsA Framework to develop the FAIR Metrics
A Framework to develop the FAIR MetricsMichel Dumontier
 
Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Michel Dumontier
 
Model Organism Linked Data
Model Organism Linked DataModel Organism Linked Data
Model Organism Linked DataMichel Dumontier
 
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge DiscoveryMichel Dumontier
 
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMaking it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMichel Dumontier
 
Link Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked DataLink Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked DataMichel Dumontier
 
Making the most of phenotypes in ontology-based biomedical knowledge discovery
Making the most of phenotypes in ontology-based biomedical knowledge discoveryMaking the most of phenotypes in ontology-based biomedical knowledge discovery
Making the most of phenotypes in ontology-based biomedical knowledge discoveryMichel Dumontier
 
W3C HCLS Dataset Description Guidelines
W3C HCLS Dataset Description GuidelinesW3C HCLS Dataset Description Guidelines
W3C HCLS Dataset Description GuidelinesMichel Dumontier
 
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...Michel Dumontier
 
1st Network-of-BioThings Hackathon
1st Network-of-BioThings Hackathon1st Network-of-BioThings Hackathon
1st Network-of-BioThings HackathonMichel Dumontier
 
Powering Scientific Discovery with the Semantic Web (VanBUG 2014)
Powering Scientific Discovery with the Semantic Web (VanBUG 2014)Powering Scientific Discovery with the Semantic Web (VanBUG 2014)
Powering Scientific Discovery with the Semantic Web (VanBUG 2014)Michel Dumontier
 

More from Michel Dumontier (19)

A metadata standard for Knowledge Graphs
A metadata standard for Knowledge GraphsA metadata standard for Knowledge Graphs
A metadata standard for Knowledge Graphs
 
The Role of the FAIR Guiding Principles for an effective Learning Health System
The Role of the FAIR Guiding Principles for an effective Learning Health SystemThe Role of the FAIR Guiding Principles for an effective Learning Health System
The Role of the FAIR Guiding Principles for an effective Learning Health System
 
The role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health SystemThe role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health System
 
Keynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University DinnerKeynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University Dinner
 
The future of science and business - a UM Star Lecture
The future of science and business - a UM Star LectureThe future of science and business - a UM Star Lecture
The future of science and business - a UM Star Lecture
 
A Framework to develop the FAIR Metrics
A Framework to develop the FAIR MetricsA Framework to develop the FAIR Metrics
A Framework to develop the FAIR Metrics
 
Data Science for the Win
Data Science for the WinData Science for the Win
Data Science for the Win
 
2016 bmdid-mappings
2016 bmdid-mappings2016 bmdid-mappings
2016 bmdid-mappings
 
Ontologies
OntologiesOntologies
Ontologies
 
Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...
 
Model Organism Linked Data
Model Organism Linked DataModel Organism Linked Data
Model Organism Linked Data
 
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
 
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMaking it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
 
Link Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked DataLink Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked Data
 
Making the most of phenotypes in ontology-based biomedical knowledge discovery
Making the most of phenotypes in ontology-based biomedical knowledge discoveryMaking the most of phenotypes in ontology-based biomedical knowledge discovery
Making the most of phenotypes in ontology-based biomedical knowledge discovery
 
W3C HCLS Dataset Description Guidelines
W3C HCLS Dataset Description GuidelinesW3C HCLS Dataset Description Guidelines
W3C HCLS Dataset Description Guidelines
 
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
 
1st Network-of-BioThings Hackathon
1st Network-of-BioThings Hackathon1st Network-of-BioThings Hackathon
1st Network-of-BioThings Hackathon
 
Powering Scientific Discovery with the Semantic Web (VanBUG 2014)
Powering Scientific Discovery with the Semantic Web (VanBUG 2014)Powering Scientific Discovery with the Semantic Web (VanBUG 2014)
Powering Scientific Discovery with the Semantic Web (VanBUG 2014)
 

Recently uploaded

PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​kaibalyasahoo82800
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptxanandsmhk
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxSwapnil Therkar
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |aasikanpl
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfmuntazimhurra
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physicsvishikhakeshava1
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfSumit Kumar yadav
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxAleenaTreesaSaji
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PPRINCE C P
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRDelhi Call girls
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxUmerFayaz5
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Patrick Diehl
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...RohitNehra6
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...jana861314
 

Recently uploaded (20)

PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Nanoparticles synthesis and characterization​ ​
Nanoparticles synthesis and characterization​  ​Nanoparticles synthesis and characterization​  ​
Nanoparticles synthesis and characterization​ ​
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptxUnlocking  the Potential: Deep dive into ocean of Ceramic Magnets.pptx
Unlocking the Potential: Deep dive into ocean of Ceramic Magnets.pptx
 
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptxAnalytical Profile of Coleus Forskohlii | Forskolin .pptx
Analytical Profile of Coleus Forskohlii | Forskolin .pptx
 
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Mukherjee Nagar(Delhi) |
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Biological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdfBiological Classification BioHack (3).pdf
Biological Classification BioHack (3).pdf
 
Work, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE PhysicsWork, Energy and Power for class 10 ICSE Physics
Work, Energy and Power for class 10 ICSE Physics
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
Botany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdfBotany 4th semester file By Sumit Kumar yadav.pdf
Botany 4th semester file By Sumit Kumar yadav.pdf
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
GFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptxGFP in rDNA Technology (Biotechnology).pptx
GFP in rDNA Technology (Biotechnology).pptx
 
Artificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C PArtificial Intelligence In Microbiology by Dr. Prince C P
Artificial Intelligence In Microbiology by Dr. Prince C P
 
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCRStunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
Stunning ➥8448380779▻ Call Girls In Panchshil Enclave Delhi NCR
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Animal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptxAnimal Communication- Auditory and Visual.pptx
Animal Communication- Auditory and Visual.pptx
 
Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?Is RISC-V ready for HPC workload? Maybe?
Is RISC-V ready for HPC workload? Maybe?
 
Biopesticide (2).pptx .This slides helps to know the different types of biop...
Biopesticide (2).pptx  .This slides helps to know the different types of biop...Biopesticide (2).pptx  .This slides helps to know the different types of biop...
Biopesticide (2).pptx .This slides helps to know the different types of biop...
 
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
Traditional Agroforestry System in India- Shifting Cultivation, Taungya, Home...
 

Acclerating biomedical discovery with an internet of FAIR data and services - keynote @ Semantics2019

  • 1. Accelerating biomedical discovery with an Internet of FAIR data and services @micheldumontier::Semantics:2019-09-101 Michel Dumontier, Ph.D. Distinguished Professor of Data Science Director, Institute of Data Science
  • 2. An increasing number of discoveries are made using already available data @micheldumontier::Semantics:2019-09-102
  • 3. 3 A common rejection module (CRM) for acute rejection across multiple organs identifies novel therapeutics for organ transplantation Khatri et al. JEM. 210 (11): 2205 DOI: 10.1084/jem.20122709 @micheldumontier::Semantics:2019-09-10 Main Findings: 1. CRM genes predicted future injury to a graft 2. Mice treated with drugs against the CRM genes extended graft survival 3. Retrospective EHR analysis supports treatment prediction Key Observations: 1. Meta-analysis offers a more reliable estimate of the magnitude of the effect 2. Data can be used to generate and support/dispute new hypotheses
  • 4. However, significant effort is still needed to find the right dataset(s), make sense of them, and use for a new purpose @micheldumontier::Semantics:2019-09-104
  • 5. How do you find your data? • Datasets you learned about in yesterday’s tutorials and workshops • Datasets used in the lab or organization • Datasets associated with a paper you read or were told about • A search in a specific repository • A search on the internet @micheldumontier::Semantics:2019-09-105
  • 6. @micheldumontier::Semantics:2019-09-106 Poor quality metadata frustrates reuse Vast number of lexically unique keys (and values)
  • 7. 7 @micheldumontier::Semantics:2019-09-10 Our ability to reproduce landmark studies is surprisingly low: 39% (39/100) in psychology1 21% (14/67) in pharmacology2 11% (6/53) in cancer3 unsatisfactory in machine learning4 1doi:10.1038/nature.2015.17433 2doi:10.1038/nrd3439-c1 3doi:10.1038/483531a 4https://openreview.net/pdf?id=By4l2PbQ- Most published research findings are false. - John Ioannidis, Stanford University PLoS Med 2005;2(8): e124.
  • 9. 9 What hope do we really have to realize ?
  • 10. @micheldumontier::Semantics:2019-09-1010 Poor quality (meta)data Reproducibility Crisis Translational Failure Broken windows theory Inadequate reusability theory visible signs of crime, anti- social behavior, and civil disorder create an urban environment that encourages further crime and disorder, including serious crimes Poor quality metadata and the inaccessibility of original research results make it less likely to reproduce original work, resulting in an ineffective translation of research into useful applications
  • 11. @micheldumontier::Semantics:2019-09-1011 It’s time to completely rethink how we perform research
  • 12. @micheldumontier::Semantics:2019-09-1012 Lambin et al. Radiother Oncol. 2013. 109(1):159-64. doi: 10.1016/j.radonc.2013.07.007
  • 13. Human Machine collaboration will be crucial to our future success @micheldumontier::Semantics:2019-09-1013
  • 14. We need a new social contract, supported by legal and technological infrastructure to make digital resources available in a responsible manner @micheldumontier::Semantics:2019-09-1014
  • 16. An international, bottom-up paradigm for the discovery and reuse of digital content for the machines that people use @micheldumontier::Semantics:2019-09-1016
  • 19. FAIR in a nutshell FAIR aims to create social and economic impact by facilitating the discovery and reuse of digital resources through a set of requirements: – unique identifiers to retrieve all forms of digital content and knowledge – high quality meta(data) to enhance discovery of digital resources – use of common vocabularies to share terms and facilitate query – establishment of community standards for more facile knowledge utilisation – detailed provenance to provide context and reproducibility – registered in appropriate repositories with high quality metadata for future content seekers – social and technological commitments to realize reliable access – simpler terms of use to clarify expectations and intensify innovation @micheldumontier::Semantics:2019-09-1019
  • 23. Why Should *I* Go FAIR? • Makes it easier for me to use my own data for a new purpose • Makes it easier for other people to find, use and cite my data, and for them to understand what I expect in return • Makes it easier/possible for people to verify my work • Ensure that the data are available in the future, especially as I may not want the responsibility • Satisfy expectations around data management from institution, funding agency, journal, my peers @micheldumontier::Semantics:2019-09-1023
  • 24. Let’s build the Internet of FAIR data and services @micheldumontier::Semantics:2019-09-1024
  • 26. The Semantic Web is a portal to the web of knowledge 26 @micheldumontier::Semantics:2019-09-10 standards for publishing, sharing and querying facts, expert knowledge and services scalable approach for the discovery of independently constructed, collaboratively described, distributed knowledge (in principle)
  • 27. • 30+ biomedical data sources • 10B+ interlinked statements • EBI, SIB, NCBI, DBCLS, NCBO, and many others produce this content chemicals/drugs/formulations, genomes/genes/proteins, domains Interactions, complexes & pathways animal models and phenotypes Disease, genetic markers, treatments Terminologies & publications 27 Alison Callahan, Jose Cruz-Toledo, Peter Ansell, Michel Dumontier: Bio2RDF Release 2: Improved Coverage, Interoperability and Provenance of Life Science Linked Data. ESWC 2013: 200-212 Linked Data for the Life Sciences Bio2RDF is an open source project that uses semantic web technologies to make it easier to reuse biomedical data @micheldumontier::Semantics:2019-09-10
  • 28. Federated query over the biological web of data @micheldumontier::Semantics:2019-09-1028 Phenotypes of knock-out mouse models for the targets of a selected drug (Imatinib)
  • 29. Reproduce original research @micheldumontier::Semantics:2019-09-1029 AUC 0.91 across all therapeutic indications Scripts not available. Feature tables available. Result: ROCAUC 0.83 … doesn’t quite match
  • 30. @micheldumontier::Semantics:2019-09-1030 Efficiently explore the web of data Finding melanoma drugs through a probabilistic knowledge graph. PeerJ Computer Science. 2017. 3:e106 https://doi.org/10.7717/peerj-cs.106 by exploring a probabilistic semantic knowledge graph And validate them against pipelines for drug discovery
  • 31. @micheldumontier::Semantics:2019-09-1031 Success depends on quality of metadataSearch registries for relevant datasets
  • 32. @micheldumontier::Semantics:2019-09-1032 Metadata identifier Resource identifier Standardized, machine readable format Use of community vocabularies License? Provenance?
  • 34. Conformance to the (meta)data standard should be machine actionable @micheldumontier::Semantics:2019-09-1034 http://hw-swel.github.io/Validata/ RDF constraint validation tool that is configurable to any profile Declarative reusable schema description Shape Expression (ShEx) based, but with extension https://github.com/micheldumontier/hcls-shex Now compliant with ShEx Convertible to SHACL Working on conversion to JSON-Schema
  • 35. • 14 universal metrics covering each of the FAIR sub-principles. The metrics don’t dictate any particular standards. They simply demand evidence (using protocols of the Web) that the resource has met community expectations. • Digital resource providers must provide at least one web-accessible document with machine-readable metadata (FM-F2, FM-F3), resource management plan (FM-A2), and any additional authorization procedures (FM-A1.2). • They must use publically registered: identifier schemes (FM-F1A), (secure) access protocols (FM-A1.1), knowledge representation languages (FM-I1), licenses (FM-R1.1), provenance specifications (FM-R1.2), and community standards (FM-R1.3) • They must evidence that their resource can be located in search results (FM-F4), that it provides links to other (FAIR) resources (FM-I3; FM-I2), and it validates against community standards (FM-R1.3) @micheldumontier::Semantics:2019-09-1035 http://fairmetrics.org
  • 37. 37 http://w3id.org/AmIFAIR To appear in Nature Scientific Data
  • 38. 38
  • 39. FAIR Data Maturity Model Working Group • Bring together stakeholders to build on existing approaches and expertise • Establish core assessment criteria for FAIRness • Explore a FAIR data maturity model & toolset • Produce an RDA Recommendation • Develop FAIR data checklist 39
  • 40. Your invitation to participate https://osf.io/n7uwp/ erik.schultes@go-fair.org 40
  • 41. The Internet of FAIR data and services must enable seamless traversal of heterogeneous digital resources 41 @micheldumontier::Semantics:2019-09-10 API API
  • 43. Mine distributed, access restricted FAIR datasets in a privacy preserving manner Maastricht Study + MUMC CBS Goal is to learn high confidence determinants of health in a privacy preserving manner over vertically partitioned data from the Maastricht Study and Statistics Netherlands. The data are made available through FAIR data stations that provide access to allowable subsets of data to authorized users of approved algorithms. Establish a new social, legal, ethical and technological infrastructure for discovery science in and across health and non-health settings, including scalable governance and flexible consent to underpin the responsible use of Big Data. @micheldumontier::Semantics:2019-09-1043 s
  • 44. Summary FAIR represents a global initiative to enhance the discovery and reuse of all kinds of digital resources. It is a work in progress! It demands a new social, legal, ethical, scientific and technological infrastructure that currently doesn’t exist in whole, but has to be built for and adopted by digital savvy communities! It must answer the questions: – How can we share data and perform analyses in a responsible manner? – What incentives, rewards and penalties are needed to maximize trust, participation, legality, and utility? Semantics, coupled with AI technologies, may enable humans, aided by intelligent machine agents, to exploit the Internet of FAIR data and services, and hence to accelerate discovery in biomedicine and in other disciplines. @micheldumontier::Semantics:2019-09-1044
  • 45. FAIR is a part of the solution that will enable arbitrary machines to work with each other @micheldumontier::Semantics:2019-09-1045 Tim Berners-Lee Ross King Semantic Web Robot Science Large Scale, Autonomous Scientific Discovery
  • 46. Acknowledgements @micheldumontier::Semantics:2019-09-1046 FAIR FAIR metrics Dumontier Lab (Maastricht University, Stanford University, Carleton University) MU: Seun Adekunle, Remzi Celebi, Dorina Claessens, Ricardo De Miranda Azevedo, Pedro Hernandez Serrano, Massimiliano Grassi, Andine Havelange, Lianne Ippel, Alexander Malic, Kody Moodley, Stuti Nayak, Nadine Rouleaux, Claudia van open, Chang Sun, Amrapali Zaveri SU: Sandeep Ayyar, Remzi Celebi, Shima Dastgheib, Maulik Kamdar, David Odgers, Maryam Panahiazar, Amrapali Zaveri CU: Alison Callahan, Jose Toledo-Cruz, Natalia Villaneuva-Rosales
  • 47. michel.dumontier@maastrichtuniversity.nl Website: http://maastrichtuniversity.nl/ids 47 @micheldumontier::Semantics:2019-09-10 The mission of the Institute of Data Science at Maastricht University is to foster a collaborative environment for multi-disciplinary data science research, interdisciplinary training, and data-driven innovation . We tackle key scientific, technical, social, legal, ethical issues that advance our understanding across a variety of disciplines and strengthen our communities in the face of these developments.

Editor's Notes

  1. Abstract Using meta-analysis of eight independent transplant datasets (236 graft biopsy samples) from four organs, we identified a common rejection module (CRM) consisting of 11 genes that were significantly overexpressed in acute rejection (AR) across all transplanted organs. The CRM genes could diagnose AR with high specificity and sensitivity in three additional independent cohorts (794 samples). In another two independent cohorts (151 renal transplant biopsies), the CRM genes correlated with the extent of graft injury and predicted future injury to a graft using protocol biopsies. Inferred drug mechanisms from the literature suggested that two FDA-approved drugs (atorvastatin and dasatinib), approved for nontransplant indications, could regulate specific CRM genes and reduce the number of graft-infiltrating cells during AR. We treated mice with HLA-mismatched mouse cardiac transplant with atorvastatin and dasatinib and showed reduction of the CRM genes, significant reduction of graft-infiltrating cells, and extended graft survival. We further validated the beneficial effect of atorvastatin on graft survival by retrospective analysis of electronic medical records of a single-center cohort of 2,515 renal transplant patients followed for up to 22 yr. In conclusion, we identified a CRM in transplantation that provides new opportunities for diagnosis, drug repositioning, and rational drug design.
  2. G20: http://europa.eu/rapid/press-release_STATEMENT-16-2967_en.htm EOSC: https://ec.europa.eu/research/openscience/pdf/realising_the_european_open_science_cloud_2016.pdf H2020: https://goo.gl/Strjua
  3. https://www.gov.uk/government/publications/g8-science-ministers-statement-london-12-june-2013
  4. The Bio2RDF project transforms silos of life science data into a globally distributed network of linked data for biological knowledge discovery.