SlideShare a Scribd company logo
Accelerating Discovery Science
with an Internet of FAIR Data and Services
@micheldumontier::CIKM:2020-10-211
Michel Dumontier, Ph.D.
Distinguished Professor of Data Science
Director, Institute of Data Science
@micheldumontier::CIKM:2020-10-212
The world is awash with vast amounts of data
@micheldumontier::CIKM:2020-10-213
4
A common rejection module (CRM) for acute rejection across multiple organs identifies novel
therapeutics for organ transplantation
Khatri et al. JEM. 210 (11): 2205
DOI: 10.1084/jem.20122709
@micheldumontier::CIKM:2020-10-21
Main Findings:
1. CRM of 11 overexpressed genes predicted future injury to a graft
2. Mice treated with existing drugs against specific CRM genes extended graft survival
3. Retrospective EHR data analysis supports treatment prediction
Key Observations:
1. Meta-analysis offers a more reliable estimate of the magnitude of the effect
2. Data can be used to generate and support/dispute new hypotheses
However, significant effort is
still needed to find the right
dataset(s), make sense of them,
and use for a new purpose
@micheldumontier::CIKM:2020-10-215
@micheldumontier::CIKM:2020-10-216
7 @micheldumontier::CIKM:2020-10-21
Our ability to reproduce landmark studies is surprisingly low:
39% (39/100) in psychology1
21% (14/67) in pharmacology2
11% (6/53) in cancer3
unsatisfactory in machine learning4
1doi:10.1038/nature.2015.17433 2doi:10.1038/nrd3439-c1 3doi:10.1038/483531a 4https://openreview.net/pdf?id=By4l2PbQ-
Most published research findings are false.
- John Ioannidis, Stanford University
PLoS Med 2005;2(8): e124.
@micheldumontier::CIKM:2020-10-218
9
What hope do we really have to realize
?
@micheldumontier::CIKM:2020-10-2110
It’s time to completely rethink
how we perform research
@micheldumontier::CIKM:2020-10-2111
Poor quality
(meta)data Reproducibility
Crisis
Translational
Failure
Broken windows theory
Inadequate reusability theory
visible signs of crime, anti-
social behavior, and civil
disorder create an
environment that
encourages more serious
crimes
Poor quality metadata and the
inaccessibility of original research
results make it less likely to
reproduce original work, resulting
in an ineffective translation of
research into useful applications
@micheldumontier::CIKM:2020-10-2112
It’s time to completely rethink
how we perform research
(and how we document and report it)
@micheldumontier::CIKM:2020-10-2113
Lambin et al. Radiother Oncol. 2013. 109(1):159-64. doi: 10.1016/j.radonc.2013.07.007
Rethinking Publishing Scientific Research
@micheldumontier::CIKM:2020-10-2114
Data Science. 2017 1(1-2):139-154. DOI: 10.3233/DS-170010
http://www.tkuhn.org/pub/sempub/
De-centralized knowledge graphs
@micheldumontier::CIKM:2020-10-2115
Kuhn T., Chichester C., Krauthammer M., Dumontier M. (2015) Publishing
Without Publishers: A Decentralized Approach to Dissemination, Retrieval, and
Archiving of Data. In: Arenas M. et al. (eds) The Semantic Web - ISWC 2015.
ISWC 2015. Lecture Notes in Computer Science, vol 9366. Springer, Cham
We need a new social contract, supported
by legal and technological infrastructure
to make digital resources available in a
responsible manner
@micheldumontier::CIKM:2020-10-2116
Human Machine collaboration
will be crucial to our future success
@micheldumontier::CIKM:2020-10-2117
@micheldumontier::CIKM:2020-10-2118
An international, bottom-up paradigm for
the discovery and reuse of digital content
for the machines that people use
@micheldumontier::CIKM:2020-10-2119
http://www.nature.com/articles/sdata201618
@micheldumontier::CIKM:2020-10-2120
FAIR in a nutshell
FAIR aims to enhance social and economic outcomes by facilitating the
discovery and reuse of digital resources through key requirements:
– unique identifiers to distinguish and retrieve all forms of digital content and
knowledge
– high quality meta(data) to enhance discovery of relevant digital resources
– use of common vocabularies to facilitate query and statistical analysis
– establishment of community standards to reduce the effort in data reuse
– detailed provenance to provide adequate context and to enable reproducibility
– registered in appropriate repositories to fulfill a promise to future content seekers
– simpler terms of use to clarify expectations and intensify innovation
– social and technological commitments to make data ready for intelligent applications
@micheldumontier::CIKM:2020-10-2121
@micheldumontier::CIKM:2020-10-2122
@micheldumontier::CIKM:2020-10-2123
The lack of FAIR data costs the European Economy a minimum of €10.2bn per year
EC:DG R&I; PWC 2018 Report: Cost-benefit analysis for FAIR research data
Why Should *you* Go FAIR?
• Makes it easier for to use your own data for a new purpose
• Makes it easier for other people to find, use and cite your
data, and for them to understand what you expect in return
• Makes it easier/possible for people to verify your work
• Ensure that the data are available in the future, especially as
you may not want the responsibility
• Satisfy the expectations around data management from
institution, funding agency, journal, my peers
@micheldumontier::CIKM:2020-10-2124
Let’s build and use the
Internet of FAIR data and services
@micheldumontier::CIKM:2020-10-2125
FAIRification process
@micheldumontier::CIKM:2020-10-2126
GO FAIR Fairification: https://www.go-fair.org/fair-principles/fairification-process/
FAIRplus FAIR cookbook: https://fairplus.github.io/cookbook-dev/intro.html
Utrecht FAIR: https://www.uu.nl/en/research/research-data-management/guides/how-to-make-your-data-fair
EC H2020 Guidelines: https://ec.europa.eu/research/participants/data/ref/h2020/grants_manual/hi/oa_pilot/h2020-hi-oa-data-mgt_en.pdf
@micheldumontier::CIKM:2020-10-2127
28
http://w3id.org/AmIFAIR
Other schemes: https://fairassist.org
29
The Semantic Web
is a portal to the web of knowledge
30 @micheldumontier::CIKM:2020-10-21
standards for publishing, sharing and querying
facts, expert knowledge and services
scalable approach for the discovery
of independently constructed,
collaboratively described,
distributed knowledge
(in principle)
@micheldumontier::CIKM:2020-10-2131
https://lod-cloud.net/
@micheldumontier::PLDN:2020-01-2232
Success depends on quality of metadataSearch registries for relevant datasets
@micheldumontier::PLDN:2020-01-2233
Metadata identifier
Resource identifier
Standardized, machine readable format
Use of community vocabularies
License?
Provenance?
@micheldumontier::PLDN:2020-01-2234
http://www.w3.org/TR/hcls-dataset/
standard is
registered in
FAIRsharing
• 30+ biomedical data sources
• 10B+ interlinked statements
• EBI, SIB, NCBI, DBCLS, NCBO, and many others
produce this content
chemicals/drugs/formulations,
genomes/genes/proteins, domains
Interactions, complexes & pathways
animal models and phenotypes
Disease, genetic markers, treatments
Terminologies & publications
35
Alison Callahan, Jose Cruz-Toledo, Peter Ansell, Michel Dumontier:
Bio2RDF Release 2: Improved Coverage, Interoperability and
Provenance of Life Science Linked Data. ESWC 2013: 200-212
Linked Data for the Life Sciences
Bio2RDF is an open source project that uses semantic web
technologies to make it easier to reuse biomedical data
@micheldumontier::CIKM:2020-10-21
Query multiple databases on the biological web of data
@micheldumontier::CIKM:2020-10-2136
Phenotypes of
knock-out
mouse models
for the targets
of a selected
drug (Imatinib)
@micheldumontier::CIKM:2020-10-2137
Explore we know, and formulate hypotheses about what we don’t
Finding melanoma drugs through a probabilistic knowledge graph.
PeerJ Computer Science. 2017. 3:e106 https://doi.org/10.7717/peerj-cs.106
by exploring a probabilistic
semantic knowledge graph
And validate them against
pipelines for drug discovery
Reproduce original research
@micheldumontier::CIKM:2020-10-2138
AUC 0.91 across all therapeutic indications Scripts not available. Feature tables available.
Result: AUC 0.83 … doesn’t match! (but now you can see what exactly we did)
Towards FAIR protocols and workflows: the OpenPREDICT use case. 2020. PeerJ Computer Science 6:e281
https://doi.org/10.7717/peerj-cs.281
Explore disease pathophysiology and treatment
@micheldumontier::CIKM:2020-10-2139
Mine distributed, access restricted FAIR datasets
in a privacy preserving manner
Maastricht Study + MUMC CBS
Goal is to learn high confidence determinants of health in a privacy preserving manner
over vertically partitioned data from the Maastricht Study and Statistics Netherlands.
The data are made available through FAIR data stations that provide access to
allowable subsets of data to authorized users of approved algorithms.
Establish a new social, legal, ethical and technological infrastructure for discovery
science in and across health and non-health settings, including scalable governance
and flexible consent to underpin the responsible use of Big Data.
@micheldumontier::CIKM:2020-10-2140
s
FAIR data and services
to accelerate discovery science
@micheldumontier::CIKM:2020-10-2141
Summary
FAIR represents a global initiative to enhance the discovery and reuse of all kinds of
digital resources. It is a work in progress and it needs you!
FAIR requires new social, legal, ethical, scientific and technological infrastructure:
– How does your research group or community make their data/findings FAIR?
– What support does your organization provide you?
– Are you making use of all the data and findings that you could?
– What is responsible data science and artificial intelligence?
Semantics, coupled with AI technologies, may enable humans, aided by intelligent
machine agents, to exploit the Internet of FAIR data and services, and hence to
accelerate discovery in biomedicine and in other disciplines.
@micheldumontier::CIKM:2020-10-2142
Acknowledgements
@micheldumontier::CIKM:2020-10-2143
FAIR
Dumontier Lab (Maastricht University, Stanford University, Carleton University)
MU: Seun Adekunle, Thales Bertaglia, Remzi Celebi, Yenisel Calana, Ricardo De Miranda Azevedo, Vincent Emonet, Lars Jacobs, Andreea Grigoriu,
Carlos Guerrero, Tim Hendriks, Massimiliano Grassi, Andine Havelange, Pedro Hernandez Serrano, Vikas Jaiman, Parveen Kumar, Lianne Ippel,
Alexander Malic, Helder Monteiro, Stefan Meier, Kody Moodley, Stuti Nayak, Hercules Panoutsopoulos, Linda Rieswijk, Carola Roubin, Nadine
Rouleaux, Claudia van open, Chang Sun, Johan van Soest, Binosha Weerarathna, Turgay Saba, Weiwei Wang, Jinzhou Yang, Amrapali Zaveri, Leto Peel,
Rohan Nanda, Visara Urovi, Andre Dekker, David Townend, Gijs van Dijck, Christopher Brewster
SU: Sandeep Ayyar, Remzi Celebi, Shima Dastgheib, Maulik Kamdar, David Odgers, Maryam Panahiazar, Amrapali Zaveri
CU: Alison Callahan, Jose Toledo-Cruz, Natalia Villaneuva-Rosales
michel.dumontier@maastrichtuniversity.nl
Website: http://maastrichtuniversity.nl/ids
44 @micheldumontier::CIKM:2020-10-21
The mission of the Institute of Data Science at Maastricht University is to foster a
collaborative environment for multi-disciplinary data science research,
interdisciplinary training, and data-driven innovation .
We tackle key scientific, technical, social, legal, ethical issues that advance our
understanding across a variety of disciplines and strengthen our communities in the
face of these developments.

More Related Content

What's hot

Blockchain in Health Research Overview - Manion
Blockchain in Health Research Overview - ManionBlockchain in Health Research Overview - Manion
Blockchain in Health Research Overview - Manion
Sean Manion PhD
 
Big Data Analytics government healthcare
Big Data Analytics government healthcareBig Data Analytics government healthcare
Big Data Analytics government healthcare
Data Science Thailand
 
A Framework to develop the FAIR Metrics
A Framework to develop the FAIR MetricsA Framework to develop the FAIR Metrics
A Framework to develop the FAIR Metrics
Michel Dumontier
 
Data quality supporting AI in Life Sciences webinar 10 dec 2018
Data quality supporting AI in Life Sciences webinar 10 dec 2018Data quality supporting AI in Life Sciences webinar 10 dec 2018
Data quality supporting AI in Life Sciences webinar 10 dec 2018
Pistoia Alliance
 
LAK16 privacy and analytics (2016)
LAK16 privacy and analytics (2016)LAK16 privacy and analytics (2016)
LAK16 privacy and analytics (2016)
Wolfgang Greller
 
Blockchain and Patient-Centered Outcomes Measures - Goldwater
Blockchain and Patient-Centered Outcomes Measures - GoldwaterBlockchain and Patient-Centered Outcomes Measures - Goldwater
Blockchain and Patient-Centered Outcomes Measures - Goldwater
Sean Manion PhD
 
Data safe havens: A future EOSC service?
Data safe havens: A future EOSC service?Data safe havens: A future EOSC service?
Data safe havens: A future EOSC service?
EUDAT
 
Big Data technology
Big Data technologyBig Data technology
Big Data technology
Nicolae Sfetcu
 
Clinical Data Models - The Hyve - Bio IT World April 2019
Clinical Data Models - The Hyve - Bio IT World April 2019Clinical Data Models - The Hyve - Bio IT World April 2019
Clinical Data Models - The Hyve - Bio IT World April 2019
Kees van Bochove
 
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Michel Dumontier
 
Impact of big data congestion in IT: An adaptive knowledgebased Bayesian network
Impact of big data congestion in IT: An adaptive knowledgebased Bayesian networkImpact of big data congestion in IT: An adaptive knowledgebased Bayesian network
Impact of big data congestion in IT: An adaptive knowledgebased Bayesian network
IJECEIAES
 
Promoting an ethical and GDPR-compliant approach to learning analytics
Promoting an ethical and GDPR-compliant approach to learning analyticsPromoting an ethical and GDPR-compliant approach to learning analytics
Promoting an ethical and GDPR-compliant approach to learning analytics
Jisc
 
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
Edward Curry
 
Open science and medical evidence generation - Kees van Bochove - The Hyve
Open science and medical evidence generation - Kees van Bochove - The HyveOpen science and medical evidence generation - Kees van Bochove - The Hyve
Open science and medical evidence generation - Kees van Bochove - The Hyve
Kees van Bochove
 
How Big Data Ecosystems Work
How Big Data Ecosystems WorkHow Big Data Ecosystems Work
How Big Data Ecosystems Work
Edward Curry
 
Distributed Ledger Tech Applications - Health Report V1.6
Distributed Ledger Tech Applications - Health Report V1.6Distributed Ledger Tech Applications - Health Report V1.6
Distributed Ledger Tech Applications - Health Report V1.6
Sean Manion PhD
 
Key Technology Trends for Big Data in Europe
Key Technology Trends for Big Data in EuropeKey Technology Trends for Big Data in Europe
Key Technology Trends for Big Data in Europe
Edward Curry
 
Differential Privacy for Information Retrieval
Differential Privacy for Information RetrievalDifferential Privacy for Information Retrieval
Differential Privacy for Information Retrieval
Grace Hui Yang
 
Big Data: Beyond the hype, Delivering value
Big Data: Beyond the hype, Delivering valueBig Data: Beyond the hype, Delivering value
Big Data: Beyond the hype, Delivering value
Edward Curry
 
Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...
Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...
Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...
Databricks
 

What's hot (20)

Blockchain in Health Research Overview - Manion
Blockchain in Health Research Overview - ManionBlockchain in Health Research Overview - Manion
Blockchain in Health Research Overview - Manion
 
Big Data Analytics government healthcare
Big Data Analytics government healthcareBig Data Analytics government healthcare
Big Data Analytics government healthcare
 
A Framework to develop the FAIR Metrics
A Framework to develop the FAIR MetricsA Framework to develop the FAIR Metrics
A Framework to develop the FAIR Metrics
 
Data quality supporting AI in Life Sciences webinar 10 dec 2018
Data quality supporting AI in Life Sciences webinar 10 dec 2018Data quality supporting AI in Life Sciences webinar 10 dec 2018
Data quality supporting AI in Life Sciences webinar 10 dec 2018
 
LAK16 privacy and analytics (2016)
LAK16 privacy and analytics (2016)LAK16 privacy and analytics (2016)
LAK16 privacy and analytics (2016)
 
Blockchain and Patient-Centered Outcomes Measures - Goldwater
Blockchain and Patient-Centered Outcomes Measures - GoldwaterBlockchain and Patient-Centered Outcomes Measures - Goldwater
Blockchain and Patient-Centered Outcomes Measures - Goldwater
 
Data safe havens: A future EOSC service?
Data safe havens: A future EOSC service?Data safe havens: A future EOSC service?
Data safe havens: A future EOSC service?
 
Big Data technology
Big Data technologyBig Data technology
Big Data technology
 
Clinical Data Models - The Hyve - Bio IT World April 2019
Clinical Data Models - The Hyve - Bio IT World April 2019Clinical Data Models - The Hyve - Bio IT World April 2019
Clinical Data Models - The Hyve - Bio IT World April 2019
 
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
Accelerating Biomedical Research with the Emerging Internet of FAIR Data and ...
 
Impact of big data congestion in IT: An adaptive knowledgebased Bayesian network
Impact of big data congestion in IT: An adaptive knowledgebased Bayesian networkImpact of big data congestion in IT: An adaptive knowledgebased Bayesian network
Impact of big data congestion in IT: An adaptive knowledgebased Bayesian network
 
Promoting an ethical and GDPR-compliant approach to learning analytics
Promoting an ethical and GDPR-compliant approach to learning analyticsPromoting an ethical and GDPR-compliant approach to learning analytics
Promoting an ethical and GDPR-compliant approach to learning analytics
 
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
From Data Platforms to Dataspaces: Enabling Data Ecosystems for Intelligent S...
 
Open science and medical evidence generation - Kees van Bochove - The Hyve
Open science and medical evidence generation - Kees van Bochove - The HyveOpen science and medical evidence generation - Kees van Bochove - The Hyve
Open science and medical evidence generation - Kees van Bochove - The Hyve
 
How Big Data Ecosystems Work
How Big Data Ecosystems WorkHow Big Data Ecosystems Work
How Big Data Ecosystems Work
 
Distributed Ledger Tech Applications - Health Report V1.6
Distributed Ledger Tech Applications - Health Report V1.6Distributed Ledger Tech Applications - Health Report V1.6
Distributed Ledger Tech Applications - Health Report V1.6
 
Key Technology Trends for Big Data in Europe
Key Technology Trends for Big Data in EuropeKey Technology Trends for Big Data in Europe
Key Technology Trends for Big Data in Europe
 
Differential Privacy for Information Retrieval
Differential Privacy for Information RetrievalDifferential Privacy for Information Retrieval
Differential Privacy for Information Retrieval
 
Big Data: Beyond the hype, Delivering value
Big Data: Beyond the hype, Delivering valueBig Data: Beyond the hype, Delivering value
Big Data: Beyond the hype, Delivering value
 
Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...
Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...
Apache Spark + AI Helps and FDA Protects the Nation with Jonathan Chu and Kun...
 

Similar to CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR data and services

Acclerating biomedical discovery with an internet of FAIR data and services -...
Acclerating biomedical discovery with an internet of FAIR data and services -...Acclerating biomedical discovery with an internet of FAIR data and services -...
Acclerating biomedical discovery with an internet of FAIR data and services -...
Michel Dumontier
 
Business with Big data
Business with Big dataBusiness with Big data
Business with Big data
Bruno Curtarelli
 
Vision 2030: A Connected Future
Vision 2030: A Connected FutureVision 2030: A Connected Future
Vision 2030: A Connected Future
Wipro Digital
 
Vision 2030: A Connected Future
Vision 2030: A Connected FutureVision 2030: A Connected Future
Vision 2030: A Connected Future
Wipro Digital
 
TOP TEN: Big Data_ Issue 16 _ Dec 2014
TOP TEN: Big Data_ Issue 16 _ Dec 2014TOP TEN: Big Data_ Issue 16 _ Dec 2014
TOP TEN: Big Data_ Issue 16 _ Dec 2014
MOTC Qatar
 
Data-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge GraphsData-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge Graphs
Michel Dumontier
 
Open Data-Driven Innovation and Smart Cities_Open Data Business Model and Pat...
Open Data-Driven Innovation and Smart Cities_Open Data Business Model and Pat...Open Data-Driven Innovation and Smart Cities_Open Data Business Model and Pat...
Open Data-Driven Innovation and Smart Cities_Open Data Business Model and Pat...Fatemeh Ahmadi
 
IRJET- Building a Big Data Provenance with its Applications for Smart Cities
IRJET- Building a Big Data Provenance with its Applications for Smart CitiesIRJET- Building a Big Data Provenance with its Applications for Smart Cities
IRJET- Building a Big Data Provenance with its Applications for Smart Cities
IRJET Journal
 
wireless sensor network
wireless sensor networkwireless sensor network
wireless sensor networkparry prabhu
 
Cloud Based Services and their Security Evaluation in the Hospitals
Cloud Based Services and their Security Evaluation in the HospitalsCloud Based Services and their Security Evaluation in the Hospitals
Cloud Based Services and their Security Evaluation in the Hospitals
ijtsrd
 
Power from big data - Are Europe's utilities ready for the age of data?
Power from big data - Are Europe's utilities ready for the age of data?Power from big data - Are Europe's utilities ready for the age of data?
Power from big data - Are Europe's utilities ready for the age of data?Steve Bray
 
big data on science of analytics and innovativeness among udergraduate studen...
big data on science of analytics and innovativeness among udergraduate studen...big data on science of analytics and innovativeness among udergraduate studen...
big data on science of analytics and innovativeness among udergraduate studen...
johnmutiso245
 
big data on science of analytics and innovativeness among udergraduate studen...
big data on science of analytics and innovativeness among udergraduate studen...big data on science of analytics and innovativeness among udergraduate studen...
big data on science of analytics and innovativeness among udergraduate studen...
johnmutiso245
 
Data driven innovation for growth and well being
Data driven innovation for growth and well beingData driven innovation for growth and well being
Data driven innovation for growth and well being
innovationoecd
 
Healthcare in Digital Age
Healthcare in Digital Age Healthcare in Digital Age
Healthcare in Digital Age
ict moph
 
Sharing Advisory Board newsletter #8
Sharing Advisory Board newsletter #8Sharing Advisory Board newsletter #8
Sharing Advisory Board newsletter #8
Carlo Vaccari
 
Operational Research Society - annual analytics summit 2017
Operational Research Society - annual analytics summit 2017Operational Research Society - annual analytics summit 2017
Operational Research Society - annual analytics summit 2017
Peter Wells
 

Similar to CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR data and services (20)

Acclerating biomedical discovery with an internet of FAIR data and services -...
Acclerating biomedical discovery with an internet of FAIR data and services -...Acclerating biomedical discovery with an internet of FAIR data and services -...
Acclerating biomedical discovery with an internet of FAIR data and services -...
 
Business with Big data
Business with Big dataBusiness with Big data
Business with Big data
 
Vision 2030: A Connected Future
Vision 2030: A Connected FutureVision 2030: A Connected Future
Vision 2030: A Connected Future
 
Vision 2030: A Connected Future
Vision 2030: A Connected FutureVision 2030: A Connected Future
Vision 2030: A Connected Future
 
TOP TEN: Big Data_ Issue 16 _ Dec 2014
TOP TEN: Big Data_ Issue 16 _ Dec 2014TOP TEN: Big Data_ Issue 16 _ Dec 2014
TOP TEN: Big Data_ Issue 16 _ Dec 2014
 
Data-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge GraphsData-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge Graphs
 
Lecture week 5 -
Lecture week 5 -Lecture week 5 -
Lecture week 5 -
 
Open Data-Driven Innovation and Smart Cities_Open Data Business Model and Pat...
Open Data-Driven Innovation and Smart Cities_Open Data Business Model and Pat...Open Data-Driven Innovation and Smart Cities_Open Data Business Model and Pat...
Open Data-Driven Innovation and Smart Cities_Open Data Business Model and Pat...
 
IRJET- Building a Big Data Provenance with its Applications for Smart Cities
IRJET- Building a Big Data Provenance with its Applications for Smart CitiesIRJET- Building a Big Data Provenance with its Applications for Smart Cities
IRJET- Building a Big Data Provenance with its Applications for Smart Cities
 
wireless sensor network
wireless sensor networkwireless sensor network
wireless sensor network
 
Cloud Based Services and their Security Evaluation in the Hospitals
Cloud Based Services and their Security Evaluation in the HospitalsCloud Based Services and their Security Evaluation in the Hospitals
Cloud Based Services and their Security Evaluation in the Hospitals
 
Power from big data - Are Europe's utilities ready for the age of data?
Power from big data - Are Europe's utilities ready for the age of data?Power from big data - Are Europe's utilities ready for the age of data?
Power from big data - Are Europe's utilities ready for the age of data?
 
big data on science of analytics and innovativeness among udergraduate studen...
big data on science of analytics and innovativeness among udergraduate studen...big data on science of analytics and innovativeness among udergraduate studen...
big data on science of analytics and innovativeness among udergraduate studen...
 
big data on science of analytics and innovativeness among udergraduate studen...
big data on science of analytics and innovativeness among udergraduate studen...big data on science of analytics and innovativeness among udergraduate studen...
big data on science of analytics and innovativeness among udergraduate studen...
 
Data driven innovation for growth and well being
Data driven innovation for growth and well beingData driven innovation for growth and well being
Data driven innovation for growth and well being
 
Healthcare in Digital Age
Healthcare in Digital Age Healthcare in Digital Age
Healthcare in Digital Age
 
Big Data a Catalunya
Big Data a CatalunyaBig Data a Catalunya
Big Data a Catalunya
 
Big Data a Catalunya
Big Data a CatalunyaBig Data a Catalunya
Big Data a Catalunya
 
Sharing Advisory Board newsletter #8
Sharing Advisory Board newsletter #8Sharing Advisory Board newsletter #8
Sharing Advisory Board newsletter #8
 
Operational Research Society - annual analytics summit 2017
Operational Research Society - annual analytics summit 2017Operational Research Society - annual analytics summit 2017
Operational Research Society - annual analytics summit 2017
 

More from Michel Dumontier

FAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable PredictionsFAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable Predictions
Michel Dumontier
 
A metadata standard for Knowledge Graphs
A metadata standard for Knowledge GraphsA metadata standard for Knowledge Graphs
A metadata standard for Knowledge Graphs
Michel Dumontier
 
The role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health SystemThe role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health System
Michel Dumontier
 
Keynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University DinnerKeynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University Dinner
Michel Dumontier
 
The future of science and business - a UM Star Lecture
The future of science and business - a UM Star LectureThe future of science and business - a UM Star Lecture
The future of science and business - a UM Star Lecture
Michel Dumontier
 
FAIR principles and metrics for evaluation
FAIR principles and metrics for evaluationFAIR principles and metrics for evaluation
FAIR principles and metrics for evaluation
Michel Dumontier
 
Towards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRnessTowards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRness
Michel Dumontier
 
Data Science for the Win
Data Science for the WinData Science for the Win
Data Science for the Win
Michel Dumontier
 
2016 bmdid-mappings
2016 bmdid-mappings2016 bmdid-mappings
2016 bmdid-mappings
Michel Dumontier
 
Ontologies
OntologiesOntologies
Ontologies
Michel Dumontier
 
Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...
Michel Dumontier
 
Model Organism Linked Data
Model Organism Linked DataModel Organism Linked Data
Model Organism Linked Data
Michel Dumontier
 
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
Michel Dumontier
 
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMaking it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Michel Dumontier
 
Link Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked DataLink Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked Data
Michel Dumontier
 
Making the most of phenotypes in ontology-based biomedical knowledge discovery
Making the most of phenotypes in ontology-based biomedical knowledge discoveryMaking the most of phenotypes in ontology-based biomedical knowledge discovery
Making the most of phenotypes in ontology-based biomedical knowledge discovery
Michel Dumontier
 
W3C HCLS Dataset Description Guidelines
W3C HCLS Dataset Description GuidelinesW3C HCLS Dataset Description Guidelines
W3C HCLS Dataset Description Guidelines
Michel Dumontier
 
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Michel Dumontier
 
1st Network-of-BioThings Hackathon
1st Network-of-BioThings Hackathon1st Network-of-BioThings Hackathon
1st Network-of-BioThings Hackathon
Michel Dumontier
 

More from Michel Dumontier (19)

FAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable PredictionsFAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable Predictions
 
A metadata standard for Knowledge Graphs
A metadata standard for Knowledge GraphsA metadata standard for Knowledge Graphs
A metadata standard for Knowledge Graphs
 
The role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health SystemThe role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health System
 
Keynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University DinnerKeynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University Dinner
 
The future of science and business - a UM Star Lecture
The future of science and business - a UM Star LectureThe future of science and business - a UM Star Lecture
The future of science and business - a UM Star Lecture
 
FAIR principles and metrics for evaluation
FAIR principles and metrics for evaluationFAIR principles and metrics for evaluation
FAIR principles and metrics for evaluation
 
Towards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRnessTowards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRness
 
Data Science for the Win
Data Science for the WinData Science for the Win
Data Science for the Win
 
2016 bmdid-mappings
2016 bmdid-mappings2016 bmdid-mappings
2016 bmdid-mappings
 
Ontologies
OntologiesOntologies
Ontologies
 
Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...
 
Model Organism Linked Data
Model Organism Linked DataModel Organism Linked Data
Model Organism Linked Data
 
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
 
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMaking it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
 
Link Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked DataLink Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked Data
 
Making the most of phenotypes in ontology-based biomedical knowledge discovery
Making the most of phenotypes in ontology-based biomedical knowledge discoveryMaking the most of phenotypes in ontology-based biomedical knowledge discovery
Making the most of phenotypes in ontology-based biomedical knowledge discovery
 
W3C HCLS Dataset Description Guidelines
W3C HCLS Dataset Description GuidelinesW3C HCLS Dataset Description Guidelines
W3C HCLS Dataset Description Guidelines
 
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
 
1st Network-of-BioThings Hackathon
1st Network-of-BioThings Hackathon1st Network-of-BioThings Hackathon
1st Network-of-BioThings Hackathon
 

Recently uploaded

原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
3ipehhoa
 
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Multi-cluster Kubernetes Networking- Patterns, Projects and GuidelinesMulti-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Sanjeev Rampal
 
Latest trends in computer networking.pptx
Latest trends in computer networking.pptxLatest trends in computer networking.pptx
Latest trends in computer networking.pptx
JungkooksNonexistent
 
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
ufdana
 
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
APNIC
 
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
3ipehhoa
 
This 7-second Brain Wave Ritual Attracts Money To You.!
This 7-second Brain Wave Ritual Attracts Money To You.!This 7-second Brain Wave Ritual Attracts Money To You.!
This 7-second Brain Wave Ritual Attracts Money To You.!
nirahealhty
 
guildmasters guide to ravnica Dungeons & Dragons 5...
guildmasters guide to ravnica Dungeons & Dragons 5...guildmasters guide to ravnica Dungeons & Dragons 5...
guildmasters guide to ravnica Dungeons & Dragons 5...
Rogerio Filho
 
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
keoku
 
History+of+E-commerce+Development+in+China-www.cfye-commerce.shop
History+of+E-commerce+Development+in+China-www.cfye-commerce.shopHistory+of+E-commerce+Development+in+China-www.cfye-commerce.shop
History+of+E-commerce+Development+in+China-www.cfye-commerce.shop
laozhuseo02
 
Internet-Security-Safeguarding-Your-Digital-World (1).pptx
Internet-Security-Safeguarding-Your-Digital-World (1).pptxInternet-Security-Safeguarding-Your-Digital-World (1).pptx
Internet-Security-Safeguarding-Your-Digital-World (1).pptx
VivekSinghShekhawat2
 
JAVIER LASA-EXPERIENCIA digital 1986-2024.pdf
JAVIER LASA-EXPERIENCIA digital 1986-2024.pdfJAVIER LASA-EXPERIENCIA digital 1986-2024.pdf
JAVIER LASA-EXPERIENCIA digital 1986-2024.pdf
Javier Lasa
 
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
eutxy
 
Comptia N+ Standard Networking lesson guide
Comptia N+ Standard Networking lesson guideComptia N+ Standard Networking lesson guide
Comptia N+ Standard Networking lesson guide
GTProductions1
 
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptxBridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Brad Spiegel Macon GA
 
1.Wireless Communication System_Wireless communication is a broad term that i...
1.Wireless Communication System_Wireless communication is a broad term that i...1.Wireless Communication System_Wireless communication is a broad term that i...
1.Wireless Communication System_Wireless communication is a broad term that i...
JeyaPerumal1
 
BASIC C++ lecture NOTE C++ lecture 3.pptx
BASIC C++ lecture NOTE C++ lecture 3.pptxBASIC C++ lecture NOTE C++ lecture 3.pptx
BASIC C++ lecture NOTE C++ lecture 3.pptx
natyesu
 
The+Prospects+of+E-Commerce+in+China.pptx
The+Prospects+of+E-Commerce+in+China.pptxThe+Prospects+of+E-Commerce+in+China.pptx
The+Prospects+of+E-Commerce+in+China.pptx
laozhuseo02
 
test test test test testtest test testtest test testtest test testtest test ...
test test  test test testtest test testtest test testtest test testtest test ...test test  test test testtest test testtest test testtest test testtest test ...
test test test test testtest test testtest test testtest test testtest test ...
Arif0071
 
How to Use Contact Form 7 Like a Pro.pptx
How to Use Contact Form 7 Like a Pro.pptxHow to Use Contact Form 7 Like a Pro.pptx
How to Use Contact Form 7 Like a Pro.pptx
Gal Baras
 

Recently uploaded (20)

原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
原版仿制(uob毕业证书)英国伯明翰大学毕业证本科学历证书原版一模一样
 
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Multi-cluster Kubernetes Networking- Patterns, Projects and GuidelinesMulti-cluster Kubernetes Networking- Patterns, Projects and Guidelines
Multi-cluster Kubernetes Networking- Patterns, Projects and Guidelines
 
Latest trends in computer networking.pptx
Latest trends in computer networking.pptxLatest trends in computer networking.pptx
Latest trends in computer networking.pptx
 
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
一比一原版(CSU毕业证)加利福尼亚州立大学毕业证成绩单专业办理
 
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
APNIC Foundation, presented by Ellisha Heppner at the PNG DNS Forum 2024
 
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
1比1复刻(bath毕业证书)英国巴斯大学毕业证学位证原版一模一样
 
This 7-second Brain Wave Ritual Attracts Money To You.!
This 7-second Brain Wave Ritual Attracts Money To You.!This 7-second Brain Wave Ritual Attracts Money To You.!
This 7-second Brain Wave Ritual Attracts Money To You.!
 
guildmasters guide to ravnica Dungeons & Dragons 5...
guildmasters guide to ravnica Dungeons & Dragons 5...guildmasters guide to ravnica Dungeons & Dragons 5...
guildmasters guide to ravnica Dungeons & Dragons 5...
 
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
一比一原版(SLU毕业证)圣路易斯大学毕业证成绩单专业办理
 
History+of+E-commerce+Development+in+China-www.cfye-commerce.shop
History+of+E-commerce+Development+in+China-www.cfye-commerce.shopHistory+of+E-commerce+Development+in+China-www.cfye-commerce.shop
History+of+E-commerce+Development+in+China-www.cfye-commerce.shop
 
Internet-Security-Safeguarding-Your-Digital-World (1).pptx
Internet-Security-Safeguarding-Your-Digital-World (1).pptxInternet-Security-Safeguarding-Your-Digital-World (1).pptx
Internet-Security-Safeguarding-Your-Digital-World (1).pptx
 
JAVIER LASA-EXPERIENCIA digital 1986-2024.pdf
JAVIER LASA-EXPERIENCIA digital 1986-2024.pdfJAVIER LASA-EXPERIENCIA digital 1986-2024.pdf
JAVIER LASA-EXPERIENCIA digital 1986-2024.pdf
 
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
一比一原版(LBS毕业证)伦敦商学院毕业证成绩单专业办理
 
Comptia N+ Standard Networking lesson guide
Comptia N+ Standard Networking lesson guideComptia N+ Standard Networking lesson guide
Comptia N+ Standard Networking lesson guide
 
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptxBridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
Bridging the Digital Gap Brad Spiegel Macon, GA Initiative.pptx
 
1.Wireless Communication System_Wireless communication is a broad term that i...
1.Wireless Communication System_Wireless communication is a broad term that i...1.Wireless Communication System_Wireless communication is a broad term that i...
1.Wireless Communication System_Wireless communication is a broad term that i...
 
BASIC C++ lecture NOTE C++ lecture 3.pptx
BASIC C++ lecture NOTE C++ lecture 3.pptxBASIC C++ lecture NOTE C++ lecture 3.pptx
BASIC C++ lecture NOTE C++ lecture 3.pptx
 
The+Prospects+of+E-Commerce+in+China.pptx
The+Prospects+of+E-Commerce+in+China.pptxThe+Prospects+of+E-Commerce+in+China.pptx
The+Prospects+of+E-Commerce+in+China.pptx
 
test test test test testtest test testtest test testtest test testtest test ...
test test  test test testtest test testtest test testtest test testtest test ...test test  test test testtest test testtest test testtest test testtest test ...
test test test test testtest test testtest test testtest test testtest test ...
 
How to Use Contact Form 7 Like a Pro.pptx
How to Use Contact Form 7 Like a Pro.pptxHow to Use Contact Form 7 Like a Pro.pptx
How to Use Contact Form 7 Like a Pro.pptx
 

CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR data and services

  • 1. Accelerating Discovery Science with an Internet of FAIR Data and Services @micheldumontier::CIKM:2020-10-211 Michel Dumontier, Ph.D. Distinguished Professor of Data Science Director, Institute of Data Science
  • 2. @micheldumontier::CIKM:2020-10-212 The world is awash with vast amounts of data
  • 4. 4 A common rejection module (CRM) for acute rejection across multiple organs identifies novel therapeutics for organ transplantation Khatri et al. JEM. 210 (11): 2205 DOI: 10.1084/jem.20122709 @micheldumontier::CIKM:2020-10-21 Main Findings: 1. CRM of 11 overexpressed genes predicted future injury to a graft 2. Mice treated with existing drugs against specific CRM genes extended graft survival 3. Retrospective EHR data analysis supports treatment prediction Key Observations: 1. Meta-analysis offers a more reliable estimate of the magnitude of the effect 2. Data can be used to generate and support/dispute new hypotheses
  • 5. However, significant effort is still needed to find the right dataset(s), make sense of them, and use for a new purpose @micheldumontier::CIKM:2020-10-215
  • 7. 7 @micheldumontier::CIKM:2020-10-21 Our ability to reproduce landmark studies is surprisingly low: 39% (39/100) in psychology1 21% (14/67) in pharmacology2 11% (6/53) in cancer3 unsatisfactory in machine learning4 1doi:10.1038/nature.2015.17433 2doi:10.1038/nrd3439-c1 3doi:10.1038/483531a 4https://openreview.net/pdf?id=By4l2PbQ- Most published research findings are false. - John Ioannidis, Stanford University PLoS Med 2005;2(8): e124.
  • 9. 9 What hope do we really have to realize ?
  • 10. @micheldumontier::CIKM:2020-10-2110 It’s time to completely rethink how we perform research
  • 11. @micheldumontier::CIKM:2020-10-2111 Poor quality (meta)data Reproducibility Crisis Translational Failure Broken windows theory Inadequate reusability theory visible signs of crime, anti- social behavior, and civil disorder create an environment that encourages more serious crimes Poor quality metadata and the inaccessibility of original research results make it less likely to reproduce original work, resulting in an ineffective translation of research into useful applications
  • 12. @micheldumontier::CIKM:2020-10-2112 It’s time to completely rethink how we perform research (and how we document and report it)
  • 13. @micheldumontier::CIKM:2020-10-2113 Lambin et al. Radiother Oncol. 2013. 109(1):159-64. doi: 10.1016/j.radonc.2013.07.007
  • 14. Rethinking Publishing Scientific Research @micheldumontier::CIKM:2020-10-2114 Data Science. 2017 1(1-2):139-154. DOI: 10.3233/DS-170010 http://www.tkuhn.org/pub/sempub/
  • 15. De-centralized knowledge graphs @micheldumontier::CIKM:2020-10-2115 Kuhn T., Chichester C., Krauthammer M., Dumontier M. (2015) Publishing Without Publishers: A Decentralized Approach to Dissemination, Retrieval, and Archiving of Data. In: Arenas M. et al. (eds) The Semantic Web - ISWC 2015. ISWC 2015. Lecture Notes in Computer Science, vol 9366. Springer, Cham
  • 16. We need a new social contract, supported by legal and technological infrastructure to make digital resources available in a responsible manner @micheldumontier::CIKM:2020-10-2116
  • 17. Human Machine collaboration will be crucial to our future success @micheldumontier::CIKM:2020-10-2117
  • 19. An international, bottom-up paradigm for the discovery and reuse of digital content for the machines that people use @micheldumontier::CIKM:2020-10-2119
  • 21. FAIR in a nutshell FAIR aims to enhance social and economic outcomes by facilitating the discovery and reuse of digital resources through key requirements: – unique identifiers to distinguish and retrieve all forms of digital content and knowledge – high quality meta(data) to enhance discovery of relevant digital resources – use of common vocabularies to facilitate query and statistical analysis – establishment of community standards to reduce the effort in data reuse – detailed provenance to provide adequate context and to enable reproducibility – registered in appropriate repositories to fulfill a promise to future content seekers – simpler terms of use to clarify expectations and intensify innovation – social and technological commitments to make data ready for intelligent applications @micheldumontier::CIKM:2020-10-2121
  • 23. @micheldumontier::CIKM:2020-10-2123 The lack of FAIR data costs the European Economy a minimum of €10.2bn per year EC:DG R&I; PWC 2018 Report: Cost-benefit analysis for FAIR research data
  • 24. Why Should *you* Go FAIR? • Makes it easier for to use your own data for a new purpose • Makes it easier for other people to find, use and cite your data, and for them to understand what you expect in return • Makes it easier/possible for people to verify your work • Ensure that the data are available in the future, especially as you may not want the responsibility • Satisfy the expectations around data management from institution, funding agency, journal, my peers @micheldumontier::CIKM:2020-10-2124
  • 25. Let’s build and use the Internet of FAIR data and services @micheldumontier::CIKM:2020-10-2125
  • 26. FAIRification process @micheldumontier::CIKM:2020-10-2126 GO FAIR Fairification: https://www.go-fair.org/fair-principles/fairification-process/ FAIRplus FAIR cookbook: https://fairplus.github.io/cookbook-dev/intro.html Utrecht FAIR: https://www.uu.nl/en/research/research-data-management/guides/how-to-make-your-data-fair EC H2020 Guidelines: https://ec.europa.eu/research/participants/data/ref/h2020/grants_manual/hi/oa_pilot/h2020-hi-oa-data-mgt_en.pdf
  • 29. 29
  • 30. The Semantic Web is a portal to the web of knowledge 30 @micheldumontier::CIKM:2020-10-21 standards for publishing, sharing and querying facts, expert knowledge and services scalable approach for the discovery of independently constructed, collaboratively described, distributed knowledge (in principle)
  • 32. @micheldumontier::PLDN:2020-01-2232 Success depends on quality of metadataSearch registries for relevant datasets
  • 33. @micheldumontier::PLDN:2020-01-2233 Metadata identifier Resource identifier Standardized, machine readable format Use of community vocabularies License? Provenance?
  • 35. • 30+ biomedical data sources • 10B+ interlinked statements • EBI, SIB, NCBI, DBCLS, NCBO, and many others produce this content chemicals/drugs/formulations, genomes/genes/proteins, domains Interactions, complexes & pathways animal models and phenotypes Disease, genetic markers, treatments Terminologies & publications 35 Alison Callahan, Jose Cruz-Toledo, Peter Ansell, Michel Dumontier: Bio2RDF Release 2: Improved Coverage, Interoperability and Provenance of Life Science Linked Data. ESWC 2013: 200-212 Linked Data for the Life Sciences Bio2RDF is an open source project that uses semantic web technologies to make it easier to reuse biomedical data @micheldumontier::CIKM:2020-10-21
  • 36. Query multiple databases on the biological web of data @micheldumontier::CIKM:2020-10-2136 Phenotypes of knock-out mouse models for the targets of a selected drug (Imatinib)
  • 37. @micheldumontier::CIKM:2020-10-2137 Explore we know, and formulate hypotheses about what we don’t Finding melanoma drugs through a probabilistic knowledge graph. PeerJ Computer Science. 2017. 3:e106 https://doi.org/10.7717/peerj-cs.106 by exploring a probabilistic semantic knowledge graph And validate them against pipelines for drug discovery
  • 38. Reproduce original research @micheldumontier::CIKM:2020-10-2138 AUC 0.91 across all therapeutic indications Scripts not available. Feature tables available. Result: AUC 0.83 … doesn’t match! (but now you can see what exactly we did) Towards FAIR protocols and workflows: the OpenPREDICT use case. 2020. PeerJ Computer Science 6:e281 https://doi.org/10.7717/peerj-cs.281
  • 39. Explore disease pathophysiology and treatment @micheldumontier::CIKM:2020-10-2139
  • 40. Mine distributed, access restricted FAIR datasets in a privacy preserving manner Maastricht Study + MUMC CBS Goal is to learn high confidence determinants of health in a privacy preserving manner over vertically partitioned data from the Maastricht Study and Statistics Netherlands. The data are made available through FAIR data stations that provide access to allowable subsets of data to authorized users of approved algorithms. Establish a new social, legal, ethical and technological infrastructure for discovery science in and across health and non-health settings, including scalable governance and flexible consent to underpin the responsible use of Big Data. @micheldumontier::CIKM:2020-10-2140 s
  • 41. FAIR data and services to accelerate discovery science @micheldumontier::CIKM:2020-10-2141
  • 42. Summary FAIR represents a global initiative to enhance the discovery and reuse of all kinds of digital resources. It is a work in progress and it needs you! FAIR requires new social, legal, ethical, scientific and technological infrastructure: – How does your research group or community make their data/findings FAIR? – What support does your organization provide you? – Are you making use of all the data and findings that you could? – What is responsible data science and artificial intelligence? Semantics, coupled with AI technologies, may enable humans, aided by intelligent machine agents, to exploit the Internet of FAIR data and services, and hence to accelerate discovery in biomedicine and in other disciplines. @micheldumontier::CIKM:2020-10-2142
  • 43. Acknowledgements @micheldumontier::CIKM:2020-10-2143 FAIR Dumontier Lab (Maastricht University, Stanford University, Carleton University) MU: Seun Adekunle, Thales Bertaglia, Remzi Celebi, Yenisel Calana, Ricardo De Miranda Azevedo, Vincent Emonet, Lars Jacobs, Andreea Grigoriu, Carlos Guerrero, Tim Hendriks, Massimiliano Grassi, Andine Havelange, Pedro Hernandez Serrano, Vikas Jaiman, Parveen Kumar, Lianne Ippel, Alexander Malic, Helder Monteiro, Stefan Meier, Kody Moodley, Stuti Nayak, Hercules Panoutsopoulos, Linda Rieswijk, Carola Roubin, Nadine Rouleaux, Claudia van open, Chang Sun, Johan van Soest, Binosha Weerarathna, Turgay Saba, Weiwei Wang, Jinzhou Yang, Amrapali Zaveri, Leto Peel, Rohan Nanda, Visara Urovi, Andre Dekker, David Townend, Gijs van Dijck, Christopher Brewster SU: Sandeep Ayyar, Remzi Celebi, Shima Dastgheib, Maulik Kamdar, David Odgers, Maryam Panahiazar, Amrapali Zaveri CU: Alison Callahan, Jose Toledo-Cruz, Natalia Villaneuva-Rosales
  • 44. michel.dumontier@maastrichtuniversity.nl Website: http://maastrichtuniversity.nl/ids 44 @micheldumontier::CIKM:2020-10-21 The mission of the Institute of Data Science at Maastricht University is to foster a collaborative environment for multi-disciplinary data science research, interdisciplinary training, and data-driven innovation . We tackle key scientific, technical, social, legal, ethical issues that advance our understanding across a variety of disciplines and strengthen our communities in the face of these developments.

Editor's Notes

  1. Abstract Using meta-analysis of eight independent transplant datasets (236 graft biopsy samples) from four organs, we identified a common rejection module (CRM) consisting of 11 genes that were significantly overexpressed in acute rejection (AR) across all transplanted organs. The CRM genes could diagnose AR with high specificity and sensitivity in three additional independent cohorts (794 samples). In another two independent cohorts (151 renal transplant biopsies), the CRM genes correlated with the extent of graft injury and predicted future injury to a graft using protocol biopsies. Inferred drug mechanisms from the literature suggested that two FDA-approved drugs (atorvastatin and dasatinib), approved for nontransplant indications, could regulate specific CRM genes and reduce the number of graft-infiltrating cells during AR. We treated mice with HLA-mismatched mouse cardiac transplant with atorvastatin and dasatinib and showed reduction of the CRM genes, significant reduction of graft-infiltrating cells, and extended graft survival. We further validated the beneficial effect of atorvastatin on graft survival by retrospective analysis of electronic medical records of a single-center cohort of 2,515 renal transplant patients followed for up to 22 yr. In conclusion, we identified a CRM in transplantation that provides new opportunities for diagnosis, drug repositioning, and rational drug design.
  2. Cost-benefit analysis for FAIR research data https://op.europa.eu/en/publication-detail/-/publication/d375368c-1a0a-11e9-8d04-01aa75ed71a1/language-en/format-PDF/source-161880070
  3. The Bio2RDF project transforms silos of life science data into a globally distributed network of linked data for biological knowledge discovery.