SlideShare a Scribd company logo
1 of 42
Accelerating Biomedical Research
with the Emerging Internet of FAIR Data and Services
@micheldumontier::Montpellier:2019-05-271
Michel Dumontier, Ph.D.
Distinguished Professor of Data Science
Director, Institute of Data Science
An increasing number of discoveries
are data-driven
@micheldumontier::Montpellier:2019-05-272
3
A common rejection module (CRM) for acute rejection across multiple organs identifies novel
therapeutics for organ transplantation
Khatri et al. JEM. 210 (11): 2205
DOI: 10.1084/jem.20122709
@micheldumontier::Montpellier:2019-05-27
Main Findings:
1. CRM genes predicted future injury to a graft
2. Mice treated with drugs against the CRM genes extended graft survival
3. Retrospective EHR analysis supports treatment prediction
Key Observations:
1. Meta-analysis offers a more reliable estimate of the magnitude of the effect
2. Data can be used to generate and support/dispute new hypotheses
However, significant effort is
still needed to find the right
datasets, make sense of them,
and ultimately use them for a
new purpose
@micheldumontier::Montpellier:2019-05-274
metadata is key to find and evaluate content
@micheldumontier::Montpellier:2019-05-275
@micheldumontier::Montpellier:2019-05-276
Poor quality experimental metadata frustrates reuse
7 @micheldumontier::Montpellier:2019-05-27
Reproducing landmark studies remains challenging:
39% (39/100) in psychology1
21% (14/67) in pharmacology2
11% (6/53) in cancer3
1doi:10.1038/nature.2015.17433 2doi:10.1038/nrd3439-c1 3doi:10.1038/483531a
@micheldumontier::Montpellier:2019-05-278
@micheldumontier::Montpellier:2019-05-279
we need to completely rethink
how we perform
biomedical research
@micheldumontier::Montpellier:2019-05-2710
Lambin et al. Radiother Oncol. 2013. 109(1):159-64. doi: 10.1016/j.radonc.2013.07.007
The Future is Human Machine Collaboration
@micheldumontier::Montpellier:2019-05-2711
12
We need a new social contract,
supported by legal and technological
infrastructure to make digital
resources available to
people and the machines they use
@micheldumontier::Montpellier:2019-05-2713
@micheldumontier::Montpellier:2019-05-2714
An international, bottom-up paradigm for
the discovery and reuse of digital content
for people and the machines that they use
@micheldumontier::Montpellier:2019-05-2715
@micheldumontier::Montpellier:2019-05-2716
http://www.nature.com/articles/sdata201618
@micheldumontier::Montpellier:2019-05-2717
FAIR: Impact
FAIR in a nutshell
FAIR aims to create social and economic impact by facilitating the
discovery and reuse of digital resources through a set of basic
requirements:
– unique identifiers to retrieve all forms of digital content and knowledge
– high quality meta(data) to enhance discovery of digital resources
– use of common vocabularies to create shared meaning and facilitate search
– adherence to community standards for common representations
– detailed provenance to provide context and facilitate reproducibility
– registered in appropriate repositories to make sure they can be found
– social and technological commitments to realize reliable access
– simpler terms of use to clarify expectations and intensify innovation
@micheldumontier::Montpellier:2019-05-2718
@micheldumontier::Montpellier:2019-05-2719
FAIR does not imply Open.
Open as possible
closed as is necessary
Improving the FAIRness of digital resources will
increase their potential for reuse
@micheldumontier::Montpellier:2019-05-2720
Let’s build the Internet of FAIR data and services
@micheldumontier::Montpellier:2019-05-2721
Your invitation to participate
https://osf.io/n7uwp/
erik.schultes@go-fair.org
22
@micheldumontier::Montpellier:2019-05-2723
@micheldumontier::Montpellier:2019-05-2724
The Semantic Web
is a portal to the web of knowledge
25 @micheldumontier::Montpellier:2019-05-27
standards for publishing, sharing and querying
facts, expert knowledge and services
scalable approach for the discovery
of independently constructed,
collaboratively described,
distributed knowledge
The semantic web community has built a massive
open and decentralized knowledge graph
26 @micheldumontier::Montpellier:2019-05-27
• 30+ biomedical data sources
• 10B+ interlinked statements
• EBI, SIB, NCBI, DBCLS, NCBO, and many others
produce this content
chemicals/drugs/formulations,
genomes/genes/proteins, domains
Interactions, complexes & pathways
animal models and phenotypes
Disease, genetic markers, treatments
Terminologies & publications
27
Alison Callahan, Jose Cruz-Toledo, Peter Ansell, Michel Dumontier:
Bio2RDF Release 2: Improved Coverage, Interoperability and
Provenance of Life Science Linked Data. ESWC 2013: 200-212
Linked Data for the Life Sciences
Bio2RDF is an open source project that uses semantic web
technologies to make it easier to reuse biomedical data
@micheldumontier::Montpellier:2019-05-27
Query the distributed web of data
@micheldumontier::Montpellier:2019-05-2728
Phenotypes of
knock-out
mouse models
for the targets
of a selected
drug (Imatinib)
Find and explore data with effective user interfaces
@micheldumontier::Montpellier:2019-05-2729
Disclosure: I’m an advisor to OntoForce
Examine the provenance behind the facts
@micheldumontier::Montpellier:2019-05-2730
Disclosure: I’m an advisor to OntoForce
Make your work easier to reproduce
@micheldumontier::Montpellier:2019-05-2731
AUC 0.91 across all therapeutic indications
Scripts not available. Feature tables available.
Result: ROCAUC 0.831 doesn’t quite match
@micheldumontier::Montpellier:2019-05-2732
@micheldumontier::Montpellier:2019-05-2733
Find new uses for existing drugs
Finding melanoma drugs through a probabilistic knowledge graph.
PeerJ Computer Science. 2017. 3:e106 https://doi.org/10.7717/peerj-cs.106
by exploring a probabilistic
semantic knowledge graph
And validate them against
pipelines for drug discovery
Analyzing partitioned FAIR health data responsibly
Maastricht Study + MUMC CBS
Goal is to learn high confidence determinants of health in a privacy preserving
manner over vertically partitioned FAIR data from the Maastricht Study and
Statistics Netherlands.
Establish a new social, legal, ethical and technological infrastructure for discovery
science in and across health and non-health settings, including scalable
governance and flexible consent to underpin the responsible use of Big Data.
@micheldumontier::Montpellier:2019-05-2734
Unifying API data
with Linked Open Data
35 @micheldumontier::Montpellier:2019-05-27
API
API
@micheldumontier::Montpellier:2019-05-2736
@micheldumontier::Montpellier:2019-05-2737
Towards Genuine Semantic Publishing
@micheldumontier::Montpellier:2019-05-2738
Automated FAIRness Assessments
• Powered using smartAPI and
semantic web technologies
• Harvests a diverse set of
metadata through HTTP
operations and links in
documents
• Open source and extensible!
39
http://W3id.org/AmIFAIR
Things to think about
• Making data FAIR suffers from a lack of incentives. Maybe data needs to be
stored, before it can be analyzed? How can data generators readily see the
impact of their contributions?
• Making data FAIR is time consuming. To what extent can we automate
this? Can non-expert workers reduce the time? Can we make more data
FAIR at the moment it is generated?
• Making data FAIR requires collaboration. How can we more efficiently
create and sustain communities to establish and disseminate best
practices?
• Making data FAIR is expensive. Some funding agencies (e.g. Horizon2020)
are exploring how to make research data management a budget line item
@micheldumontier::Montpellier:2019-05-2740
Summary
• FAIR represents a global initiative to enhance the discovery and reuse of all
kinds of digital resources which will also help address the reproducibility crisis
• It demands a new social, legal and technological infrastructure that currently
doesn’t exist in whole, but has to be built for and tested by various
communities!
• The FAIR concept is transforming into new processes, behaviours and
platforms.
• Huge benefits to be had, particularly in augmenting existing research
programs and in automated machine processing, but needs to be coupled
with the proper technical and ethical training.
@micheldumontier::FAIR:2019-05-2441
michel.dumontier@maastrichtuniversity.nl
Website: http://maastrichtuniversity.nl/ids
42 @micheldumontier::FAIR:2019-05-24
The mission of the Institute of Data Science at Maastricht University is to foster a
collaborative environment for multi-disciplinary data science research,
interdisciplinary training, and data-driven innovation .
We tackle key scientific, technical, social, legal, ethical issues that advance our
understanding across a variety of disciplines and strengthen our communities in the
face of these developments.

More Related Content

Similar to Accelerating Biomedical Research with the Emerging Internet of FAIR Data and Services

Accelerating biomedical discovery with an Internet of FAIR data and services ...
Accelerating biomedical discovery with an Internet of FAIR data and services ...Accelerating biomedical discovery with an Internet of FAIR data and services ...
Accelerating biomedical discovery with an Internet of FAIR data and services ...Platform Linked Data Netherlands (PLDN)
 
Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?Michel Dumontier
 
Fake News Detection Using Machine Learning
Fake News Detection Using Machine LearningFake News Detection Using Machine Learning
Fake News Detection Using Machine LearningIRJET Journal
 
Developing and assessing FAIR digital resources
Developing and assessing FAIR digital resourcesDeveloping and assessing FAIR digital resources
Developing and assessing FAIR digital resourcesMichel Dumontier
 
Data-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge GraphsData-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge GraphsMichel Dumontier
 
acatech_STUDY_Internet_Privacy_WEB
acatech_STUDY_Internet_Privacy_WEBacatech_STUDY_Internet_Privacy_WEB
acatech_STUDY_Internet_Privacy_WEBJaina Hirai
 
Why B2B should embrace IoE
Why B2B should embrace IoEWhy B2B should embrace IoE
Why B2B should embrace IoEJerome Petit
 
Advancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIRAdvancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIRMichel Dumontier
 
Mining Social Media Data for Understanding Drugs Usage
Mining Social Media Data for Understanding Drugs  UsageMining Social Media Data for Understanding Drugs  Usage
Mining Social Media Data for Understanding Drugs UsageIRJET Journal
 
Sgd emerging -manufacturing-12-oct 2018
Sgd emerging -manufacturing-12-oct 2018 Sgd emerging -manufacturing-12-oct 2018
Sgd emerging -manufacturing-12-oct 2018 Sanjeev Deshmukh
 
Good Practices and Recommendations on the Security and Resilience of Big Data...
Good Practices and Recommendations on the Security and Resilience of Big Data...Good Practices and Recommendations on the Security and Resilience of Big Data...
Good Practices and Recommendations on the Security and Resilience of Big Data...Eftychia Chalvatzi
 
CINECA webinar slides: Open science through fair health data networks dream o...
CINECA webinar slides: Open science through fair health data networks dream o...CINECA webinar slides: Open science through fair health data networks dream o...
CINECA webinar slides: Open science through fair health data networks dream o...CINECAProject
 
IRJET- Scope of Big Data Analytics in Industrial Domain
IRJET- Scope of Big Data Analytics in Industrial DomainIRJET- Scope of Big Data Analytics in Industrial Domain
IRJET- Scope of Big Data Analytics in Industrial DomainIRJET Journal
 
Innovation series 112318
Innovation series 112318Innovation series 112318
Innovation series 112318Tim Maurer
 
TOP TEN: Big Data_ Issue 16 _ Dec 2014
TOP TEN: Big Data_ Issue 16 _ Dec 2014TOP TEN: Big Data_ Issue 16 _ Dec 2014
TOP TEN: Big Data_ Issue 16 _ Dec 2014MOTC Qatar
 
Data management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.euData management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.euEUDAT
 

Similar to Accelerating Biomedical Research with the Emerging Internet of FAIR Data and Services (20)

Accelerating biomedical discovery with an Internet of FAIR data and services ...
Accelerating biomedical discovery with an Internet of FAIR data and services ...Accelerating biomedical discovery with an Internet of FAIR data and services ...
Accelerating biomedical discovery with an Internet of FAIR data and services ...
 
Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?Are we FAIR yet? And will it be worth it?
Are we FAIR yet? And will it be worth it?
 
Fake News Detection Using Machine Learning
Fake News Detection Using Machine LearningFake News Detection Using Machine Learning
Fake News Detection Using Machine Learning
 
Developing and assessing FAIR digital resources
Developing and assessing FAIR digital resourcesDeveloping and assessing FAIR digital resources
Developing and assessing FAIR digital resources
 
Data-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge GraphsData-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge Graphs
 
acatech_STUDY_Internet_Privacy_WEB
acatech_STUDY_Internet_Privacy_WEBacatech_STUDY_Internet_Privacy_WEB
acatech_STUDY_Internet_Privacy_WEB
 
Why B2B should embrace IoE
Why B2B should embrace IoEWhy B2B should embrace IoE
Why B2B should embrace IoE
 
Advancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIRAdvancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIR
 
Mining Social Media Data for Understanding Drugs Usage
Mining Social Media Data for Understanding Drugs  UsageMining Social Media Data for Understanding Drugs  Usage
Mining Social Media Data for Understanding Drugs Usage
 
Sgd emerging -manufacturing-12-oct 2018
Sgd emerging -manufacturing-12-oct 2018 Sgd emerging -manufacturing-12-oct 2018
Sgd emerging -manufacturing-12-oct 2018
 
Good Practices and Recommendations on the Security and Resilience of Big Data...
Good Practices and Recommendations on the Security and Resilience of Big Data...Good Practices and Recommendations on the Security and Resilience of Big Data...
Good Practices and Recommendations on the Security and Resilience of Big Data...
 
CINECA webinar slides: Open science through fair health data networks dream o...
CINECA webinar slides: Open science through fair health data networks dream o...CINECA webinar slides: Open science through fair health data networks dream o...
CINECA webinar slides: Open science through fair health data networks dream o...
 
Are we FAIR yet?
Are we FAIR yet?Are we FAIR yet?
Are we FAIR yet?
 
Big Data a Catalunya
Big Data a CatalunyaBig Data a Catalunya
Big Data a Catalunya
 
Big Data a Catalunya
Big Data a CatalunyaBig Data a Catalunya
Big Data a Catalunya
 
IRJET- Scope of Big Data Analytics in Industrial Domain
IRJET- Scope of Big Data Analytics in Industrial DomainIRJET- Scope of Big Data Analytics in Industrial Domain
IRJET- Scope of Big Data Analytics in Industrial Domain
 
Innovation series 112318
Innovation series 112318Innovation series 112318
Innovation series 112318
 
TOP TEN: Big Data_ Issue 16 _ Dec 2014
TOP TEN: Big Data_ Issue 16 _ Dec 2014TOP TEN: Big Data_ Issue 16 _ Dec 2014
TOP TEN: Big Data_ Issue 16 _ Dec 2014
 
Data management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.euData management plans – EUDAT Best practices and case study | www.eudat.eu
Data management plans – EUDAT Best practices and case study | www.eudat.eu
 
Διαχείριση Ανοικτών Ερευνητικών Δεδομένων Υγείας - Π. Μπαμίδης
Διαχείριση Ανοικτών Ερευνητικών Δεδομένων Υγείας - Π. ΜπαμίδηςΔιαχείριση Ανοικτών Ερευνητικών Δεδομένων Υγείας - Π. Μπαμίδης
Διαχείριση Ανοικτών Ερευνητικών Δεδομένων Υγείας - Π. Μπαμίδης
 

More from Michel Dumontier

A metadata standard for Knowledge Graphs
A metadata standard for Knowledge GraphsA metadata standard for Knowledge Graphs
A metadata standard for Knowledge GraphsMichel Dumontier
 
The Role of the FAIR Guiding Principles for an effective Learning Health System
The Role of the FAIR Guiding Principles for an effective Learning Health SystemThe Role of the FAIR Guiding Principles for an effective Learning Health System
The Role of the FAIR Guiding Principles for an effective Learning Health SystemMichel Dumontier
 
The role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health SystemThe role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health SystemMichel Dumontier
 
Keynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University DinnerKeynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University DinnerMichel Dumontier
 
FAIR principles and metrics for evaluation
FAIR principles and metrics for evaluationFAIR principles and metrics for evaluation
FAIR principles and metrics for evaluationMichel Dumontier
 
Towards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRnessTowards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRnessMichel Dumontier
 
Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Michel Dumontier
 
Model Organism Linked Data
Model Organism Linked DataModel Organism Linked Data
Model Organism Linked DataMichel Dumontier
 
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge DiscoveryMichel Dumontier
 
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMaking it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMichel Dumontier
 
Link Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked DataLink Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked DataMichel Dumontier
 
Making the most of phenotypes in ontology-based biomedical knowledge discovery
Making the most of phenotypes in ontology-based biomedical knowledge discoveryMaking the most of phenotypes in ontology-based biomedical knowledge discovery
Making the most of phenotypes in ontology-based biomedical knowledge discoveryMichel Dumontier
 
W3C HCLS Dataset Description Guidelines
W3C HCLS Dataset Description GuidelinesW3C HCLS Dataset Description Guidelines
W3C HCLS Dataset Description GuidelinesMichel Dumontier
 
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...Michel Dumontier
 
1st Network-of-BioThings Hackathon
1st Network-of-BioThings Hackathon1st Network-of-BioThings Hackathon
1st Network-of-BioThings HackathonMichel Dumontier
 
Powering Scientific Discovery with the Semantic Web (VanBUG 2014)
Powering Scientific Discovery with the Semantic Web (VanBUG 2014)Powering Scientific Discovery with the Semantic Web (VanBUG 2014)
Powering Scientific Discovery with the Semantic Web (VanBUG 2014)Michel Dumontier
 

More from Michel Dumontier (20)

A metadata standard for Knowledge Graphs
A metadata standard for Knowledge GraphsA metadata standard for Knowledge Graphs
A metadata standard for Knowledge Graphs
 
Evaluating FAIRness
Evaluating FAIRnessEvaluating FAIRness
Evaluating FAIRness
 
The Role of the FAIR Guiding Principles for an effective Learning Health System
The Role of the FAIR Guiding Principles for an effective Learning Health SystemThe Role of the FAIR Guiding Principles for an effective Learning Health System
The Role of the FAIR Guiding Principles for an effective Learning Health System
 
The role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health SystemThe role of the FAIR Guiding Principles in a Learning Health System
The role of the FAIR Guiding Principles in a Learning Health System
 
Keynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University DinnerKeynote at the 2018 Maastricht University Dinner
Keynote at the 2018 Maastricht University Dinner
 
FAIR principles and metrics for evaluation
FAIR principles and metrics for evaluationFAIR principles and metrics for evaluation
FAIR principles and metrics for evaluation
 
Towards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRnessTowards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRness
 
Data Science for the Win
Data Science for the WinData Science for the Win
Data Science for the Win
 
2016 bmdid-mappings
2016 bmdid-mappings2016 bmdid-mappings
2016 bmdid-mappings
 
Ontologies
OntologiesOntologies
Ontologies
 
Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...
 
Model Organism Linked Data
Model Organism Linked DataModel Organism Linked Data
Model Organism Linked Data
 
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
 
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMaking it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
 
Link Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked DataLink Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked Data
 
Making the most of phenotypes in ontology-based biomedical knowledge discovery
Making the most of phenotypes in ontology-based biomedical knowledge discoveryMaking the most of phenotypes in ontology-based biomedical knowledge discovery
Making the most of phenotypes in ontology-based biomedical knowledge discovery
 
W3C HCLS Dataset Description Guidelines
W3C HCLS Dataset Description GuidelinesW3C HCLS Dataset Description Guidelines
W3C HCLS Dataset Description Guidelines
 
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
 
1st Network-of-BioThings Hackathon
1st Network-of-BioThings Hackathon1st Network-of-BioThings Hackathon
1st Network-of-BioThings Hackathon
 
Powering Scientific Discovery with the Semantic Web (VanBUG 2014)
Powering Scientific Discovery with the Semantic Web (VanBUG 2014)Powering Scientific Discovery with the Semantic Web (VanBUG 2014)
Powering Scientific Discovery with the Semantic Web (VanBUG 2014)
 

Recently uploaded

Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Silpa
 
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATIONSTS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATIONrouseeyyy
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptxAlMamun560346
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedDelhi Call girls
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...ssuser79fe74
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Servicemonikaservice1
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsSérgio Sacani
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryAlex Henderson
 
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Mohammad Khajehpour
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .Poonam Aher Patil
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)Areesha Ahmad
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxBhagirath Gogikar
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)Areesha Ahmad
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learninglevieagacer
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...chandars293
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformationAreesha Ahmad
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxFarihaAbdulRasheed
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Bookingroncy bisnoi
 

Recently uploaded (20)

Clean In Place(CIP).pptx .
Clean In Place(CIP).pptx                 .Clean In Place(CIP).pptx                 .
Clean In Place(CIP).pptx .
 
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
Locating and isolating a gene, FISH, GISH, Chromosome walking and jumping, te...
 
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATIONSTS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
STS-UNIT 4 CLIMATE CHANGE POWERPOINT PRESENTATION
 
Seismic Method Estimate velocity from seismic data.pptx
Seismic Method Estimate velocity from seismic  data.pptxSeismic Method Estimate velocity from seismic  data.pptx
Seismic Method Estimate velocity from seismic data.pptx
 
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verifiedConnaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
Connaught Place, Delhi Call girls :8448380779 Model Escorts | 100% verified
 
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
Chemical Tests; flame test, positive and negative ions test Edexcel Internati...
 
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts ServiceJustdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
Justdial Call Girls In Indirapuram, Ghaziabad, 8800357707 Escorts Service
 
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and SpectrometryFAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
FAIRSpectra - Enabling the FAIRification of Spectroscopy and Spectrometry
 
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
Dopamine neurotransmitter determination using graphite sheet- graphene nano-s...
 
Factory Acceptance Test( FAT).pptx .
Factory Acceptance Test( FAT).pptx       .Factory Acceptance Test( FAT).pptx       .
Factory Acceptance Test( FAT).pptx .
 
GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)GBSN - Microbiology (Unit 3)
GBSN - Microbiology (Unit 3)
 
Introduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptxIntroduction,importance and scope of horticulture.pptx
Introduction,importance and scope of horticulture.pptx
 
GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)GBSN - Biochemistry (Unit 1)
GBSN - Biochemistry (Unit 1)
 
module for grade 9 for distance learning
module for grade 9 for distance learningmodule for grade 9 for distance learning
module for grade 9 for distance learning
 
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
High Class Escorts in Hyderabad ₹7.5k Pick Up & Drop With Cash Payment 969456...
 
Conjugation, transduction and transformation
Conjugation, transduction and transformationConjugation, transduction and transformation
Conjugation, transduction and transformation
 
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptxCOST ESTIMATION FOR A RESEARCH PROJECT.pptx
COST ESTIMATION FOR A RESEARCH PROJECT.pptx
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance BookingCall Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
Call Girls Alandi Call Me 7737669865 Budget Friendly No Advance Booking
 

Accelerating Biomedical Research with the Emerging Internet of FAIR Data and Services

  • 1. Accelerating Biomedical Research with the Emerging Internet of FAIR Data and Services @micheldumontier::Montpellier:2019-05-271 Michel Dumontier, Ph.D. Distinguished Professor of Data Science Director, Institute of Data Science
  • 2. An increasing number of discoveries are data-driven @micheldumontier::Montpellier:2019-05-272
  • 3. 3 A common rejection module (CRM) for acute rejection across multiple organs identifies novel therapeutics for organ transplantation Khatri et al. JEM. 210 (11): 2205 DOI: 10.1084/jem.20122709 @micheldumontier::Montpellier:2019-05-27 Main Findings: 1. CRM genes predicted future injury to a graft 2. Mice treated with drugs against the CRM genes extended graft survival 3. Retrospective EHR analysis supports treatment prediction Key Observations: 1. Meta-analysis offers a more reliable estimate of the magnitude of the effect 2. Data can be used to generate and support/dispute new hypotheses
  • 4. However, significant effort is still needed to find the right datasets, make sense of them, and ultimately use them for a new purpose @micheldumontier::Montpellier:2019-05-274
  • 5. metadata is key to find and evaluate content @micheldumontier::Montpellier:2019-05-275
  • 7. 7 @micheldumontier::Montpellier:2019-05-27 Reproducing landmark studies remains challenging: 39% (39/100) in psychology1 21% (14/67) in pharmacology2 11% (6/53) in cancer3 1doi:10.1038/nature.2015.17433 2doi:10.1038/nrd3439-c1 3doi:10.1038/483531a
  • 9. @micheldumontier::Montpellier:2019-05-279 we need to completely rethink how we perform biomedical research
  • 10. @micheldumontier::Montpellier:2019-05-2710 Lambin et al. Radiother Oncol. 2013. 109(1):159-64. doi: 10.1016/j.radonc.2013.07.007
  • 11. The Future is Human Machine Collaboration @micheldumontier::Montpellier:2019-05-2711
  • 12. 12
  • 13. We need a new social contract, supported by legal and technological infrastructure to make digital resources available to people and the machines they use @micheldumontier::Montpellier:2019-05-2713
  • 15. An international, bottom-up paradigm for the discovery and reuse of digital content for people and the machines that they use @micheldumontier::Montpellier:2019-05-2715
  • 18. FAIR in a nutshell FAIR aims to create social and economic impact by facilitating the discovery and reuse of digital resources through a set of basic requirements: – unique identifiers to retrieve all forms of digital content and knowledge – high quality meta(data) to enhance discovery of digital resources – use of common vocabularies to create shared meaning and facilitate search – adherence to community standards for common representations – detailed provenance to provide context and facilitate reproducibility – registered in appropriate repositories to make sure they can be found – social and technological commitments to realize reliable access – simpler terms of use to clarify expectations and intensify innovation @micheldumontier::Montpellier:2019-05-2718
  • 19. @micheldumontier::Montpellier:2019-05-2719 FAIR does not imply Open. Open as possible closed as is necessary
  • 20. Improving the FAIRness of digital resources will increase their potential for reuse @micheldumontier::Montpellier:2019-05-2720
  • 21. Let’s build the Internet of FAIR data and services @micheldumontier::Montpellier:2019-05-2721
  • 22. Your invitation to participate https://osf.io/n7uwp/ erik.schultes@go-fair.org 22
  • 25. The Semantic Web is a portal to the web of knowledge 25 @micheldumontier::Montpellier:2019-05-27 standards for publishing, sharing and querying facts, expert knowledge and services scalable approach for the discovery of independently constructed, collaboratively described, distributed knowledge
  • 26. The semantic web community has built a massive open and decentralized knowledge graph 26 @micheldumontier::Montpellier:2019-05-27
  • 27. • 30+ biomedical data sources • 10B+ interlinked statements • EBI, SIB, NCBI, DBCLS, NCBO, and many others produce this content chemicals/drugs/formulations, genomes/genes/proteins, domains Interactions, complexes & pathways animal models and phenotypes Disease, genetic markers, treatments Terminologies & publications 27 Alison Callahan, Jose Cruz-Toledo, Peter Ansell, Michel Dumontier: Bio2RDF Release 2: Improved Coverage, Interoperability and Provenance of Life Science Linked Data. ESWC 2013: 200-212 Linked Data for the Life Sciences Bio2RDF is an open source project that uses semantic web technologies to make it easier to reuse biomedical data @micheldumontier::Montpellier:2019-05-27
  • 28. Query the distributed web of data @micheldumontier::Montpellier:2019-05-2728 Phenotypes of knock-out mouse models for the targets of a selected drug (Imatinib)
  • 29. Find and explore data with effective user interfaces @micheldumontier::Montpellier:2019-05-2729 Disclosure: I’m an advisor to OntoForce
  • 30. Examine the provenance behind the facts @micheldumontier::Montpellier:2019-05-2730 Disclosure: I’m an advisor to OntoForce
  • 31. Make your work easier to reproduce @micheldumontier::Montpellier:2019-05-2731 AUC 0.91 across all therapeutic indications Scripts not available. Feature tables available.
  • 32. Result: ROCAUC 0.831 doesn’t quite match @micheldumontier::Montpellier:2019-05-2732
  • 33. @micheldumontier::Montpellier:2019-05-2733 Find new uses for existing drugs Finding melanoma drugs through a probabilistic knowledge graph. PeerJ Computer Science. 2017. 3:e106 https://doi.org/10.7717/peerj-cs.106 by exploring a probabilistic semantic knowledge graph And validate them against pipelines for drug discovery
  • 34. Analyzing partitioned FAIR health data responsibly Maastricht Study + MUMC CBS Goal is to learn high confidence determinants of health in a privacy preserving manner over vertically partitioned FAIR data from the Maastricht Study and Statistics Netherlands. Establish a new social, legal, ethical and technological infrastructure for discovery science in and across health and non-health settings, including scalable governance and flexible consent to underpin the responsible use of Big Data. @micheldumontier::Montpellier:2019-05-2734
  • 35. Unifying API data with Linked Open Data 35 @micheldumontier::Montpellier:2019-05-27 API API
  • 38. Towards Genuine Semantic Publishing @micheldumontier::Montpellier:2019-05-2738
  • 39. Automated FAIRness Assessments • Powered using smartAPI and semantic web technologies • Harvests a diverse set of metadata through HTTP operations and links in documents • Open source and extensible! 39 http://W3id.org/AmIFAIR
  • 40. Things to think about • Making data FAIR suffers from a lack of incentives. Maybe data needs to be stored, before it can be analyzed? How can data generators readily see the impact of their contributions? • Making data FAIR is time consuming. To what extent can we automate this? Can non-expert workers reduce the time? Can we make more data FAIR at the moment it is generated? • Making data FAIR requires collaboration. How can we more efficiently create and sustain communities to establish and disseminate best practices? • Making data FAIR is expensive. Some funding agencies (e.g. Horizon2020) are exploring how to make research data management a budget line item @micheldumontier::Montpellier:2019-05-2740
  • 41. Summary • FAIR represents a global initiative to enhance the discovery and reuse of all kinds of digital resources which will also help address the reproducibility crisis • It demands a new social, legal and technological infrastructure that currently doesn’t exist in whole, but has to be built for and tested by various communities! • The FAIR concept is transforming into new processes, behaviours and platforms. • Huge benefits to be had, particularly in augmenting existing research programs and in automated machine processing, but needs to be coupled with the proper technical and ethical training. @micheldumontier::FAIR:2019-05-2441
  • 42. michel.dumontier@maastrichtuniversity.nl Website: http://maastrichtuniversity.nl/ids 42 @micheldumontier::FAIR:2019-05-24 The mission of the Institute of Data Science at Maastricht University is to foster a collaborative environment for multi-disciplinary data science research, interdisciplinary training, and data-driven innovation . We tackle key scientific, technical, social, legal, ethical issues that advance our understanding across a variety of disciplines and strengthen our communities in the face of these developments.

Editor's Notes

  1. Abstract Using meta-analysis of eight independent transplant datasets (236 graft biopsy samples) from four organs, we identified a common rejection module (CRM) consisting of 11 genes that were significantly overexpressed in acute rejection (AR) across all transplanted organs. The CRM genes could diagnose AR with high specificity and sensitivity in three additional independent cohorts (794 samples). In another two independent cohorts (151 renal transplant biopsies), the CRM genes correlated with the extent of graft injury and predicted future injury to a graft using protocol biopsies. Inferred drug mechanisms from the literature suggested that two FDA-approved drugs (atorvastatin and dasatinib), approved for nontransplant indications, could regulate specific CRM genes and reduce the number of graft-infiltrating cells during AR. We treated mice with HLA-mismatched mouse cardiac transplant with atorvastatin and dasatinib and showed reduction of the CRM genes, significant reduction of graft-infiltrating cells, and extended graft survival. We further validated the beneficial effect of atorvastatin on graft survival by retrospective analysis of electronic medical records of a single-center cohort of 2,515 renal transplant patients followed for up to 22 yr. In conclusion, we identified a CRM in transplantation that provides new opportunities for diagnosis, drug repositioning, and rational drug design.
  2. G20: http://europa.eu/rapid/press-release_STATEMENT-16-2967_en.htm EOSC: https://ec.europa.eu/research/openscience/pdf/realising_the_european_open_science_cloud_2016.pdf H2020: https://goo.gl/Strjua
  3. https://www.gov.uk/government/publications/g8-science-ministers-statement-london-12-june-2013
  4. The Bio2RDF project transforms silos of life science data into a globally distributed network of linked data for biological knowledge discovery.