SlideShare a Scribd company logo
THE RARE DISEASE DATA LINKAGE PLAN
BOOSTING RESEARCH BY MAKING DATA
RESOURCES FINDABLE, ACCESSIBLE,
INTEROPERABLE AND REUSABLE TOGETHER
I RDiRC 2017, Paris, Februar y 9, 2017
Marco Roos, David van Enckevort
Acknowledging patient representatives, ELIXIR(-EXCELERATE), RD-CONNECT, BBMRI(-NL), ODEX4ALL,
FAIRDict, the rare disease linked data and ontology task force, Mark Thompson, Robert Reihs, Rajaram
Kaliyaperumal, Pedro Sernadela, Marc Hanauer, Mark Wilkinson, Claudio Carta, Rachel Thompson, Estrella Lopes, Lorena
Casareto, Frederique Ehrhart, Roxana Merino, Luiz Bonino & team, Ronald Cornet, Peter Robinson, Mathias Brochhausen, Simon Jupp, Sira
Sarntivijai, Helen Parkinson, Ana Rath, Heimo Muller, Lucia Monaco, Domenica Taruscio, Manuel Posada, Luca Sangiorgi, Morris Swertz, José
Oliveira, Peter-Bram ‘t Hoen, Hanns Lochmuller, Larry Hunter, participants and organisers of rare disease Bring Your Own Data workshops
In this presentation, I will give
you a light-weight introduction
to the rare disease data linkage
plan to boost research on rare
diseases.
2
ELIXIR
16 February
2017
A distributed
infrastructure
for life science
information
The plan is supported by several projects and
infrastructures. Here, I highlight ELIXIR, the European
infrastructure for life science information. Ivo Gut, Sergi
Beltran and Marco Roos co-lead the case for rare diseases in
this infrastructure.
3
To boost rare disease research
Objective
16 February
2017
Our goal is to boost rare
disease research
4
> 6000 rare diseases
16 February
2017
But then we have to take
into account that there are
over 6000 rare diseases
5
> 6000 rare diseases
biobanks, registries, sequencing, OMICS, …
16 February
2017
Collecting multiple types of
materials and data, such as
biobanks, registries,
sequence data, omics,
etcetera
6
> 6000 rare diseases
biobanks, registries, sequencing, OMICS, …
across countries and institutes
16 February
2017
And across many countries
and institutes
7
What we want to achieve
16 February
2017
7
Rare disease researcher
What we aim to achieve is
the following; here is rare
disease researcher Claudio
8
Rare disease researcher asks
Which treatments for symptoms
of other diseases may mitigate
the same symptoms of
my rare disease?
Claudio has questions like
these
9
A rare disease researcher asks…
Where can I obtain biosamples of donors
with an abnormality in head or neck?
In which biobanks can I
find these samples?
Or these…
We have built a
demonstration web tool
to show how we can
answer these questions
with the technologies that
we advocate.
16 February
2017
10
Here, Claudio can select his
question.
16 February
2017
11
Select the symptom that he
is interested in
And obtain this table with
the answers.
Note that all items are blue; that means you can click on
them to get more information about them.
Note that the phenotypes are all subtypes of abnormality
of head or neck.
Also note that there are multiple diseases, multiple
biobanks, and multiple registries in the list. This shows
that the information came from different sources.
13
The role of computational analysis in
boosting health care and life science will
 Decrease
 Increase
 Stay the same
Questions to you
16 February
2017
Before I continue, think
about this question
14
The role of computational analysis in
boosting health care and life science will
 Decrease
 Increase
 Stay the same
Questions to you
16 February
2017
Most people tend to say it
will increase.
15
to boost rare disease research
16 February
2017
If that is the case, then…
16
enable computational analysts
to boost rare disease research
16 February
2017
We need to…
17
The substrate for computational analysts
DATA
And then we have to look at
the substrate for
computational analysts: data
18
How to boost rare disease research?
PATIENT
DATA
In the rare disease domain,
we have many different
types of data, such as
patient data
19
How to boost rare disease research?
OMICS
DATA
Omics data
20
How to boost rare disease research?
SAMPLE
DATA
Or biological samples
21
How to boost rare disease research?
SAMPLE
DATA
And in all shapes and sizes,
different languages,
different formats
22
How to boost rare disease research?
PATIENT
DATA
> 6000 rare diseases
across countries and institutes
Remember that we have
thousands of data sources in
our domain.
23
How to boost rare disease research?
PATIENT
DATA
Data incompatibilities are
an enormous bottleneck for
data analysts: they spend
months per data source to
resolve them.
24
How to boost rare disease research?
DA
TA
DATA
You Them
A way to address this is by letting
others make your data compatible:
‘they’ transform the data to be more
compatible.
25
How to boost rare disease research?
DA
TA
You Them
DATA
They can do that with
multiple data sources, and
integrate them.
26
DATA
How to boost rare disease research?
DA
TA
You Them
However, there is a big risk. When for whatever
reason, ‘they’ cannot maintain this anymore, for
instance because the funding stops…
27
How to boost rare disease research?
DA
TA
You
There is nothing left.
28
How to boost rare disease research?
DA
TA
You
We are back to square one:
incompatible data.
This is not good enough for data infrastructure.
International leading data experts have defined an
approach for this that I cannot explain better than is done
in the following video.
29
Inspiration: personal Health Train
16 February 2017
https://www.youtube.com/watch?v=mktAtHmy-FM
30
G20 Endorsement of FAIR principles
Next to ELIXIR, EOSC, NIH Commons
We, the leaders of the G20…
facilitate appropriate access to publicly
funded research results on
findable, accessible,
interoperable and reusable
(FAIR) principles …
The FAIR principles are highly endorsed, by ELIXIR, the Open
European Science Cloud, NIH via its ‘commons’ program,
and since 2016 also the G20.
31
FAIR principles applied to rare disease data
RD
DATA
FAIR
Linkable
RD
DATA
You
How do we apply them for
rare diseases?
At a high level, the steps are more-or-less the
same: your data, a transformation, but now
we have FAIR, linkable data on the right.
32
FAIR principles applied to rare disease data
RD
DATA
FAIR
Linkable
RD
DATA
You YouYou and them
together
Knowledge exchange
But there are major differences: instead of ‘you’
and ‘them’ data experts and FAIR data experts do
the transformation together.
This involves substantial knowledge exchange. Another major difference is that
instead of ‘them’ there is ‘you’ on the right: data owners stay in control.
33
FAIR principles applied to rare disease data
Data can be more easily combined. Each resource is an
independently FAIR resource. This is a much more
robust infrastructure.
34
Creating substrate to boost rare disease research
In our rare
disease data
linkage plan
we go
through this
process, one
at a time.
Each time we
improve our
methods,
each time we
do this faster.
Rare disease data linkage plan - 2017
• > 7 biobanks/registries FAIR at the source
• Study FAIR pathways, Orphanet, mutation data
• Support: RD-Connect, ELIXIR, BBMRI (-NL), FAIRDict,
ODEX4All, patient organisations
David van Enckevort
Technical leader
Our aims for 2017
36
 Light-weight introduction to the rare
disease data linkage plan
 Account for scale and sparsity of data in
rare disease domain
 Federated infrastructure of local FAIR data
Summary
16-Feb-17
36
37
Invitation
 Contact us about making
rare disease data FAIR
 Turn plan into long-
running service
Long term plans
16 February
2017
Contact us about making rare disease data FAIR. Let us know
if you would like to help turning the plan for 2017 into a long
running service for the rare disease community. We envision
a role for patient organisations in that.
• Mascha Jansen: FAIR data projects
and Bring Your Own Data workshops
(mascha.jansen@dtls.nl)
• David van Enckevort, Marco Roos:
Rare disease data linkage plan &
FAIR RD data projects
• Erik Schultes: FAIR data (awareness)
training; for Elixir/RD: Brane
Leskosek (DTL: Celia van Gelder)
Thank you
Acknowledging patient
representatives, ELIXIR(-
EXCELERATE), RD-CONNECT,
BBMRI(-NL), ODEX4ALL,
FAIRDict, the rare disease
linked data and ontology
task force, David van Enckevort,
Mark Thompson, Robert Reihs,
Rajaram Kaliyaperumal, Pedro
Sernadela, Marc Hanauer, Mark
Wilkinson, Claudio Carta, Rachel
Thompson, Estrella Lopes, Lorena
Casareto, Frederique Ehrhart,
Roxana Merino, Luiz Bonino & team,
Ronald Cornet, Peter Robinson,
Mathias Brochhausen, Simon Jupp,
Sira Sarntivijai, Helen Parkinson, Ana
Rath, Heimo Muller, Lucia Monaco,
Domenica Taruscio, Manuel Posada,
Luca Sangiorgi, Morris Swertz, José
Oliveira, Peter-Bram ‘t Hoen, Hanns
Lochmuller, Larry Hunter,
participants and organisers of rare
disease Bring Your Own Data
workshops
Thank you for your attention.

More Related Content

Similar to Rare Disease Data Linkage plan 2017 - IRDiRC 2017 presentation

IMS Health Workshop World Orphan Drug Congress
IMS Health Workshop World Orphan Drug CongressIMS Health Workshop World Orphan Drug Congress
IMS Health Workshop World Orphan Drug Congress
IMSHealthRWES
 
Data Literacy -- Necessity and challenges
Data Literacy -- Necessity and challengesData Literacy -- Necessity and challenges
Data Literacy -- Necessity and challenges
Srdjan Verbić
 
Digital webinar master deck final
Digital webinar master deck finalDigital webinar master deck final
Digital webinar master deck final
Pistoia Alliance
 
2016 iHT2 San Diego Health IT Summit
2016 iHT2 San Diego Health IT Summit2016 iHT2 San Diego Health IT Summit
2016 iHT2 San Diego Health IT Summit
Health IT Conference – iHT2
 
Presentation of the PICASO Project at WHINN Conference, October 2016
Presentation of the PICASO Project at WHINN Conference, October 2016Presentation of the PICASO Project at WHINN Conference, October 2016
Presentation of the PICASO Project at WHINN Conference, October 2016
PicasoProject
 
Mental Health Informatics - What we can learn from the past and where we can be
Mental Health Informatics - What we can learn from the past and where we can beMental Health Informatics - What we can learn from the past and where we can be
Mental Health Informatics - What we can learn from the past and where we can be
Himanshu Tyagi
 
Standards as Infrastructure for Innovation
Standards as Infrastructure for Innovation Standards as Infrastructure for Innovation
Trusted! Quest for data-driven and fair health solutions
Trusted! Quest for data-driven and fair health solutions Trusted! Quest for data-driven and fair health solutions
Trusted! Quest for data-driven and fair health solutions
Sitra / Hyvinvointi
 
20191203 DOE Data Driven Healthcare- Expert Event
20191203 DOE Data Driven Healthcare- Expert Event20191203 DOE Data Driven Healthcare- Expert Event
20191203 DOE Data Driven Healthcare- Expert Event
DayOne
 
Latest Project to Cure PKD
Latest Project to Cure PKDLatest Project to Cure PKD
Latest Project to Cure PKD
Sean Flaherty
 
Charles Alessi
Charles AlessiCharles Alessi
Charles Alessi
Lucia Garcia
 
The IMI EHDEN project: large-scale analysis of observation data in Europe - C...
The IMI EHDEN project: large-scale analysis of observation data in Europe - C...The IMI EHDEN project: large-scale analysis of observation data in Europe - C...
The IMI EHDEN project: large-scale analysis of observation data in Europe - C...
Maxim Moinat
 
ICG-11 - genomic data projects around the world - nov 5 2016
ICG-11 - genomic data projects around the world - nov 5 2016ICG-11 - genomic data projects around the world - nov 5 2016
ICG-11 - genomic data projects around the world - nov 5 2016
Fiona Nielsen
 
New Health Data Deluges Require Secure Information Flow Enablement Via Standa...
New Health Data Deluges Require Secure Information Flow Enablement Via Standa...New Health Data Deluges Require Secure Information Flow Enablement Via Standa...
New Health Data Deluges Require Secure Information Flow Enablement Via Standa...
Dana Gardner
 
Natalie banner
Natalie bannerNatalie banner
Natalie banner
HIQAHIS
 
Rock Report: Big Data by @Rock_Health
Rock Report: Big Data by @Rock_HealthRock Report: Big Data by @Rock_Health
Rock Report: Big Data by @Rock_Health
Rock Health
 
Portugal-patientsummaries
Portugal-patientsummariesPortugal-patientsummaries
Portugal-patientsummaries
AnnaSeebergHansen
 
Digitization in the emergency Department: the role of patient summaries
Digitization in the emergency Department: the role of patient summariesDigitization in the emergency Department: the role of patient summaries
Digitization in the emergency Department: the role of patient summaries
Trillium Bridge: Reinforcing the Bridges and Scaling up EU/US Cooperation on Patient Summary
 
Analysis of EU – Brazil cooperation opportunities on teleHealth
Analysis of EU – Brazil cooperation opportunities on teleHealthAnalysis of EU – Brazil cooperation opportunities on teleHealth
Analysis of EU – Brazil cooperation opportunities on teleHealth
Universitat Politècnica de València
 
OHDSI OMOP - i2b2 conference Tübingen 2019
OHDSI OMOP - i2b2 conference Tübingen 2019OHDSI OMOP - i2b2 conference Tübingen 2019
OHDSI OMOP - i2b2 conference Tübingen 2019
Maxim Moinat
 

Similar to Rare Disease Data Linkage plan 2017 - IRDiRC 2017 presentation (20)

IMS Health Workshop World Orphan Drug Congress
IMS Health Workshop World Orphan Drug CongressIMS Health Workshop World Orphan Drug Congress
IMS Health Workshop World Orphan Drug Congress
 
Data Literacy -- Necessity and challenges
Data Literacy -- Necessity and challengesData Literacy -- Necessity and challenges
Data Literacy -- Necessity and challenges
 
Digital webinar master deck final
Digital webinar master deck finalDigital webinar master deck final
Digital webinar master deck final
 
2016 iHT2 San Diego Health IT Summit
2016 iHT2 San Diego Health IT Summit2016 iHT2 San Diego Health IT Summit
2016 iHT2 San Diego Health IT Summit
 
Presentation of the PICASO Project at WHINN Conference, October 2016
Presentation of the PICASO Project at WHINN Conference, October 2016Presentation of the PICASO Project at WHINN Conference, October 2016
Presentation of the PICASO Project at WHINN Conference, October 2016
 
Mental Health Informatics - What we can learn from the past and where we can be
Mental Health Informatics - What we can learn from the past and where we can beMental Health Informatics - What we can learn from the past and where we can be
Mental Health Informatics - What we can learn from the past and where we can be
 
Standards as Infrastructure for Innovation
Standards as Infrastructure for Innovation Standards as Infrastructure for Innovation
Standards as Infrastructure for Innovation
 
Trusted! Quest for data-driven and fair health solutions
Trusted! Quest for data-driven and fair health solutions Trusted! Quest for data-driven and fair health solutions
Trusted! Quest for data-driven and fair health solutions
 
20191203 DOE Data Driven Healthcare- Expert Event
20191203 DOE Data Driven Healthcare- Expert Event20191203 DOE Data Driven Healthcare- Expert Event
20191203 DOE Data Driven Healthcare- Expert Event
 
Latest Project to Cure PKD
Latest Project to Cure PKDLatest Project to Cure PKD
Latest Project to Cure PKD
 
Charles Alessi
Charles AlessiCharles Alessi
Charles Alessi
 
The IMI EHDEN project: large-scale analysis of observation data in Europe - C...
The IMI EHDEN project: large-scale analysis of observation data in Europe - C...The IMI EHDEN project: large-scale analysis of observation data in Europe - C...
The IMI EHDEN project: large-scale analysis of observation data in Europe - C...
 
ICG-11 - genomic data projects around the world - nov 5 2016
ICG-11 - genomic data projects around the world - nov 5 2016ICG-11 - genomic data projects around the world - nov 5 2016
ICG-11 - genomic data projects around the world - nov 5 2016
 
New Health Data Deluges Require Secure Information Flow Enablement Via Standa...
New Health Data Deluges Require Secure Information Flow Enablement Via Standa...New Health Data Deluges Require Secure Information Flow Enablement Via Standa...
New Health Data Deluges Require Secure Information Flow Enablement Via Standa...
 
Natalie banner
Natalie bannerNatalie banner
Natalie banner
 
Rock Report: Big Data by @Rock_Health
Rock Report: Big Data by @Rock_HealthRock Report: Big Data by @Rock_Health
Rock Report: Big Data by @Rock_Health
 
Portugal-patientsummaries
Portugal-patientsummariesPortugal-patientsummaries
Portugal-patientsummaries
 
Digitization in the emergency Department: the role of patient summaries
Digitization in the emergency Department: the role of patient summariesDigitization in the emergency Department: the role of patient summaries
Digitization in the emergency Department: the role of patient summaries
 
Analysis of EU – Brazil cooperation opportunities on teleHealth
Analysis of EU – Brazil cooperation opportunities on teleHealthAnalysis of EU – Brazil cooperation opportunities on teleHealth
Analysis of EU – Brazil cooperation opportunities on teleHealth
 
OHDSI OMOP - i2b2 conference Tübingen 2019
OHDSI OMOP - i2b2 conference Tübingen 2019OHDSI OMOP - i2b2 conference Tübingen 2019
OHDSI OMOP - i2b2 conference Tübingen 2019
 

More from Leiden University Medical Center

Nanopubs strong to_weak_semantics_vs_machine_readability
Nanopubs strong to_weak_semantics_vs_machine_readabilityNanopubs strong to_weak_semantics_vs_machine_readability
Nanopubs strong to_weak_semantics_vs_machine_readability
Leiden University Medical Center
 
Data models for preserving and publishing digital research material beyond th...
Data models for preserving and publishing digital research material beyond th...Data models for preserving and publishing digital research material beyond th...
Data models for preserving and publishing digital research material beyond th...
Leiden University Medical Center
 
Feasting onbrainswithworkflows
Feasting onbrainswithworkflowsFeasting onbrainswithworkflows
Feasting onbrainswithworkflows
Leiden University Medical Center
 
Enabling Collaborative Biobank Research with feedback from audience
Enabling Collaborative Biobank Research with feedback from audienceEnabling Collaborative Biobank Research with feedback from audience
Enabling Collaborative Biobank Research with feedback from audience
Leiden University Medical Center
 
From Laboratory to e-Laboratory
From Laboratory to e-LaboratoryFrom Laboratory to e-Laboratory
From Laboratory to e-Laboratory
Leiden University Medical Center
 
CWA & SWAT4LS Pitch at DILS2009
CWA & SWAT4LS Pitch at DILS2009CWA & SWAT4LS Pitch at DILS2009
CWA & SWAT4LS Pitch at DILS2009
Leiden University Medical Center
 
Demo Presentation Wageningen Text Mining Workshop 2007
Demo Presentation Wageningen Text Mining Workshop 2007Demo Presentation Wageningen Text Mining Workshop 2007
Demo Presentation Wageningen Text Mining Workshop 2007
Leiden University Medical Center
 
Feasting On Brains With Taverna Public
Feasting On Brains With Taverna PublicFeasting On Brains With Taverna Public
Feasting On Brains With Taverna Public
Leiden University Medical Center
 
Demo Presentation ISMB/ECCB 2007
Demo Presentation ISMB/ECCB 2007Demo Presentation ISMB/ECCB 2007
Demo Presentation ISMB/ECCB 2007
Leiden University Medical Center
 
'A PAL's Life' for OMII-UK Board, May 2008
'A PAL's Life' for OMII-UK Board, May 2008'A PAL's Life' for OMII-UK Board, May 2008
'A PAL's Life' for OMII-UK Board, May 2008
Leiden University Medical Center
 
A biologist in e-Science
A biologist in e-ScienceA biologist in e-Science
A biologist in e-Science
Leiden University Medical Center
 
E Science4 Chromatin Research
E Science4 Chromatin ResearchE Science4 Chromatin Research
E Science4 Chromatin Research
Leiden University Medical Center
 

More from Leiden University Medical Center (12)

Nanopubs strong to_weak_semantics_vs_machine_readability
Nanopubs strong to_weak_semantics_vs_machine_readabilityNanopubs strong to_weak_semantics_vs_machine_readability
Nanopubs strong to_weak_semantics_vs_machine_readability
 
Data models for preserving and publishing digital research material beyond th...
Data models for preserving and publishing digital research material beyond th...Data models for preserving and publishing digital research material beyond th...
Data models for preserving and publishing digital research material beyond th...
 
Feasting onbrainswithworkflows
Feasting onbrainswithworkflowsFeasting onbrainswithworkflows
Feasting onbrainswithworkflows
 
Enabling Collaborative Biobank Research with feedback from audience
Enabling Collaborative Biobank Research with feedback from audienceEnabling Collaborative Biobank Research with feedback from audience
Enabling Collaborative Biobank Research with feedback from audience
 
From Laboratory to e-Laboratory
From Laboratory to e-LaboratoryFrom Laboratory to e-Laboratory
From Laboratory to e-Laboratory
 
CWA & SWAT4LS Pitch at DILS2009
CWA & SWAT4LS Pitch at DILS2009CWA & SWAT4LS Pitch at DILS2009
CWA & SWAT4LS Pitch at DILS2009
 
Demo Presentation Wageningen Text Mining Workshop 2007
Demo Presentation Wageningen Text Mining Workshop 2007Demo Presentation Wageningen Text Mining Workshop 2007
Demo Presentation Wageningen Text Mining Workshop 2007
 
Feasting On Brains With Taverna Public
Feasting On Brains With Taverna PublicFeasting On Brains With Taverna Public
Feasting On Brains With Taverna Public
 
Demo Presentation ISMB/ECCB 2007
Demo Presentation ISMB/ECCB 2007Demo Presentation ISMB/ECCB 2007
Demo Presentation ISMB/ECCB 2007
 
'A PAL's Life' for OMII-UK Board, May 2008
'A PAL's Life' for OMII-UK Board, May 2008'A PAL's Life' for OMII-UK Board, May 2008
'A PAL's Life' for OMII-UK Board, May 2008
 
A biologist in e-Science
A biologist in e-ScienceA biologist in e-Science
A biologist in e-Science
 
E Science4 Chromatin Research
E Science4 Chromatin ResearchE Science4 Chromatin Research
E Science4 Chromatin Research
 

Recently uploaded

Applied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdfApplied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdf
University of Hertfordshire
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
Sharon Liu
 
NuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyerNuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyer
pablovgd
 
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero WaterSharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Texas Alliance of Groundwater Districts
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
KrushnaDarade1
 
Equivariant neural networks and representation theory
Equivariant neural networks and representation theoryEquivariant neural networks and representation theory
Equivariant neural networks and representation theory
Daniel Tubbenhauer
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
MAGOTI ERNEST
 
Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.
Aditi Bajpai
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
Sérgio Sacani
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
RitabrataSarkar3
 
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
hozt8xgk
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
University of Maribor
 
Medical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptxMedical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptx
terusbelajar5
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
Gokturk Mehmet Dilci
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
by6843629
 
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdfwaterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
LengamoLAppostilic
 
Bob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdfBob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdf
Texas Alliance of Groundwater Districts
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
PRIYANKA PATEL
 
Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
IshaGoswami9
 
Thornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdfThornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdf
European Sustainable Phosphorus Platform
 

Recently uploaded (20)

Applied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdfApplied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdf
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
 
NuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyerNuGOweek 2024 Ghent programme overview flyer
NuGOweek 2024 Ghent programme overview flyer
 
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero WaterSharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
 
Equivariant neural networks and representation theory
Equivariant neural networks and representation theoryEquivariant neural networks and representation theory
Equivariant neural networks and representation theory
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
 
Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.Micronuclei test.M.sc.zoology.fisheries.
Micronuclei test.M.sc.zoology.fisheries.
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
 
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
快速办理(UAM毕业证书)马德里自治大学毕业证学位证一模一样
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
 
Medical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptxMedical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptx
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
 
8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf8.Isolation of pure cultures and preservation of cultures.pdf
8.Isolation of pure cultures and preservation of cultures.pdf
 
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdfwaterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
waterlessdyeingtechnolgyusing carbon dioxide chemicalspdf
 
Bob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdfBob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdf
 
ESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptxESR spectroscopy in liquid food and beverages.pptx
ESR spectroscopy in liquid food and beverages.pptx
 
Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
 
Thornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdfThornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdf
 

Rare Disease Data Linkage plan 2017 - IRDiRC 2017 presentation

  • 1. THE RARE DISEASE DATA LINKAGE PLAN BOOSTING RESEARCH BY MAKING DATA RESOURCES FINDABLE, ACCESSIBLE, INTEROPERABLE AND REUSABLE TOGETHER I RDiRC 2017, Paris, Februar y 9, 2017 Marco Roos, David van Enckevort Acknowledging patient representatives, ELIXIR(-EXCELERATE), RD-CONNECT, BBMRI(-NL), ODEX4ALL, FAIRDict, the rare disease linked data and ontology task force, Mark Thompson, Robert Reihs, Rajaram Kaliyaperumal, Pedro Sernadela, Marc Hanauer, Mark Wilkinson, Claudio Carta, Rachel Thompson, Estrella Lopes, Lorena Casareto, Frederique Ehrhart, Roxana Merino, Luiz Bonino & team, Ronald Cornet, Peter Robinson, Mathias Brochhausen, Simon Jupp, Sira Sarntivijai, Helen Parkinson, Ana Rath, Heimo Muller, Lucia Monaco, Domenica Taruscio, Manuel Posada, Luca Sangiorgi, Morris Swertz, José Oliveira, Peter-Bram ‘t Hoen, Hanns Lochmuller, Larry Hunter, participants and organisers of rare disease Bring Your Own Data workshops In this presentation, I will give you a light-weight introduction to the rare disease data linkage plan to boost research on rare diseases.
  • 2. 2 ELIXIR 16 February 2017 A distributed infrastructure for life science information The plan is supported by several projects and infrastructures. Here, I highlight ELIXIR, the European infrastructure for life science information. Ivo Gut, Sergi Beltran and Marco Roos co-lead the case for rare diseases in this infrastructure.
  • 3. 3 To boost rare disease research Objective 16 February 2017 Our goal is to boost rare disease research
  • 4. 4 > 6000 rare diseases 16 February 2017 But then we have to take into account that there are over 6000 rare diseases
  • 5. 5 > 6000 rare diseases biobanks, registries, sequencing, OMICS, … 16 February 2017 Collecting multiple types of materials and data, such as biobanks, registries, sequence data, omics, etcetera
  • 6. 6 > 6000 rare diseases biobanks, registries, sequencing, OMICS, … across countries and institutes 16 February 2017 And across many countries and institutes
  • 7. 7 What we want to achieve 16 February 2017 7 Rare disease researcher What we aim to achieve is the following; here is rare disease researcher Claudio
  • 8. 8 Rare disease researcher asks Which treatments for symptoms of other diseases may mitigate the same symptoms of my rare disease? Claudio has questions like these
  • 9. 9 A rare disease researcher asks… Where can I obtain biosamples of donors with an abnormality in head or neck? In which biobanks can I find these samples? Or these… We have built a demonstration web tool to show how we can answer these questions with the technologies that we advocate.
  • 10. 16 February 2017 10 Here, Claudio can select his question.
  • 11. 16 February 2017 11 Select the symptom that he is interested in
  • 12. And obtain this table with the answers. Note that all items are blue; that means you can click on them to get more information about them. Note that the phenotypes are all subtypes of abnormality of head or neck. Also note that there are multiple diseases, multiple biobanks, and multiple registries in the list. This shows that the information came from different sources.
  • 13. 13 The role of computational analysis in boosting health care and life science will  Decrease  Increase  Stay the same Questions to you 16 February 2017 Before I continue, think about this question
  • 14. 14 The role of computational analysis in boosting health care and life science will  Decrease  Increase  Stay the same Questions to you 16 February 2017 Most people tend to say it will increase.
  • 15. 15 to boost rare disease research 16 February 2017 If that is the case, then…
  • 16. 16 enable computational analysts to boost rare disease research 16 February 2017 We need to…
  • 17. 17 The substrate for computational analysts DATA And then we have to look at the substrate for computational analysts: data
  • 18. 18 How to boost rare disease research? PATIENT DATA In the rare disease domain, we have many different types of data, such as patient data
  • 19. 19 How to boost rare disease research? OMICS DATA Omics data
  • 20. 20 How to boost rare disease research? SAMPLE DATA Or biological samples
  • 21. 21 How to boost rare disease research? SAMPLE DATA And in all shapes and sizes, different languages, different formats
  • 22. 22 How to boost rare disease research? PATIENT DATA > 6000 rare diseases across countries and institutes Remember that we have thousands of data sources in our domain.
  • 23. 23 How to boost rare disease research? PATIENT DATA Data incompatibilities are an enormous bottleneck for data analysts: they spend months per data source to resolve them.
  • 24. 24 How to boost rare disease research? DA TA DATA You Them A way to address this is by letting others make your data compatible: ‘they’ transform the data to be more compatible.
  • 25. 25 How to boost rare disease research? DA TA You Them DATA They can do that with multiple data sources, and integrate them.
  • 26. 26 DATA How to boost rare disease research? DA TA You Them However, there is a big risk. When for whatever reason, ‘they’ cannot maintain this anymore, for instance because the funding stops…
  • 27. 27 How to boost rare disease research? DA TA You There is nothing left.
  • 28. 28 How to boost rare disease research? DA TA You We are back to square one: incompatible data. This is not good enough for data infrastructure. International leading data experts have defined an approach for this that I cannot explain better than is done in the following video.
  • 29. 29 Inspiration: personal Health Train 16 February 2017 https://www.youtube.com/watch?v=mktAtHmy-FM
  • 30. 30 G20 Endorsement of FAIR principles Next to ELIXIR, EOSC, NIH Commons We, the leaders of the G20… facilitate appropriate access to publicly funded research results on findable, accessible, interoperable and reusable (FAIR) principles … The FAIR principles are highly endorsed, by ELIXIR, the Open European Science Cloud, NIH via its ‘commons’ program, and since 2016 also the G20.
  • 31. 31 FAIR principles applied to rare disease data RD DATA FAIR Linkable RD DATA You How do we apply them for rare diseases? At a high level, the steps are more-or-less the same: your data, a transformation, but now we have FAIR, linkable data on the right.
  • 32. 32 FAIR principles applied to rare disease data RD DATA FAIR Linkable RD DATA You YouYou and them together Knowledge exchange But there are major differences: instead of ‘you’ and ‘them’ data experts and FAIR data experts do the transformation together. This involves substantial knowledge exchange. Another major difference is that instead of ‘them’ there is ‘you’ on the right: data owners stay in control.
  • 33. 33 FAIR principles applied to rare disease data Data can be more easily combined. Each resource is an independently FAIR resource. This is a much more robust infrastructure.
  • 34. 34 Creating substrate to boost rare disease research In our rare disease data linkage plan we go through this process, one at a time. Each time we improve our methods, each time we do this faster.
  • 35. Rare disease data linkage plan - 2017 • > 7 biobanks/registries FAIR at the source • Study FAIR pathways, Orphanet, mutation data • Support: RD-Connect, ELIXIR, BBMRI (-NL), FAIRDict, ODEX4All, patient organisations David van Enckevort Technical leader Our aims for 2017
  • 36. 36  Light-weight introduction to the rare disease data linkage plan  Account for scale and sparsity of data in rare disease domain  Federated infrastructure of local FAIR data Summary 16-Feb-17 36
  • 37. 37 Invitation  Contact us about making rare disease data FAIR  Turn plan into long- running service Long term plans 16 February 2017 Contact us about making rare disease data FAIR. Let us know if you would like to help turning the plan for 2017 into a long running service for the rare disease community. We envision a role for patient organisations in that.
  • 38. • Mascha Jansen: FAIR data projects and Bring Your Own Data workshops (mascha.jansen@dtls.nl) • David van Enckevort, Marco Roos: Rare disease data linkage plan & FAIR RD data projects • Erik Schultes: FAIR data (awareness) training; for Elixir/RD: Brane Leskosek (DTL: Celia van Gelder) Thank you Acknowledging patient representatives, ELIXIR(- EXCELERATE), RD-CONNECT, BBMRI(-NL), ODEX4ALL, FAIRDict, the rare disease linked data and ontology task force, David van Enckevort, Mark Thompson, Robert Reihs, Rajaram Kaliyaperumal, Pedro Sernadela, Marc Hanauer, Mark Wilkinson, Claudio Carta, Rachel Thompson, Estrella Lopes, Lorena Casareto, Frederique Ehrhart, Roxana Merino, Luiz Bonino & team, Ronald Cornet, Peter Robinson, Mathias Brochhausen, Simon Jupp, Sira Sarntivijai, Helen Parkinson, Ana Rath, Heimo Muller, Lucia Monaco, Domenica Taruscio, Manuel Posada, Luca Sangiorgi, Morris Swertz, José Oliveira, Peter-Bram ‘t Hoen, Hanns Lochmuller, Larry Hunter, participants and organisers of rare disease Bring Your Own Data workshops Thank you for your attention.

Editor's Notes

  1. In this presentation, I will give you a light-weight introduction to the rare disease data linkage plan to boost research on rare diseases.
  2. The plan is supported by several projects and infrastructures. Here, I highlight ELIXIR, the European infrastructure for life science information. Ivo Gut and Sergi Beltran from Barcelona and myself from Leiden are carrying the flag for rare diseases in this infrastructure.
  3. Our goal is to boost rare disease research
  4. But then we have to take into account that there are over 6000 rare diseases
  5. For each disease we collect multiple types of materials and data, such as samples in biobanks, health information in registries, sequence data, omics data, etcetera
  6. And across many countries and institutes
  7. What we aim to achieve is the following; here is rare disease researcher Claudio
  8. Claudio has questions like these: which treatments for symptoms of other diseases may mitigate the same symptoms in my disease of interest?
  9. Or, where can I obtain biosamples of donors with an abnormality in head or neck? And in which biobanks can I find these samples? We have built a demonstration web tool to show how we can answer these questions with the technologies that we advocate.
  10. Here, Claudio can select his question.
  11. Select the symptom that he is interested in
  12. And obtain this table with the answers. Note that all items are blue; that means you can click on them to get more information about them. Note that the phenotypes are all subtypes of abnormality of head or neck. Also note that there are multiple diseases, multiple biobanks, and multiple registries in the list. This shows that the information came from different sources.
  13. Before I continue, think about this question: do you think that the role of computational analysis will increase in health care and life science in the future?
  14. Most people tend to say it will increase.
  15. If that is the case, then to boost rare disease research
  16. We need to enable computational analysts to boost rare disease research
  17. And then we have to look at the substrate for computational analysts: data
  18. In the rare disease domain, we have many different types of data, such as patient data
  19. Omics data
  20. Or biosamples
  21. And in all shapes and sizes, different languages, different formats
  22. Remember that we have thousands of data sources in our domain.
  23. Data incompatibilities are an enormous bottleneck for data analysts: they spend months per data source to resolve them.
  24. A way to address this is by letting others make your data compatible: ‘they’ transform the data to be more compatible.
  25. They can do that with multiple data sources, and integrate them.
  26. However, there is a big risk. When for whatever reason, ‘they’ cannot maintain this anymore, for instance because the funding stops.
  27. There is nothing left.
  28. We are back to square one: incompatible data. This is not good enough for data infrastructure. International leading data experts have defined an approach for this that I cannot explain better than is done in the following video.
  29. The FAIR principles are highly endorsed, such as by ELIXIR, a European initiative called the Open European Science Cloud, NIH via its ‘commons’ program, and since 2016 also the G20.
  30. How do we apply them for rare diseases? At a high level, the steps are more-or-less the same: your data, a transformation, but now we have FAIR, linkable data on the right.
  31. But there are major differences: instead of ‘you’ and ‘them’ data experts and FAIR data experts do the transformation together. This involves substantial knowledge exchange. Another major difference is that instead of ‘them’ there is ‘you’ on the right: data owners stay in control of their data.
  32. Data can be more easily combined. Each resource is an independently FAIR resource. This is a much more robust infrastructure.
  33. In our rare disease data linkage plan we go through this process, one at a time. Each time we improve our methods, each time we do this faster.
  34. In 2017 we aim at our first seven biobanks/registries, we will study pathways, orphaned and mutation data. We have support from multiple projects, including patient organisations who we ask to also invest into the collaboration.
  35. In summary: I have given a light-weight introduction to the rare disease data linkage pan. I have shown how we account for the scale and sparsity of data in the rare disease domain, By federated infrastructure of local FAIR data
  36. I invite you to contact us about making rare disease data FAIR And I invite you to let us know if you would like to help us turn this plan into a long running service. We envision a role for patient organisations in that.
  37. Thank you for your attention. Here are some contact points.