SlideShare a Scribd company logo
Are we FAIR yet? And will it be worth it?
@micheldumontier::NETTAB:2018-10-221
Michel Dumontier, Ph.D.
Distinguished Professor of Data Science
Director, Institute of Data Science
An increasing number of
discoveries are made using other
people’s data
@micheldumontier::NETTAB:2018-10-222
3
A common rejection module (CRM) for acute rejection across multiple organs identifies novel
therapeutics for organ transplantation
Khatri et al. JEM. 210 (11): 2205
DOI: 10.1084/jem.20122709
@micheldumontier::NETTAB:2018-10-22
Main Findings:
1. CRM genes correlated with the extent of graft injury and predicted future injury to a graft
2. Mice treated with drugs against the CRM genes extended graft survival
However, significant effort was
needed to find the right datasets,
make sense of them, and ultimately
use them for a new purpose
@micheldumontier::NETTAB:2018-10-224
@micheldumontier::NETTAB:2018-10-225
Poor quality (meta)data impairs (re)search
If we are ever to realize the full
potential of content we create
then we must find ways to reduce the
barrier to publish digital content in a
way that makes it vastly easier to
find, assess and reuse
@micheldumontier::NETTAB:2018-10-226
@micheldumontier::NETTAB:2018-10-227
Lambin et al. Radiother Oncol. 2013. 109(1):159-64. doi: 10.1016/j.radonc.2013.07.007
Why does this matter?
@micheldumontier::NETTAB:2018-10-228
9 @micheldumontier::NETTAB:2018-10-22
Most published research findings are false.
- John Ioannidis, Stanford University
Reproducibility of landmark studies is shockingly low:
39% (39/100) in psychology1
21% (14/67) in pharmacology2
11% (6/53) in cancer3
PLoS Med 2005;2(8): e124.
1doi:10.1038/nature.2015.17433 2doi:10.1038/nrd3439-c1 3doi:10.1038/483531a
@micheldumontier::NETTAB:2018-10-2210
Published online 28 September 2011 | Nature 477, 526-528 (2011) | doi:10.1038/477526a
@micheldumontier::NETTAB:2018-10-2211
we need new ways to think about
discovery science
We need to improve
our confidence in any result
by using more data
and with support
from multiple lines of evidence
Grand Challenge:
Automatically
uncover evidence
that supports and
disputes a
hypothesis using the
totality of available
data, tools and
scientific knowledge
@micheldumontier::NETTAB:2018-10-2212
We must build a social, ethical and
technological infrastructure that
facilitates the discovery and reuse
of digital resources
for people and machines
@micheldumontier::NETTAB:2018-10-2213
Why machines?
• Can gather and make sense of vast amounts of information to
better understand the world and make more effective
decisions
@micheldumontier::NETTAB:2018-10-2214
Big Data
for Medicine
@micheldumontier::NETTAB:2018-10-2215
Multiple sources of heterogeneous
data, including experimental evidence,
bioinformatics databases, lifestyle
measurements, electronic health
records, environmental influences, and
biobank findings, can be combined
using machine learning algorithms to
identify causal disease networks,
stratify patients, and predict more
efficacious therapies.
Why machines?
• Can make sense of vast amounts of information to make
personalized, evidence-based decisions to maximize desired
outcomes
• Can create detailed workflows to enable transparency and
reproducibility
• Will be able to identify and minimize bias in research and in
real world applications in a robust and systematic manner
@micheldumontier::NETTAB:2018-10-2216
@micheldumontier::NETTAB:2018-10-2217
An international, bottom-up paradigm for
the discovery and reuse of digital content
by and for people and machines
@micheldumontier::NETTAB:2018-10-2218
• DATA FAIRPORT workshop aimed
to define a minimal (yet
comprehensive) framework for
data discoverability, access,
annotation and authoring
• FAIR acronym was created and
guiding principles drafted
• for comment on FORCE11 website
• Principles were refined during the
2015 BioHackathon in Japan
@micheldumontier::NETTAB:2018-10-2219
FAIR: History
http://www.nature.com/articles/sdata201618
@micheldumontier::NETTAB:2018-10-2220
FAIR: Impact
@micheldumontier::NETTAB:2018-10-2221
4 Principles (F,A,I,R) and 15 sub-principles.
http://www.nature.com/articles/sdata201618
FAIR Principles - summarized
Findable
• Globally unique, resolvable, and persistent identifiers
• Machine-readable descriptions to support structured search and
filtering
Accessible
• Metadata is accessible beyond the lifetime of the digital resource
• Clearly defined access and security protocols (FAIR != Open)
@micheldumontier::NETTAB:2018-10-2222
@micheldumontier::NETTAB:2018-10-2223
FAIR Principles - summarized
Findable
• Globally unique, resolvable, and persistent identifiers
• Machine-readable descriptions to support structured search and filtering
Accessible
• Metadata is accessible beyond the lifetime of the digital resource
• Clearly defined access and security protocols (FAIR != Open)
Interoperable
• Extensible machine interpretable formats for data + metadata
• Use vocabularies and link to other resources
Reusable
• Provide licensing, provenance, and meet community-standards
@micheldumontier::NETTAB:2018-10-2224
Improving the FAIRness of digital
resources will increase their quality and
their potential and ease for reuse.
@micheldumontier::NETTAB:2018-10-2225
Communities
must make clear their expectations
@micheldumontier::NETTAB:2018-10-2226
@micheldumontier::NETTAB:2018-10-2227
http://www.nature.com/articles/sdata201618
Oct 15 2018
Communities ARE discussing
what FAIR means to them
Extent of FAIRness may affect what resources people select
@micheldumontier::NETTAB:2018-10-2228
Measuring FAIRness
• A metric is a standard of measurement.
• It must provide clear definition of what is being measured,
why one wants to measure it.
• It must describe what a valid result is and how one obtains
it, so that it can be reproduced by others.
@micheldumontier::NETTAB:2018-10-2229
Qualities of a Good Metric
• Clear: anyone can understand the purpose of the metric
• Realistic: compliance should not be unduly complicated
• Objective: the assessment can be made in a quantitative,
machine-interpretable, scalable and reproducible manner
• Discriminating: the measure can distinguish between those
resources that meet the criteria and those that do not
• Universal: The metric should be applicable to all digital
resources
@micheldumontier::NETTAB:2018-10-2230
• 14 universal metrics covering each of the FAIR sub-principles. The metrics demand
evidence from the community, some of which may require specific new actions.
• Digital resource providers must provide a web-accessible document with machine-
readable metadata (FM-F2, FM-F3), detail identifier management (FM-F1B), metadata
longevity (FM-A2), and any additional authorization procedures (FM-A1.2).
• They must ensure the public registration of their identifier schemes (FM-F1A), (secure)
access protocols (FM-A1.1), knowledge representation languages (FM-I1), licenses
(FM-R1.1), provenance specifications (FM-R1.2), and community standards (FM-R1.3).
• They must provide evidence of ability to find the digital resource in search results (FM-
F4), linking to other resources (FM-I3), FAIRness of linked resources (FM-I2), and
meeting community standards (FM-R1.3)
@micheldumontier::NETTAB:2018-10-2231
@micheldumontier::NETTAB:2018-10-2232
http://www.w3.org/TR/hcls-dataset/
Evidence:
standard is
registered in
FAIRsharing
Compliance to the standard can be automatically
assessed
@micheldumontier::NETTAB:2018-10-2233
• http://hw-swel.github.io/Validata/
RDF constraint validation tool that is
configurable to any profile
Declarative reusable schema description
Shape Expression (ShEx) constraints
A first assessment using the metrics
• Used a simple form to ask for the information needed as input
to the FAIR metrics
• Questions either require one or more URL or true/false
@micheldumontier::NETTAB:2018-10-2234
@micheldumontier::NETTAB:2018-10-2235
@micheldumontier::NETTAB:2018-10-2236
@micheldumontier::NETTAB:2018-10-2237
http://fairshake.cloud
@micheldumontier::NETTAB:2018-10-2238
Automated FAIRness assessments
@micheldumontier::NETTAB:2018-10-2239
Automated assessments
are rather unforgiving, but also correct mistakes
@micheldumontier::NETTAB:2018-10-2240
@micheldumontier::NETTAB:2018-10-2241
@micheldumontier::NETTAB:2018-10-2242
@micheldumontier::NETTAB:2018-10-2243
Celia van Gelder (DTL/ELIXIR-NL)
@micheldumontier::NETTAB:2018-10-2244
@micheldumontier::NETTAB:2018-10-2245
H2020 EG: Turning FAIR Data into Reality -
Report and Action Plan Consultation
(Draft) Recommendations include:
• Sustainable funding for FAIR components (#5)
• Strategic and evidence-based funding (#6)
• Cross-disciplinary FAIRness (#8)
• Encourage and incentivize data reuse (#19)
• Facilitate automated processing (#25)
• Data science and stewardship skills (#26)
• Skills transfer schemes and brokering roles (#27)
• Curriculum frameworks and training (#28)
@micheldumontier::NETTAB:2018-10-2246
Hodson, Simon; Jones, Sarah; Collins, Sandra; Genova, Françoise; Harrower, Natalie; Laaksonen, Leif; Mietchen, Daniel; Petrauskaité, Rūta; Wittenburg, Peter
Are we FAIR yet?
• Early claims (including press releases) of being fully FAIR were
vastly premature
• FAIRness assessments can demonstrate standing, and some
aspects of FAIR are much easier to address than others.
• Much more work still needs to be done
– Compatible data and metadata standards across all disciplines (no more
data and metadata silos)
– FAIR by design, using common frameworks
– The development of the FAIR Internet of Data and Services (FIDS) and a
FAIR knowledge graph of available resources
– Automated discovery and workflow execution using FIDS
@micheldumontier::NETTAB:2018-10-2247
Will it be worth it?
FAIR addresses, in a concise manner, the basic requirements
associated with publishing and reusing digital resources.
– Lack of high quality meta(data) reduces usability
– Lack of detailed provenance contributes to irreproducibility
– Lack of clear licensing terms hinders innovation
FAIR is set to accelerate research and discovery and will have
worldwide social and economic impact
@micheldumontier::NETTAB:2018-10-2248
@micheldumontier::NETTAB:2018-10-2249
* I’m an advisor to OntoForce
* I wish I was an advisor to transcriptic
Summary
• FAIR represents a grassroots and global initiative to enhance
the discovery and reuse of all kinds of digital resources
• The FAIR ecosystem is maturing quickly, and GO-FAIR offers
communities the means to actively participate.
• FAIR demands a new social, ethical and technological
infrastructure that currently doesn’t exist in whole, but has to
be built for and tested by various communities!
• Huge benefits to be had, particularly in augmenting existing
research programs and in automated machine processing, but
needs to be coupled with the proper training and ethics.
@micheldumontier::NETTAB:2018-10-2250
Acknowledgements
@micheldumontier::NETTAB:2018-10-2251
FAIR FAIR metrics
Dumontier Lab (Maastricht University, Stanford University, Carleton University)
MU: Seun Adekunle, Remzi Celebi, Dorina Claessens, Ricardo De Miranda Azevedo, Pedro Hernandez Serrano, Massimiliano Grassi, Andine Havelange,
Lianne Ippel, Alexander Malic, Kody Moodley, Stuti Nayak, Nadine Rouleaux, Claudia van open, Chang Sun, Amrapali Zaveri
SU: Sandeep Ayyar, Remzi Celebi, Shima Dastgheib, Maulik Kamdar, David Odgers, Maryam Panahiazar, Amrapali Zaveri
CU: Alison Callahan, Jose Toledo-Cruz, Natalia Villaneuva-Rosales
michel.dumontier@maastrichtuniversity.nl
Website: http://maastrichtuniversity.nl/ids
52 @micheldumontier::NETTAB:2018-10-22
The mission of the Institute of Data Science at Maastricht University is to foster a
collaborative environment for multi-disciplinary data science research,
interdisciplinary training, and data-driven innovation .
We tackle key scientific, technical, social, legal, ethical issues that advance our
understanding and strengthen our communities in the face of these developments.

More Related Content

What's hot

Neo4j for Healthcare & Life Sciences
Neo4j for Healthcare & Life SciencesNeo4j for Healthcare & Life Sciences
Neo4j for Healthcare & Life Sciences
Neo4j
 
Reveal Hidden Patterns in Healthcare Data: Graph Analytics and the Opioid Crisis
Reveal Hidden Patterns in Healthcare Data: Graph Analytics and the Opioid CrisisReveal Hidden Patterns in Healthcare Data: Graph Analytics and the Opioid Crisis
Reveal Hidden Patterns in Healthcare Data: Graph Analytics and the Opioid Crisis
Neo4j
 
Big Data Analytics government healthcare
Big Data Analytics government healthcareBig Data Analytics government healthcare
Big Data Analytics government healthcare
Data Science Thailand
 
Identifying Drug Interaction Candidates in Real-World Data
Identifying Drug Interaction Candidates in Real-World DataIdentifying Drug Interaction Candidates in Real-World Data
Identifying Drug Interaction Candidates in Real-World Data
Neo4j
 
Pistoia Alliance conference April 2016: Big Data: Eric Little
Pistoia Alliance conference April 2016: Big Data: Eric LittlePistoia Alliance conference April 2016: Big Data: Eric Little
Pistoia Alliance conference April 2016: Big Data: Eric Little
Pistoia Alliance
 
Innovative project1
Innovative project1Innovative project1
Innovative project1
LillySheebaS1
 
Pistoia Alliance Debates: Moving Research Informatics into the Cloud: 25th Ma...
Pistoia Alliance Debates: Moving Research Informatics into the Cloud: 25th Ma...Pistoia Alliance Debates: Moving Research Informatics into the Cloud: 25th Ma...
Pistoia Alliance Debates: Moving Research Informatics into the Cloud: 25th Ma...
Pistoia Alliance
 
apidays LIVE Australia 2021 - APIs enable global collaborations and accelerat...
apidays LIVE Australia 2021 - APIs enable global collaborations and accelerat...apidays LIVE Australia 2021 - APIs enable global collaborations and accelerat...
apidays LIVE Australia 2021 - APIs enable global collaborations and accelerat...
apidays
 
The life sciences industry in 2018
The life sciences industry in 2018The life sciences industry in 2018
The life sciences industry in 2018
pi
 
Blockchain and Patient-Centered Outcomes Measures - Goldwater
Blockchain and Patient-Centered Outcomes Measures - GoldwaterBlockchain and Patient-Centered Outcomes Measures - Goldwater
Blockchain and Patient-Centered Outcomes Measures - Goldwater
Sean Manion PhD
 
Acclerating biomedical discovery with an internet of FAIR data and services -...
Acclerating biomedical discovery with an internet of FAIR data and services -...Acclerating biomedical discovery with an internet of FAIR data and services -...
Acclerating biomedical discovery with an internet of FAIR data and services -...
Michel Dumontier
 
Mattaliano and Gilmartin "7 Things Modern Researchers Want in a Search Tool"
Mattaliano and Gilmartin "7 Things Modern Researchers Want in a Search Tool"Mattaliano and Gilmartin "7 Things Modern Researchers Want in a Search Tool"
Mattaliano and Gilmartin "7 Things Modern Researchers Want in a Search Tool"
National Information Standards Organization (NISO)
 
Kaindl "Managing Information Overload"
Kaindl "Managing Information Overload"Kaindl "Managing Information Overload"
Kaindl "Managing Information Overload"
National Information Standards Organization (NISO)
 
Supporting the community-owned open scholarly communications ecosystem
Supporting the community-owned open scholarly communications ecosystemSupporting the community-owned open scholarly communications ecosystem
Supporting the community-owned open scholarly communications ecosystem
Jisc
 
Blockchain Healthcare Situation Report (BC/HC SITREP) Volume 2 Issue 19, 07 -...
Blockchain Healthcare Situation Report (BC/HC SITREP) Volume 2 Issue 19, 07 -...Blockchain Healthcare Situation Report (BC/HC SITREP) Volume 2 Issue 19, 07 -...
Blockchain Healthcare Situation Report (BC/HC SITREP) Volume 2 Issue 19, 07 -...
Sean Manion PhD
 
Prototype SDX Bioinformatics Exchange: Demonstrating an Essential Use-Case fo...
Prototype SDX Bioinformatics Exchange: Demonstrating an Essential Use-Case fo...Prototype SDX Bioinformatics Exchange: Demonstrating an Essential Use-Case fo...
Prototype SDX Bioinformatics Exchange: Demonstrating an Essential Use-Case fo...
US-Ignite
 
Big data for healthcare analytics final -v0.3 miz
Big data for healthcare analytics   final -v0.3 mizBig data for healthcare analytics   final -v0.3 miz
Big data for healthcare analytics final -v0.3 miz
Yusuf Brima
 
Automating Data Curation with AI and NLP for Biomedical Graph Applications
Automating Data Curation with AI and NLP for Biomedical Graph ApplicationsAutomating Data Curation with AI and NLP for Biomedical Graph Applications
Automating Data Curation with AI and NLP for Biomedical Graph Applications
Neo4j
 
Darwin ai covid-net mitre
Darwin ai   covid-net mitreDarwin ai   covid-net mitre
Darwin ai covid-net mitre
ianmitch
 
How much is that data in the window : Healthcare data valuation
How much is that data in the window : Healthcare data valuationHow much is that data in the window : Healthcare data valuation
How much is that data in the window : Healthcare data valuation
Sean Manion PhD
 

What's hot (20)

Neo4j for Healthcare & Life Sciences
Neo4j for Healthcare & Life SciencesNeo4j for Healthcare & Life Sciences
Neo4j for Healthcare & Life Sciences
 
Reveal Hidden Patterns in Healthcare Data: Graph Analytics and the Opioid Crisis
Reveal Hidden Patterns in Healthcare Data: Graph Analytics and the Opioid CrisisReveal Hidden Patterns in Healthcare Data: Graph Analytics and the Opioid Crisis
Reveal Hidden Patterns in Healthcare Data: Graph Analytics and the Opioid Crisis
 
Big Data Analytics government healthcare
Big Data Analytics government healthcareBig Data Analytics government healthcare
Big Data Analytics government healthcare
 
Identifying Drug Interaction Candidates in Real-World Data
Identifying Drug Interaction Candidates in Real-World DataIdentifying Drug Interaction Candidates in Real-World Data
Identifying Drug Interaction Candidates in Real-World Data
 
Pistoia Alliance conference April 2016: Big Data: Eric Little
Pistoia Alliance conference April 2016: Big Data: Eric LittlePistoia Alliance conference April 2016: Big Data: Eric Little
Pistoia Alliance conference April 2016: Big Data: Eric Little
 
Innovative project1
Innovative project1Innovative project1
Innovative project1
 
Pistoia Alliance Debates: Moving Research Informatics into the Cloud: 25th Ma...
Pistoia Alliance Debates: Moving Research Informatics into the Cloud: 25th Ma...Pistoia Alliance Debates: Moving Research Informatics into the Cloud: 25th Ma...
Pistoia Alliance Debates: Moving Research Informatics into the Cloud: 25th Ma...
 
apidays LIVE Australia 2021 - APIs enable global collaborations and accelerat...
apidays LIVE Australia 2021 - APIs enable global collaborations and accelerat...apidays LIVE Australia 2021 - APIs enable global collaborations and accelerat...
apidays LIVE Australia 2021 - APIs enable global collaborations and accelerat...
 
The life sciences industry in 2018
The life sciences industry in 2018The life sciences industry in 2018
The life sciences industry in 2018
 
Blockchain and Patient-Centered Outcomes Measures - Goldwater
Blockchain and Patient-Centered Outcomes Measures - GoldwaterBlockchain and Patient-Centered Outcomes Measures - Goldwater
Blockchain and Patient-Centered Outcomes Measures - Goldwater
 
Acclerating biomedical discovery with an internet of FAIR data and services -...
Acclerating biomedical discovery with an internet of FAIR data and services -...Acclerating biomedical discovery with an internet of FAIR data and services -...
Acclerating biomedical discovery with an internet of FAIR data and services -...
 
Mattaliano and Gilmartin "7 Things Modern Researchers Want in a Search Tool"
Mattaliano and Gilmartin "7 Things Modern Researchers Want in a Search Tool"Mattaliano and Gilmartin "7 Things Modern Researchers Want in a Search Tool"
Mattaliano and Gilmartin "7 Things Modern Researchers Want in a Search Tool"
 
Kaindl "Managing Information Overload"
Kaindl "Managing Information Overload"Kaindl "Managing Information Overload"
Kaindl "Managing Information Overload"
 
Supporting the community-owned open scholarly communications ecosystem
Supporting the community-owned open scholarly communications ecosystemSupporting the community-owned open scholarly communications ecosystem
Supporting the community-owned open scholarly communications ecosystem
 
Blockchain Healthcare Situation Report (BC/HC SITREP) Volume 2 Issue 19, 07 -...
Blockchain Healthcare Situation Report (BC/HC SITREP) Volume 2 Issue 19, 07 -...Blockchain Healthcare Situation Report (BC/HC SITREP) Volume 2 Issue 19, 07 -...
Blockchain Healthcare Situation Report (BC/HC SITREP) Volume 2 Issue 19, 07 -...
 
Prototype SDX Bioinformatics Exchange: Demonstrating an Essential Use-Case fo...
Prototype SDX Bioinformatics Exchange: Demonstrating an Essential Use-Case fo...Prototype SDX Bioinformatics Exchange: Demonstrating an Essential Use-Case fo...
Prototype SDX Bioinformatics Exchange: Demonstrating an Essential Use-Case fo...
 
Big data for healthcare analytics final -v0.3 miz
Big data for healthcare analytics   final -v0.3 mizBig data for healthcare analytics   final -v0.3 miz
Big data for healthcare analytics final -v0.3 miz
 
Automating Data Curation with AI and NLP for Biomedical Graph Applications
Automating Data Curation with AI and NLP for Biomedical Graph ApplicationsAutomating Data Curation with AI and NLP for Biomedical Graph Applications
Automating Data Curation with AI and NLP for Biomedical Graph Applications
 
Darwin ai covid-net mitre
Darwin ai   covid-net mitreDarwin ai   covid-net mitre
Darwin ai covid-net mitre
 
How much is that data in the window : Healthcare data valuation
How much is that data in the window : Healthcare data valuationHow much is that data in the window : Healthcare data valuation
How much is that data in the window : Healthcare data valuation
 

Similar to Are we FAIR yet? And will it be worth it?

CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
Michel Dumontier
 
Accelerating biomedical discovery with an Internet of FAIR data and services ...
Accelerating biomedical discovery with an Internet of FAIR data and services ...Accelerating biomedical discovery with an Internet of FAIR data and services ...
Accelerating biomedical discovery with an Internet of FAIR data and services ...
Platform Linked Data Netherlands (PLDN)
 
Analyzing Social media’s real data detection through Web content mining using...
Analyzing Social media’s real data detection through Web content mining using...Analyzing Social media’s real data detection through Web content mining using...
Analyzing Social media’s real data detection through Web content mining using...
IRJET Journal
 
Internet of Things (IoT) Expert Session Webinar
Internet of Things (IoT) Expert Session WebinarInternet of Things (IoT) Expert Session Webinar
Internet of Things (IoT) Expert Session Webinar
ibi
 
ISC2 Privacy-Preserving Analytics and Secure Multiparty Computation
ISC2 Privacy-Preserving Analytics and Secure Multiparty ComputationISC2 Privacy-Preserving Analytics and Secure Multiparty Computation
ISC2 Privacy-Preserving Analytics and Secure Multiparty Computation
UlfMattsson7
 
e-Marketing Research
e-Marketing Researche-Marketing Research
e-Marketing Research
Usman Tariq
 
Blockchain for industry 4.0 HMI 2018
Blockchain for industry 4.0 HMI 2018Blockchain for industry 4.0 HMI 2018
Blockchain for industry 4.0 HMI 2018
Mark Mueller-Eberstein
 
Data-Driven Innovation & Competitive Advantage
Data-Driven Innovation & Competitive AdvantageData-Driven Innovation & Competitive Advantage
Data-Driven Innovation & Competitive Advantage
Martin De Saulles
 
Blockchain for Marketing & Insights
Blockchain for Marketing & InsightsBlockchain for Marketing & Insights
Blockchain for Marketing & Insights
Rolfe William Swinton
 
IRJET- Scope of Big Data Analytics in Industrial Domain
IRJET- Scope of Big Data Analytics in Industrial DomainIRJET- Scope of Big Data Analytics in Industrial Domain
IRJET- Scope of Big Data Analytics in Industrial Domain
IRJET Journal
 
New technologies for data protection
New technologies for data protectionNew technologies for data protection
New technologies for data protection
Ulf Mattsson
 
Community of practice on socio-economic data
Community of practice on socio-economic dataCommunity of practice on socio-economic data
Community of practice on socio-economic data
IFPRI-PIM
 
Community of practice on socio-economic data
Community of practice on socio-economic dataCommunity of practice on socio-economic data
Community of practice on socio-economic data
CGIAR
 
Technology Vision 2020: The Analytics Angle with SAS
Technology Vision 2020: The Analytics Angle with SASTechnology Vision 2020: The Analytics Angle with SAS
Technology Vision 2020: The Analytics Angle with SAS
accenture
 
Protecting data privacy in analytics and machine learning ISACA London UK
Protecting data privacy in analytics and machine learning ISACA London UKProtecting data privacy in analytics and machine learning ISACA London UK
Protecting data privacy in analytics and machine learning ISACA London UK
Ulf Mattsson
 
How Big Data Shaping The Supply Chain
How Big Data Shaping The Supply ChainHow Big Data Shaping The Supply Chain
How Big Data Shaping The Supply Chain
Hafizullah Mohd Amin
 
-Enrichment - Unlocking the value of data for digital transformation - Big Da...
-Enrichment - Unlocking the value of data for digital transformation - Big Da...-Enrichment - Unlocking the value of data for digital transformation - Big Da...
-Enrichment - Unlocking the value of data for digital transformation - Big Da...
webwinkelvakdag
 
D2 d turning information into a competive asset - 23 jan 2014
D2 d   turning information into a competive asset - 23 jan 2014D2 d   turning information into a competive asset - 23 jan 2014
D2 d turning information into a competive asset - 23 jan 2014
Henk van Roekel
 
Connected barrels_IoT in Oil and Gas_deloitte
Connected barrels_IoT in Oil and Gas_deloitteConnected barrels_IoT in Oil and Gas_deloitte
Connected barrels_IoT in Oil and Gas_deloitte
Anshu Mittal
 
Connected barrels io t in og_deloitte
Connected barrels io t in og_deloitteConnected barrels io t in og_deloitte
Connected barrels io t in og_deloitte
Anshu Mittal
 

Similar to Are we FAIR yet? And will it be worth it? (20)

CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
CIKM2020 Keynote: Accelerating discovery science with an Internet of FAIR dat...
 
Accelerating biomedical discovery with an Internet of FAIR data and services ...
Accelerating biomedical discovery with an Internet of FAIR data and services ...Accelerating biomedical discovery with an Internet of FAIR data and services ...
Accelerating biomedical discovery with an Internet of FAIR data and services ...
 
Analyzing Social media’s real data detection through Web content mining using...
Analyzing Social media’s real data detection through Web content mining using...Analyzing Social media’s real data detection through Web content mining using...
Analyzing Social media’s real data detection through Web content mining using...
 
Internet of Things (IoT) Expert Session Webinar
Internet of Things (IoT) Expert Session WebinarInternet of Things (IoT) Expert Session Webinar
Internet of Things (IoT) Expert Session Webinar
 
ISC2 Privacy-Preserving Analytics and Secure Multiparty Computation
ISC2 Privacy-Preserving Analytics and Secure Multiparty ComputationISC2 Privacy-Preserving Analytics and Secure Multiparty Computation
ISC2 Privacy-Preserving Analytics and Secure Multiparty Computation
 
e-Marketing Research
e-Marketing Researche-Marketing Research
e-Marketing Research
 
Blockchain for industry 4.0 HMI 2018
Blockchain for industry 4.0 HMI 2018Blockchain for industry 4.0 HMI 2018
Blockchain for industry 4.0 HMI 2018
 
Data-Driven Innovation & Competitive Advantage
Data-Driven Innovation & Competitive AdvantageData-Driven Innovation & Competitive Advantage
Data-Driven Innovation & Competitive Advantage
 
Blockchain for Marketing & Insights
Blockchain for Marketing & InsightsBlockchain for Marketing & Insights
Blockchain for Marketing & Insights
 
IRJET- Scope of Big Data Analytics in Industrial Domain
IRJET- Scope of Big Data Analytics in Industrial DomainIRJET- Scope of Big Data Analytics in Industrial Domain
IRJET- Scope of Big Data Analytics in Industrial Domain
 
New technologies for data protection
New technologies for data protectionNew technologies for data protection
New technologies for data protection
 
Community of practice on socio-economic data
Community of practice on socio-economic dataCommunity of practice on socio-economic data
Community of practice on socio-economic data
 
Community of practice on socio-economic data
Community of practice on socio-economic dataCommunity of practice on socio-economic data
Community of practice on socio-economic data
 
Technology Vision 2020: The Analytics Angle with SAS
Technology Vision 2020: The Analytics Angle with SASTechnology Vision 2020: The Analytics Angle with SAS
Technology Vision 2020: The Analytics Angle with SAS
 
Protecting data privacy in analytics and machine learning ISACA London UK
Protecting data privacy in analytics and machine learning ISACA London UKProtecting data privacy in analytics and machine learning ISACA London UK
Protecting data privacy in analytics and machine learning ISACA London UK
 
How Big Data Shaping The Supply Chain
How Big Data Shaping The Supply ChainHow Big Data Shaping The Supply Chain
How Big Data Shaping The Supply Chain
 
-Enrichment - Unlocking the value of data for digital transformation - Big Da...
-Enrichment - Unlocking the value of data for digital transformation - Big Da...-Enrichment - Unlocking the value of data for digital transformation - Big Da...
-Enrichment - Unlocking the value of data for digital transformation - Big Da...
 
D2 d turning information into a competive asset - 23 jan 2014
D2 d   turning information into a competive asset - 23 jan 2014D2 d   turning information into a competive asset - 23 jan 2014
D2 d turning information into a competive asset - 23 jan 2014
 
Connected barrels_IoT in Oil and Gas_deloitte
Connected barrels_IoT in Oil and Gas_deloitteConnected barrels_IoT in Oil and Gas_deloitte
Connected barrels_IoT in Oil and Gas_deloitte
 
Connected barrels io t in og_deloitte
Connected barrels io t in og_deloitteConnected barrels io t in og_deloitte
Connected barrels io t in og_deloitte
 

More from Michel Dumontier

FAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable PredictionsFAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable Predictions
Michel Dumontier
 
A metadata standard for Knowledge Graphs
A metadata standard for Knowledge GraphsA metadata standard for Knowledge Graphs
A metadata standard for Knowledge Graphs
Michel Dumontier
 
Data-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge GraphsData-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge Graphs
Michel Dumontier
 
Evaluating FAIRness
Evaluating FAIRnessEvaluating FAIRness
Evaluating FAIRness
Michel Dumontier
 
Developing and assessing FAIR digital resources
Developing and assessing FAIR digital resourcesDeveloping and assessing FAIR digital resources
Developing and assessing FAIR digital resources
Michel Dumontier
 
Advancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIRAdvancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIR
Michel Dumontier
 
A Framework to develop the FAIR Metrics
A Framework to develop the FAIR MetricsA Framework to develop the FAIR Metrics
A Framework to develop the FAIR Metrics
Michel Dumontier
 
FAIR principles and metrics for evaluation
FAIR principles and metrics for evaluationFAIR principles and metrics for evaluation
FAIR principles and metrics for evaluation
Michel Dumontier
 
Towards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRnessTowards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRness
Michel Dumontier
 
Data Science for the Win
Data Science for the WinData Science for the Win
Data Science for the Win
Michel Dumontier
 
2016 bmdid-mappings
2016 bmdid-mappings2016 bmdid-mappings
2016 bmdid-mappings
Michel Dumontier
 
Ontologies
OntologiesOntologies
Ontologies
Michel Dumontier
 
Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...
Michel Dumontier
 
Model Organism Linked Data
Model Organism Linked DataModel Organism Linked Data
Model Organism Linked Data
Michel Dumontier
 
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
Michel Dumontier
 
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMaking it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Michel Dumontier
 
Link Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked DataLink Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked Data
Michel Dumontier
 
Making the most of phenotypes in ontology-based biomedical knowledge discovery
Making the most of phenotypes in ontology-based biomedical knowledge discoveryMaking the most of phenotypes in ontology-based biomedical knowledge discovery
Making the most of phenotypes in ontology-based biomedical knowledge discovery
Michel Dumontier
 
W3C HCLS Dataset Description Guidelines
W3C HCLS Dataset Description GuidelinesW3C HCLS Dataset Description Guidelines
W3C HCLS Dataset Description Guidelines
Michel Dumontier
 
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Michel Dumontier
 

More from Michel Dumontier (20)

FAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable PredictionsFAIR & AI Ready KGs for Explainable Predictions
FAIR & AI Ready KGs for Explainable Predictions
 
A metadata standard for Knowledge Graphs
A metadata standard for Knowledge GraphsA metadata standard for Knowledge Graphs
A metadata standard for Knowledge Graphs
 
Data-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge GraphsData-Driven Discovery Science with FAIR Knowledge Graphs
Data-Driven Discovery Science with FAIR Knowledge Graphs
 
Evaluating FAIRness
Evaluating FAIRnessEvaluating FAIRness
Evaluating FAIRness
 
Developing and assessing FAIR digital resources
Developing and assessing FAIR digital resourcesDeveloping and assessing FAIR digital resources
Developing and assessing FAIR digital resources
 
Advancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIRAdvancing Biomedical Knowledge Reuse with FAIR
Advancing Biomedical Knowledge Reuse with FAIR
 
A Framework to develop the FAIR Metrics
A Framework to develop the FAIR MetricsA Framework to develop the FAIR Metrics
A Framework to develop the FAIR Metrics
 
FAIR principles and metrics for evaluation
FAIR principles and metrics for evaluationFAIR principles and metrics for evaluation
FAIR principles and metrics for evaluation
 
Towards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRnessTowards metrics to assess and encourage FAIRness
Towards metrics to assess and encourage FAIRness
 
Data Science for the Win
Data Science for the WinData Science for the Win
Data Science for the Win
 
2016 bmdid-mappings
2016 bmdid-mappings2016 bmdid-mappings
2016 bmdid-mappings
 
Ontologies
OntologiesOntologies
Ontologies
 
Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...Building a Network of Interoperable and Independently Produced Linked and Ope...
Building a Network of Interoperable and Independently Produced Linked and Ope...
 
Model Organism Linked Data
Model Organism Linked DataModel Organism Linked Data
Model Organism Linked Data
 
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
2016 ACS Semantic Approaches for Biochemical Knowledge Discovery
 
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental MetadataMaking it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
Making it Easier, Possibly Even Pleasant, to Author Rich Experimental Metadata
 
Link Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked DataLink Analysis of Life Sciences Linked Data
Link Analysis of Life Sciences Linked Data
 
Making the most of phenotypes in ontology-based biomedical knowledge discovery
Making the most of phenotypes in ontology-based biomedical knowledge discoveryMaking the most of phenotypes in ontology-based biomedical knowledge discovery
Making the most of phenotypes in ontology-based biomedical knowledge discovery
 
W3C HCLS Dataset Description Guidelines
W3C HCLS Dataset Description GuidelinesW3C HCLS Dataset Description Guidelines
W3C HCLS Dataset Description Guidelines
 
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
Semantic approaches for biomedical knowledge discovery - Discovery Science 20...
 

Recently uploaded

Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
RenuJangid3
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
anitaento25
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Ana Luísa Pinho
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
muralinath2
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
AADYARAJPANDEY1
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
YOGESH DOGRA
 
Structures and textures of metamorphic rocks
Structures and textures of metamorphic rocksStructures and textures of metamorphic rocks
Structures and textures of metamorphic rocks
kumarmathi863
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
University of Maribor
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
IvanMallco1
 
Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...
Sérgio Sacani
 
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
Health Advances
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Sérgio Sacani
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
Areesha Ahmad
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
muralinath2
 
role of pramana in research.pptx in science
role of pramana in research.pptx in sciencerole of pramana in research.pptx in science
role of pramana in research.pptx in science
sonaliswain16
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
muralinath2
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
muralinath2
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
IqrimaNabilatulhusni
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
SAMIR PANDA
 

Recently uploaded (20)

Leaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdfLeaf Initiation, Growth and Differentiation.pdf
Leaf Initiation, Growth and Differentiation.pdf
 
insect taxonomy importance systematics and classification
insect taxonomy importance systematics and classificationinsect taxonomy importance systematics and classification
insect taxonomy importance systematics and classification
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
 
Hemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptxHemostasis_importance& clinical significance.pptx
Hemostasis_importance& clinical significance.pptx
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
 
Structures and textures of metamorphic rocks
Structures and textures of metamorphic rocksStructures and textures of metamorphic rocks
Structures and textures of metamorphic rocks
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
 
Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...Multi-source connectivity as the driver of solar wind variability in the heli...
Multi-source connectivity as the driver of solar wind variability in the heli...
 
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...The ASGCT Annual Meeting was packed with exciting progress in the field advan...
The ASGCT Annual Meeting was packed with exciting progress in the field advan...
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
Observation of Io’s Resurfacing via Plume Deposition Using Ground-based Adapt...
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
 
role of pramana in research.pptx in science
role of pramana in research.pptx in sciencerole of pramana in research.pptx in science
role of pramana in research.pptx in science
 
erythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptxerythropoiesis-I_mechanism& clinical significance.pptx
erythropoiesis-I_mechanism& clinical significance.pptx
 
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
Circulatory system_ Laplace law. Ohms law.reynaults law,baro-chemo-receptors-...
 
general properties of oerganologametal.ppt
general properties of oerganologametal.pptgeneral properties of oerganologametal.ppt
general properties of oerganologametal.ppt
 
Seminar of U.V. Spectroscopy by SAMIR PANDA
 Seminar of U.V. Spectroscopy by SAMIR PANDA Seminar of U.V. Spectroscopy by SAMIR PANDA
Seminar of U.V. Spectroscopy by SAMIR PANDA
 

Are we FAIR yet? And will it be worth it?

  • 1. Are we FAIR yet? And will it be worth it? @micheldumontier::NETTAB:2018-10-221 Michel Dumontier, Ph.D. Distinguished Professor of Data Science Director, Institute of Data Science
  • 2. An increasing number of discoveries are made using other people’s data @micheldumontier::NETTAB:2018-10-222
  • 3. 3 A common rejection module (CRM) for acute rejection across multiple organs identifies novel therapeutics for organ transplantation Khatri et al. JEM. 210 (11): 2205 DOI: 10.1084/jem.20122709 @micheldumontier::NETTAB:2018-10-22 Main Findings: 1. CRM genes correlated with the extent of graft injury and predicted future injury to a graft 2. Mice treated with drugs against the CRM genes extended graft survival
  • 4. However, significant effort was needed to find the right datasets, make sense of them, and ultimately use them for a new purpose @micheldumontier::NETTAB:2018-10-224
  • 6. If we are ever to realize the full potential of content we create then we must find ways to reduce the barrier to publish digital content in a way that makes it vastly easier to find, assess and reuse @micheldumontier::NETTAB:2018-10-226
  • 7. @micheldumontier::NETTAB:2018-10-227 Lambin et al. Radiother Oncol. 2013. 109(1):159-64. doi: 10.1016/j.radonc.2013.07.007
  • 8. Why does this matter? @micheldumontier::NETTAB:2018-10-228
  • 9. 9 @micheldumontier::NETTAB:2018-10-22 Most published research findings are false. - John Ioannidis, Stanford University Reproducibility of landmark studies is shockingly low: 39% (39/100) in psychology1 21% (14/67) in pharmacology2 11% (6/53) in cancer3 PLoS Med 2005;2(8): e124. 1doi:10.1038/nature.2015.17433 2doi:10.1038/nrd3439-c1 3doi:10.1038/483531a
  • 10. @micheldumontier::NETTAB:2018-10-2210 Published online 28 September 2011 | Nature 477, 526-528 (2011) | doi:10.1038/477526a
  • 11. @micheldumontier::NETTAB:2018-10-2211 we need new ways to think about discovery science We need to improve our confidence in any result by using more data and with support from multiple lines of evidence
  • 12. Grand Challenge: Automatically uncover evidence that supports and disputes a hypothesis using the totality of available data, tools and scientific knowledge @micheldumontier::NETTAB:2018-10-2212
  • 13. We must build a social, ethical and technological infrastructure that facilitates the discovery and reuse of digital resources for people and machines @micheldumontier::NETTAB:2018-10-2213
  • 14. Why machines? • Can gather and make sense of vast amounts of information to better understand the world and make more effective decisions @micheldumontier::NETTAB:2018-10-2214
  • 15. Big Data for Medicine @micheldumontier::NETTAB:2018-10-2215 Multiple sources of heterogeneous data, including experimental evidence, bioinformatics databases, lifestyle measurements, electronic health records, environmental influences, and biobank findings, can be combined using machine learning algorithms to identify causal disease networks, stratify patients, and predict more efficacious therapies.
  • 16. Why machines? • Can make sense of vast amounts of information to make personalized, evidence-based decisions to maximize desired outcomes • Can create detailed workflows to enable transparency and reproducibility • Will be able to identify and minimize bias in research and in real world applications in a robust and systematic manner @micheldumontier::NETTAB:2018-10-2216
  • 18. An international, bottom-up paradigm for the discovery and reuse of digital content by and for people and machines @micheldumontier::NETTAB:2018-10-2218
  • 19. • DATA FAIRPORT workshop aimed to define a minimal (yet comprehensive) framework for data discoverability, access, annotation and authoring • FAIR acronym was created and guiding principles drafted • for comment on FORCE11 website • Principles were refined during the 2015 BioHackathon in Japan @micheldumontier::NETTAB:2018-10-2219 FAIR: History http://www.nature.com/articles/sdata201618
  • 21. @micheldumontier::NETTAB:2018-10-2221 4 Principles (F,A,I,R) and 15 sub-principles. http://www.nature.com/articles/sdata201618
  • 22. FAIR Principles - summarized Findable • Globally unique, resolvable, and persistent identifiers • Machine-readable descriptions to support structured search and filtering Accessible • Metadata is accessible beyond the lifetime of the digital resource • Clearly defined access and security protocols (FAIR != Open) @micheldumontier::NETTAB:2018-10-2222
  • 24. FAIR Principles - summarized Findable • Globally unique, resolvable, and persistent identifiers • Machine-readable descriptions to support structured search and filtering Accessible • Metadata is accessible beyond the lifetime of the digital resource • Clearly defined access and security protocols (FAIR != Open) Interoperable • Extensible machine interpretable formats for data + metadata • Use vocabularies and link to other resources Reusable • Provide licensing, provenance, and meet community-standards @micheldumontier::NETTAB:2018-10-2224
  • 25. Improving the FAIRness of digital resources will increase their quality and their potential and ease for reuse. @micheldumontier::NETTAB:2018-10-2225
  • 26. Communities must make clear their expectations @micheldumontier::NETTAB:2018-10-2226
  • 28. Extent of FAIRness may affect what resources people select @micheldumontier::NETTAB:2018-10-2228
  • 29. Measuring FAIRness • A metric is a standard of measurement. • It must provide clear definition of what is being measured, why one wants to measure it. • It must describe what a valid result is and how one obtains it, so that it can be reproduced by others. @micheldumontier::NETTAB:2018-10-2229
  • 30. Qualities of a Good Metric • Clear: anyone can understand the purpose of the metric • Realistic: compliance should not be unduly complicated • Objective: the assessment can be made in a quantitative, machine-interpretable, scalable and reproducible manner • Discriminating: the measure can distinguish between those resources that meet the criteria and those that do not • Universal: The metric should be applicable to all digital resources @micheldumontier::NETTAB:2018-10-2230
  • 31. • 14 universal metrics covering each of the FAIR sub-principles. The metrics demand evidence from the community, some of which may require specific new actions. • Digital resource providers must provide a web-accessible document with machine- readable metadata (FM-F2, FM-F3), detail identifier management (FM-F1B), metadata longevity (FM-A2), and any additional authorization procedures (FM-A1.2). • They must ensure the public registration of their identifier schemes (FM-F1A), (secure) access protocols (FM-A1.1), knowledge representation languages (FM-I1), licenses (FM-R1.1), provenance specifications (FM-R1.2), and community standards (FM-R1.3). • They must provide evidence of ability to find the digital resource in search results (FM- F4), linking to other resources (FM-I3), FAIRness of linked resources (FM-I2), and meeting community standards (FM-R1.3) @micheldumontier::NETTAB:2018-10-2231
  • 33. Compliance to the standard can be automatically assessed @micheldumontier::NETTAB:2018-10-2233 • http://hw-swel.github.io/Validata/ RDF constraint validation tool that is configurable to any profile Declarative reusable schema description Shape Expression (ShEx) constraints
  • 34. A first assessment using the metrics • Used a simple form to ask for the information needed as input to the FAIR metrics • Questions either require one or more URL or true/false @micheldumontier::NETTAB:2018-10-2234
  • 40. Automated assessments are rather unforgiving, but also correct mistakes @micheldumontier::NETTAB:2018-10-2240
  • 46. H2020 EG: Turning FAIR Data into Reality - Report and Action Plan Consultation (Draft) Recommendations include: • Sustainable funding for FAIR components (#5) • Strategic and evidence-based funding (#6) • Cross-disciplinary FAIRness (#8) • Encourage and incentivize data reuse (#19) • Facilitate automated processing (#25) • Data science and stewardship skills (#26) • Skills transfer schemes and brokering roles (#27) • Curriculum frameworks and training (#28) @micheldumontier::NETTAB:2018-10-2246 Hodson, Simon; Jones, Sarah; Collins, Sandra; Genova, Françoise; Harrower, Natalie; Laaksonen, Leif; Mietchen, Daniel; Petrauskaité, Rūta; Wittenburg, Peter
  • 47. Are we FAIR yet? • Early claims (including press releases) of being fully FAIR were vastly premature • FAIRness assessments can demonstrate standing, and some aspects of FAIR are much easier to address than others. • Much more work still needs to be done – Compatible data and metadata standards across all disciplines (no more data and metadata silos) – FAIR by design, using common frameworks – The development of the FAIR Internet of Data and Services (FIDS) and a FAIR knowledge graph of available resources – Automated discovery and workflow execution using FIDS @micheldumontier::NETTAB:2018-10-2247
  • 48. Will it be worth it? FAIR addresses, in a concise manner, the basic requirements associated with publishing and reusing digital resources. – Lack of high quality meta(data) reduces usability – Lack of detailed provenance contributes to irreproducibility – Lack of clear licensing terms hinders innovation FAIR is set to accelerate research and discovery and will have worldwide social and economic impact @micheldumontier::NETTAB:2018-10-2248
  • 49. @micheldumontier::NETTAB:2018-10-2249 * I’m an advisor to OntoForce * I wish I was an advisor to transcriptic
  • 50. Summary • FAIR represents a grassroots and global initiative to enhance the discovery and reuse of all kinds of digital resources • The FAIR ecosystem is maturing quickly, and GO-FAIR offers communities the means to actively participate. • FAIR demands a new social, ethical and technological infrastructure that currently doesn’t exist in whole, but has to be built for and tested by various communities! • Huge benefits to be had, particularly in augmenting existing research programs and in automated machine processing, but needs to be coupled with the proper training and ethics. @micheldumontier::NETTAB:2018-10-2250
  • 51. Acknowledgements @micheldumontier::NETTAB:2018-10-2251 FAIR FAIR metrics Dumontier Lab (Maastricht University, Stanford University, Carleton University) MU: Seun Adekunle, Remzi Celebi, Dorina Claessens, Ricardo De Miranda Azevedo, Pedro Hernandez Serrano, Massimiliano Grassi, Andine Havelange, Lianne Ippel, Alexander Malic, Kody Moodley, Stuti Nayak, Nadine Rouleaux, Claudia van open, Chang Sun, Amrapali Zaveri SU: Sandeep Ayyar, Remzi Celebi, Shima Dastgheib, Maulik Kamdar, David Odgers, Maryam Panahiazar, Amrapali Zaveri CU: Alison Callahan, Jose Toledo-Cruz, Natalia Villaneuva-Rosales
  • 52. michel.dumontier@maastrichtuniversity.nl Website: http://maastrichtuniversity.nl/ids 52 @micheldumontier::NETTAB:2018-10-22 The mission of the Institute of Data Science at Maastricht University is to foster a collaborative environment for multi-disciplinary data science research, interdisciplinary training, and data-driven innovation . We tackle key scientific, technical, social, legal, ethical issues that advance our understanding and strengthen our communities in the face of these developments.

Editor's Notes

  1. Abstract Using meta-analysis of eight independent transplant datasets (236 graft biopsy samples) from four organs, we identified a common rejection module (CRM) consisting of 11 genes that were significantly overexpressed in acute rejection (AR) across all transplanted organs. The CRM genes could diagnose AR with high specificity and sensitivity in three additional independent cohorts (794 samples). In another two independent cohorts (151 renal transplant biopsies), the CRM genes correlated with the extent of graft injury and predicted future injury to a graft using protocol biopsies. Inferred drug mechanisms from the literature suggested that two FDA-approved drugs (atorvastatin and dasatinib), approved for nontransplant indications, could regulate specific CRM genes and reduce the number of graft-infiltrating cells during AR. We treated mice with HLA-mismatched mouse cardiac transplant with atorvastatin and dasatinib and showed reduction of the CRM genes, significant reduction of graft-infiltrating cells, and extended graft survival. We further validated the beneficial effect of atorvastatin on graft survival by retrospective analysis of electronic medical records of a single-center cohort of 2,515 renal transplant patients followed for up to 22 yr. In conclusion, we identified a CRM in transplantation that provides new opportunities for diagnosis, drug repositioning, and rational drug design.
  2. G20: http://europa.eu/rapid/press-release_STATEMENT-16-2967_en.htm EOSC: https://ec.europa.eu/research/openscience/pdf/realising_the_european_open_science_cloud_2016.pdf H2020: https://goo.gl/Strjua
  3. G20: http://europa.eu/rapid/press-release_STATEMENT-16-2967_en.htm EOSC: https://ec.europa.eu/research/openscience/pdf/realising_the_european_open_science_cloud_2016.pdf H2020: https://goo.gl/Strjua
  4. https://www.gov.uk/government/publications/g8-science-ministers-statement-london-12-june-2013