SlideShare a Scribd company logo
1 of 55
Download to read offline
November 29, 2018
Harmonizing FAIR Data Catalog Approaches
A Pistoia Alliance Debates Webinar
Moderated by Martin Romacker – Roche
Panelists Tom Plasterer - AstraZeneca
Eric Little - OSTHUS
Kees van Bochove The Hyve
Rafael Jimenez - Elixir
This webinar is being recorded
Poll Question 1: How familiar are you with FAIR principles and
metrics?
A. Unfamiliar with FAIR principles
B. Some familiarity with FAIR principles
C. Familiar with FAIR principles
D. Some expertise with FAIR principles and metrics
E. Highly expert with FAIR principles and metrics
Poll Question 2: What is the maturity level of your organization
with respect to implementation of FAIR?
A. Don’t know
B. No plans or interest in FAIR implementation
B. Thinking about FAIR implementation
C. Several projects have FAIR implementation
E. Systematic FAIR implementation across whole organization
©PistoiaAlliance
Phuse and the Regulated Cloud 5
Eric Little
Chiref Data Officer
OSTHUS
Martin Romacker
Principal Scientist
Roche
Tom Plasterer
US Cross-Science Director,
R&D Information
AstraZeneca
Rafael Jimenez
Chief Data Architect
ELIXIR
Kees van Bochove
CEO
The Hyve
Harmonizing FAIR Data Catalog Approaches –
Introduction
Martin Romacker
Data and Information Architect
Pharma Research and Early Development Informatics (pREDi)
Roche Innovation Center Basel
Pistoia Alliance Webinar, 29th November 2018
Data Catalogue & FAIR Principles
High Quality Data Driving Insights
• Digitial Transformation leading to a data-driven Industry
• Data are more and more perceived as an asset
Doing now what patients need next
Findability and FAIR Data Catalogs
Tom Plasterer, PhD
Science & Enabling Units IT (S&EUIT), Semantic Technology Lead
29 Nov 2018
Pistoia/Linked Data Community of Practice Webinar Series
Eric Little, PhD
Chief Data Officer, Osthus
10
Starting Point: Questions and Iterative Data Model
core:Study
core:Project
core:Target
core:Subject
core:Drug
core:Indication core:TherapeuticArea
core:BiologicalSample
core:Measurement core:Technologycore:Visit
bdm:Cohort
core:hasSubject
core:hasProject
core:hasDrug
core:hasIndication
bdm:hasArm
bdm:participatesIn
core:hasTA
core:hasTarget
core:hasMeasurement
core:hasSample
core:hasVisit
core:measuredBy
How do you find
datasets
described with
this model when
you have
hundreds or
thousands?
11
Dataset Catalog is a collection of Dataset Records
• Catalogs are needed to supporting FAIR (Findable) data
• Catalogs can and should support Enterprise MDM strategies
• Consumers can be internal or external
Dataset Catalogs are needed so data consumers can find Datasets
• Dataset records need sufficient metadata to support discoverability
• Dataset terms are NOT the data instance
Dataset Catalogs surface dataset provenance and enable data access
Dataset Catalogs can provide datasets for multiple consumption patters
• Analytics readiness and fit
• ‘Walking’ across information models
Dataset Catalogs: Findability Starts Here
12
Best Practices:
Data on the Web, Vocabulary of Interlinked Datasets
Dataset Descriptions for the Open Pharmacological Space
http://www.openphacts.org/specs/2012/WD-datadesc-20121019/
Data on the Web Best Practices
https://www.w3.org/TR/dwbp/
13
Dataset Catalogs: Find me Datasets about:
Projects
Study
Indication/
Disease
Technology
Targets
Cohort DatesAgent
Therapeutic
Area
Drugs
14
Dublin-Core-Type (DCT): Dataset
• A dataset is information encoded in a defined structure (for example,
lists, tables, and databases), intended to be useful for direct
machine processing
Data-Catalog (DCAT): Dataset
• A collection of data, published or curated by a single source, and
available for access or download in one or more formats
Vocabulary of Interlinked Dataset (VoID): Dataset
• A set of RDF triples that are published, maintained or aggregated by
a single provider.
Data and Datasets…
dct:Dataset
dcat:Dataset
void:Dataset
rdfs:subClassOf
rdfs:subClassOf
15
The Backbone: A DCAT conformant Data Catalog
https://www.w3.org/TR/hcls-dataset/
https://www.w3.org/TR/vocab-dcat/#vocabulary-overview
Semantic tagging of datasets with
concepts from taxonomies:
• provides context
• multi-dimensional & flexible
• effective for discoverability
• light-weight semantics
skos:Concept
dcat:Catalog skos:ConceptScheme
dctypes:Dataset (summary)
dct:title
dct:publisher <foaf:Agent>
foaf:page
void:sparqlEndpoint
dct:accrualPeriodicity
dcat:keyword
dcat:dataset
dcat:theme
dctypes:Dataset (version)
dcat:Distribution
(dctypes:Dataset)
void:vocabulary
dct:conformsTo
void:exampleResource
…other void properties
dcat:distribution
dcat:themeTaxonomy
dct:isVersionOf
pav:previousVersion
dct:hasPart
pav:hasCurrentVersion
dct:hasPart
dct:title
dct:publisher <foaf:Agent>
pav:version
dct:creator <foaf:Agent>
dct:created
dct:source
dct:creator <foaf:Agent>
dct:license
dct:format
pav:retrievedFrom
dct:created
pav:createdWith
dcat:accessURL
dcat:downloadURL
void:Dataset
dct:title
dctDescription
dct:publisher <foaf:Agent>
16
Metadata Model Stack for the AZ Data Catalog
DCAT
VoID
DCTerms
RDF/S, OWL, SKOS/SKOS-XL
AZ TaxonomiesPAV
AZ DataCatalog
ontology and instances for catalogs, datasets, distributions
(could be further modularized later)
DCMI
bdm-tech
core bdm
internal external
uniprot
umls
sio
chembl…
W3C and
Metadata
Standards
Reference
Master Data
Data Discoverability: Multi-phase Filtering
Data Catalog Filter
Phase 1
Experiment Metadata Filter
Phase 2
Ad hoc Analyses Filtering
Phase 3
Outbound
to Data Analytics
Data Science
Tools
Statistical
Filtering
e.g., clinical trial with > 50
participants
Dataset
Catalog
Descriptions
18
Creating a Dataset Record
19
Validation and SHACL
Az:ANYDataset
az:BDMDataset
dct:Dataset
rdfs:subClassOf
az:BDMDataset (Node Shape)
sh:targetClass az:BDMDataset
sh:and
dctypes:dataset (Node Shape)
sh:targetClass dctypes:Dataset
sh:property
az:BDMDatasetExtension (Shape)
sh:property
dct:title (Propery Shape)
sh:path dct:title
sh:datatype xsd:string
sh:minCount: 1
sh:maxCount: 1
…
az:theme (Shape)
sh:path dcat:theme
sh:class bdm:Technology
sh:minCount: 1
…
For BDM dataset: at least one technology MUST be specified
Use of SHACL for Data Catalogs & Dataset
Types
@Heiner Oberkampf;
<internal talk> April 18, 2018
20
DisQover Example
Slide 21
How FAIR is Your Data with a Catalog?
FAIR metrics: https://tinyurl.com/FAIRMetrics-ALL
I
Slide 22
Data Science Layer
machine learning, text analytics, NLP, clustering, matching, classification
Lightweight Semantic Integration Layer: FAIR Catalog
enablers: reference master data mgt., metadata mgt., semantic indexing, linking, governance, APIs
A Data-Centric World Allows to Utilize Data Effectively
Linked Open Data
& Open APIs
Semantic
Graph DB
Operational DBs
…
Unstructured
Documents
Analytics Tools
simulations
statistics
Visualization
dashboards
exploration
search
…
Semi-structured
Data
Instrument
Data
Reporting
regulatory
internal
external
R&D | RDI
DCTERMS, DCAT, VoID are nearly sufficient
• Extend for local needs
DCTERMS, DCAT, VoID are nearly sufficient
• Extend for local needs
Public Domain Ontologies should be reused
• Consensus is emerging around best practices and cross-mapping
Public Domain Ontologies should be reused
• Consensus is emerging around best practices and cross-mapping
Use Multi-Phase Filtering for Shallow & Deep Questions
• Balance to what belongs in a catalog record vs. instance data
Use Multi-Phase Filtering for Shallow & Deep Questions
• Balance to what belongs in a catalog record vs. instance data
Lots of Activity to Learn and Shape Best Practices
• Didn’t reinvent a wheel
Lots of Activity to Learn and Shape Best Practices
• Didn’t reinvent a wheel
Dataset Catalogs: Take-aways
R&D | RDI
Thanks
Key Influencers
David Wood
Tim Berners-Lee
Lee Harland
Jane Lomax
James Malone
Dean Allemang
Barend Mons
Carole Goble
Bernadette Hyland
Bob Stanley
Eric Little
Juan Sequeda
Michel Dumontier
John Wilbanks
Hans Constandt
Filip Pattyn
Dan Crowther
Tim Hoctor
Ian Harrow
Key Influencers
David Wood
Tim Berners-Lee
Lee Harland
Jane Lomax
James Malone
Dean Allemang
Barend Mons
Carole Goble
Bernadette Hyland
Bob Stanley
Eric Little
Juan Sequeda
Michel Dumontier
John Wilbanks
Hans Constandt
Filip Pattyn
Dan Crowther
Tim Hoctor
Ian Harrow
AZ/MedImmune Linked
Data Community
David Fenstermacher
Mathew Woodwark
Rajan Desai
Nic Sinibaldi
Chia-Chien Chiang
Kerstin Forsberg
Ola Engkvist
Ian Dix
Colin Wood
Ted Slater
Martin Romacker
Eric Neumann
Jeff Saltzman
Kathy Reinold
Nirmal Keshava
Bryan Takasaki
AZ/MedImmune Linked
Data Community
David Fenstermacher
Mathew Woodwark
Rajan Desai
Nic Sinibaldi
Chia-Chien Chiang
Kerstin Forsberg
Ola Engkvist
Ian Dix
Colin Wood
Ted Slater
Martin Romacker
Eric Neumann
Jeff Saltzman
Kathy Reinold
Nirmal Keshava
Bryan Takasaki
Clinical Data Catalogs
3 case studies
2018, NOVEMBER 29th
Kees van Bochove, CEO & Founder, The Hyve
@keesvanbochove
● Cohort study / registry / biobank: Netherlands Twin Registry
● Academic medical center: Institut Curie
● Pharma company: Cataloguing the Data Lake
Clinical data catalog case studies
27
Teams
Research Data Management
● FAIR / Data Governance consultancy
● Fairspace (meta)data management
Cancer Genomics
● Cancer data warehouse: cBioPortal
● Knowledge base: Open Targets
Data Warehousing
● Data warehouses: tranSMART, i2b2
● Cohort selection: Glowing Bear
● Request Portals: Podium
Real World Data
● Real world evidence: OMOP/OHDSI
● Wearables platform: RADAR-BASE
● Data catalogues: CKAN, DataVerse
28
Case 1: Netherlands Twin Register
29
Case 1: Netherlands Twin Register
1. Data Catalogue: Low barrier of entry, variable level
metadata
30
Case 1: Netherlands Twin Register
2. Podium Request Portal: Request access to data (or samples)
31
Case 1: Netherlands Twin Register
3. Glowing Bear: Data selection tool for data manager (or researcher)
● Provide foundations for a comprehensive knowledge system to
make data accessible
● Foster a strong extended research network within the organization
● Translated to 4 processes / propositions:
● Understand which data is available in the institute
● Collaborate easily on data analysis
● Comply to regulations (GDPR) and best practices (i.e. FAIR)
● Manage the full data lifecycle
Case 2: Institut Curie: project goals
Research
Team A
SEMANTIC LAYER
GOVERNANCE LAYER
VIRTUAL RESEARCH DESKTOP
HOSPITAL INFORMATION SYSTEM (HIS)
CURIE DATA RESOURCE
A C
Research
Team B
AD E
Research
Team C
FBAB
Clinical annotations,
Consent
metadata metadata metadata
Sequencing data (WGS, WES, targeted)
RNAseq
Images
Institut Curie – Data Office – Vision on FAIR Data
By: X. Fernandez and J.
Guerien
FAIRSPACE
- Apache Jena
- FS Protocol
- Kubernetes
- Apps a.o.
cBioPortal
FAIRSPACE
34
Add your
data to
Fairspace
Automate it!
Share and
publish your
data
Annotate your
data with FAIR
metadata
01
02
03
04
Collaborative science in 4 steps
Metadata
Governance
Audit Logs
● Strategic programme underway to make preclinical, clinical and
real world data assets available across R&D
● Most files and legacy data warehouses moved to AWS data lake
● Computational pipelines, data analysis solutions etc. available
● Need for indexing and searching data assets on study, indication,
compound, site etc.
Case 3: Unlocking the enterprise data lake
36
Unlock the data lake with cataloguing
Data Lake
Data Source
Adapters
S3, GCP buckets, FS
mounts, Azure blobs etc.
File & Collection
Metadata
DCAT, DATS etc.
Logging & Auditing
Lightweight Metadata Layer
(Kubernetes microservices)
Data Governance
Access control
Request flow
FAIRSPACE FederationData Sources
Dataverse, C/DKAN
etc. via OAI-PMH
Semantic search
(e.g. Disqover) via
SPARQL
Analytics (e.g. SAS,
Jupyter, Matlab etc.)
Cohort & Patient
Finders (e.g.
Glowing Bear)
● Understand the driving use case for the catalog
● Filtering, browse and search: which criteria would users search on?
● Result sets: do your users / customers want to locate datasets, files, patients, samples etc?
● Strategy to populate the catalogue
● Favour automation, but some manual work and terminology management will be needed
● Some further reading: https://peerj.com/preprints/27151/
● Testing the FAIR metrics on data catalogs, comparing CKAN, Dataverse, Invenio, by The Hyve team
● Let me know if you are interested in beta testing Fairspace: kees@thehyve.nl
Conclusions
www.elixir-europe.org
Bioschemas for dataset discoverability
on the inter/intranet
Harmonizing FAIR Data Catalog Approaches
Rafael C Jimenez
Chief Data Architect
November 29, 2018
The European Open Science Cloud for
Research pilot project is funded by the
European Commission, DG Research &
Innovation under contract no. 739563
ELIXIR-EXCELERATE is funded by the European
Commission within the Research Infrastructures
programme of Horizon 2020, grant agreement
number 676559.
Semantic markup
for web pages
<div itemscope itemtype="http://schema.org/Recipe">
<h1 itemprop="name">Classic potato salad</h1>
<div itemprop="nutrition” itemscope
itemtype="http://schema.org/NutritionInformation">
Nutrition facts:
<span itemprop="calories">144 kcal</span>,
</div>
Ingredients:
- <span itemprop="recipeIngredient">800g small new potato</span>
- <span itemprop="recipeIngredient">3 shallot</span>
. . .
RDFa
JSON-LD
Microdata
Semantic markup for web pages
http://schema.org/docs/full.html
More than 600 types
Tim Berners-Lee
Bioschemas
Schema.org for life sciences
Collection of specifications
Community initiative
Bioschemas
• New life sciences schema.org types (eg. protein)
• Life sciences profiles for existing types (eg. dataset)
Types
• Proposed to Schema.org
• More descriptive
• Guidelines applying constrains to
existing Schema.org types
• Managed by Bioschemas and
specific for life sciences
• Minimum properties and best
practices for finding and accessing
data
• Focused on few types and well
defined relationships
Profiles 2210
DataCatalog Datasets DataRecord
Bioschemas
• Use case driven
• Finding data
• Presenting search results
• Metadata exchange
• Minimum information guidelines
• Link to domain ontologies
• Examples and documentation
Tools
• Buzzbang
Open-source components for software to find, crawl and use
Bioschemas markup, and for humans to search it.
• GoCrawlIt
Minimal crawler and extractor of microdata and JSON-LD metadata.
• GoWeb
Application to help publishing Bioschemas profiles on the Bisochemas
website.
• Validata
A web application for validating Bioschemas markup against the
specifications.
• Markup Builder
A web application for prototyping markup against the Bioschemas
profiles
Data resources Tools Training
Bioschemas Bioschemas Bioschemas
Search
engines
Metadata
Registries
Data
Aggregators
BioschemasBioschemas
Providers
Consumers
MarRef -> BioSamples
MarRef <- BioSamples
Without an API
https://github.com/EBIBioSamples/bioschemas_marref_demo/blob/master/Summary.md
Use case: data exchange
Metadata exchange
Data producers
Bioschemas
Data
Aggregators
Bioschemas
Providers
Consumers
Organization “A” private data
Data producers
Bioschemas
Data
Aggregators
Bioschemas
Providers
Consumers
Organization “B” private data
Search
Compare
Link
Exchange
Metadata
Metadata
Metadata
Metadata
• Different data models and interfaces
• Minimum common metadata agreements on a selection of
data types
Remarks
• Bioschemas is focused on findability, not data modeling
• It can be useful for some basic metadata exchange (MarRef example)
• It is a complement, not a replacement for more tailored ways to share metadata
• It is mainly used embedded in HTML, but it can be exposed via APIs or JSON files
• For data providers it is a simple way to expose metadata
• It is not just for Google, it is also helping our metadata catalogues to automatically
ingest metadata
Thanks for your attention!
200+
People
8
Events
(2018)
10
Types
23Working
groups22Profiles
55Live
deploys
6M+Pages
Audience Q&A
Please use the Question function in GoToWebinar
Upcoming Webinars
Data Quality in support of AI/ML: Pistoia Alliance CoE for AI Webinar Series
Date/Time: 10 Dec 2018: 4pm - 5pm GMT, 11 am ET/5pm CET
Panel members include:
Terry Stouch, Science for Solutions, Isabella Feierberg, AstraZeneca,
Jamie Powers, Cambridge Semantics,
Sirarat Sarntivijai, ELIXIR EU, Jabe Wilson, Elsevier
Knowledge Graphs for Pharma: A perspective from the PhUSE Project
'Clinical Trials Data as RDF'
Date/Time: January 24th, 2019 11am ET/4pmGMT/5pm CET
Speaker: Tim Williams (UCB and PhUSE)

More Related Content

What's hot

LIBER Webinar: Are the FAIR Data Principles really fair?
LIBER Webinar: Are the FAIR Data Principles really fair?LIBER Webinar: Are the FAIR Data Principles really fair?
LIBER Webinar: Are the FAIR Data Principles really fair?LIBER Europe
 
Trustworthy AI and Open Science
Trustworthy AI and Open ScienceTrustworthy AI and Open Science
Trustworthy AI and Open ScienceBeth Plale
 
FAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to PracticeFAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to PracticeTom Plasterer
 
BioPharma and FAIR Data, a Collaborative Advantage
BioPharma and FAIR Data, a Collaborative AdvantageBioPharma and FAIR Data, a Collaborative Advantage
BioPharma and FAIR Data, a Collaborative AdvantageTom Plasterer
 
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...Tom Plasterer
 
Data Management Planning for researchers
Data Management Planning for researchersData Management Planning for researchers
Data Management Planning for researchersSarah Jones
 
Fair data principles for AOASG
Fair data principles for AOASGFair data principles for AOASG
Fair data principles for AOASGKeith Russell
 
RDA FAIR Data Maturity Model
RDA FAIR Data Maturity ModelRDA FAIR Data Maturity Model
RDA FAIR Data Maturity ModelOpenAIRE
 
Software Sustainability: Better Software Better Science
Software Sustainability: Better Software Better ScienceSoftware Sustainability: Better Software Better Science
Software Sustainability: Better Software Better ScienceCarole Goble
 
DataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data SharingDataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data SharingDataONE
 
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE
 
FAIR Data Knowledge Graphs
FAIR Data Knowledge GraphsFAIR Data Knowledge Graphs
FAIR Data Knowledge GraphsTom Plasterer
 
INSTRUCT - Integrated Structural Biology Infrastructure
INSTRUCT - Integrated Structural Biology InfrastructureINSTRUCT - Integrated Structural Biology Infrastructure
INSTRUCT - Integrated Structural Biology InfrastructureResearch Data Alliance
 
Workshop - finding and accessing data - Cambridge August 22 2016
Workshop - finding and accessing data - Cambridge August 22 2016Workshop - finding and accessing data - Cambridge August 22 2016
Workshop - finding and accessing data - Cambridge August 22 2016Fiona Nielsen
 
Research Lifecycles and RDM
Research Lifecycles and RDMResearch Lifecycles and RDM
Research Lifecycles and RDMMarieke Guy
 
David Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published recordDavid Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published recordJisc
 
Managing Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research MethodsManaging Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research MethodsRebecca Grant
 

What's hot (20)

LIBER Webinar: Are the FAIR Data Principles really fair?
LIBER Webinar: Are the FAIR Data Principles really fair?LIBER Webinar: Are the FAIR Data Principles really fair?
LIBER Webinar: Are the FAIR Data Principles really fair?
 
Trustworthy AI and Open Science
Trustworthy AI and Open ScienceTrustworthy AI and Open Science
Trustworthy AI and Open Science
 
Mendeley Data FAIR hackathon
Mendeley Data FAIR hackathonMendeley Data FAIR hackathon
Mendeley Data FAIR hackathon
 
FAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to PracticeFAIR Data Knowledge Graphs–from Theory to Practice
FAIR Data Knowledge Graphs–from Theory to Practice
 
BioPharma and FAIR Data, a Collaborative Advantage
BioPharma and FAIR Data, a Collaborative AdvantageBioPharma and FAIR Data, a Collaborative Advantage
BioPharma and FAIR Data, a Collaborative Advantage
 
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...
Edge Informatics and FAIR (Findable, Accessible, Interoperable and Reusable) ...
 
DTL Partners Event - FAIR Data Tech overview - Day 1
DTL Partners Event - FAIR Data Tech overview - Day 1DTL Partners Event - FAIR Data Tech overview - Day 1
DTL Partners Event - FAIR Data Tech overview - Day 1
 
Data Management Planning for researchers
Data Management Planning for researchersData Management Planning for researchers
Data Management Planning for researchers
 
Fair data principles for AOASG
Fair data principles for AOASGFair data principles for AOASG
Fair data principles for AOASG
 
RDA FAIR Data Maturity Model
RDA FAIR Data Maturity ModelRDA FAIR Data Maturity Model
RDA FAIR Data Maturity Model
 
Software Sustainability: Better Software Better Science
Software Sustainability: Better Software Better ScienceSoftware Sustainability: Better Software Better Science
Software Sustainability: Better Software Better Science
 
DataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data SharingDataONE Education Module 02: Data Sharing
DataONE Education Module 02: Data Sharing
 
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
OpenAIRE webinar on Open Research Data in H2020 (OAW2016)
 
FAIR Data Knowledge Graphs
FAIR Data Knowledge GraphsFAIR Data Knowledge Graphs
FAIR Data Knowledge Graphs
 
INSTRUCT - Integrated Structural Biology Infrastructure
INSTRUCT - Integrated Structural Biology InfrastructureINSTRUCT - Integrated Structural Biology Infrastructure
INSTRUCT - Integrated Structural Biology Infrastructure
 
Workshop - finding and accessing data - Cambridge August 22 2016
Workshop - finding and accessing data - Cambridge August 22 2016Workshop - finding and accessing data - Cambridge August 22 2016
Workshop - finding and accessing data - Cambridge August 22 2016
 
Research Lifecycles and RDM
Research Lifecycles and RDMResearch Lifecycles and RDM
Research Lifecycles and RDM
 
Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...
Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...
Introduction to Research Data Management - 2015-02-09 - MPLS Division, Univer...
 
David Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published recordDavid Shotton - Research Integrity: Integrity of the published record
David Shotton - Research Integrity: Integrity of the published record
 
Managing Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research MethodsManaging Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research Methods
 

Similar to Pistoia alliance harmonizing fair data catalog approaches webinar

No Free Lunch: Metadata in the life sciences
No Free Lunch:  Metadata in the life sciencesNo Free Lunch:  Metadata in the life sciences
No Free Lunch: Metadata in the life sciencesChris Dwan
 
Preparing your data for sharing and publishing
Preparing your data for sharing and publishingPreparing your data for sharing and publishing
Preparing your data for sharing and publishingVarsha Khodiyar
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer SchoolCarole Goble
 
Data management plans (dmp) for nsf
Data management plans (dmp) for nsfData management plans (dmp) for nsf
Data management plans (dmp) for nsfBrad Houston
 
Data management plans (dmp) for nsf
Data management plans (dmp) for nsfData management plans (dmp) for nsf
Data management plans (dmp) for nsfBrad Houston
 
The challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can helpThe challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can helpVarsha Khodiyar
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation Research Data Alliance
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation Research Data Alliance
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data ChallengesPhilip Bourne
 
Data peer review workshop
Data peer review workshopData peer review workshop
Data peer review workshopVarsha Khodiyar
 
Open, FAIR data and RDM
Open, FAIR data and RDMOpen, FAIR data and RDM
Open, FAIR data and RDMSarah Jones
 
FAIR for the future: embracing all things data
FAIR for the future: embracing all things dataFAIR for the future: embracing all things data
FAIR for the future: embracing all things dataARDC
 
Data management plans
Data management plansData management plans
Data management plansBrad Houston
 
Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6ARDC
 
KIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdf
KIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdfKIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdf
KIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdfDr. Radhey Shyam
 
Introduction to Data Analytics and data analytics life cycle
Introduction to Data Analytics and data analytics life cycleIntroduction to Data Analytics and data analytics life cycle
Introduction to Data Analytics and data analytics life cycleDr. Radhey Shyam
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016 Rebecca Raworth, MLIS
 

Similar to Pistoia alliance harmonizing fair data catalog approaches webinar (20)

No Free Lunch: Metadata in the life sciences
No Free Lunch:  Metadata in the life sciencesNo Free Lunch:  Metadata in the life sciences
No Free Lunch: Metadata in the life sciences
 
Research-Data-Management-and-your-PhD
Research-Data-Management-and-your-PhDResearch-Data-Management-and-your-PhD
Research-Data-Management-and-your-PhD
 
Preparing your data for sharing and publishing
Preparing your data for sharing and publishingPreparing your data for sharing and publishing
Preparing your data for sharing and publishing
 
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
Being FAIR:  FAIR data and model management SSBSS 2017 Summer SchoolBeing FAIR:  FAIR data and model management SSBSS 2017 Summer School
Being FAIR: FAIR data and model management SSBSS 2017 Summer School
 
Data management plans (dmp) for nsf
Data management plans (dmp) for nsfData management plans (dmp) for nsf
Data management plans (dmp) for nsf
 
Data management plans (dmp) for nsf
Data management plans (dmp) for nsfData management plans (dmp) for nsf
Data management plans (dmp) for nsf
 
The challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can helpThe challenge of sharing data well, how publishers can help
The challenge of sharing data well, how publishers can help
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
 
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation OpenAIRE and Eudat services and tools to support FAIR DMP implementation
OpenAIRE and Eudat services and tools to support FAIR DMP implementation
 
Human Genome and Big Data Challenges
Human Genome and Big Data ChallengesHuman Genome and Big Data Challenges
Human Genome and Big Data Challenges
 
Data peer review workshop
Data peer review workshopData peer review workshop
Data peer review workshop
 
Open, FAIR data and RDM
Open, FAIR data and RDMOpen, FAIR data and RDM
Open, FAIR data and RDM
 
FAIR for the future: embracing all things data
FAIR for the future: embracing all things dataFAIR for the future: embracing all things data
FAIR for the future: embracing all things data
 
Data management plans
Data management plansData management plans
Data management plans
 
Öppen data och forskningens genomslag
Öppen data och forskningens genomslagÖppen data och forskningens genomslag
Öppen data och forskningens genomslag
 
Open Science Governance and Regulation/Simon Hodson
Open Science Governance and Regulation/Simon HodsonOpen Science Governance and Regulation/Simon Hodson
Open Science Governance and Regulation/Simon Hodson
 
Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6Fsci 2018 thursday2_august_am6
Fsci 2018 thursday2_august_am6
 
KIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdf
KIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdfKIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdf
KIT-601-L-UNIT-1 (Revised) Introduction to Data Analytcs.pdf
 
Introduction to Data Analytics and data analytics life cycle
Introduction to Data Analytics and data analytics life cycleIntroduction to Data Analytics and data analytics life cycle
Introduction to Data Analytics and data analytics life cycle
 
Research data management workshop april12 2016
Research data management workshop april12 2016 Research data management workshop april12 2016
Research data management workshop april12 2016
 

More from Pistoia Alliance

Fairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesFairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesPistoia Alliance
 
Digital webinar master deck final
Digital webinar master deck finalDigital webinar master deck final
Digital webinar master deck finalPistoia Alliance
 
Heartificial intelligence - claudio-mirti
Heartificial intelligence - claudio-mirtiHeartificial intelligence - claudio-mirti
Heartificial intelligence - claudio-mirtiPistoia Alliance
 
Knowledge graphs ilaria maresi the hyve 23apr2020
Knowledge graphs   ilaria maresi the hyve 23apr2020Knowledge graphs   ilaria maresi the hyve 23apr2020
Knowledge graphs ilaria maresi the hyve 23apr2020Pistoia Alliance
 
2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinar2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinarPistoia Alliance
 
Data market evolution, a future shaped by FAIR
Data market evolution, a future shaped by FAIRData market evolution, a future shaped by FAIR
Data market evolution, a future shaped by FAIRPistoia Alliance
 
AI in translational medicine webinar
AI in translational medicine webinarAI in translational medicine webinar
AI in translational medicine webinarPistoia Alliance
 
CEDAR work bench for metadata management
CEDAR work bench for metadata managementCEDAR work bench for metadata management
CEDAR work bench for metadata managementPistoia Alliance
 
Open interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBIOpen interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBIPistoia Alliance
 
Fair webinar, Ted slater: progress towards commercial fair data products and ...
Fair webinar, Ted slater: progress towards commercial fair data products and ...Fair webinar, Ted slater: progress towards commercial fair data products and ...
Fair webinar, Ted slater: progress towards commercial fair data products and ...Pistoia Alliance
 
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data ResourcesApplication of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data ResourcesPistoia Alliance
 
Implementing Blockchain applications in healthcare
Implementing Blockchain applications in healthcareImplementing Blockchain applications in healthcare
Implementing Blockchain applications in healthcarePistoia Alliance
 
Building trust and accountability - the role User Experience design can play ...
Building trust and accountability - the role User Experience design can play ...Building trust and accountability - the role User Experience design can play ...
Building trust and accountability - the role User Experience design can play ...Pistoia Alliance
 
Pistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier DatathonPistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier DatathonPistoia Alliance
 
Data for AI models, the past, the present, the future
Data for AI models, the past, the present, the futureData for AI models, the past, the present, the future
Data for AI models, the past, the present, the futurePistoia Alliance
 
PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences Pistoia Alliance
 
AI & ML in Drug Design: Pistoia Alliance CoE
AI & ML in Drug Design: Pistoia Alliance CoEAI & ML in Drug Design: Pistoia Alliance CoE
AI & ML in Drug Design: Pistoia Alliance CoEPistoia Alliance
 
Ai in drug design webinar 26 feb 2019
Ai in drug design webinar 26 feb 2019Ai in drug design webinar 26 feb 2019
Ai in drug design webinar 26 feb 2019Pistoia Alliance
 

More from Pistoia Alliance (20)

Fairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matricesFairification experience clarifying the semantics of data matrices
Fairification experience clarifying the semantics of data matrices
 
MPS webinar master deck
MPS webinar master deckMPS webinar master deck
MPS webinar master deck
 
Digital webinar master deck final
Digital webinar master deck finalDigital webinar master deck final
Digital webinar master deck final
 
Heartificial intelligence - claudio-mirti
Heartificial intelligence - claudio-mirtiHeartificial intelligence - claudio-mirti
Heartificial intelligence - claudio-mirti
 
Fair by design
Fair by designFair by design
Fair by design
 
Knowledge graphs ilaria maresi the hyve 23apr2020
Knowledge graphs   ilaria maresi the hyve 23apr2020Knowledge graphs   ilaria maresi the hyve 23apr2020
Knowledge graphs ilaria maresi the hyve 23apr2020
 
2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinar2020.04.07 automated molecular design and the bradshaw platform webinar
2020.04.07 automated molecular design and the bradshaw platform webinar
 
Data market evolution, a future shaped by FAIR
Data market evolution, a future shaped by FAIRData market evolution, a future shaped by FAIR
Data market evolution, a future shaped by FAIR
 
AI in translational medicine webinar
AI in translational medicine webinarAI in translational medicine webinar
AI in translational medicine webinar
 
CEDAR work bench for metadata management
CEDAR work bench for metadata managementCEDAR work bench for metadata management
CEDAR work bench for metadata management
 
Open interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBIOpen interoperability standards, tools and services at EMBL-EBI
Open interoperability standards, tools and services at EMBL-EBI
 
Fair webinar, Ted slater: progress towards commercial fair data products and ...
Fair webinar, Ted slater: progress towards commercial fair data products and ...Fair webinar, Ted slater: progress towards commercial fair data products and ...
Fair webinar, Ted slater: progress towards commercial fair data products and ...
 
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data ResourcesApplication of recently developed FAIR metrics to the ELIXIR Core Data Resources
Application of recently developed FAIR metrics to the ELIXIR Core Data Resources
 
Implementing Blockchain applications in healthcare
Implementing Blockchain applications in healthcareImplementing Blockchain applications in healthcare
Implementing Blockchain applications in healthcare
 
Building trust and accountability - the role User Experience design can play ...
Building trust and accountability - the role User Experience design can play ...Building trust and accountability - the role User Experience design can play ...
Building trust and accountability - the role User Experience design can play ...
 
Pistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier DatathonPistoia Alliance-Elsevier Datathon
Pistoia Alliance-Elsevier Datathon
 
Data for AI models, the past, the present, the future
Data for AI models, the past, the present, the futureData for AI models, the past, the present, the future
Data for AI models, the past, the present, the future
 
PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences PA webinar on benefits & costs of FAIR implementation in life sciences
PA webinar on benefits & costs of FAIR implementation in life sciences
 
AI & ML in Drug Design: Pistoia Alliance CoE
AI & ML in Drug Design: Pistoia Alliance CoEAI & ML in Drug Design: Pistoia Alliance CoE
AI & ML in Drug Design: Pistoia Alliance CoE
 
Ai in drug design webinar 26 feb 2019
Ai in drug design webinar 26 feb 2019Ai in drug design webinar 26 feb 2019
Ai in drug design webinar 26 feb 2019
 

Recently uploaded

Digi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxDigi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxTanveerAhmed817946
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...dajasot375
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...Suhani Kapoor
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Delhi Call girls
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptxthyngster
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...Suhani Kapoor
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSAishani27
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz1
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Sapana Sha
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiSuhani Kapoor
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfLars Albertsson
 
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...shivangimorya083
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsappssapnasaifi408
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998YohFuh
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Callshivangimorya083
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystSamantha Rae Coolbeth
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfRachmat Ramadhan H
 

Recently uploaded (20)

Digi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptxDigi Khata Problem along complete plan.pptx
Digi Khata Problem along complete plan.pptx
 
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
Indian Call Girls in Abu Dhabi O5286O24O8 Call Girls in Abu Dhabi By Independ...
 
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
VIP High Profile Call Girls Amravati Aarushi 8250192130 Independent Escort Se...
 
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
Best VIP Call Girls Noida Sector 39 Call Me: 8448380779
 
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptxEMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM  TRACKING WITH GOOGLE ANALYTICS.pptx
EMERCE - 2024 - AMSTERDAM - CROSS-PLATFORM TRACKING WITH GOOGLE ANALYTICS.pptx
 
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
꧁❤ Aerocity Call Girls Service Aerocity Delhi ❤꧂ 9999965857 ☎️ Hard And Sexy ...
 
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
VIP High Class Call Girls Bikaner Anushka 8250192130 Independent Escort Servi...
 
Ukraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICSUkraine War presentation: KNOW THE BASICS
Ukraine War presentation: KNOW THE BASICS
 
Invezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signalsInvezz.com - Grow your wealth with trading signals
Invezz.com - Grow your wealth with trading signals
 
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
Saket, (-DELHI )+91-9654467111-(=)CHEAP Call Girls in Escorts Service Saket C...
 
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service BhilaiLow Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
Low Rate Call Girls Bhilai Anika 8250192130 Independent Escort Service Bhilai
 
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in  KishangarhDelhi 99530 vip 56974 Genuine Escort Service Call Girls in  Kishangarh
Delhi 99530 vip 56974 Genuine Escort Service Call Girls in Kishangarh
 
Schema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdfSchema on read is obsolete. Welcome metaprogramming..pdf
Schema on read is obsolete. Welcome metaprogramming..pdf
 
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
Full night 🥵 Call Girls Delhi New Friends Colony {9711199171} Sanya Reddy ✌️o...
 
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /WhatsappsBeautiful Sapna Vip  Call Girls Hauz Khas 9711199012 Call /Whatsapps
Beautiful Sapna Vip Call Girls Hauz Khas 9711199012 Call /Whatsapps
 
RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998RA-11058_IRR-COMPRESS Do 198 series of 1998
RA-11058_IRR-COMPRESS Do 198 series of 1998
 
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip CallDelhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
Delhi Call Girls Punjabi Bagh 9711199171 ☎✔👌✔ Whatsapp Hard And Sexy Vip Call
 
Unveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data AnalystUnveiling Insights: The Role of a Data Analyst
Unveiling Insights: The Role of a Data Analyst
 
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdfMarket Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
Market Analysis in the 5 Largest Economic Countries in Southeast Asia.pdf
 
Decoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in ActionDecoding Loan Approval: Predictive Modeling in Action
Decoding Loan Approval: Predictive Modeling in Action
 

Pistoia alliance harmonizing fair data catalog approaches webinar

  • 1. November 29, 2018 Harmonizing FAIR Data Catalog Approaches A Pistoia Alliance Debates Webinar Moderated by Martin Romacker – Roche Panelists Tom Plasterer - AstraZeneca Eric Little - OSTHUS Kees van Bochove The Hyve Rafael Jimenez - Elixir
  • 2. This webinar is being recorded
  • 3. Poll Question 1: How familiar are you with FAIR principles and metrics? A. Unfamiliar with FAIR principles B. Some familiarity with FAIR principles C. Familiar with FAIR principles D. Some expertise with FAIR principles and metrics E. Highly expert with FAIR principles and metrics
  • 4. Poll Question 2: What is the maturity level of your organization with respect to implementation of FAIR? A. Don’t know B. No plans or interest in FAIR implementation B. Thinking about FAIR implementation C. Several projects have FAIR implementation E. Systematic FAIR implementation across whole organization
  • 5. ©PistoiaAlliance Phuse and the Regulated Cloud 5 Eric Little Chiref Data Officer OSTHUS Martin Romacker Principal Scientist Roche Tom Plasterer US Cross-Science Director, R&D Information AstraZeneca Rafael Jimenez Chief Data Architect ELIXIR Kees van Bochove CEO The Hyve
  • 6. Harmonizing FAIR Data Catalog Approaches – Introduction Martin Romacker Data and Information Architect Pharma Research and Early Development Informatics (pREDi) Roche Innovation Center Basel Pistoia Alliance Webinar, 29th November 2018
  • 7. Data Catalogue & FAIR Principles High Quality Data Driving Insights • Digitial Transformation leading to a data-driven Industry • Data are more and more perceived as an asset
  • 8. Doing now what patients need next
  • 9. Findability and FAIR Data Catalogs Tom Plasterer, PhD Science & Enabling Units IT (S&EUIT), Semantic Technology Lead 29 Nov 2018 Pistoia/Linked Data Community of Practice Webinar Series Eric Little, PhD Chief Data Officer, Osthus
  • 10. 10 Starting Point: Questions and Iterative Data Model core:Study core:Project core:Target core:Subject core:Drug core:Indication core:TherapeuticArea core:BiologicalSample core:Measurement core:Technologycore:Visit bdm:Cohort core:hasSubject core:hasProject core:hasDrug core:hasIndication bdm:hasArm bdm:participatesIn core:hasTA core:hasTarget core:hasMeasurement core:hasSample core:hasVisit core:measuredBy How do you find datasets described with this model when you have hundreds or thousands?
  • 11. 11 Dataset Catalog is a collection of Dataset Records • Catalogs are needed to supporting FAIR (Findable) data • Catalogs can and should support Enterprise MDM strategies • Consumers can be internal or external Dataset Catalogs are needed so data consumers can find Datasets • Dataset records need sufficient metadata to support discoverability • Dataset terms are NOT the data instance Dataset Catalogs surface dataset provenance and enable data access Dataset Catalogs can provide datasets for multiple consumption patters • Analytics readiness and fit • ‘Walking’ across information models Dataset Catalogs: Findability Starts Here
  • 12. 12 Best Practices: Data on the Web, Vocabulary of Interlinked Datasets Dataset Descriptions for the Open Pharmacological Space http://www.openphacts.org/specs/2012/WD-datadesc-20121019/ Data on the Web Best Practices https://www.w3.org/TR/dwbp/
  • 13. 13 Dataset Catalogs: Find me Datasets about: Projects Study Indication/ Disease Technology Targets Cohort DatesAgent Therapeutic Area Drugs
  • 14. 14 Dublin-Core-Type (DCT): Dataset • A dataset is information encoded in a defined structure (for example, lists, tables, and databases), intended to be useful for direct machine processing Data-Catalog (DCAT): Dataset • A collection of data, published or curated by a single source, and available for access or download in one or more formats Vocabulary of Interlinked Dataset (VoID): Dataset • A set of RDF triples that are published, maintained or aggregated by a single provider. Data and Datasets… dct:Dataset dcat:Dataset void:Dataset rdfs:subClassOf rdfs:subClassOf
  • 15. 15 The Backbone: A DCAT conformant Data Catalog https://www.w3.org/TR/hcls-dataset/ https://www.w3.org/TR/vocab-dcat/#vocabulary-overview Semantic tagging of datasets with concepts from taxonomies: • provides context • multi-dimensional & flexible • effective for discoverability • light-weight semantics skos:Concept dcat:Catalog skos:ConceptScheme dctypes:Dataset (summary) dct:title dct:publisher <foaf:Agent> foaf:page void:sparqlEndpoint dct:accrualPeriodicity dcat:keyword dcat:dataset dcat:theme dctypes:Dataset (version) dcat:Distribution (dctypes:Dataset) void:vocabulary dct:conformsTo void:exampleResource …other void properties dcat:distribution dcat:themeTaxonomy dct:isVersionOf pav:previousVersion dct:hasPart pav:hasCurrentVersion dct:hasPart dct:title dct:publisher <foaf:Agent> pav:version dct:creator <foaf:Agent> dct:created dct:source dct:creator <foaf:Agent> dct:license dct:format pav:retrievedFrom dct:created pav:createdWith dcat:accessURL dcat:downloadURL void:Dataset dct:title dctDescription dct:publisher <foaf:Agent>
  • 16. 16 Metadata Model Stack for the AZ Data Catalog DCAT VoID DCTerms RDF/S, OWL, SKOS/SKOS-XL AZ TaxonomiesPAV AZ DataCatalog ontology and instances for catalogs, datasets, distributions (could be further modularized later) DCMI bdm-tech core bdm internal external uniprot umls sio chembl… W3C and Metadata Standards Reference Master Data
  • 17. Data Discoverability: Multi-phase Filtering Data Catalog Filter Phase 1 Experiment Metadata Filter Phase 2 Ad hoc Analyses Filtering Phase 3 Outbound to Data Analytics Data Science Tools Statistical Filtering e.g., clinical trial with > 50 participants Dataset Catalog Descriptions
  • 19. 19 Validation and SHACL Az:ANYDataset az:BDMDataset dct:Dataset rdfs:subClassOf az:BDMDataset (Node Shape) sh:targetClass az:BDMDataset sh:and dctypes:dataset (Node Shape) sh:targetClass dctypes:Dataset sh:property az:BDMDatasetExtension (Shape) sh:property dct:title (Propery Shape) sh:path dct:title sh:datatype xsd:string sh:minCount: 1 sh:maxCount: 1 … az:theme (Shape) sh:path dcat:theme sh:class bdm:Technology sh:minCount: 1 … For BDM dataset: at least one technology MUST be specified Use of SHACL for Data Catalogs & Dataset Types @Heiner Oberkampf; <internal talk> April 18, 2018
  • 21. Slide 21 How FAIR is Your Data with a Catalog? FAIR metrics: https://tinyurl.com/FAIRMetrics-ALL I
  • 22. Slide 22 Data Science Layer machine learning, text analytics, NLP, clustering, matching, classification Lightweight Semantic Integration Layer: FAIR Catalog enablers: reference master data mgt., metadata mgt., semantic indexing, linking, governance, APIs A Data-Centric World Allows to Utilize Data Effectively Linked Open Data & Open APIs Semantic Graph DB Operational DBs … Unstructured Documents Analytics Tools simulations statistics Visualization dashboards exploration search … Semi-structured Data Instrument Data Reporting regulatory internal external
  • 23. R&D | RDI DCTERMS, DCAT, VoID are nearly sufficient • Extend for local needs DCTERMS, DCAT, VoID are nearly sufficient • Extend for local needs Public Domain Ontologies should be reused • Consensus is emerging around best practices and cross-mapping Public Domain Ontologies should be reused • Consensus is emerging around best practices and cross-mapping Use Multi-Phase Filtering for Shallow & Deep Questions • Balance to what belongs in a catalog record vs. instance data Use Multi-Phase Filtering for Shallow & Deep Questions • Balance to what belongs in a catalog record vs. instance data Lots of Activity to Learn and Shape Best Practices • Didn’t reinvent a wheel Lots of Activity to Learn and Shape Best Practices • Didn’t reinvent a wheel Dataset Catalogs: Take-aways
  • 24. R&D | RDI Thanks Key Influencers David Wood Tim Berners-Lee Lee Harland Jane Lomax James Malone Dean Allemang Barend Mons Carole Goble Bernadette Hyland Bob Stanley Eric Little Juan Sequeda Michel Dumontier John Wilbanks Hans Constandt Filip Pattyn Dan Crowther Tim Hoctor Ian Harrow Key Influencers David Wood Tim Berners-Lee Lee Harland Jane Lomax James Malone Dean Allemang Barend Mons Carole Goble Bernadette Hyland Bob Stanley Eric Little Juan Sequeda Michel Dumontier John Wilbanks Hans Constandt Filip Pattyn Dan Crowther Tim Hoctor Ian Harrow AZ/MedImmune Linked Data Community David Fenstermacher Mathew Woodwark Rajan Desai Nic Sinibaldi Chia-Chien Chiang Kerstin Forsberg Ola Engkvist Ian Dix Colin Wood Ted Slater Martin Romacker Eric Neumann Jeff Saltzman Kathy Reinold Nirmal Keshava Bryan Takasaki AZ/MedImmune Linked Data Community David Fenstermacher Mathew Woodwark Rajan Desai Nic Sinibaldi Chia-Chien Chiang Kerstin Forsberg Ola Engkvist Ian Dix Colin Wood Ted Slater Martin Romacker Eric Neumann Jeff Saltzman Kathy Reinold Nirmal Keshava Bryan Takasaki
  • 25. Clinical Data Catalogs 3 case studies 2018, NOVEMBER 29th Kees van Bochove, CEO & Founder, The Hyve @keesvanbochove
  • 26. ● Cohort study / registry / biobank: Netherlands Twin Registry ● Academic medical center: Institut Curie ● Pharma company: Cataloguing the Data Lake Clinical data catalog case studies
  • 27. 27 Teams Research Data Management ● FAIR / Data Governance consultancy ● Fairspace (meta)data management Cancer Genomics ● Cancer data warehouse: cBioPortal ● Knowledge base: Open Targets Data Warehousing ● Data warehouses: tranSMART, i2b2 ● Cohort selection: Glowing Bear ● Request Portals: Podium Real World Data ● Real world evidence: OMOP/OHDSI ● Wearables platform: RADAR-BASE ● Data catalogues: CKAN, DataVerse
  • 28. 28 Case 1: Netherlands Twin Register
  • 29. 29 Case 1: Netherlands Twin Register 1. Data Catalogue: Low barrier of entry, variable level metadata
  • 30. 30 Case 1: Netherlands Twin Register 2. Podium Request Portal: Request access to data (or samples)
  • 31. 31 Case 1: Netherlands Twin Register 3. Glowing Bear: Data selection tool for data manager (or researcher)
  • 32. ● Provide foundations for a comprehensive knowledge system to make data accessible ● Foster a strong extended research network within the organization ● Translated to 4 processes / propositions: ● Understand which data is available in the institute ● Collaborate easily on data analysis ● Comply to regulations (GDPR) and best practices (i.e. FAIR) ● Manage the full data lifecycle Case 2: Institut Curie: project goals
  • 33. Research Team A SEMANTIC LAYER GOVERNANCE LAYER VIRTUAL RESEARCH DESKTOP HOSPITAL INFORMATION SYSTEM (HIS) CURIE DATA RESOURCE A C Research Team B AD E Research Team C FBAB Clinical annotations, Consent metadata metadata metadata Sequencing data (WGS, WES, targeted) RNAseq Images Institut Curie – Data Office – Vision on FAIR Data By: X. Fernandez and J. Guerien FAIRSPACE - Apache Jena - FS Protocol - Kubernetes - Apps a.o. cBioPortal
  • 34. FAIRSPACE 34 Add your data to Fairspace Automate it! Share and publish your data Annotate your data with FAIR metadata 01 02 03 04 Collaborative science in 4 steps Metadata Governance Audit Logs
  • 35. ● Strategic programme underway to make preclinical, clinical and real world data assets available across R&D ● Most files and legacy data warehouses moved to AWS data lake ● Computational pipelines, data analysis solutions etc. available ● Need for indexing and searching data assets on study, indication, compound, site etc. Case 3: Unlocking the enterprise data lake
  • 36. 36 Unlock the data lake with cataloguing Data Lake Data Source Adapters S3, GCP buckets, FS mounts, Azure blobs etc. File & Collection Metadata DCAT, DATS etc. Logging & Auditing Lightweight Metadata Layer (Kubernetes microservices) Data Governance Access control Request flow FAIRSPACE FederationData Sources Dataverse, C/DKAN etc. via OAI-PMH Semantic search (e.g. Disqover) via SPARQL Analytics (e.g. SAS, Jupyter, Matlab etc.) Cohort & Patient Finders (e.g. Glowing Bear)
  • 37. ● Understand the driving use case for the catalog ● Filtering, browse and search: which criteria would users search on? ● Result sets: do your users / customers want to locate datasets, files, patients, samples etc? ● Strategy to populate the catalogue ● Favour automation, but some manual work and terminology management will be needed ● Some further reading: https://peerj.com/preprints/27151/ ● Testing the FAIR metrics on data catalogs, comparing CKAN, Dataverse, Invenio, by The Hyve team ● Let me know if you are interested in beta testing Fairspace: kees@thehyve.nl Conclusions
  • 38. www.elixir-europe.org Bioschemas for dataset discoverability on the inter/intranet Harmonizing FAIR Data Catalog Approaches Rafael C Jimenez Chief Data Architect November 29, 2018 The European Open Science Cloud for Research pilot project is funded by the European Commission, DG Research & Innovation under contract no. 739563 ELIXIR-EXCELERATE is funded by the European Commission within the Research Infrastructures programme of Horizon 2020, grant agreement number 676559.
  • 40. <div itemscope itemtype="http://schema.org/Recipe"> <h1 itemprop="name">Classic potato salad</h1> <div itemprop="nutrition” itemscope itemtype="http://schema.org/NutritionInformation"> Nutrition facts: <span itemprop="calories">144 kcal</span>, </div> Ingredients: - <span itemprop="recipeIngredient">800g small new potato</span> - <span itemprop="recipeIngredient">3 shallot</span> . . . RDFa JSON-LD Microdata
  • 41. Semantic markup for web pages
  • 44.
  • 45. Bioschemas Schema.org for life sciences Collection of specifications Community initiative
  • 46. Bioschemas • New life sciences schema.org types (eg. protein) • Life sciences profiles for existing types (eg. dataset) Types • Proposed to Schema.org • More descriptive • Guidelines applying constrains to existing Schema.org types • Managed by Bioschemas and specific for life sciences • Minimum properties and best practices for finding and accessing data • Focused on few types and well defined relationships Profiles 2210 DataCatalog Datasets DataRecord
  • 47. Bioschemas • Use case driven • Finding data • Presenting search results • Metadata exchange • Minimum information guidelines • Link to domain ontologies • Examples and documentation
  • 48. Tools • Buzzbang Open-source components for software to find, crawl and use Bioschemas markup, and for humans to search it. • GoCrawlIt Minimal crawler and extractor of microdata and JSON-LD metadata. • GoWeb Application to help publishing Bioschemas profiles on the Bisochemas website. • Validata A web application for validating Bioschemas markup against the specifications. • Markup Builder A web application for prototyping markup against the Bioschemas profiles
  • 49. Data resources Tools Training Bioschemas Bioschemas Bioschemas Search engines Metadata Registries Data Aggregators BioschemasBioschemas Providers Consumers
  • 50. MarRef -> BioSamples MarRef <- BioSamples Without an API https://github.com/EBIBioSamples/bioschemas_marref_demo/blob/master/Summary.md Use case: data exchange Metadata exchange
  • 51. Data producers Bioschemas Data Aggregators Bioschemas Providers Consumers Organization “A” private data Data producers Bioschemas Data Aggregators Bioschemas Providers Consumers Organization “B” private data Search Compare Link Exchange Metadata Metadata Metadata Metadata • Different data models and interfaces • Minimum common metadata agreements on a selection of data types
  • 52. Remarks • Bioschemas is focused on findability, not data modeling • It can be useful for some basic metadata exchange (MarRef example) • It is a complement, not a replacement for more tailored ways to share metadata • It is mainly used embedded in HTML, but it can be exposed via APIs or JSON files • For data providers it is a simple way to expose metadata • It is not just for Google, it is also helping our metadata catalogues to automatically ingest metadata
  • 53. Thanks for your attention! 200+ People 8 Events (2018) 10 Types 23Working groups22Profiles 55Live deploys 6M+Pages
  • 54. Audience Q&A Please use the Question function in GoToWebinar
  • 55. Upcoming Webinars Data Quality in support of AI/ML: Pistoia Alliance CoE for AI Webinar Series Date/Time: 10 Dec 2018: 4pm - 5pm GMT, 11 am ET/5pm CET Panel members include: Terry Stouch, Science for Solutions, Isabella Feierberg, AstraZeneca, Jamie Powers, Cambridge Semantics, Sirarat Sarntivijai, ELIXIR EU, Jabe Wilson, Elsevier Knowledge Graphs for Pharma: A perspective from the PhUSE Project 'Clinical Trials Data as RDF' Date/Time: January 24th, 2019 11am ET/4pmGMT/5pm CET Speaker: Tim Williams (UCB and PhUSE)