SlideShare a Scribd company logo
1 of 31
Download to read offline
This project has received funding from the European Union’s Horizon 2020 research and
Innovation programme under grant agreement No. 825775
Making Cohort data FAIR
Presenter: William Hsiao (Simon Fraser University)
Host: Marta Lloret Llinares (EMBL-EBI)
This webinar is being recorded
Audience Q&A Session
Please write your
questions in the
questions
window of the
GoToWebinar
application
The challenges:
Stay
informed
@CinecaProject
www.cineca-project.eu
Common Infrastructure for National Cohorts
in Europe, Canada and Africa
This project has received funding from the European Union’s Horizon 2020 research and
Innovation programme under grant agreement No. 825775
Accelerating disease research and
improving health by facilitating
transcontinental human data exchange
The vision:
This project has received funding from the Canadian Institute of Health
Research under grant agreement #404896
Context for the webinar
• CINECA “How FAIR are you?” webinar series and hackathon:
• https://www.cineca-project.eu/news-events-all/how-fair-are-you-
webinar-series-and-hackathon
• Webinar series Jan-April
• Making cohort data FAIR
• FAIR software tools
• Practically FAIR
• How to make training FAIR
• Ethics/ELSI considerations
• Hackathon 28-29th April 4 hours per day
• 3 streams: cohort data, software, training materials
Today’s presenter
Dr. William Hsiao joined the Faculty of Health Sciences at Simon Fraser University as an
associate professor in September 2020. He is also an affiliated scientist at the BC Centre for
Disease Control Public Health Laboratory (BCCDC PHL) and at Canada’s Michael Smith Genome
Sciences Centre.
Prior to joining FHS, Dr. Hsiao was the chief bioinformatician and a senior scientist at the
BCCDC PHL for 8 years and a clinical associate professor at the University of British Columbia.
Dr. Hsiao obtained his PhD in the Department of Molecular Biology and Biochemistry at Simon
Fraser University followed by a postdoctoral fellowship at the Institute for Genome Sciences at
University of Maryland School of Medicine. His research focused on microbial genomics and
metagenomics.
At the BCCDC PHL, he incorporated data science and knowledge engineering into his research and developed expertise in
public health data sharing, integration, and harmonization. With experience conducting and applying genomics and data
science research both in academia and in government laboratories, Dr. Hsiao has developed a special perspective on
integrating basic and applied research to improve our public health system.
This project has received funding from the European Union’s Horizon 2020 research and
Innovation programme under grant agreement No. 825775
Making Cohort data FAIR
William Hsiao, PhD
Associate Professor, Faculty of Health Sciences
Simon Fraser University
Cohort Studies
• A cohort is a group of people who share a common characteristics or
experience within a defined period
• A cohort study typically is observational in nature and follows a large
number of participants over a long period of time (longitudinal)
• Useful when intervention (e.g. RCT) is not possible or not ethical
• Exposure to risk factors are measured or collected
• Multiple outcomes can be observed (e.g. from large population-based
cohort)
• Valuable for scientific understanding of disease causation
• In CINECA, ~10 cohorts are part of the consortium
• A goal of CINECA is to enable federated queries and analyses of the varying
and wide-ranging datasets from the 10 CINECA cohorts.
Construction of Cohorts
• Define study question(s)/scope and identify and consent the study
subjects (cohort)
• Obtain baseline data on the exposure(s) to the risk factor(s) of
interest (e.g. obese vs. lean, genetic allelic variation)
• Subclassify the cohorts into groups with or without (or less) of the
risk factor(s)
• Follow-up with the participants to measure the outcomes using data
collection instruments (questionnaire, physical measures, cognitive
measures, biosamples, etc.)
• Analyze the data to see if outcome correlated to risk factor(s); infer
causality
Examples of Cohorts in CINECA
• CHILD: The CHILD Cohort Study is a prospective longitudinal birth
cohort study in Canada; ~3000 participating families (trios);
multidisciplinary; multi-modal (questionnaires, biological samples,
home assessments, clinical assessments)
• UK-Biobank: population based biomedical database; 500,000
participants from UK (national cohort); able to mobilize for COVID-19
related research by linking host genetics to disease status
• H3Africa: a pan-African consortium to build research program and
infrastructure in genomic medicine on the African continent;
biospecimens and data collection for research
Challenges with Cohort Studies
• Due to the (usually) prospective, large sample-size, and longitudinal
nature, cohort studies can be expensive and labour-intensive
• Data collection instruments and variables are usually specific to a
study, and data may not be interoperable and machine readable (e.g.
free text, custom pick-list, non-standardized units)
• Most valuable data is often the most sensitive and access controlled
• Consent process may restrict the re-use of cohort data
• Broad consents are encouraged as long as they are informed consents and
allow withdrawal of the consents at any time
Discovery of Cohort Database Content (F/A)
• Cohort Profiles are often in descriptive text that are human but not
machine readable
• Similarly, data fields and study protocols are accessible openly but
only in human readable formats (free text or tables)
Cohort profile – CHILD Cohort Study (childstudy.ca)
Access to Data is controlled to protect
privacy
Data access – CHILD Cohort Study (childstudy.ca)
Minimal Metadata for Cohorts
• One solution is to make cohort metadata and aggregated data
researchable by machines by encoding the data in common formats
• E.g. GO FAIR’s Framework for FAIRification has a Metadata for
Machines (M4M) component
• Assess community specific metadata practices
• Using the FAIR principles to define metadata elements
• Formulate these decisions as machine-actionable templates
• Register these templates so they are FAIR and accessible by
machine API
• Result in domain-specific, community built, FAIR metadata schema
Metadata for Machines - GO FAIR (go-fair.org)
A Minimal Metadata Model for
CINECA
• A minimal metadata model is being developed for CINECA
• Based on surveys of common data fields in 10 cohort studies (no assessment of
actual data, but consulted data dictionaries to capture acceptable data values)
• Supplemented with common data fields from additional data catalogues
• Supplemented with “use cases”
• Each variable grouped into a broad category (e.g. socio-demographic and economic
characteristics, diseases, and lifestyle and behaviours)
• Maelstrom Research data standards were used as the basis for the categorization –
promote compatibility with existing data catalogues
• Each variables is mapped to an ontology term (work in progress)
• Metadata model encoded as a new ”application ontology” called GEKCO (Genomics
Cohorts Knowledge Ontology) - http://www.obofoundry.org/ontology/gecko.html
Example of Structured Variables in the
CINECA Minimal Metadata Model
Harmonization of metadata (-IR)
• Data collection instruments are usually developed in silos (only
considering the study scope and current research needs)
• This leads to incompatibilities when trying to aggregate datasets
across studies at both the cohort level (study attributes) and at the
individual data (participant attributes)
• As a result, data are:
• Not Interoperable
• Not (easily) reusable
Similar Epi/Clinical data needs across agencies
Questions re: diseases, symptoms, clinical events, dates of onset
CDC Atlanta,
BC CDC, WHO
18
Examples of Similar Fields / Similar
Names
COVID-19 Case Report Forms in
Canada
COVID-19 Mutation Reports
• RG203KR
• insG28262GAACA
• 3675-3677 SGF
• L242_244L deletion; L242del,A243del,L244del
• SGF 3675-3677 deletion
• Q27stop
• HV 69-70 deletion
• Y144 deletion
• Common: locations and amino acid substitutions
• Variable: nucleotide vs. amino acid locations; abbreviations; gene names
Maelstrom Data Harmonization Guideline
Fortier, Isabel, et al. "Maelstrom Research guidelines for rigorous retrospective data harmonization." International journal of epidemiology (2017) 46 (1): 103-105.
Semantic Web and Ontology
• Semantic Web term refers to set of standards and best practice for
sharing data and the semantic (Meanings) of that data over the web by
using application. Semantic web can be processed directly or indirectly
from machines, so basically human and computers can work in
cooperation.
• Ontology is a way to structure knowledge in a machine readable way with
defined terms and relationships.
• In ontology, one can provide a set of vocabularies that can be used to
model a domain, which is types of objects and concepts that are exist in
specific area and the relationship between them (Gruber TR, 1993).
Ontology
● Ontologies provide standard and linkable IDs for
○ names for form and tabular data fields
○ terms (choices) for categorical variables
● Modelling processes into data structures may be subjective: Who is the
model for, and to what granularity?
● Upper level ontologies (e.g. BFO) bring some harmony but do not
guarantee full interoperability
Ontology Creation and Re-use
● No silo = build the ontology as a suite of interoperable modules
structured to provide berths for future ontologies as needed
● No short half-life = start not with data (which changes rapidly) but
with the entities (things, processes, attributes, ...) the data are used
to describe
● No reinvent the wheel = follow a method tested in over 300
ontology-building initiatives and documented in ISO 21838,
(https://www.iso.org/standard/71954.html) leveraging existing
resources wherever possible
● E.g. GECKO ontology reuse terms whenever possible
• Formal representation of objects, concepts and relationships among
them
• Shared understanding [language] for communicating cohort information
• Overcome the semantic heterogeneity
• Ontologies are interpretable by humans and by computer programs.
• Information integration - Multiple information resources are combined
using ontologies to match concepts with similar meaning.
• Ontology embedded into software - Object-oriented implementation
(e.g. Java classes) generated from classes in the ontology…
Strength of Ontology and Semantic web
26
27
GEEM: platform for building ontology-based data
specifications
http://genepio.org/geem/form.html#GENEPIO:0002083
Challenges with Harmonization
• balance between accepting only precisely uniform variables that
render pooling straightforward (e.g. exact question or standard
operating procedures) but limit the potential to integrate multiple
studies; and accepting a certain level of heterogeneity across
participating studies providing similar but not necessarily identical
data
• E.g. broader terms used to aggregate more detailed terms
• E.g. same term used for different scales
https://www.maelstrom-research.org/about-harmonization/guidelines/data-processing-methods
Benefit Sharing and Privacy Concerns
• Making Cohort Data FAIR can mean broader and easier access to data
• However, usually the HICs (more research capacity) can take better
advantages of such broader access than LMICs
• Food for Thought (discussion): how to we protect the interests of
study participants and enable benefit sharing?
Questions?
Title: Making Cohort data FAIR
Presenter: William Hsiao
Please write your questions in the
questions window of the GoToWebinar
application
Next CINECA webinars
Title: FAIR Software tools
Presenter: Carlos Martinez
Date: Wed 24th February 2021
Time: 3:00 PM GMT / 4:00 PM CET
Registration and details:
https://www.cineca-project.eu/news-
events-all/fair-software-tools
Title: Practically Fair
Presenter: Andrew Stubbs
Date: Thurs 4th March 2021
Time: 3:00 PM GMT / 4:00 PM CET
Registration and details:
https://www.cineca-project.eu/news-
events-all/practically-fair

More Related Content

What's hot

Managing Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research MethodsManaging Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research MethodsRebecca Grant
 
Do Open data badges influence author behaviour? A case study at Springer Nature
Do Open data badges influence author behaviour? A case study at Springer NatureDo Open data badges influence author behaviour? A case study at Springer Nature
Do Open data badges influence author behaviour? A case study at Springer NatureRebecca Grant
 
The Vision for Data @ the NIH
The Vision for Data @ the NIHThe Vision for Data @ the NIH
The Vision for Data @ the NIHPhilip Bourne
 
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...dkNET
 
Open science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamOpen science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamPlatforma Otwartej Nauki
 
Methodologies for Addressing Privacy and Social Issues in Health Data: A Case...
Methodologies for Addressing Privacy and Social Issues in Health Data: A Case...Methodologies for Addressing Privacy and Social Issues in Health Data: A Case...
Methodologies for Addressing Privacy and Social Issues in Health Data: A Case...Trilateral Research
 
Introduction to ADA
Introduction to ADAIntroduction to ADA
Introduction to ADAARDC
 
Investigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisInvestigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisCatherine Canevet
 
Addressing the wicked problem of learning data privacy though principle and p...
Addressing the wicked problem of learning data privacy though principle and p...Addressing the wicked problem of learning data privacy though principle and p...
Addressing the wicked problem of learning data privacy though principle and p...Jisc
 
dkNET Poster Experimental Biology 2019
dkNET Poster Experimental Biology 2019dkNET Poster Experimental Biology 2019
dkNET Poster Experimental Biology 2019dkNET
 
The Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliThe Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliLEARN Project
 
Data Governance in two different data archives: When is a federal data reposi...
Data Governance in two different data archives: When is a federal data reposi...Data Governance in two different data archives: When is a federal data reposi...
Data Governance in two different data archives: When is a federal data reposi...Carolyn Ten Holter
 
cBioPortal Webinar Slides (3/3)
cBioPortal Webinar Slides (3/3)cBioPortal Webinar Slides (3/3)
cBioPortal Webinar Slides (3/3)Pistoia Alliance
 
Genome sharing projects around the world nijmegen oct 29 - 2015
Genome sharing projects around the world   nijmegen oct 29 - 2015Genome sharing projects around the world   nijmegen oct 29 - 2015
Genome sharing projects around the world nijmegen oct 29 - 2015Fiona Nielsen
 
Navigating the data management ecosystem - Dan Valen
Navigating the data management ecosystem - Dan ValenNavigating the data management ecosystem - Dan Valen
Navigating the data management ecosystem - Dan ValenDigital Science
 
Finding and Accessing Human Genomics Datasets
Finding and Accessing Human Genomics DatasetsFinding and Accessing Human Genomics Datasets
Finding and Accessing Human Genomics DatasetsManuel Corpas
 

What's hot (20)

Managing Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research MethodsManaging Ireland's Research Data - 3 Research Methods
Managing Ireland's Research Data - 3 Research Methods
 
Do Open data badges influence author behaviour? A case study at Springer Nature
Do Open data badges influence author behaviour? A case study at Springer NatureDo Open data badges influence author behaviour? A case study at Springer Nature
Do Open data badges influence author behaviour? A case study at Springer Nature
 
The Vision for Data @ the NIH
The Vision for Data @ the NIHThe Vision for Data @ the NIH
The Vision for Data @ the NIH
 
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
dkNET Webinar: Creating and Sustaining a FAIR Biomedical Data Ecosystem 10/09...
 
Open science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, PotsdamOpen science, open data - FOSTER training, Potsdam
Open science, open data - FOSTER training, Potsdam
 
Methodologies for Addressing Privacy and Social Issues in Health Data: A Case...
Methodologies for Addressing Privacy and Social Issues in Health Data: A Case...Methodologies for Addressing Privacy and Social Issues in Health Data: A Case...
Methodologies for Addressing Privacy and Social Issues in Health Data: A Case...
 
Identification of emerging technologies via a systematic search strategy deve...
Identification of emerging technologies via a systematic search strategy deve...Identification of emerging technologies via a systematic search strategy deve...
Identification of emerging technologies via a systematic search strategy deve...
 
Introduction to ADA
Introduction to ADAIntroduction to ADA
Introduction to ADA
 
Investigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysisInvestigating plant systems using data integration and network analysis
Investigating plant systems using data integration and network analysis
 
Addressing the wicked problem of learning data privacy though principle and p...
Addressing the wicked problem of learning data privacy though principle and p...Addressing the wicked problem of learning data privacy though principle and p...
Addressing the wicked problem of learning data privacy though principle and p...
 
dkNET Poster Experimental Biology 2019
dkNET Poster Experimental Biology 2019dkNET Poster Experimental Biology 2019
dkNET Poster Experimental Biology 2019
 
Open Access as a Means to Produce High Quality Data
Open Access as a Means to Produce High Quality DataOpen Access as a Means to Produce High Quality Data
Open Access as a Means to Produce High Quality Data
 
The Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina LeonelliThe Challenges of Making Data Travel, by Sabina Leonelli
The Challenges of Making Data Travel, by Sabina Leonelli
 
Data Governance in two different data archives: When is a federal data reposi...
Data Governance in two different data archives: When is a federal data reposi...Data Governance in two different data archives: When is a federal data reposi...
Data Governance in two different data archives: When is a federal data reposi...
 
Data Analytics
Data AnalyticsData Analytics
Data Analytics
 
cBioPortal Webinar Slides (3/3)
cBioPortal Webinar Slides (3/3)cBioPortal Webinar Slides (3/3)
cBioPortal Webinar Slides (3/3)
 
Genome sharing projects around the world nijmegen oct 29 - 2015
Genome sharing projects around the world   nijmegen oct 29 - 2015Genome sharing projects around the world   nijmegen oct 29 - 2015
Genome sharing projects around the world nijmegen oct 29 - 2015
 
Navigating the data management ecosystem - Dan Valen
Navigating the data management ecosystem - Dan ValenNavigating the data management ecosystem - Dan Valen
Navigating the data management ecosystem - Dan Valen
 
Markham2009
Markham2009Markham2009
Markham2009
 
Finding and Accessing Human Genomics Datasets
Finding and Accessing Human Genomics DatasetsFinding and Accessing Human Genomics Datasets
Finding and Accessing Human Genomics Datasets
 

Similar to CINECA webinar slides: Making cohort data FAIR

The African Open Science Platform: Policy, Infrastructure, Skills and Incenti...
The African Open Science Platform: Policy, Infrastructure, Skills and Incenti...The African Open Science Platform: Policy, Infrastructure, Skills and Incenti...
The African Open Science Platform: Policy, Infrastructure, Skills and Incenti...African Open Science Platform
 
CINECA webinar slides: Ethics/ELSI considerations - From FAIR to fair data sh...
CINECA webinar slides: Ethics/ELSI considerations - From FAIR to fair data sh...CINECA webinar slides: Ethics/ELSI considerations - From FAIR to fair data sh...
CINECA webinar slides: Ethics/ELSI considerations - From FAIR to fair data sh...CINECAProject
 
Open Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonOpen Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonAfrican Open Science Platform
 
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)Tom Plasterer
 
Digital transformation to enable a FAIR approach for health data science
Digital transformation to enable a FAIR approach for health data scienceDigital transformation to enable a FAIR approach for health data science
Digital transformation to enable a FAIR approach for health data scienceVarsha Khodiyar
 
Susanna Sansone at the Knowledge Dialogues/ODHK "Beyond Open"event
Susanna Sansone at the Knowledge Dialogues/ODHK "Beyond Open"eventSusanna Sansone at the Knowledge Dialogues/ODHK "Beyond Open"event
Susanna Sansone at the Knowledge Dialogues/ODHK "Beyond Open"eventGigaScience, BGI Hong Kong
 
Open FAIR Data and Open Science: Developing Partnerships, Strategies, Policie...
Open FAIR Data and Open Science: Developing Partnerships, Strategies, Policie...Open FAIR Data and Open Science: Developing Partnerships, Strategies, Policie...
Open FAIR Data and Open Science: Developing Partnerships, Strategies, Policie...Academy of Science of South Africa (ASSAf)
 
Clinical Research Informatics Year-in-Review 2024
Clinical Research Informatics Year-in-Review 2024Clinical Research Informatics Year-in-Review 2024
Clinical Research Informatics Year-in-Review 2024Peter Embi
 
International perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research dataInternational perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research dataARDC
 
David Van Enckevort - FAIR sample and data access
David Van Enckevort - FAIR sample and data access David Van Enckevort - FAIR sample and data access
David Van Enckevort - FAIR sample and data access DataSciSIG
 
NIH Data Science Special Interest Group
NIH Data Science Special Interest GroupNIH Data Science Special Interest Group
NIH Data Science Special Interest GroupYaffa Rubinstien
 
Fair sample and data access -David Van enckevort
Fair sample and data access -David Van enckevortFair sample and data access -David Van enckevort
Fair sample and data access -David Van enckevortData Science NIH
 
Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...African Open Science Platform
 
Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014Susanna-Assunta Sansone
 
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...William Gunn
 
CINECA webinar slides: H3ABioNet Experiences in Phenotype Data Harmonisation ...
CINECA webinar slides: H3ABioNet Experiences in Phenotype Data Harmonisation ...CINECA webinar slides: H3ABioNet Experiences in Phenotype Data Harmonisation ...
CINECA webinar slides: H3ABioNet Experiences in Phenotype Data Harmonisation ...CINECAProject
 
Alain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producersAlain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producersIncisive_Events
 

Similar to CINECA webinar slides: Making cohort data FAIR (20)

The African Open Science Platform: Policy, Infrastructure, Skills and Incenti...
The African Open Science Platform: Policy, Infrastructure, Skills and Incenti...The African Open Science Platform: Policy, Infrastructure, Skills and Incenti...
The African Open Science Platform: Policy, Infrastructure, Skills and Incenti...
 
CINECA webinar slides: Ethics/ELSI considerations - From FAIR to fair data sh...
CINECA webinar slides: Ethics/ELSI considerations - From FAIR to fair data sh...CINECA webinar slides: Ethics/ELSI considerations - From FAIR to fair data sh...
CINECA webinar slides: Ethics/ELSI considerations - From FAIR to fair data sh...
 
Open Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon HodsonOpen Science Globally: Some Developments/Dr Simon Hodson
Open Science Globally: Some Developments/Dr Simon Hodson
 
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
Making Data FAIR (Findable, Accessible, Interoperable, Reusable)
 
FAIR overview - MAQC Society, Feb 2018
FAIR overview - MAQC Society, Feb 2018FAIR overview - MAQC Society, Feb 2018
FAIR overview - MAQC Society, Feb 2018
 
Digital transformation to enable a FAIR approach for health data science
Digital transformation to enable a FAIR approach for health data scienceDigital transformation to enable a FAIR approach for health data science
Digital transformation to enable a FAIR approach for health data science
 
Susanna Sansone at the Knowledge Dialogues/ODHK "Beyond Open"event
Susanna Sansone at the Knowledge Dialogues/ODHK "Beyond Open"eventSusanna Sansone at the Knowledge Dialogues/ODHK "Beyond Open"event
Susanna Sansone at the Knowledge Dialogues/ODHK "Beyond Open"event
 
Open FAIR Data and Open Science: Developing Partnerships, Strategies, Policie...
Open FAIR Data and Open Science: Developing Partnerships, Strategies, Policie...Open FAIR Data and Open Science: Developing Partnerships, Strategies, Policie...
Open FAIR Data and Open Science: Developing Partnerships, Strategies, Policie...
 
Clinical Research Informatics Year-in-Review 2024
Clinical Research Informatics Year-in-Review 2024Clinical Research Informatics Year-in-Review 2024
Clinical Research Informatics Year-in-Review 2024
 
International perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research dataInternational perspective for sharing publicly funded medical research data
International perspective for sharing publicly funded medical research data
 
David Van Enckevort - FAIR sample and data access
David Van Enckevort - FAIR sample and data access David Van Enckevort - FAIR sample and data access
David Van Enckevort - FAIR sample and data access
 
NIH Data Science Special Interest Group
NIH Data Science Special Interest GroupNIH Data Science Special Interest Group
NIH Data Science Special Interest Group
 
Fair sample and data access -David Van enckevort
Fair sample and data access -David Van enckevortFair sample and data access -David Van enckevort
Fair sample and data access -David Van enckevort
 
Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...Data as a research output and a research asset: the case for Open Science/Sim...
Data as a research output and a research asset: the case for Open Science/Sim...
 
2015 04-18-wilson cg
2015 04-18-wilson cg2015 04-18-wilson cg
2015 04-18-wilson cg
 
Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014Open Access Week - Oxford, 20-24 Oct 2014
Open Access Week - Oxford, 20-24 Oct 2014
 
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
Sci Know Mine 2013: What can we learn from topic modeling on 350M academic do...
 
CINECA webinar slides: H3ABioNet Experiences in Phenotype Data Harmonisation ...
CINECA webinar slides: H3ABioNet Experiences in Phenotype Data Harmonisation ...CINECA webinar slides: H3ABioNet Experiences in Phenotype Data Harmonisation ...
CINECA webinar slides: H3ABioNet Experiences in Phenotype Data Harmonisation ...
 
Öppen data och forskningens genomslag
Öppen data och forskningens genomslagÖppen data och forskningens genomslag
Öppen data och forskningens genomslag
 
Alain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producersAlain Frey Research Data for universities and information producers
Alain Frey Research Data for universities and information producers
 

More from CINECAProject

CINECA webinar slides: Modular and reproducible workflows for federated molec...
CINECA webinar slides: Modular and reproducible workflows for federated molec...CINECA webinar slides: Modular and reproducible workflows for federated molec...
CINECA webinar slides: Modular and reproducible workflows for federated molec...CINECAProject
 
Beacon v2 Reference Implementation: An Overview
Beacon v2 Reference Implementation: An OverviewBeacon v2 Reference Implementation: An Overview
Beacon v2 Reference Implementation: An OverviewCINECAProject
 
Lighting a Beacon: training for (future) implementers
Lighting a Beacon: training for (future) implementersLighting a Beacon: training for (future) implementers
Lighting a Beacon: training for (future) implementersCINECAProject
 
CINECA webinar slides: How to make training FAIR
CINECA webinar slides: How to make training FAIRCINECA webinar slides: How to make training FAIR
CINECA webinar slides: How to make training FAIRCINECAProject
 
CINECA webinar slides: Practically FAIR
CINECA webinar slides: Practically FAIRCINECA webinar slides: Practically FAIR
CINECA webinar slides: Practically FAIRCINECAProject
 
CINECA webinar slides: FAIR software tools
CINECA webinar slides: FAIR software toolsCINECA webinar slides: FAIR software tools
CINECA webinar slides: FAIR software toolsCINECAProject
 
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECAProject
 
CINECA webinar slides: Status Update Code of Conduct: Teaming up & Talking ab...
CINECA webinar slides: Status Update Code of Conduct: Teaming up & Talking ab...CINECA webinar slides: Status Update Code of Conduct: Teaming up & Talking ab...
CINECA webinar slides: Status Update Code of Conduct: Teaming up & Talking ab...CINECAProject
 
CINECA webinar slides: Ethical, legal and societal issues in international da...
CINECA webinar slides: Ethical, legal and societal issues in international da...CINECA webinar slides: Ethical, legal and societal issues in international da...
CINECA webinar slides: Ethical, legal and societal issues in international da...CINECAProject
 

More from CINECAProject (9)

CINECA webinar slides: Modular and reproducible workflows for federated molec...
CINECA webinar slides: Modular and reproducible workflows for federated molec...CINECA webinar slides: Modular and reproducible workflows for federated molec...
CINECA webinar slides: Modular and reproducible workflows for federated molec...
 
Beacon v2 Reference Implementation: An Overview
Beacon v2 Reference Implementation: An OverviewBeacon v2 Reference Implementation: An Overview
Beacon v2 Reference Implementation: An Overview
 
Lighting a Beacon: training for (future) implementers
Lighting a Beacon: training for (future) implementersLighting a Beacon: training for (future) implementers
Lighting a Beacon: training for (future) implementers
 
CINECA webinar slides: How to make training FAIR
CINECA webinar slides: How to make training FAIRCINECA webinar slides: How to make training FAIR
CINECA webinar slides: How to make training FAIR
 
CINECA webinar slides: Practically FAIR
CINECA webinar slides: Practically FAIRCINECA webinar slides: Practically FAIR
CINECA webinar slides: Practically FAIR
 
CINECA webinar slides: FAIR software tools
CINECA webinar slides: FAIR software toolsCINECA webinar slides: FAIR software tools
CINECA webinar slides: FAIR software tools
 
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
CINECA webinar slides: Data Gravity in the Life Sciences: Lessons learned fro...
 
CINECA webinar slides: Status Update Code of Conduct: Teaming up & Talking ab...
CINECA webinar slides: Status Update Code of Conduct: Teaming up & Talking ab...CINECA webinar slides: Status Update Code of Conduct: Teaming up & Talking ab...
CINECA webinar slides: Status Update Code of Conduct: Teaming up & Talking ab...
 
CINECA webinar slides: Ethical, legal and societal issues in international da...
CINECA webinar slides: Ethical, legal and societal issues in international da...CINECA webinar slides: Ethical, legal and societal issues in international da...
CINECA webinar slides: Ethical, legal and societal issues in international da...
 

Recently uploaded

Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...aartirawatdelhi
 
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...Dipal Arora
 
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...astropune
 
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...Arohi Goyal
 
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...chandars293
 
Call Girls Faridabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Faridabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Faridabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Faridabad Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...Genuine Call Girls
 
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...Dipal Arora
 
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort ServicePremium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Servicevidya singh
 
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...Call Girls in Nagpur High Profile
 
Call Girls Haridwar Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Haridwar Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Haridwar Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Haridwar Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Bangalore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bangalore Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Bangalore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bangalore Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Top Rated Bangalore Call Girls Richmond Circle ⟟ 9332606886 ⟟ Call Me For Ge...
Top Rated Bangalore Call Girls Richmond Circle ⟟  9332606886 ⟟ Call Me For Ge...Top Rated Bangalore Call Girls Richmond Circle ⟟  9332606886 ⟟ Call Me For Ge...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 9332606886 ⟟ Call Me For Ge...narwatsonia7
 
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...parulsinha
 
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...perfect solution
 

Recently uploaded (20)

Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Nagpur Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
 
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
 
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
Best Rate (Patna ) Call Girls Patna ⟟ 8617370543 ⟟ High Class Call Girl In 5 ...
 
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
Best Rate (Hyderabad) Call Girls Jahanuma ⟟ 8250192130 ⟟ High Class Call Girl...
 
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
All Time Service Available Call Girls Marine Drive 📳 9820252231 For 18+ VIP C...
 
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
 
Call Girls Faridabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Faridabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Faridabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Faridabad Just Call 9907093804 Top Class Call Girl Service Available
 
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
Pondicherry Call Girls Book Now 9630942363 Top Class Pondicherry Escort Servi...
 
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
 
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
Best Rate (Guwahati ) Call Girls Guwahati ⟟ 8617370543 ⟟ High Class Call Girl...
 
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Siliguri Just Call 9907093804 Top Class Call Girl Service Available
 
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort ServicePremium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
Premium Call Girls Cottonpet Whatsapp 7001035870 Independent Escort Service
 
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
 
Call Girls Haridwar Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Haridwar Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Haridwar Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Haridwar Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Bangalore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bangalore Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Bangalore Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bangalore Just Call 9907093804 Top Class Call Girl Service Available
 
Top Rated Bangalore Call Girls Richmond Circle ⟟ 9332606886 ⟟ Call Me For Ge...
Top Rated Bangalore Call Girls Richmond Circle ⟟  9332606886 ⟟ Call Me For Ge...Top Rated Bangalore Call Girls Richmond Circle ⟟  9332606886 ⟟ Call Me For Ge...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 9332606886 ⟟ Call Me For Ge...
 
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
(Low Rate RASHMI ) Rate Of Call Girls Jaipur ❣ 8445551418 ❣ Elite Models & Ce...
 
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
College Call Girls in Haridwar 9667172968 Short 4000 Night 10000 Best call gi...
 

CINECA webinar slides: Making cohort data FAIR

  • 1. This project has received funding from the European Union’s Horizon 2020 research and Innovation programme under grant agreement No. 825775 Making Cohort data FAIR Presenter: William Hsiao (Simon Fraser University) Host: Marta Lloret Llinares (EMBL-EBI)
  • 2. This webinar is being recorded
  • 3. Audience Q&A Session Please write your questions in the questions window of the GoToWebinar application
  • 4. The challenges: Stay informed @CinecaProject www.cineca-project.eu Common Infrastructure for National Cohorts in Europe, Canada and Africa This project has received funding from the European Union’s Horizon 2020 research and Innovation programme under grant agreement No. 825775 Accelerating disease research and improving health by facilitating transcontinental human data exchange The vision: This project has received funding from the Canadian Institute of Health Research under grant agreement #404896
  • 5. Context for the webinar • CINECA “How FAIR are you?” webinar series and hackathon: • https://www.cineca-project.eu/news-events-all/how-fair-are-you- webinar-series-and-hackathon • Webinar series Jan-April • Making cohort data FAIR • FAIR software tools • Practically FAIR • How to make training FAIR • Ethics/ELSI considerations • Hackathon 28-29th April 4 hours per day • 3 streams: cohort data, software, training materials
  • 6. Today’s presenter Dr. William Hsiao joined the Faculty of Health Sciences at Simon Fraser University as an associate professor in September 2020. He is also an affiliated scientist at the BC Centre for Disease Control Public Health Laboratory (BCCDC PHL) and at Canada’s Michael Smith Genome Sciences Centre. Prior to joining FHS, Dr. Hsiao was the chief bioinformatician and a senior scientist at the BCCDC PHL for 8 years and a clinical associate professor at the University of British Columbia. Dr. Hsiao obtained his PhD in the Department of Molecular Biology and Biochemistry at Simon Fraser University followed by a postdoctoral fellowship at the Institute for Genome Sciences at University of Maryland School of Medicine. His research focused on microbial genomics and metagenomics. At the BCCDC PHL, he incorporated data science and knowledge engineering into his research and developed expertise in public health data sharing, integration, and harmonization. With experience conducting and applying genomics and data science research both in academia and in government laboratories, Dr. Hsiao has developed a special perspective on integrating basic and applied research to improve our public health system.
  • 7. This project has received funding from the European Union’s Horizon 2020 research and Innovation programme under grant agreement No. 825775 Making Cohort data FAIR William Hsiao, PhD Associate Professor, Faculty of Health Sciences Simon Fraser University
  • 8. Cohort Studies • A cohort is a group of people who share a common characteristics or experience within a defined period • A cohort study typically is observational in nature and follows a large number of participants over a long period of time (longitudinal) • Useful when intervention (e.g. RCT) is not possible or not ethical • Exposure to risk factors are measured or collected • Multiple outcomes can be observed (e.g. from large population-based cohort) • Valuable for scientific understanding of disease causation • In CINECA, ~10 cohorts are part of the consortium • A goal of CINECA is to enable federated queries and analyses of the varying and wide-ranging datasets from the 10 CINECA cohorts.
  • 9. Construction of Cohorts • Define study question(s)/scope and identify and consent the study subjects (cohort) • Obtain baseline data on the exposure(s) to the risk factor(s) of interest (e.g. obese vs. lean, genetic allelic variation) • Subclassify the cohorts into groups with or without (or less) of the risk factor(s) • Follow-up with the participants to measure the outcomes using data collection instruments (questionnaire, physical measures, cognitive measures, biosamples, etc.) • Analyze the data to see if outcome correlated to risk factor(s); infer causality
  • 10. Examples of Cohorts in CINECA • CHILD: The CHILD Cohort Study is a prospective longitudinal birth cohort study in Canada; ~3000 participating families (trios); multidisciplinary; multi-modal (questionnaires, biological samples, home assessments, clinical assessments) • UK-Biobank: population based biomedical database; 500,000 participants from UK (national cohort); able to mobilize for COVID-19 related research by linking host genetics to disease status • H3Africa: a pan-African consortium to build research program and infrastructure in genomic medicine on the African continent; biospecimens and data collection for research
  • 11. Challenges with Cohort Studies • Due to the (usually) prospective, large sample-size, and longitudinal nature, cohort studies can be expensive and labour-intensive • Data collection instruments and variables are usually specific to a study, and data may not be interoperable and machine readable (e.g. free text, custom pick-list, non-standardized units) • Most valuable data is often the most sensitive and access controlled • Consent process may restrict the re-use of cohort data • Broad consents are encouraged as long as they are informed consents and allow withdrawal of the consents at any time
  • 12. Discovery of Cohort Database Content (F/A) • Cohort Profiles are often in descriptive text that are human but not machine readable • Similarly, data fields and study protocols are accessible openly but only in human readable formats (free text or tables) Cohort profile – CHILD Cohort Study (childstudy.ca)
  • 13. Access to Data is controlled to protect privacy Data access – CHILD Cohort Study (childstudy.ca)
  • 14. Minimal Metadata for Cohorts • One solution is to make cohort metadata and aggregated data researchable by machines by encoding the data in common formats • E.g. GO FAIR’s Framework for FAIRification has a Metadata for Machines (M4M) component • Assess community specific metadata practices • Using the FAIR principles to define metadata elements • Formulate these decisions as machine-actionable templates • Register these templates so they are FAIR and accessible by machine API • Result in domain-specific, community built, FAIR metadata schema Metadata for Machines - GO FAIR (go-fair.org)
  • 15. A Minimal Metadata Model for CINECA • A minimal metadata model is being developed for CINECA • Based on surveys of common data fields in 10 cohort studies (no assessment of actual data, but consulted data dictionaries to capture acceptable data values) • Supplemented with common data fields from additional data catalogues • Supplemented with “use cases” • Each variable grouped into a broad category (e.g. socio-demographic and economic characteristics, diseases, and lifestyle and behaviours) • Maelstrom Research data standards were used as the basis for the categorization – promote compatibility with existing data catalogues • Each variables is mapped to an ontology term (work in progress) • Metadata model encoded as a new ”application ontology” called GEKCO (Genomics Cohorts Knowledge Ontology) - http://www.obofoundry.org/ontology/gecko.html
  • 16. Example of Structured Variables in the CINECA Minimal Metadata Model
  • 17. Harmonization of metadata (-IR) • Data collection instruments are usually developed in silos (only considering the study scope and current research needs) • This leads to incompatibilities when trying to aggregate datasets across studies at both the cohort level (study attributes) and at the individual data (participant attributes) • As a result, data are: • Not Interoperable • Not (easily) reusable
  • 18. Similar Epi/Clinical data needs across agencies Questions re: diseases, symptoms, clinical events, dates of onset CDC Atlanta, BC CDC, WHO 18
  • 19. Examples of Similar Fields / Similar Names
  • 20. COVID-19 Case Report Forms in Canada
  • 21. COVID-19 Mutation Reports • RG203KR • insG28262GAACA • 3675-3677 SGF • L242_244L deletion; L242del,A243del,L244del • SGF 3675-3677 deletion • Q27stop • HV 69-70 deletion • Y144 deletion • Common: locations and amino acid substitutions • Variable: nucleotide vs. amino acid locations; abbreviations; gene names
  • 22. Maelstrom Data Harmonization Guideline Fortier, Isabel, et al. "Maelstrom Research guidelines for rigorous retrospective data harmonization." International journal of epidemiology (2017) 46 (1): 103-105.
  • 23. Semantic Web and Ontology • Semantic Web term refers to set of standards and best practice for sharing data and the semantic (Meanings) of that data over the web by using application. Semantic web can be processed directly or indirectly from machines, so basically human and computers can work in cooperation. • Ontology is a way to structure knowledge in a machine readable way with defined terms and relationships. • In ontology, one can provide a set of vocabularies that can be used to model a domain, which is types of objects and concepts that are exist in specific area and the relationship between them (Gruber TR, 1993).
  • 24. Ontology ● Ontologies provide standard and linkable IDs for ○ names for form and tabular data fields ○ terms (choices) for categorical variables ● Modelling processes into data structures may be subjective: Who is the model for, and to what granularity? ● Upper level ontologies (e.g. BFO) bring some harmony but do not guarantee full interoperability
  • 25. Ontology Creation and Re-use ● No silo = build the ontology as a suite of interoperable modules structured to provide berths for future ontologies as needed ● No short half-life = start not with data (which changes rapidly) but with the entities (things, processes, attributes, ...) the data are used to describe ● No reinvent the wheel = follow a method tested in over 300 ontology-building initiatives and documented in ISO 21838, (https://www.iso.org/standard/71954.html) leveraging existing resources wherever possible ● E.g. GECKO ontology reuse terms whenever possible
  • 26. • Formal representation of objects, concepts and relationships among them • Shared understanding [language] for communicating cohort information • Overcome the semantic heterogeneity • Ontologies are interpretable by humans and by computer programs. • Information integration - Multiple information resources are combined using ontologies to match concepts with similar meaning. • Ontology embedded into software - Object-oriented implementation (e.g. Java classes) generated from classes in the ontology… Strength of Ontology and Semantic web 26
  • 27. 27 GEEM: platform for building ontology-based data specifications http://genepio.org/geem/form.html#GENEPIO:0002083
  • 28. Challenges with Harmonization • balance between accepting only precisely uniform variables that render pooling straightforward (e.g. exact question or standard operating procedures) but limit the potential to integrate multiple studies; and accepting a certain level of heterogeneity across participating studies providing similar but not necessarily identical data • E.g. broader terms used to aggregate more detailed terms • E.g. same term used for different scales https://www.maelstrom-research.org/about-harmonization/guidelines/data-processing-methods
  • 29. Benefit Sharing and Privacy Concerns • Making Cohort Data FAIR can mean broader and easier access to data • However, usually the HICs (more research capacity) can take better advantages of such broader access than LMICs • Food for Thought (discussion): how to we protect the interests of study participants and enable benefit sharing?
  • 30. Questions? Title: Making Cohort data FAIR Presenter: William Hsiao Please write your questions in the questions window of the GoToWebinar application
  • 31. Next CINECA webinars Title: FAIR Software tools Presenter: Carlos Martinez Date: Wed 24th February 2021 Time: 3:00 PM GMT / 4:00 PM CET Registration and details: https://www.cineca-project.eu/news- events-all/fair-software-tools Title: Practically Fair Presenter: Andrew Stubbs Date: Thurs 4th March 2021 Time: 3:00 PM GMT / 4:00 PM CET Registration and details: https://www.cineca-project.eu/news- events-all/practically-fair