SlideShare a Scribd company logo
Open science and causality in the Exposome era
Lessons learned from the Exposome Data Challenge
Léa Maitre, PhD
Barcelona Institute for Global Health (ISGlobal)
June 23rd 2022 – One Health Conference
The Exposome concept
The totality of environmental exposures (meaning all non-genetic factors) that
a person experiences, from conception onwards.
Chris Wild, 2005.
Léa Maitre - ISGlobal - Exposome data challenge
Classical approach in environmental epidemiology:
single-exposure association
 Selective reporting of associations 
Publication bias
 No correction for multiple testing
(separate papers for each exposure)
 Cannot take into account
confounding by co-exposures
 Lack of consideration of “mixture
effect”
Exposome approach
calls for a holistic view of the
effects of environmental
exposures on human health
Léa Maitre - ISGlobal - Exposome data challenge
A collaborative, open-science approach to promote
and accelerate innovation in Exposome research
Léa Maitre - ISGlobal - Exposome data challenge
The parable of the blind men and the elephant
Data science/
AI expert
Epidemiologist
Statistician
System
biologist
(-omics)
Illustration: © Natasha Stolovitzky-Brunner
Data Challenges to
promote:
 Accelerated innovation
 Interdisciplinary
collaboration
 Participation from around
the world in COVID time
 Training material for
educational purposes
The Exposome data challenge
 Event created in the framework of the ISGlobal Exposome hub and H2020 ATHLETE
project
 Scientific publication pre-print https://arxiv.org/abs/2202.01680 (under review)
 Simulated data (based on the HELIX project) publicly available to challenge
researchers on statistical tools to study exposome-health associations
Léa Maitre - ISGlobal - Exposome data challenge
Léa Maitre - ISGlobal - Exposome data challenge
- 25 selected teams out of 39
abstracts sent
- 307 participants online:
• 101 North America
• 186 Europe
• 9 Asia
• 8 Latin America
• 2 Africa
• 1 Australia
- Awards attributed from the public
and from the comittee
The Human early-life exposome project (HELIX)
Study design
Six mother-child cohorts in Europe (n=1301)
Pregnant women enrolled at the beginning of their pregnancy
Follow-up of the children with a standardized clinical examination at 6-11 years old
Aim
To study the association between multiple exposures, molecular signatures, and
child health outcomes.
6-11 yo
Léa Maitre - ISGlobal - Exposome data challenge
Exposure assessment
>100 environmental factors
Assessed during pregnancy and childhood (at the time of the children follow-up, 6-11yo)
Outdoor exposures
(Geographic Information System)
Air pollution*
Noise†
Built environment†
Natural spaces†
Traffic
Meteorology*
Water DBP
Indoor air
Chemicals
(blood or urine biomarkers)
Organochlorines
PBDE
PFAS
Metals
Phthalates
Phenols
Organophosphate pesticides
Lifestyles
(questionnaires)
Smoking
Diet
Physical activity
Social and economic capital
Sleep
* Postnatal exposures available within different time window
† Postnatal exposures available at different location: home and school
Léa Maitre - ISGlobal - Exposome data challenge
Health outcomes
6 health outcomes
At birth or at the time of the children follow-up (6-11yo)
Continuous variables
Birth weight
Body mass index at 6-11yo
Categorical variables
Asthma at 6-11yo (binary)
Body mass index at 6-11yo
(4 categories)
Count variables
Intelligence quotient at 6-11yo
Total correct answers (RAVEN
test)
Neuro behavior at 6-11yo
Internalizing and externalizing
problems (CBCL scale)
Léa Maitre - ISGlobal - Exposome data challenge
Covariates, potential confounders
Maternal and child data
Maternal data
Cohort of inclusion
Age
Education
Pre-pregnancy body mass index
Weight gain during pregnancy
Parity
Child data at birth
Sex
Gestational age
Year of birth
Native origin
Child data at 6-11yo
Age
Weight
Height
Léa Maitre - ISGlobal - Exposome data challenge
Omics data
>400,000 features
Assessed during childhood only (at the time of the children follow-up, 6-11yo)
White blood cells
DNA methylation
(386,518 CpGs)
Transcription
Gene expression
(58,254 transcripts)
Plasma & serum
Proteins
(36 proteins)
Metabolites
(177 metabolites)
Urine
Metabolites
(44 metabolites)
Léa Maitre - ISGlobal - Exposome data challenge
Data summary
Naissance
lusion:1er trimestre
estionnaire
nes
Dossier maternité
+
Sang du cordon,
Fragments placenta,
Cheveux
Questionnaires
(mère, père,
enfant)
2 ans
Questionnaires
+ Suivi médical
(médecine
scolaire)
4 ans
Naissance
lusion:1er trimestre
estionnaire
nes
Dossier maternité
+
Sang du cordon,
Fragments placenta,
Cheveux
Questionnaires
(mère, père,
enfant)
2 ans
Questionnaires
+ Suivi médical
(médecine
scolaire)
4 ans
6-11 yo
Exposures ++ Exposures ++
Omics ++++
Health
Covariates
Health
Covariates
Léa Maitre - ISGlobal - Exposome data challenge
Particularity of the data
Correlation between
exposures
Causal relation
between
exposures/omic layers
High dimension
(exposure, omics)
Multi-center study (6
mother-child cohorts)
Repeated exposure
data (pregnancy and
childhood)
Small sample size
(n=1301 mother-child
pairs)
Missing data
Non-linear association
Continuous vs
categorical variables
Léa Maitre - ISGlobal - Exposome data challenge
Challenge examples
Challenge 1: Combined effects of exposures
Identify combination of exposures, high-order
interactions or exposure patterns that are particularly
harmful or beneficial for one or several health outcomes.
Challenge 2: Using omics data to improve inference
on the link between exposome and health.
Incorporate the different omics layers into the analysis
linking the exposome and one or more health outcomes.
Challenge 3: Multi-omics analysis
Incorporate different layers of omics data (including exposome as one of the layers) to find
patterns that can explain variations in one or more health outcomes.
Challenge 4: Causal structure in the exposome
Define hypothesized causal relationships between the
different exposures and one health outcome, and
incorporate this information into the analysis.
Challenge 5: Visualization techniques
Tools to visualize the complex relationships between the
different components of the analysis, with the main aim of
illustrating determinants of health effects.
/! Control for potential confounders and multicenter design.
Handle missing data.
Léa Maitre - ISGlobal - Exposome data challenge
Popular vote: winner
Using causal random forest to determine exposure environments with high
sexual dimorphisms
Alejandro Caceres, ISGlobal
Committee vote: winner 1
Quantifying Exposome-Health Associations with Bayesian Multiple Index
Models
Glen McGee, University of Waterloo
github.com/glenmcgee/bsmim2
glen.mcgee@uwaterloo.ca
Interactions between multipollutant index and individual covariates
Committee vote: winner 2
Decoding unknown links between the exposome and health outcomes by
multi-omics analysis
Xiaotao Shen, Stanford University
Ressources
Exposome data challenge
Dataset: https://github.com/isglobal-exposomeHub/ExposomeDataChallenge2021/blob/main/README.md
Data description: https://docs.google.com/document/d/1ul3v-sIniLuTjFB1F1CrFQIX8mrEXVnvSzOF7BCOnpQ/edit
Scientific publication https://arxiv.org/abs/2202.01680 (under review)
Slides and videos of the presentations https://www.isglobal.org/-/exposome-data-analysis-challenge
https://www.youtube.com/channel/UC0F3hR04UzUeKkcfAyikltA/featured
Code used shared on GitHub: https://github.com/isglobal-
exposomeHub/ExposomeDataChallenge2021/tree/main/R_Code_Presentations
HELIX project
Data inventory: https://www.projecthelix.eu/index.php/es/data-inventory
Tamayo-Uria I, et al. The early-life exposome: Description and patterns in six European countries. Environ Int. 2019.
Maitre L, et al. Human Early Life Exposome (HELIX) study: a European population-based exposome cohort. BMJ Open. 2018.
Vrijheid M, et al. The human early-life exposome (HELIX): project rationale and design. Environ Health Perspect. 2014
Léa Maitre - ISGlobal - Exposome data challenge
 Publicize methods
for developers
 Code source for
analysists
Challenge challenges
• Making real-case, sensitive personal data, publicly available
 Partial imputation of the data
• Finding a balance between well-defined research questions and
generalizability of the results to a wide community
The first edition of the challenge was focused on general data analysis approach challenges,
rather than specific research questions, which means methods presented could not be
directly compared. The advantage was a broad catalogue of methods
• A short time frame, sometimes lack of adjustment for known biaises
• Appropriate recognition for the data-generating team
Léa Maitre - ISGlobal - Exposome data challenge
Challenge successes
• Knowledge transfer -> Training tools for students, already used by
universities such as French Centrale sup elec or PhD students
• New collaborations formed
• Accelerated the rate of scientific discovery, out-of-the-box ideas
created by multidisciplinary teams in a short time:
• 25 teams who worked over a month, will never equal the work even of a
consortium over 5 years (only 5 methods tested for the HELIX stat protocol,
run mainly by one experienced statistician)
Léa Maitre - ISGlobal - Exposome data challenge
Thank you
Léa Maitre - ISGlobal - Exposome data challenge
Contact me at
lea.maitre@isglobal.org
Extra slides
Léa Maitre - ISGlobal - Exposome data challenge
The early life period
 Vulnerable period of rapid
organ development
 Many chronic diseases have
part of their origin in early life
(Barker hypothesis)
 Starting point for a life-course
exposome
Léa Maitre - ISGlobal - Exposome data challenge
The Exposome data challenge
 Event created in the framework of the ISGlobal Exposome hub and H2020 ATHLETE
project
 Organized by ISGlobal, Barcelona
 Simulated data (based on the HELIX project) publicly available to challenge
researchers on statistical tools to study exposome-health associations
Léa Maitre - ISGlobal - Exposome data challenge
Organization of the datasets
Exposures
Omics
Health
Covariates
data.frame: exposome
data.frame: phenotype
data.frame: covariates
DNA methylation GenomicRatioSet: methyl
Gene expression ExpressionSet: genexpr
Metabolites (urine) ExpressionSet: metabol_urine
Metabolites (serum) ExpressionSet: metabol_serum
Proteines ExpressionSet: proteom
Data available here: https://github.com/isglobal-exposomeHub/ExposomeDataChallenge2021/blob/main/README.md
Léa Maitre - ISGlobal - Exposome data challenge
Codebook
Exposures Health Covariates
https://github.com/isglobal-brge/brgedata/blob/master/data/ExposomeDataChallenge2021/codebook.xlsx
Léa Maitre - ISGlobal - Exposome data challenge

More Related Content

Similar to Exposome data challenge - ONE2022.pptx

Open Data and the Social Sciences - OpenCon Community Webcast
Open Data and the Social Sciences - OpenCon Community WebcastOpen Data and the Social Sciences - OpenCon Community Webcast
Open Data and the Social Sciences - OpenCon Community Webcast
Right to Research
 
Innovative research approaches to improve evidence in global health
Innovative research approaches to improve evidence in global healthInnovative research approaches to improve evidence in global health
Innovative research approaches to improve evidence in global health
Emilie Robert
 
Brescia Exposome climate LMaitre April 2023.pptx
Brescia Exposome climate LMaitre April 2023.pptxBrescia Exposome climate LMaitre April 2023.pptx
Brescia Exposome climate LMaitre April 2023.pptx
LeaMaitre1
 
Big Data and the Future by Sherri Rose
Big Data and the Future by Sherri RoseBig Data and the Future by Sherri Rose
Big Data and the Future by Sherri Rose
Lewis Lin 🦊
 
20160811 Big Data for Health and Medicine
20160811 Big Data for Health and Medicine20160811 Big Data for Health and Medicine
20160811 Big Data for Health and Medicine
Brian Bot
 
Genomics in Society: Genomics, Cellular Networks, Preventive Medicine, and So...
Genomics in Society: Genomics, Cellular Networks, Preventive Medicine, and So...Genomics in Society: Genomics, Cellular Networks, Preventive Medicine, and So...
Genomics in Society: Genomics, Cellular Networks, Preventive Medicine, and So...
Larry Smarr
 
Roche_open_science_NIOO_KNAW_workshop_NL
Roche_open_science_NIOO_KNAW_workshop_NLRoche_open_science_NIOO_KNAW_workshop_NL
Roche_open_science_NIOO_KNAW_workshop_NL
Dominique Roche
 
Beacon Network: A System for Global Genomic Data Sharing
Beacon Network: A System for Global Genomic Data SharingBeacon Network: A System for Global Genomic Data Sharing
Beacon Network: A System for Global Genomic Data Sharing
Miro Cupak
 
Novel method for reviewing mechanistic evidence on diet, nutrition, physical ...
Novel method for reviewing mechanistic evidence on diet, nutrition, physical ...Novel method for reviewing mechanistic evidence on diet, nutrition, physical ...
Novel method for reviewing mechanistic evidence on diet, nutrition, physical ...
World Cancer Research Fund International
 
Big data and machine learning: opportunità per la medicina di precisione e i ...
Big data and machine learning: opportunità per la medicina di precisione e i ...Big data and machine learning: opportunità per la medicina di precisione e i ...
Big data and machine learning: opportunità per la medicina di precisione e i ...
Fondazione Giannino Bassetti
 
Methods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataMethods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big data
Chirag Patel
 
Putting bioscience into context
Putting bioscience into contextPutting bioscience into context
Putting bioscience into context
Chris Willmott
 
Math, Stats and CS in Public Health and Medical Research
Math, Stats and CS in Public Health and Medical ResearchMath, Stats and CS in Public Health and Medical Research
Math, Stats and CS in Public Health and Medical Research
Jessica Minnier
 
The shared value of personal and population data
The shared value of personal and population dataThe shared value of personal and population data
The shared value of personal and population data
Wessel Kraaij
 
Reaching out to collaborators and crowdsourcing for pharmaceutical research
Reaching out to collaborators and crowdsourcing for pharmaceutical research  Reaching out to collaborators and crowdsourcing for pharmaceutical research
Reaching out to collaborators and crowdsourcing for pharmaceutical research
US Environmental Protection Agency (EPA), Center for Computational Toxicology and Exposure
 
XNN001 Introductory epidemiological concepts - Study design
XNN001 Introductory epidemiological concepts - Study designXNN001 Introductory epidemiological concepts - Study design
XNN001 Introductory epidemiological concepts - Study design
ramseyr
 
ISB Prosperity Partnership Presentation by John Aitchison
ISB Prosperity Partnership Presentation by John AitchisonISB Prosperity Partnership Presentation by John Aitchison
ISB Prosperity Partnership Presentation by John Aitchison
Institute for Systems Biology
 
The Learning Health System: Thinking and Acting Across Scales
The Learning Health System: Thinking and Acting Across ScalesThe Learning Health System: Thinking and Acting Across Scales
The Learning Health System: Thinking and Acting Across Scales
Philip Payne
 
Repurposing large datasets for exposomic discovery in disease
Repurposing large datasets for exposomic discovery in diseaseRepurposing large datasets for exposomic discovery in disease
Repurposing large datasets for exposomic discovery in disease
Chirag Patel
 
Sharing data from clinical and medical research
Sharing data from clinical and medical researchSharing data from clinical and medical research
Sharing data from clinical and medical research
Christoph Steinbeck
 

Similar to Exposome data challenge - ONE2022.pptx (20)

Open Data and the Social Sciences - OpenCon Community Webcast
Open Data and the Social Sciences - OpenCon Community WebcastOpen Data and the Social Sciences - OpenCon Community Webcast
Open Data and the Social Sciences - OpenCon Community Webcast
 
Innovative research approaches to improve evidence in global health
Innovative research approaches to improve evidence in global healthInnovative research approaches to improve evidence in global health
Innovative research approaches to improve evidence in global health
 
Brescia Exposome climate LMaitre April 2023.pptx
Brescia Exposome climate LMaitre April 2023.pptxBrescia Exposome climate LMaitre April 2023.pptx
Brescia Exposome climate LMaitre April 2023.pptx
 
Big Data and the Future by Sherri Rose
Big Data and the Future by Sherri RoseBig Data and the Future by Sherri Rose
Big Data and the Future by Sherri Rose
 
20160811 Big Data for Health and Medicine
20160811 Big Data for Health and Medicine20160811 Big Data for Health and Medicine
20160811 Big Data for Health and Medicine
 
Genomics in Society: Genomics, Cellular Networks, Preventive Medicine, and So...
Genomics in Society: Genomics, Cellular Networks, Preventive Medicine, and So...Genomics in Society: Genomics, Cellular Networks, Preventive Medicine, and So...
Genomics in Society: Genomics, Cellular Networks, Preventive Medicine, and So...
 
Roche_open_science_NIOO_KNAW_workshop_NL
Roche_open_science_NIOO_KNAW_workshop_NLRoche_open_science_NIOO_KNAW_workshop_NL
Roche_open_science_NIOO_KNAW_workshop_NL
 
Beacon Network: A System for Global Genomic Data Sharing
Beacon Network: A System for Global Genomic Data SharingBeacon Network: A System for Global Genomic Data Sharing
Beacon Network: A System for Global Genomic Data Sharing
 
Novel method for reviewing mechanistic evidence on diet, nutrition, physical ...
Novel method for reviewing mechanistic evidence on diet, nutrition, physical ...Novel method for reviewing mechanistic evidence on diet, nutrition, physical ...
Novel method for reviewing mechanistic evidence on diet, nutrition, physical ...
 
Big data and machine learning: opportunità per la medicina di precisione e i ...
Big data and machine learning: opportunità per la medicina di precisione e i ...Big data and machine learning: opportunità per la medicina di precisione e i ...
Big data and machine learning: opportunità per la medicina di precisione e i ...
 
Methods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataMethods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big data
 
Putting bioscience into context
Putting bioscience into contextPutting bioscience into context
Putting bioscience into context
 
Math, Stats and CS in Public Health and Medical Research
Math, Stats and CS in Public Health and Medical ResearchMath, Stats and CS in Public Health and Medical Research
Math, Stats and CS in Public Health and Medical Research
 
The shared value of personal and population data
The shared value of personal and population dataThe shared value of personal and population data
The shared value of personal and population data
 
Reaching out to collaborators and crowdsourcing for pharmaceutical research
Reaching out to collaborators and crowdsourcing for pharmaceutical research  Reaching out to collaborators and crowdsourcing for pharmaceutical research
Reaching out to collaborators and crowdsourcing for pharmaceutical research
 
XNN001 Introductory epidemiological concepts - Study design
XNN001 Introductory epidemiological concepts - Study designXNN001 Introductory epidemiological concepts - Study design
XNN001 Introductory epidemiological concepts - Study design
 
ISB Prosperity Partnership Presentation by John Aitchison
ISB Prosperity Partnership Presentation by John AitchisonISB Prosperity Partnership Presentation by John Aitchison
ISB Prosperity Partnership Presentation by John Aitchison
 
The Learning Health System: Thinking and Acting Across Scales
The Learning Health System: Thinking and Acting Across ScalesThe Learning Health System: Thinking and Acting Across Scales
The Learning Health System: Thinking and Acting Across Scales
 
Repurposing large datasets for exposomic discovery in disease
Repurposing large datasets for exposomic discovery in diseaseRepurposing large datasets for exposomic discovery in disease
Repurposing large datasets for exposomic discovery in disease
 
Sharing data from clinical and medical research
Sharing data from clinical and medical researchSharing data from clinical and medical research
Sharing data from clinical and medical research
 

Recently uploaded

Medical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptxMedical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptx
terusbelajar5
 
Equivariant neural networks and representation theory
Equivariant neural networks and representation theoryEquivariant neural networks and representation theory
Equivariant neural networks and representation theory
Daniel Tubbenhauer
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
kejapriya1
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
KrushnaDarade1
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
University of Maribor
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
Sérgio Sacani
 
Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
IshaGoswami9
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Ana Luísa Pinho
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
Gokturk Mehmet Dilci
 
molar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptxmolar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptx
Anagha Prasad
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
yqqaatn0
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
Sérgio Sacani
 
Thornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdfThornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdf
European Sustainable Phosphorus Platform
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
HongcNguyn6
 
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero WaterSharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Texas Alliance of Groundwater Districts
 
Bob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdfBob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdf
Texas Alliance of Groundwater Districts
 
Applied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdfApplied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdf
University of Hertfordshire
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
yqqaatn0
 

Recently uploaded (20)

Medical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptxMedical Orthopedic PowerPoint Templates.pptx
Medical Orthopedic PowerPoint Templates.pptx
 
Equivariant neural networks and representation theory
Equivariant neural networks and representation theoryEquivariant neural networks and representation theory
Equivariant neural networks and representation theory
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
 
The binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defectsThe binding of cosmological structures by massless topological defects
The binding of cosmological structures by massless topological defects
 
Phenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvementPhenomics assisted breeding in crop improvement
Phenomics assisted breeding in crop improvement
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
 
Shallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptxShallowest Oil Discovery of Turkiye.pptx
Shallowest Oil Discovery of Turkiye.pptx
 
molar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptxmolar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptx
 
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
原版制作(carleton毕业证书)卡尔顿大学毕业证硕士文凭原版一模一样
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
 
Thornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdfThornton ESPP slides UK WW Network 4_6_24.pdf
Thornton ESPP slides UK WW Network 4_6_24.pdf
 
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốtmô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
mô tả các thí nghiệm về đánh giá tác động dòng khí hóa sau đốt
 
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero WaterSharlene Leurig - Enabling Onsite Water Use with Net Zero Water
Sharlene Leurig - Enabling Onsite Water Use with Net Zero Water
 
Bob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdfBob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdf
 
Applied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdfApplied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdf
 
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
如何办理(uvic毕业证书)维多利亚大学毕业证本科学位证书原版一模一样
 

Exposome data challenge - ONE2022.pptx

  • 1. Open science and causality in the Exposome era Lessons learned from the Exposome Data Challenge Léa Maitre, PhD Barcelona Institute for Global Health (ISGlobal) June 23rd 2022 – One Health Conference
  • 2. The Exposome concept The totality of environmental exposures (meaning all non-genetic factors) that a person experiences, from conception onwards. Chris Wild, 2005. Léa Maitre - ISGlobal - Exposome data challenge
  • 3. Classical approach in environmental epidemiology: single-exposure association  Selective reporting of associations  Publication bias  No correction for multiple testing (separate papers for each exposure)  Cannot take into account confounding by co-exposures  Lack of consideration of “mixture effect” Exposome approach calls for a holistic view of the effects of environmental exposures on human health Léa Maitre - ISGlobal - Exposome data challenge
  • 4. A collaborative, open-science approach to promote and accelerate innovation in Exposome research Léa Maitre - ISGlobal - Exposome data challenge The parable of the blind men and the elephant Data science/ AI expert Epidemiologist Statistician System biologist (-omics) Illustration: © Natasha Stolovitzky-Brunner Data Challenges to promote:  Accelerated innovation  Interdisciplinary collaboration  Participation from around the world in COVID time  Training material for educational purposes
  • 5. The Exposome data challenge  Event created in the framework of the ISGlobal Exposome hub and H2020 ATHLETE project  Scientific publication pre-print https://arxiv.org/abs/2202.01680 (under review)  Simulated data (based on the HELIX project) publicly available to challenge researchers on statistical tools to study exposome-health associations Léa Maitre - ISGlobal - Exposome data challenge
  • 6. Léa Maitre - ISGlobal - Exposome data challenge - 25 selected teams out of 39 abstracts sent - 307 participants online: • 101 North America • 186 Europe • 9 Asia • 8 Latin America • 2 Africa • 1 Australia - Awards attributed from the public and from the comittee
  • 7. The Human early-life exposome project (HELIX) Study design Six mother-child cohorts in Europe (n=1301) Pregnant women enrolled at the beginning of their pregnancy Follow-up of the children with a standardized clinical examination at 6-11 years old Aim To study the association between multiple exposures, molecular signatures, and child health outcomes. 6-11 yo Léa Maitre - ISGlobal - Exposome data challenge
  • 8. Exposure assessment >100 environmental factors Assessed during pregnancy and childhood (at the time of the children follow-up, 6-11yo) Outdoor exposures (Geographic Information System) Air pollution* Noise† Built environment† Natural spaces† Traffic Meteorology* Water DBP Indoor air Chemicals (blood or urine biomarkers) Organochlorines PBDE PFAS Metals Phthalates Phenols Organophosphate pesticides Lifestyles (questionnaires) Smoking Diet Physical activity Social and economic capital Sleep * Postnatal exposures available within different time window † Postnatal exposures available at different location: home and school Léa Maitre - ISGlobal - Exposome data challenge
  • 9. Health outcomes 6 health outcomes At birth or at the time of the children follow-up (6-11yo) Continuous variables Birth weight Body mass index at 6-11yo Categorical variables Asthma at 6-11yo (binary) Body mass index at 6-11yo (4 categories) Count variables Intelligence quotient at 6-11yo Total correct answers (RAVEN test) Neuro behavior at 6-11yo Internalizing and externalizing problems (CBCL scale) Léa Maitre - ISGlobal - Exposome data challenge
  • 10. Covariates, potential confounders Maternal and child data Maternal data Cohort of inclusion Age Education Pre-pregnancy body mass index Weight gain during pregnancy Parity Child data at birth Sex Gestational age Year of birth Native origin Child data at 6-11yo Age Weight Height Léa Maitre - ISGlobal - Exposome data challenge
  • 11. Omics data >400,000 features Assessed during childhood only (at the time of the children follow-up, 6-11yo) White blood cells DNA methylation (386,518 CpGs) Transcription Gene expression (58,254 transcripts) Plasma & serum Proteins (36 proteins) Metabolites (177 metabolites) Urine Metabolites (44 metabolites) Léa Maitre - ISGlobal - Exposome data challenge
  • 12. Data summary Naissance lusion:1er trimestre estionnaire nes Dossier maternité + Sang du cordon, Fragments placenta, Cheveux Questionnaires (mère, père, enfant) 2 ans Questionnaires + Suivi médical (médecine scolaire) 4 ans Naissance lusion:1er trimestre estionnaire nes Dossier maternité + Sang du cordon, Fragments placenta, Cheveux Questionnaires (mère, père, enfant) 2 ans Questionnaires + Suivi médical (médecine scolaire) 4 ans 6-11 yo Exposures ++ Exposures ++ Omics ++++ Health Covariates Health Covariates Léa Maitre - ISGlobal - Exposome data challenge
  • 13. Particularity of the data Correlation between exposures Causal relation between exposures/omic layers High dimension (exposure, omics) Multi-center study (6 mother-child cohorts) Repeated exposure data (pregnancy and childhood) Small sample size (n=1301 mother-child pairs) Missing data Non-linear association Continuous vs categorical variables Léa Maitre - ISGlobal - Exposome data challenge
  • 14. Challenge examples Challenge 1: Combined effects of exposures Identify combination of exposures, high-order interactions or exposure patterns that are particularly harmful or beneficial for one or several health outcomes. Challenge 2: Using omics data to improve inference on the link between exposome and health. Incorporate the different omics layers into the analysis linking the exposome and one or more health outcomes. Challenge 3: Multi-omics analysis Incorporate different layers of omics data (including exposome as one of the layers) to find patterns that can explain variations in one or more health outcomes. Challenge 4: Causal structure in the exposome Define hypothesized causal relationships between the different exposures and one health outcome, and incorporate this information into the analysis. Challenge 5: Visualization techniques Tools to visualize the complex relationships between the different components of the analysis, with the main aim of illustrating determinants of health effects. /! Control for potential confounders and multicenter design. Handle missing data. Léa Maitre - ISGlobal - Exposome data challenge
  • 15. Popular vote: winner Using causal random forest to determine exposure environments with high sexual dimorphisms Alejandro Caceres, ISGlobal
  • 16. Committee vote: winner 1 Quantifying Exposome-Health Associations with Bayesian Multiple Index Models Glen McGee, University of Waterloo github.com/glenmcgee/bsmim2 glen.mcgee@uwaterloo.ca Interactions between multipollutant index and individual covariates
  • 17. Committee vote: winner 2 Decoding unknown links between the exposome and health outcomes by multi-omics analysis Xiaotao Shen, Stanford University
  • 18. Ressources Exposome data challenge Dataset: https://github.com/isglobal-exposomeHub/ExposomeDataChallenge2021/blob/main/README.md Data description: https://docs.google.com/document/d/1ul3v-sIniLuTjFB1F1CrFQIX8mrEXVnvSzOF7BCOnpQ/edit Scientific publication https://arxiv.org/abs/2202.01680 (under review) Slides and videos of the presentations https://www.isglobal.org/-/exposome-data-analysis-challenge https://www.youtube.com/channel/UC0F3hR04UzUeKkcfAyikltA/featured Code used shared on GitHub: https://github.com/isglobal- exposomeHub/ExposomeDataChallenge2021/tree/main/R_Code_Presentations HELIX project Data inventory: https://www.projecthelix.eu/index.php/es/data-inventory Tamayo-Uria I, et al. The early-life exposome: Description and patterns in six European countries. Environ Int. 2019. Maitre L, et al. Human Early Life Exposome (HELIX) study: a European population-based exposome cohort. BMJ Open. 2018. Vrijheid M, et al. The human early-life exposome (HELIX): project rationale and design. Environ Health Perspect. 2014 Léa Maitre - ISGlobal - Exposome data challenge  Publicize methods for developers  Code source for analysists
  • 19. Challenge challenges • Making real-case, sensitive personal data, publicly available  Partial imputation of the data • Finding a balance between well-defined research questions and generalizability of the results to a wide community The first edition of the challenge was focused on general data analysis approach challenges, rather than specific research questions, which means methods presented could not be directly compared. The advantage was a broad catalogue of methods • A short time frame, sometimes lack of adjustment for known biaises • Appropriate recognition for the data-generating team Léa Maitre - ISGlobal - Exposome data challenge
  • 20. Challenge successes • Knowledge transfer -> Training tools for students, already used by universities such as French Centrale sup elec or PhD students • New collaborations formed • Accelerated the rate of scientific discovery, out-of-the-box ideas created by multidisciplinary teams in a short time: • 25 teams who worked over a month, will never equal the work even of a consortium over 5 years (only 5 methods tested for the HELIX stat protocol, run mainly by one experienced statistician) Léa Maitre - ISGlobal - Exposome data challenge
  • 21. Thank you Léa Maitre - ISGlobal - Exposome data challenge Contact me at lea.maitre@isglobal.org
  • 22. Extra slides Léa Maitre - ISGlobal - Exposome data challenge
  • 23. The early life period  Vulnerable period of rapid organ development  Many chronic diseases have part of their origin in early life (Barker hypothesis)  Starting point for a life-course exposome Léa Maitre - ISGlobal - Exposome data challenge
  • 24. The Exposome data challenge  Event created in the framework of the ISGlobal Exposome hub and H2020 ATHLETE project  Organized by ISGlobal, Barcelona  Simulated data (based on the HELIX project) publicly available to challenge researchers on statistical tools to study exposome-health associations Léa Maitre - ISGlobal - Exposome data challenge
  • 25. Organization of the datasets Exposures Omics Health Covariates data.frame: exposome data.frame: phenotype data.frame: covariates DNA methylation GenomicRatioSet: methyl Gene expression ExpressionSet: genexpr Metabolites (urine) ExpressionSet: metabol_urine Metabolites (serum) ExpressionSet: metabol_serum Proteines ExpressionSet: proteom Data available here: https://github.com/isglobal-exposomeHub/ExposomeDataChallenge2021/blob/main/README.md Léa Maitre - ISGlobal - Exposome data challenge

Editor's Notes

  1. The collaboration enabled by Challenges and open science can be well illustrated by the parable of the blind men and the elephant. In this parable six blind men who have never encountered an elephant learn what an elephant is by being exposed to different parts of the elephant. While individually these men cannot fully grasp what an elephant is, together they have a chance to have an integrated concept of an elephant that goes beyond each man’s limited experience. Illustration: © Natasha Stolovitzky-Brunner. Open competitions (also known as Challenges) have proved to be an efficient and unbiased way to find computational solutions to biomedical research questions, by exploiting the ‘wisdom of the crowd’