SlideShare a Scribd company logo
1 of 26
Open science and causality in the Exposome era
Lessons learned from the Exposome Data Challenge
Léa Maitre, PhD
Barcelona Institute for Global Health (ISGlobal)
June 23rd 2022 – One Health Conference
The Exposome concept
The totality of environmental exposures (meaning all non-genetic factors) that
a person experiences, from conception onwards.
Chris Wild, 2005.
Léa Maitre - ISGlobal - Exposome data challenge
Classical approach in environmental epidemiology:
single-exposure association
 Selective reporting of associations 
Publication bias
 No correction for multiple testing
(separate papers for each exposure)
 Cannot take into account
confounding by co-exposures
 Lack of consideration of “mixture
effect”
Exposome approach
calls for a holistic view of the
effects of environmental
exposures on human health
Léa Maitre - ISGlobal - Exposome data challenge
A collaborative, open-science approach to promote
and accelerate innovation in Exposome research
Léa Maitre - ISGlobal - Exposome data challenge
The parable of the blind men and the elephant
Data science/
AI expert
Epidemiologist
Statistician
System
biologist
(-omics)
Illustration: © Natasha Stolovitzky-Brunner
Data Challenges to
promote:
 Accelerated innovation
 Interdisciplinary
collaboration
 Participation from around
the world in COVID time
 Training material for
educational purposes
The Exposome data challenge
 Event created in the framework of the ISGlobal Exposome hub and H2020 ATHLETE
project
 Scientific publication pre-print https://arxiv.org/abs/2202.01680 (under review)
 Simulated data (based on the HELIX project) publicly available to challenge
researchers on statistical tools to study exposome-health associations
Léa Maitre - ISGlobal - Exposome data challenge
Léa Maitre - ISGlobal - Exposome data challenge
- 25 selected teams out of 39
abstracts sent
- 307 participants online:
• 101 North America
• 186 Europe
• 9 Asia
• 8 Latin America
• 2 Africa
• 1 Australia
- Awards attributed from the public
and from the comittee
The Human early-life exposome project (HELIX)
Study design
Six mother-child cohorts in Europe (n=1301)
Pregnant women enrolled at the beginning of their pregnancy
Follow-up of the children with a standardized clinical examination at 6-11 years old
Aim
To study the association between multiple exposures, molecular signatures, and
child health outcomes.
6-11 yo
Léa Maitre - ISGlobal - Exposome data challenge
Exposure assessment
>100 environmental factors
Assessed during pregnancy and childhood (at the time of the children follow-up, 6-11yo)
Outdoor exposures
(Geographic Information System)
Air pollution*
Noise†
Built environment†
Natural spaces†
Traffic
Meteorology*
Water DBP
Indoor air
Chemicals
(blood or urine biomarkers)
Organochlorines
PBDE
PFAS
Metals
Phthalates
Phenols
Organophosphate pesticides
Lifestyles
(questionnaires)
Smoking
Diet
Physical activity
Social and economic capital
Sleep
* Postnatal exposures available within different time window
† Postnatal exposures available at different location: home and school
Léa Maitre - ISGlobal - Exposome data challenge
Health outcomes
6 health outcomes
At birth or at the time of the children follow-up (6-11yo)
Continuous variables
Birth weight
Body mass index at 6-11yo
Categorical variables
Asthma at 6-11yo (binary)
Body mass index at 6-11yo
(4 categories)
Count variables
Intelligence quotient at 6-11yo
Total correct answers (RAVEN
test)
Neuro behavior at 6-11yo
Internalizing and externalizing
problems (CBCL scale)
Léa Maitre - ISGlobal - Exposome data challenge
Covariates, potential confounders
Maternal and child data
Maternal data
Cohort of inclusion
Age
Education
Pre-pregnancy body mass index
Weight gain during pregnancy
Parity
Child data at birth
Sex
Gestational age
Year of birth
Native origin
Child data at 6-11yo
Age
Weight
Height
Léa Maitre - ISGlobal - Exposome data challenge
Omics data
>400,000 features
Assessed during childhood only (at the time of the children follow-up, 6-11yo)
White blood cells
DNA methylation
(386,518 CpGs)
Transcription
Gene expression
(58,254 transcripts)
Plasma & serum
Proteins
(36 proteins)
Metabolites
(177 metabolites)
Urine
Metabolites
(44 metabolites)
Léa Maitre - ISGlobal - Exposome data challenge
Data summary
Naissance
lusion:1er trimestre
estionnaire
nes
Dossier maternité
+
Sang du cordon,
Fragments placenta,
Cheveux
Questionnaires
(mère, père,
enfant)
2 ans
Questionnaires
+ Suivi médical
(médecine
scolaire)
4 ans
Naissance
lusion:1er trimestre
estionnaire
nes
Dossier maternité
+
Sang du cordon,
Fragments placenta,
Cheveux
Questionnaires
(mère, père,
enfant)
2 ans
Questionnaires
+ Suivi médical
(médecine
scolaire)
4 ans
6-11 yo
Exposures ++ Exposures ++
Omics ++++
Health
Covariates
Health
Covariates
Léa Maitre - ISGlobal - Exposome data challenge
Particularity of the data
Correlation between
exposures
Causal relation
between
exposures/omic layers
High dimension
(exposure, omics)
Multi-center study (6
mother-child cohorts)
Repeated exposure
data (pregnancy and
childhood)
Small sample size
(n=1301 mother-child
pairs)
Missing data
Non-linear association
Continuous vs
categorical variables
Léa Maitre - ISGlobal - Exposome data challenge
Challenge examples
Challenge 1: Combined effects of exposures
Identify combination of exposures, high-order
interactions or exposure patterns that are particularly
harmful or beneficial for one or several health outcomes.
Challenge 2: Using omics data to improve inference
on the link between exposome and health.
Incorporate the different omics layers into the analysis
linking the exposome and one or more health outcomes.
Challenge 3: Multi-omics analysis
Incorporate different layers of omics data (including exposome as one of the layers) to find
patterns that can explain variations in one or more health outcomes.
Challenge 4: Causal structure in the exposome
Define hypothesized causal relationships between the
different exposures and one health outcome, and
incorporate this information into the analysis.
Challenge 5: Visualization techniques
Tools to visualize the complex relationships between the
different components of the analysis, with the main aim of
illustrating determinants of health effects.
/! Control for potential confounders and multicenter design.
Handle missing data.
Léa Maitre - ISGlobal - Exposome data challenge
Popular vote: winner
Using causal random forest to determine exposure environments with high
sexual dimorphisms
Alejandro Caceres, ISGlobal
Committee vote: winner 1
Quantifying Exposome-Health Associations with Bayesian Multiple Index
Models
Glen McGee, University of Waterloo
github.com/glenmcgee/bsmim2
glen.mcgee@uwaterloo.ca
Interactions between multipollutant index and individual covariates
Committee vote: winner 2
Decoding unknown links between the exposome and health outcomes by
multi-omics analysis
Xiaotao Shen, Stanford University
Ressources
Exposome data challenge
Dataset: https://github.com/isglobal-exposomeHub/ExposomeDataChallenge2021/blob/main/README.md
Data description: https://docs.google.com/document/d/1ul3v-sIniLuTjFB1F1CrFQIX8mrEXVnvSzOF7BCOnpQ/edit
Scientific publication https://arxiv.org/abs/2202.01680 (under review)
Slides and videos of the presentations https://www.isglobal.org/-/exposome-data-analysis-challenge
https://www.youtube.com/channel/UC0F3hR04UzUeKkcfAyikltA/featured
Code used shared on GitHub: https://github.com/isglobal-
exposomeHub/ExposomeDataChallenge2021/tree/main/R_Code_Presentations
HELIX project
Data inventory: https://www.projecthelix.eu/index.php/es/data-inventory
Tamayo-Uria I, et al. The early-life exposome: Description and patterns in six European countries. Environ Int. 2019.
Maitre L, et al. Human Early Life Exposome (HELIX) study: a European population-based exposome cohort. BMJ Open. 2018.
Vrijheid M, et al. The human early-life exposome (HELIX): project rationale and design. Environ Health Perspect. 2014
Léa Maitre - ISGlobal - Exposome data challenge
 Publicize methods
for developers
 Code source for
analysists
Challenge challenges
• Making real-case, sensitive personal data, publicly available
 Partial imputation of the data
• Finding a balance between well-defined research questions and
generalizability of the results to a wide community
The first edition of the challenge was focused on general data analysis approach challenges,
rather than specific research questions, which means methods presented could not be
directly compared. The advantage was a broad catalogue of methods
• A short time frame, sometimes lack of adjustment for known biaises
• Appropriate recognition for the data-generating team
Léa Maitre - ISGlobal - Exposome data challenge
Challenge successes
• Knowledge transfer -> Training tools for students, already used by
universities such as French Centrale sup elec or PhD students
• New collaborations formed
• Accelerated the rate of scientific discovery, out-of-the-box ideas
created by multidisciplinary teams in a short time:
• 25 teams who worked over a month, will never equal the work even of a
consortium over 5 years (only 5 methods tested for the HELIX stat protocol,
run mainly by one experienced statistician)
Léa Maitre - ISGlobal - Exposome data challenge
Thank you
Léa Maitre - ISGlobal - Exposome data challenge
Contact me at
lea.maitre@isglobal.org
Extra slides
Léa Maitre - ISGlobal - Exposome data challenge
The early life period
 Vulnerable period of rapid
organ development
 Many chronic diseases have
part of their origin in early life
(Barker hypothesis)
 Starting point for a life-course
exposome
Léa Maitre - ISGlobal - Exposome data challenge
The Exposome data challenge
 Event created in the framework of the ISGlobal Exposome hub and H2020 ATHLETE
project
 Organized by ISGlobal, Barcelona
 Simulated data (based on the HELIX project) publicly available to challenge
researchers on statistical tools to study exposome-health associations
Léa Maitre - ISGlobal - Exposome data challenge
Organization of the datasets
Exposures
Omics
Health
Covariates
data.frame: exposome
data.frame: phenotype
data.frame: covariates
DNA methylation GenomicRatioSet: methyl
Gene expression ExpressionSet: genexpr
Metabolites (urine) ExpressionSet: metabol_urine
Metabolites (serum) ExpressionSet: metabol_serum
Proteines ExpressionSet: proteom
Data available here: https://github.com/isglobal-exposomeHub/ExposomeDataChallenge2021/blob/main/README.md
Léa Maitre - ISGlobal - Exposome data challenge
Codebook
Exposures Health Covariates
https://github.com/isglobal-brge/brgedata/blob/master/data/ExposomeDataChallenge2021/codebook.xlsx
Léa Maitre - ISGlobal - Exposome data challenge

More Related Content

What's hot

Uso de Herbicidas na cultura do Algodão
Uso de Herbicidas na cultura do AlgodãoUso de Herbicidas na cultura do Algodão
Uso de Herbicidas na cultura do Algodão
Geagra UFG
 
Mip soja percevejo marrom
Mip soja   percevejo marromMip soja   percevejo marrom
Mip soja percevejo marrom
Ediney Dias
 
Avaliação de Cultivares para o Estado de Santa Catarina
Avaliação de Cultivares para o Estado de Santa CatarinaAvaliação de Cultivares para o Estado de Santa Catarina
Avaliação de Cultivares para o Estado de Santa Catarina
Rural Pecuária
 
Profissoinal repolho
Profissoinal repolhoProfissoinal repolho
Profissoinal repolho
Elvis Bruno
 

What's hot (20)

BRAC Agricultural Research and Development Centre and Its Vegetable Sector (B...
BRAC Agricultural Research and Development Centre and Its Vegetable Sector (B...BRAC Agricultural Research and Development Centre and Its Vegetable Sector (B...
BRAC Agricultural Research and Development Centre and Its Vegetable Sector (B...
 
Uso de Herbicidas na cultura do Algodão
Uso de Herbicidas na cultura do AlgodãoUso de Herbicidas na cultura do Algodão
Uso de Herbicidas na cultura do Algodão
 
Cacao polyphenols
Cacao polyphenolsCacao polyphenols
Cacao polyphenols
 
Phytosterols
PhytosterolsPhytosterols
Phytosterols
 
Mip soja percevejo marrom
Mip soja   percevejo marromMip soja   percevejo marrom
Mip soja percevejo marrom
 
Agroecologia, Saberes e Práticas - Um guia rápido para desenvolver a agroeco...
Agroecologia, Saberes e Práticas - Um guia rápido para desenvolver a agroeco...Agroecologia, Saberes e Práticas - Um guia rápido para desenvolver a agroeco...
Agroecologia, Saberes e Práticas - Um guia rápido para desenvolver a agroeco...
 
Bt brinjal good technology product
Bt brinjal good technology productBt brinjal good technology product
Bt brinjal good technology product
 
Manejo de plantas daninhas
Manejo de plantas daninhasManejo de plantas daninhas
Manejo de plantas daninhas
 
Smart Foods: Nutri-cereals for her - The creation of biofortified pearl millet
Smart Foods: Nutri-cereals for her - The creation of biofortified pearl milletSmart Foods: Nutri-cereals for her - The creation of biofortified pearl millet
Smart Foods: Nutri-cereals for her - The creation of biofortified pearl millet
 
pomato plant
pomato plantpomato plant
pomato plant
 
future of gmos
  future of gmos  future of gmos
future of gmos
 
Manejo de Pragas do Girassol
Manejo de Pragas do GirassolManejo de Pragas do Girassol
Manejo de Pragas do Girassol
 
Pre and Post-Harvest Management of Aflatoxin in Food Crops
Pre and Post-Harvest Management of Aflatoxin in Food CropsPre and Post-Harvest Management of Aflatoxin in Food Crops
Pre and Post-Harvest Management of Aflatoxin in Food Crops
 
Aula programa Genes
Aula programa GenesAula programa Genes
Aula programa Genes
 
11.social forestry
11.social forestry11.social forestry
11.social forestry
 
Avaliação de Cultivares para o Estado de Santa Catarina
Avaliação de Cultivares para o Estado de Santa CatarinaAvaliação de Cultivares para o Estado de Santa Catarina
Avaliação de Cultivares para o Estado de Santa Catarina
 
Profissoinal repolho
Profissoinal repolhoProfissoinal repolho
Profissoinal repolho
 
A cultura da banana minicurso
A cultura da banana   minicursoA cultura da banana   minicurso
A cultura da banana minicurso
 
Bt cotton in india
Bt cotton in indiaBt cotton in india
Bt cotton in india
 
Tecnologia de Pulverização - Good Spray
Tecnologia de Pulverização - Good Spray Tecnologia de Pulverização - Good Spray
Tecnologia de Pulverização - Good Spray
 

Similar to Exposome data challenge - ONE2022.pptx

XNN001 Introductory epidemiological concepts - Study design
XNN001 Introductory epidemiological concepts - Study designXNN001 Introductory epidemiological concepts - Study design
XNN001 Introductory epidemiological concepts - Study design
ramseyr
 

Similar to Exposome data challenge - ONE2022.pptx (20)

Exposome data challenge - ISGlobal hub prez July 2022.pptx
Exposome data challenge - ISGlobal hub prez July 2022.pptxExposome data challenge - ISGlobal hub prez July 2022.pptx
Exposome data challenge - ISGlobal hub prez July 2022.pptx
 
11 am sim
11 am sim11 am sim
11 am sim
 
C&E news talk sept 16
C&E news talk sept 16C&E news talk sept 16
C&E news talk sept 16
 
Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416
 
Open Data and the Social Sciences - OpenCon Community Webcast
Open Data and the Social Sciences - OpenCon Community WebcastOpen Data and the Social Sciences - OpenCon Community Webcast
Open Data and the Social Sciences - OpenCon Community Webcast
 
Innovative research approaches to improve evidence in global health
Innovative research approaches to improve evidence in global healthInnovative research approaches to improve evidence in global health
Innovative research approaches to improve evidence in global health
 
Brescia Exposome climate LMaitre April 2023.pptx
Brescia Exposome climate LMaitre April 2023.pptxBrescia Exposome climate LMaitre April 2023.pptx
Brescia Exposome climate LMaitre April 2023.pptx
 
Big Data and the Future by Sherri Rose
Big Data and the Future by Sherri RoseBig Data and the Future by Sherri Rose
Big Data and the Future by Sherri Rose
 
20160811 Big Data for Health and Medicine
20160811 Big Data for Health and Medicine20160811 Big Data for Health and Medicine
20160811 Big Data for Health and Medicine
 
Genomics in Society: Genomics, Cellular Networks, Preventive Medicine, and So...
Genomics in Society: Genomics, Cellular Networks, Preventive Medicine, and So...Genomics in Society: Genomics, Cellular Networks, Preventive Medicine, and So...
Genomics in Society: Genomics, Cellular Networks, Preventive Medicine, and So...
 
Roche_open_science_NIOO_KNAW_workshop_NL
Roche_open_science_NIOO_KNAW_workshop_NLRoche_open_science_NIOO_KNAW_workshop_NL
Roche_open_science_NIOO_KNAW_workshop_NL
 
Beacon Network: A System for Global Genomic Data Sharing
Beacon Network: A System for Global Genomic Data SharingBeacon Network: A System for Global Genomic Data Sharing
Beacon Network: A System for Global Genomic Data Sharing
 
Novel method for reviewing mechanistic evidence on diet, nutrition, physical ...
Novel method for reviewing mechanistic evidence on diet, nutrition, physical ...Novel method for reviewing mechanistic evidence on diet, nutrition, physical ...
Novel method for reviewing mechanistic evidence on diet, nutrition, physical ...
 
Big data and machine learning: opportunità per la medicina di precisione e i ...
Big data and machine learning: opportunità per la medicina di precisione e i ...Big data and machine learning: opportunità per la medicina di precisione e i ...
Big data and machine learning: opportunità per la medicina di precisione e i ...
 
Methods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataMethods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big data
 
Putting bioscience into context
Putting bioscience into contextPutting bioscience into context
Putting bioscience into context
 
Math, Stats and CS in Public Health and Medical Research
Math, Stats and CS in Public Health and Medical ResearchMath, Stats and CS in Public Health and Medical Research
Math, Stats and CS in Public Health and Medical Research
 
The shared value of personal and population data
The shared value of personal and population dataThe shared value of personal and population data
The shared value of personal and population data
 
Reaching out to collaborators and crowdsourcing for pharmaceutical research
Reaching out to collaborators and crowdsourcing for pharmaceutical research  Reaching out to collaborators and crowdsourcing for pharmaceutical research
Reaching out to collaborators and crowdsourcing for pharmaceutical research
 
XNN001 Introductory epidemiological concepts - Study design
XNN001 Introductory epidemiological concepts - Study designXNN001 Introductory epidemiological concepts - Study design
XNN001 Introductory epidemiological concepts - Study design
 

Recently uploaded

HIV AND INFULENZA VIRUS PPT HIV PPT INFULENZA VIRUS PPT
HIV AND INFULENZA VIRUS PPT HIV PPT  INFULENZA VIRUS PPTHIV AND INFULENZA VIRUS PPT HIV PPT  INFULENZA VIRUS PPT
Warming the earth and the atmosphere.pptx
Warming the earth and the atmosphere.pptxWarming the earth and the atmosphere.pptx
Warming the earth and the atmosphere.pptx
GlendelCaroz
 
Nanoparticles for the Treatment of Alzheimer’s Disease_102718.pptx
Nanoparticles for the Treatment of Alzheimer’s Disease_102718.pptxNanoparticles for the Treatment of Alzheimer’s Disease_102718.pptx
Nanoparticles for the Treatment of Alzheimer’s Disease_102718.pptx
ssusera4ec7b
 

Recently uploaded (20)

Vital Signs of Animals Presentation By Aftab Ahmed Rahimoon
Vital Signs of Animals Presentation By Aftab Ahmed RahimoonVital Signs of Animals Presentation By Aftab Ahmed Rahimoon
Vital Signs of Animals Presentation By Aftab Ahmed Rahimoon
 
MSCII_ FCT UNIT 5 TOXICOLOGY.pdf
MSCII_              FCT UNIT 5 TOXICOLOGY.pdfMSCII_              FCT UNIT 5 TOXICOLOGY.pdf
MSCII_ FCT UNIT 5 TOXICOLOGY.pdf
 
An Overview of Active and Passive Targeting Strategies to Improve the Nano-Ca...
An Overview of Active and Passive Targeting Strategies to Improve the Nano-Ca...An Overview of Active and Passive Targeting Strategies to Improve the Nano-Ca...
An Overview of Active and Passive Targeting Strategies to Improve the Nano-Ca...
 
HIV AND INFULENZA VIRUS PPT HIV PPT INFULENZA VIRUS PPT
HIV AND INFULENZA VIRUS PPT HIV PPT  INFULENZA VIRUS PPTHIV AND INFULENZA VIRUS PPT HIV PPT  INFULENZA VIRUS PPT
HIV AND INFULENZA VIRUS PPT HIV PPT INFULENZA VIRUS PPT
 
PARENTAL CARE IN FISHES.pptx for 5th sem
PARENTAL CARE IN FISHES.pptx for 5th semPARENTAL CARE IN FISHES.pptx for 5th sem
PARENTAL CARE IN FISHES.pptx for 5th sem
 
Terpineol and it's characterization pptx
Terpineol and it's characterization pptxTerpineol and it's characterization pptx
Terpineol and it's characterization pptx
 
ANITINUTRITION FACTOR GYLCOSIDES SAPONINS CYANODENS
ANITINUTRITION FACTOR GYLCOSIDES SAPONINS CYANODENSANITINUTRITION FACTOR GYLCOSIDES SAPONINS CYANODENS
ANITINUTRITION FACTOR GYLCOSIDES SAPONINS CYANODENS
 
Warming the earth and the atmosphere.pptx
Warming the earth and the atmosphere.pptxWarming the earth and the atmosphere.pptx
Warming the earth and the atmosphere.pptx
 
Technical english Technical english.pptx
Technical english Technical english.pptxTechnical english Technical english.pptx
Technical english Technical english.pptx
 
Nanoparticles for the Treatment of Alzheimer’s Disease_102718.pptx
Nanoparticles for the Treatment of Alzheimer’s Disease_102718.pptxNanoparticles for the Treatment of Alzheimer’s Disease_102718.pptx
Nanoparticles for the Treatment of Alzheimer’s Disease_102718.pptx
 
Polyethylene and its polymerization.pptx
Polyethylene and its polymerization.pptxPolyethylene and its polymerization.pptx
Polyethylene and its polymerization.pptx
 
EU START PROJECT. START-Newsletter_Issue_4.pdf
EU START PROJECT. START-Newsletter_Issue_4.pdfEU START PROJECT. START-Newsletter_Issue_4.pdf
EU START PROJECT. START-Newsletter_Issue_4.pdf
 
Information science research with large language models: between science and ...
Information science research with large language models: between science and ...Information science research with large language models: between science and ...
Information science research with large language models: between science and ...
 
THE FUNDAMENTAL UNIT OF LIFE CLASS IX.ppt
THE FUNDAMENTAL UNIT OF LIFE CLASS IX.pptTHE FUNDAMENTAL UNIT OF LIFE CLASS IX.ppt
THE FUNDAMENTAL UNIT OF LIFE CLASS IX.ppt
 
Heads-Up Multitasker: CHI 2024 Presentation.pdf
Heads-Up Multitasker: CHI 2024 Presentation.pdfHeads-Up Multitasker: CHI 2024 Presentation.pdf
Heads-Up Multitasker: CHI 2024 Presentation.pdf
 
Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...
Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...
Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...
 
Harry Coumnas Thinks That Human Teleportation is Possible in Quantum Mechanic...
Harry Coumnas Thinks That Human Teleportation is Possible in Quantum Mechanic...Harry Coumnas Thinks That Human Teleportation is Possible in Quantum Mechanic...
Harry Coumnas Thinks That Human Teleportation is Possible in Quantum Mechanic...
 
Adaptive Restore algorithm & importance Monte Carlo
Adaptive Restore algorithm & importance Monte CarloAdaptive Restore algorithm & importance Monte Carlo
Adaptive Restore algorithm & importance Monte Carlo
 
ABHISHEK ANTIBIOTICS PPT MICROBIOLOGY // USES OF ANTIOBIOTICS TYPES OF ANTIB...
ABHISHEK ANTIBIOTICS PPT MICROBIOLOGY  // USES OF ANTIOBIOTICS TYPES OF ANTIB...ABHISHEK ANTIBIOTICS PPT MICROBIOLOGY  // USES OF ANTIOBIOTICS TYPES OF ANTIB...
ABHISHEK ANTIBIOTICS PPT MICROBIOLOGY // USES OF ANTIOBIOTICS TYPES OF ANTIB...
 
GBSN - Biochemistry (Unit 3) Metabolism
GBSN - Biochemistry (Unit 3) MetabolismGBSN - Biochemistry (Unit 3) Metabolism
GBSN - Biochemistry (Unit 3) Metabolism
 

Exposome data challenge - ONE2022.pptx

  • 1. Open science and causality in the Exposome era Lessons learned from the Exposome Data Challenge Léa Maitre, PhD Barcelona Institute for Global Health (ISGlobal) June 23rd 2022 – One Health Conference
  • 2. The Exposome concept The totality of environmental exposures (meaning all non-genetic factors) that a person experiences, from conception onwards. Chris Wild, 2005. Léa Maitre - ISGlobal - Exposome data challenge
  • 3. Classical approach in environmental epidemiology: single-exposure association  Selective reporting of associations  Publication bias  No correction for multiple testing (separate papers for each exposure)  Cannot take into account confounding by co-exposures  Lack of consideration of “mixture effect” Exposome approach calls for a holistic view of the effects of environmental exposures on human health Léa Maitre - ISGlobal - Exposome data challenge
  • 4. A collaborative, open-science approach to promote and accelerate innovation in Exposome research Léa Maitre - ISGlobal - Exposome data challenge The parable of the blind men and the elephant Data science/ AI expert Epidemiologist Statistician System biologist (-omics) Illustration: © Natasha Stolovitzky-Brunner Data Challenges to promote:  Accelerated innovation  Interdisciplinary collaboration  Participation from around the world in COVID time  Training material for educational purposes
  • 5. The Exposome data challenge  Event created in the framework of the ISGlobal Exposome hub and H2020 ATHLETE project  Scientific publication pre-print https://arxiv.org/abs/2202.01680 (under review)  Simulated data (based on the HELIX project) publicly available to challenge researchers on statistical tools to study exposome-health associations Léa Maitre - ISGlobal - Exposome data challenge
  • 6. Léa Maitre - ISGlobal - Exposome data challenge - 25 selected teams out of 39 abstracts sent - 307 participants online: • 101 North America • 186 Europe • 9 Asia • 8 Latin America • 2 Africa • 1 Australia - Awards attributed from the public and from the comittee
  • 7. The Human early-life exposome project (HELIX) Study design Six mother-child cohorts in Europe (n=1301) Pregnant women enrolled at the beginning of their pregnancy Follow-up of the children with a standardized clinical examination at 6-11 years old Aim To study the association between multiple exposures, molecular signatures, and child health outcomes. 6-11 yo Léa Maitre - ISGlobal - Exposome data challenge
  • 8. Exposure assessment >100 environmental factors Assessed during pregnancy and childhood (at the time of the children follow-up, 6-11yo) Outdoor exposures (Geographic Information System) Air pollution* Noise† Built environment† Natural spaces† Traffic Meteorology* Water DBP Indoor air Chemicals (blood or urine biomarkers) Organochlorines PBDE PFAS Metals Phthalates Phenols Organophosphate pesticides Lifestyles (questionnaires) Smoking Diet Physical activity Social and economic capital Sleep * Postnatal exposures available within different time window † Postnatal exposures available at different location: home and school Léa Maitre - ISGlobal - Exposome data challenge
  • 9. Health outcomes 6 health outcomes At birth or at the time of the children follow-up (6-11yo) Continuous variables Birth weight Body mass index at 6-11yo Categorical variables Asthma at 6-11yo (binary) Body mass index at 6-11yo (4 categories) Count variables Intelligence quotient at 6-11yo Total correct answers (RAVEN test) Neuro behavior at 6-11yo Internalizing and externalizing problems (CBCL scale) Léa Maitre - ISGlobal - Exposome data challenge
  • 10. Covariates, potential confounders Maternal and child data Maternal data Cohort of inclusion Age Education Pre-pregnancy body mass index Weight gain during pregnancy Parity Child data at birth Sex Gestational age Year of birth Native origin Child data at 6-11yo Age Weight Height Léa Maitre - ISGlobal - Exposome data challenge
  • 11. Omics data >400,000 features Assessed during childhood only (at the time of the children follow-up, 6-11yo) White blood cells DNA methylation (386,518 CpGs) Transcription Gene expression (58,254 transcripts) Plasma & serum Proteins (36 proteins) Metabolites (177 metabolites) Urine Metabolites (44 metabolites) Léa Maitre - ISGlobal - Exposome data challenge
  • 12. Data summary Naissance lusion:1er trimestre estionnaire nes Dossier maternité + Sang du cordon, Fragments placenta, Cheveux Questionnaires (mère, père, enfant) 2 ans Questionnaires + Suivi médical (médecine scolaire) 4 ans Naissance lusion:1er trimestre estionnaire nes Dossier maternité + Sang du cordon, Fragments placenta, Cheveux Questionnaires (mère, père, enfant) 2 ans Questionnaires + Suivi médical (médecine scolaire) 4 ans 6-11 yo Exposures ++ Exposures ++ Omics ++++ Health Covariates Health Covariates Léa Maitre - ISGlobal - Exposome data challenge
  • 13. Particularity of the data Correlation between exposures Causal relation between exposures/omic layers High dimension (exposure, omics) Multi-center study (6 mother-child cohorts) Repeated exposure data (pregnancy and childhood) Small sample size (n=1301 mother-child pairs) Missing data Non-linear association Continuous vs categorical variables Léa Maitre - ISGlobal - Exposome data challenge
  • 14. Challenge examples Challenge 1: Combined effects of exposures Identify combination of exposures, high-order interactions or exposure patterns that are particularly harmful or beneficial for one or several health outcomes. Challenge 2: Using omics data to improve inference on the link between exposome and health. Incorporate the different omics layers into the analysis linking the exposome and one or more health outcomes. Challenge 3: Multi-omics analysis Incorporate different layers of omics data (including exposome as one of the layers) to find patterns that can explain variations in one or more health outcomes. Challenge 4: Causal structure in the exposome Define hypothesized causal relationships between the different exposures and one health outcome, and incorporate this information into the analysis. Challenge 5: Visualization techniques Tools to visualize the complex relationships between the different components of the analysis, with the main aim of illustrating determinants of health effects. /! Control for potential confounders and multicenter design. Handle missing data. Léa Maitre - ISGlobal - Exposome data challenge
  • 15. Popular vote: winner Using causal random forest to determine exposure environments with high sexual dimorphisms Alejandro Caceres, ISGlobal
  • 16. Committee vote: winner 1 Quantifying Exposome-Health Associations with Bayesian Multiple Index Models Glen McGee, University of Waterloo github.com/glenmcgee/bsmim2 glen.mcgee@uwaterloo.ca Interactions between multipollutant index and individual covariates
  • 17. Committee vote: winner 2 Decoding unknown links between the exposome and health outcomes by multi-omics analysis Xiaotao Shen, Stanford University
  • 18. Ressources Exposome data challenge Dataset: https://github.com/isglobal-exposomeHub/ExposomeDataChallenge2021/blob/main/README.md Data description: https://docs.google.com/document/d/1ul3v-sIniLuTjFB1F1CrFQIX8mrEXVnvSzOF7BCOnpQ/edit Scientific publication https://arxiv.org/abs/2202.01680 (under review) Slides and videos of the presentations https://www.isglobal.org/-/exposome-data-analysis-challenge https://www.youtube.com/channel/UC0F3hR04UzUeKkcfAyikltA/featured Code used shared on GitHub: https://github.com/isglobal- exposomeHub/ExposomeDataChallenge2021/tree/main/R_Code_Presentations HELIX project Data inventory: https://www.projecthelix.eu/index.php/es/data-inventory Tamayo-Uria I, et al. The early-life exposome: Description and patterns in six European countries. Environ Int. 2019. Maitre L, et al. Human Early Life Exposome (HELIX) study: a European population-based exposome cohort. BMJ Open. 2018. Vrijheid M, et al. The human early-life exposome (HELIX): project rationale and design. Environ Health Perspect. 2014 Léa Maitre - ISGlobal - Exposome data challenge  Publicize methods for developers  Code source for analysists
  • 19. Challenge challenges • Making real-case, sensitive personal data, publicly available  Partial imputation of the data • Finding a balance between well-defined research questions and generalizability of the results to a wide community The first edition of the challenge was focused on general data analysis approach challenges, rather than specific research questions, which means methods presented could not be directly compared. The advantage was a broad catalogue of methods • A short time frame, sometimes lack of adjustment for known biaises • Appropriate recognition for the data-generating team Léa Maitre - ISGlobal - Exposome data challenge
  • 20. Challenge successes • Knowledge transfer -> Training tools for students, already used by universities such as French Centrale sup elec or PhD students • New collaborations formed • Accelerated the rate of scientific discovery, out-of-the-box ideas created by multidisciplinary teams in a short time: • 25 teams who worked over a month, will never equal the work even of a consortium over 5 years (only 5 methods tested for the HELIX stat protocol, run mainly by one experienced statistician) Léa Maitre - ISGlobal - Exposome data challenge
  • 21. Thank you Léa Maitre - ISGlobal - Exposome data challenge Contact me at lea.maitre@isglobal.org
  • 22. Extra slides Léa Maitre - ISGlobal - Exposome data challenge
  • 23. The early life period  Vulnerable period of rapid organ development  Many chronic diseases have part of their origin in early life (Barker hypothesis)  Starting point for a life-course exposome Léa Maitre - ISGlobal - Exposome data challenge
  • 24. The Exposome data challenge  Event created in the framework of the ISGlobal Exposome hub and H2020 ATHLETE project  Organized by ISGlobal, Barcelona  Simulated data (based on the HELIX project) publicly available to challenge researchers on statistical tools to study exposome-health associations Léa Maitre - ISGlobal - Exposome data challenge
  • 25. Organization of the datasets Exposures Omics Health Covariates data.frame: exposome data.frame: phenotype data.frame: covariates DNA methylation GenomicRatioSet: methyl Gene expression ExpressionSet: genexpr Metabolites (urine) ExpressionSet: metabol_urine Metabolites (serum) ExpressionSet: metabol_serum Proteines ExpressionSet: proteom Data available here: https://github.com/isglobal-exposomeHub/ExposomeDataChallenge2021/blob/main/README.md Léa Maitre - ISGlobal - Exposome data challenge

Editor's Notes

  1. The collaboration enabled by Challenges and open science can be well illustrated by the parable of the blind men and the elephant. In this parable six blind men who have never encountered an elephant learn what an elephant is by being exposed to different parts of the elephant. While individually these men cannot fully grasp what an elephant is, together they have a chance to have an integrated concept of an elephant that goes beyond each man’s limited experience. Illustration: © Natasha Stolovitzky-Brunner. Open competitions (also known as Challenges) have proved to be an efficient and unbiased way to find computational solutions to biomedical research questions, by exploiting the ‘wisdom of the crowd’