SlideShare a Scribd company logo
1 of 45
Download to read offline
Building a search engine to find and
robustly identify environmental factors
with phenotype and disease
Chirag J Patel
Unite for Sight
4/14/2018
chirag@hms.harvard.edu
@chiragjp
www.chiragjpgroup.org
P = G + EType 2 Diabetes
Cancer
Alzheimer’s
Gene expression
Phenotype Genome
Variants
Environment
Infectious agents
Diet + Nutrients
Pollutants
Drugs
We are great at G investigation!
>4000 (as of 1/1/18)
36,066 G-P associations
Genome-wide Association Studies (GWAS)
https://www.ebi.ac.uk/gwas/
G
Nothing comparable to elucidate E influence!
E: ???
We lack high-throughput methods
and data to discover new E in P…
A similar paradigm for discovery should exist
for E!
Why?
σ2
P = σ2
G + σ2
E
…
σ2
G
σ2
P
H2 =
Heritability (H2) is the range of phenotypic
variability attributed to genetic variability in a
population
Indicator of the proportion of phenotypic
differences attributed to G.
Height is an example of a heritable trait:
Francis Galton shows how its done (1887)
“mid-height of 205 parents
described 60% of variability of 928
offspring”
What else describes height?
Source: SNPedia.com
Heritability estimates for burdensome diseases are low and variable
Type 2 Diabetes (25%)
Heart Disease (25-30%)
cancer?
Source: SNPedia.com
G estimates for complex disease (P) are low and variable:
massive opportunity for high-throughput E discovery
σ2
E
What describes this variation NOT explained by
genetics?
Physical activity?
Is it coffee…?
HR: 0.9 in N=500K
HR: 0.9 in N=50K
Chemicals?
EPA Chemical Substances List (~80K)
via tylervigen.com
… we just don’t know
… we just don’t know
xkcd.com
We just don’t know:
Is everything we are exposed to associated with cancer?
Schoenfeld and Ioannidis, AJCN 2012
50 random ingredients from
Boston Cooking School
Cookbook
Any associated with cancer?
Of 50, 40 studied in cancer risk
Weak statistical evidence:
non-replicated
inconsistent effects
non-standardized
… we just don’t know
http://fivethirtyeight.com/features/you-cant-trust-what-you-read-about-nutrition/
The problem remains:
(1) what explains the missing variation in phenotype…
σ2
E
So the problem remains:
(2) and how do we find the stuff that matters?
E: ???
Diet
Infection
Pollution
Drugs
We are great at G investigation!
>4000 (as of 1/1/18)
36,066 G-P associations
Genome-wide Association Studies (GWAS)
https://www.ebi.ac.uk/gwas/
G
How did genetics-based investigations advance?
(And advance so quickly?)
A new paradigm of GWAS for discovery of G in P:
Human Genome Project to GWAS
Sequencing of the genome
2001
HapMap project:
http://hapmap.ncbi.nlm.nih.gov/
Characterize common variation
2001-current day
High-throughput variant
assay
< $99 for ~1M variants
Measurement tools
~2003 (ongoing)
Nature 2008
Comprehensive, high-throughput analyses
GWAS
How can we do better in both discovery and
translation?:
Leverage data-driven “exposomic” techniques!
• Data-driven discovery
• search through all the possibilities
• gauge the totality of the evidence
• New ways to measure the exposome (E)!
• scalable ways to measure diet, infection,
pollution
Explaining the missing variation:
A data-driven paradigm for robust discovery of E in disease
via systematic study of the “exposome”
what to measure? how to measure?
“A more comprehensive view of
environmental exposure is
needed ... to discover major
causes of diseases...”
how to analyze in relation to health?
Wild, 2005, 2012
Rappaport and Smith, 2010, 2011
Buck-Louis and Sundaram 2012
Miller and Jones, 2014
Patel CJ and Ioannidis JPAI, 2014
Possible to use existing technologies for E
Exposure (and P) Assessment…
CEBP 2017
… however, heterogeneous measures that require different
study designs and analytic approaches.
Promises and Challenges in creating a search engine for
identifying E in P
JAMA 2014
ARPH 2016
JECH 2014
Curr Epidemiol Rep 2017
Examples of data-driven discovery for E associations
Gold standard for breadth of human exposure information:
National Health and Nutrition Examination Survey1
since the 1960s
now biannual: 1999 onwards
10,000 participants per survey
1 http://www.cdc.gov/nchs/nhanes.htm
>250 exposures (serum + urine)
GWAS chip
>200 quantitative clinical traits
(e.g., serum glucose, lipids, body
mass index)
Death index linkage (cause of
death)
Gold standard for breadth of exposure & behavior data:
National Health and Nutrition Examination Survey
Nutrients and Vitamins
vitamin D, carotenes
Infectious Agents
hepatitis, HIV, Staph. aureus
Plastics and consumables
phthalates, bisphenol A
Physical Activity
e.g., stepsPesticides and pollutants
atrazine; cadmium; hydrocarbons
Drugs
statins; aspirin
What E are associated with aging:
all-cause mortality, heart disease, and
telomere length?
Int J Epidem 2013
Int J Epidem 2016
Identifying E associated with all-cause mortality:
Data-driven searching through 253 associations
age (10 years)
income (quintile 2)
income (quintile 1)
male
black income (quintile 3)
any one smoke in home?
Multivariate cox (age, sex, income, education, race/ethnicity, occupation [in red])
serum and urine cadmium
[1 SD]
past smoker?
current smoker?serum lycopene
[1SD]
physical activity
[low, moderate, high activity]*
*derived from METs per activity and categorized by Health.gov guidelines
R2 ~ 14%
(2%)
R2 < 10-20%
Required
What about other factors related to aging?:
452 associations in Telomere Length!
Int J Epidem 2016
PCBs
FDR<5%
Trunk Fat
Alk. PhosCRP
Cadmium
Cadmium (urine)cigs per day
retinyl stearate
R2 ~ 1%
VO2 Maxpulse rate
shorter telomeres longer telomeres
adjusted by age, age2, race, poverty, education, occupation
median N=3000; N range: 300-7000
2-8 years
Interdependencies of the exposome:
Correlation globes paint a complex view of exposure
Red: positive ρ
Blue: negative ρ
thickness: |ρ|
for each pair of E:
Spearman ρ
(575 factors: 81,937 correlations)
permuted data to produce
“null ρ”
sought replication in > 1
cohort
Pac Symp Biocomput. 2015
JECH. 2015
Red: positive ρ
Blue: negative ρ
thickness: |ρ|
for each pair of E:
Spearman ρ
(575 factors: 81,937 correlations)
Interdependencies of the exposome:
Correlation globes paint a complex view of exposure:
average correlation of < 0.3
permuted data to produce
“null ρ”
sought replication in > 1
cohort
Pac Symp Biocomput. 2015
JECH. 2015
Effective number of
variables:
500 (10% decrease)
How can we do better in both discovery and translation?:
Leverage data-driven “exposomic” techniques!
• Data-driven discovery
• search through all the possibilities
• gauge the totality of the evidence
• New ways to measure the exposome (E)!
• scalable ways to measure diet, infection,
pollution
Data-driven discovery to identifying factors that matter!
1.) Find elusive E in P and
explain variation of disease risk
2.) Consideration of totality of
evidence: Does my correlation
matter?
3.) Machine learning methods to
detecting signals in observational and
large data
Data-driven discovery to identifying factors that matter!
1.) Find elusive E in P and explain
variation of disease risk
2.) Consideration of totality of
evidence: Does my correlation
matter?
3.) Machine learning methods to detecting
signals in observational and large data
ARPH 2016
JAMA 2014
JECH 2015
Data-driven discovery to identifying factors that matter!
1.) Find elusive E in P and explain
variation of disease risk
2.) Consideration of totality of
evidence: Does my correlation
matter?
3.) Machine learning methods to
detecting signals in observational and
large data ARPH 2016
JAMA 2014
JECH 2015
How can we do better in both discovery and translation?:
Leverage data-driven “exposomic” techniques!
• Data-driven discovery
• search through all the possibilities
• gauge the totality of the evidence
• New ways to measure the exposome (E)!
• scalable ways to measure diet, infection,
pollution
Explaining the missing variation:
A data-driven paradigm for robust discovery of E in disease
via systematic study of the “exposome”
what to measure? how to measure?
“A more comprehensive view of
environmental exposure is
needed ... to discover major
causes of diseases...”
how to analyze in relation to health?
Wild, 2005, 2012
Rappaport and Smith, 2010, 2011
Buck-Louis and Sundaram 2012
Miller and Jones, 2014
Patel CJ and Ioannidis JPAI, 2014
Need to assess the exposome globally:
(e.g., India and China)
c/o Getty Images c/o AFP
… and Sub-Saharan Africa!
Can we predict HIV as a function of the exposome?
AIDS 2018
Harvard DBMI
Susanne Churchill
Nathan Palmer
Sophia Mamousette
Sunny Alvear
Chirag J Patel
chirag@hms.harvard.edu
@chiragjp
www.chiragjpgroup.org
NIH Common Fund
Big Data to Knowledge
Acknowledgements
RagGroup
Arjun Manrai
Nam Pho
Jake Chung
Kajal Claypool
Chirag Lakhani
Danielle Rasooly
Alan LeGoallec
Sivateja Tangirala
Mentioned Collaborators
Isaac Kohane
John Ioannidis
Dennis Bier
Hugo Aschard

More Related Content

What's hot

Biomedical Informatics 706: Precision Medicine with exposures
Biomedical Informatics 706: Precision Medicine with exposuresBiomedical Informatics 706: Precision Medicine with exposures
Biomedical Informatics 706: Precision Medicine with exposuresChirag Patel
 
Informatics and data analytics to support for exposome-based discovery
Informatics and data analytics to support for exposome-based discoveryInformatics and data analytics to support for exposome-based discovery
Informatics and data analytics to support for exposome-based discoveryChirag Patel
 
AACR 041616 digital exposomes
AACR 041616 digital exposomesAACR 041616 digital exposomes
AACR 041616 digital exposomesChirag Patel
 
Intro to Biomedical Informatics 701
Intro to Biomedical Informatics 701 Intro to Biomedical Informatics 701
Intro to Biomedical Informatics 701 Chirag Patel
 
Correlation globes of the exposome 2016
Correlation globes of the exposome 2016Correlation globes of the exposome 2016
Correlation globes of the exposome 2016Chirag Patel
 
Data analytics to support exposome research course slides
Data analytics to support exposome research course slidesData analytics to support exposome research course slides
Data analytics to support exposome research course slidesChirag Patel
 
Methods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataMethods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataChirag Patel
 
Japanese Environmental Children's Study and Data-driven E
Japanese Environmental Children's Study and Data-driven EJapanese Environmental Children's Study and Data-driven E
Japanese Environmental Children's Study and Data-driven EChirag Patel
 
Searching for predictors of male fecundity
Searching for predictors of male fecunditySearching for predictors of male fecundity
Searching for predictors of male fecundityChirag Patel
 
Open zika presentation
Open zika presentation Open zika presentation
Open zika presentation Sean Ekins
 
Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...
Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...
Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...ExternalEvents
 
dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...
dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...
dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...dkNET
 
Identification of PFOA linked metabolic diseases by crossing databases
Identification of PFOA linked metabolic diseases by crossing databasesIdentification of PFOA linked metabolic diseases by crossing databases
Identification of PFOA linked metabolic diseases by crossing databasesYoann Pageaud
 
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan DiseasesUsing In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan DiseasesSean Ekins
 
Protein-Protein Interaction Presentation
Protein-Protein Interaction PresentationProtein-Protein Interaction Presentation
Protein-Protein Interaction PresentationUsman (Ali) Ahmed
 
Data Visualization in Biomedical Sciences: More than Meets the Eye
Data Visualization in Biomedical Sciences: More than Meets the EyeData Visualization in Biomedical Sciences: More than Meets the Eye
Data Visualization in Biomedical Sciences: More than Meets the EyeNils Gehlenborg
 
MathiasHibbard_604FinalPaper
MathiasHibbard_604FinalPaperMathiasHibbard_604FinalPaper
MathiasHibbard_604FinalPaperMathias Hibbard
 
BOSC2017: Using Wikidata as an open, community-maintained database of biomedi...
BOSC2017: Using Wikidata as an open, community-maintained database of biomedi...BOSC2017: Using Wikidata as an open, community-maintained database of biomedi...
BOSC2017: Using Wikidata as an open, community-maintained database of biomedi...Andrew Su
 
MathiasHibbard_655PaperFinal
MathiasHibbard_655PaperFinalMathiasHibbard_655PaperFinal
MathiasHibbard_655PaperFinalMathias Hibbard
 
Open data, compound repurposing, and rare diseases -- Point Loma Nazarene Uni...
Open data, compound repurposing, and rare diseases -- Point Loma Nazarene Uni...Open data, compound repurposing, and rare diseases -- Point Loma Nazarene Uni...
Open data, compound repurposing, and rare diseases -- Point Loma Nazarene Uni...Andrew Su
 

What's hot (20)

Biomedical Informatics 706: Precision Medicine with exposures
Biomedical Informatics 706: Precision Medicine with exposuresBiomedical Informatics 706: Precision Medicine with exposures
Biomedical Informatics 706: Precision Medicine with exposures
 
Informatics and data analytics to support for exposome-based discovery
Informatics and data analytics to support for exposome-based discoveryInformatics and data analytics to support for exposome-based discovery
Informatics and data analytics to support for exposome-based discovery
 
AACR 041616 digital exposomes
AACR 041616 digital exposomesAACR 041616 digital exposomes
AACR 041616 digital exposomes
 
Intro to Biomedical Informatics 701
Intro to Biomedical Informatics 701 Intro to Biomedical Informatics 701
Intro to Biomedical Informatics 701
 
Correlation globes of the exposome 2016
Correlation globes of the exposome 2016Correlation globes of the exposome 2016
Correlation globes of the exposome 2016
 
Data analytics to support exposome research course slides
Data analytics to support exposome research course slidesData analytics to support exposome research course slides
Data analytics to support exposome research course slides
 
Methods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataMethods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big data
 
Japanese Environmental Children's Study and Data-driven E
Japanese Environmental Children's Study and Data-driven EJapanese Environmental Children's Study and Data-driven E
Japanese Environmental Children's Study and Data-driven E
 
Searching for predictors of male fecundity
Searching for predictors of male fecunditySearching for predictors of male fecundity
Searching for predictors of male fecundity
 
Open zika presentation
Open zika presentation Open zika presentation
Open zika presentation
 
Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...
Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...
Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...
 
dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...
dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...
dkNET Webinar: Population-Based Approaches to Investigate Endocrine Communica...
 
Identification of PFOA linked metabolic diseases by crossing databases
Identification of PFOA linked metabolic diseases by crossing databasesIdentification of PFOA linked metabolic diseases by crossing databases
Identification of PFOA linked metabolic diseases by crossing databases
 
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan DiseasesUsing In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
 
Protein-Protein Interaction Presentation
Protein-Protein Interaction PresentationProtein-Protein Interaction Presentation
Protein-Protein Interaction Presentation
 
Data Visualization in Biomedical Sciences: More than Meets the Eye
Data Visualization in Biomedical Sciences: More than Meets the EyeData Visualization in Biomedical Sciences: More than Meets the Eye
Data Visualization in Biomedical Sciences: More than Meets the Eye
 
MathiasHibbard_604FinalPaper
MathiasHibbard_604FinalPaperMathiasHibbard_604FinalPaper
MathiasHibbard_604FinalPaper
 
BOSC2017: Using Wikidata as an open, community-maintained database of biomedi...
BOSC2017: Using Wikidata as an open, community-maintained database of biomedi...BOSC2017: Using Wikidata as an open, community-maintained database of biomedi...
BOSC2017: Using Wikidata as an open, community-maintained database of biomedi...
 
MathiasHibbard_655PaperFinal
MathiasHibbard_655PaperFinalMathiasHibbard_655PaperFinal
MathiasHibbard_655PaperFinal
 
Open data, compound repurposing, and rare diseases -- Point Loma Nazarene Uni...
Open data, compound repurposing, and rare diseases -- Point Loma Nazarene Uni...Open data, compound repurposing, and rare diseases -- Point Loma Nazarene Uni...
Open data, compound repurposing, and rare diseases -- Point Loma Nazarene Uni...
 

Similar to Chirag patel unite for sight 041418

Search engine for E NEU network science 080817
Search engine for E NEU network science 080817Search engine for E NEU network science 080817
Search engine for E NEU network science 080817Chirag Patel
 
헬스케어 빅데이터로 무엇을 할 수 있는가?
헬스케어 빅데이터로 무엇을 할 수 있는가?헬스케어 빅데이터로 무엇을 할 수 있는가?
헬스케어 빅데이터로 무엇을 할 수 있는가? Hyung Jin Choi
 
Studying the elusive in larger scale
Studying the elusive in larger scaleStudying the elusive in larger scale
Studying the elusive in larger scaleChirag Patel
 
(서울의대 공유용) 빅데이터 분석 유전체 정보와 개인라이프로그 정보 활용-2015_11_24
(서울의대 공유용) 빅데이터 분석  유전체 정보와 개인라이프로그 정보 활용-2015_11_24(서울의대 공유용) 빅데이터 분석  유전체 정보와 개인라이프로그 정보 활용-2015_11_24
(서울의대 공유용) 빅데이터 분석 유전체 정보와 개인라이프로그 정보 활용-2015_11_24Hyung Jin Choi
 
Meg Ehm: Fueling a Genetics-Driven Drug Discovery Organization
Meg Ehm: Fueling a Genetics-Driven Drug Discovery OrganizationMeg Ehm: Fueling a Genetics-Driven Drug Discovery Organization
Meg Ehm: Fueling a Genetics-Driven Drug Discovery OrganizationTHL
 
Comparison of autocorrelation between CV-RISK independent variables in groups...
Comparison of autocorrelation between CV-RISK independent variables in groups...Comparison of autocorrelation between CV-RISK independent variables in groups...
Comparison of autocorrelation between CV-RISK independent variables in groups...YohanesFirmansyah1
 
Sexual Risk Behaviors Research
Sexual Risk Behaviors Research Sexual Risk Behaviors Research
Sexual Risk Behaviors Research Brittney Johns
 
Mel Reichman on Pool Shark’s Cues for More Efficient Drug Discovery
Mel Reichman on Pool Shark’s Cues for More Efficient Drug DiscoveryMel Reichman on Pool Shark’s Cues for More Efficient Drug Discovery
Mel Reichman on Pool Shark’s Cues for More Efficient Drug DiscoveryJean-Claude Bradley
 
Alzheimer
AlzheimerAlzheimer
AlzheimerHarold
 
20160119 디지털 헬스케어 의사모임 1월 전체 파일 v3
20160119 디지털 헬스케어 의사모임 1월 전체 파일 v320160119 디지털 헬스케어 의사모임 1월 전체 파일 v3
20160119 디지털 헬스케어 의사모임 1월 전체 파일 v3Chiweon Kim
 
의료 빅데이터와 인공지능의 현재와 미래
의료 빅데이터와 인공지능의 현재와 미래의료 빅데이터와 인공지능의 현재와 미래
의료 빅데이터와 인공지능의 현재와 미래Hyung Jin Choi
 
Data Science in Drug Discovery
Data Science in Drug DiscoveryData Science in Drug Discovery
Data Science in Drug DiscoveryWrangleConf
 
Introduction to Epidemiology and Surveillance
Introduction to Epidemiology and SurveillanceIntroduction to Epidemiology and Surveillance
Introduction to Epidemiology and SurveillanceGeorge Moulton
 
Khoury ashg2014
Khoury ashg2014Khoury ashg2014
Khoury ashg2014muink
 
The Cochrane Collaboration Colloquium: The Human Genome Epidemiology Network:...
The Cochrane Collaboration Colloquium: The Human Genome Epidemiology Network:...The Cochrane Collaboration Colloquium: The Human Genome Epidemiology Network:...
The Cochrane Collaboration Colloquium: The Human Genome Epidemiology Network:...Cochrane.Collaboration
 
Day2 145pm Crawford
Day2 145pm CrawfordDay2 145pm Crawford
Day2 145pm CrawfordSean Paul
 
The emerging picture of host genetic control of susceptibility and outcome in...
The emerging picture of host genetic control of susceptibility and outcome in...The emerging picture of host genetic control of susceptibility and outcome in...
The emerging picture of host genetic control of susceptibility and outcome in...Meningitis Research Foundation
 
Sheet1idsbpgestage139272242334025458335533564327739318442895229104.docx
Sheet1idsbpgestage139272242334025458335533564327739318442895229104.docxSheet1idsbpgestage139272242334025458335533564327739318442895229104.docx
Sheet1idsbpgestage139272242334025458335533564327739318442895229104.docxbjohn46
 

Similar to Chirag patel unite for sight 041418 (20)

Search engine for E NEU network science 080817
Search engine for E NEU network science 080817Search engine for E NEU network science 080817
Search engine for E NEU network science 080817
 
헬스케어 빅데이터로 무엇을 할 수 있는가?
헬스케어 빅데이터로 무엇을 할 수 있는가?헬스케어 빅데이터로 무엇을 할 수 있는가?
헬스케어 빅데이터로 무엇을 할 수 있는가?
 
Studying the elusive in larger scale
Studying the elusive in larger scaleStudying the elusive in larger scale
Studying the elusive in larger scale
 
(서울의대 공유용) 빅데이터 분석 유전체 정보와 개인라이프로그 정보 활용-2015_11_24
(서울의대 공유용) 빅데이터 분석  유전체 정보와 개인라이프로그 정보 활용-2015_11_24(서울의대 공유용) 빅데이터 분석  유전체 정보와 개인라이프로그 정보 활용-2015_11_24
(서울의대 공유용) 빅데이터 분석 유전체 정보와 개인라이프로그 정보 활용-2015_11_24
 
Meg Ehm: Fueling a Genetics-Driven Drug Discovery Organization
Meg Ehm: Fueling a Genetics-Driven Drug Discovery OrganizationMeg Ehm: Fueling a Genetics-Driven Drug Discovery Organization
Meg Ehm: Fueling a Genetics-Driven Drug Discovery Organization
 
Comparison of autocorrelation between CV-RISK independent variables in groups...
Comparison of autocorrelation between CV-RISK independent variables in groups...Comparison of autocorrelation between CV-RISK independent variables in groups...
Comparison of autocorrelation between CV-RISK independent variables in groups...
 
Sexual Risk Behaviors Research
Sexual Risk Behaviors Research Sexual Risk Behaviors Research
Sexual Risk Behaviors Research
 
Mel Reichman on Pool Shark’s Cues for More Efficient Drug Discovery
Mel Reichman on Pool Shark’s Cues for More Efficient Drug DiscoveryMel Reichman on Pool Shark’s Cues for More Efficient Drug Discovery
Mel Reichman on Pool Shark’s Cues for More Efficient Drug Discovery
 
Green
GreenGreen
Green
 
Alzheimer
AlzheimerAlzheimer
Alzheimer
 
20160119 디지털 헬스케어 의사모임 1월 전체 파일 v3
20160119 디지털 헬스케어 의사모임 1월 전체 파일 v320160119 디지털 헬스케어 의사모임 1월 전체 파일 v3
20160119 디지털 헬스케어 의사모임 1월 전체 파일 v3
 
의료 빅데이터와 인공지능의 현재와 미래
의료 빅데이터와 인공지능의 현재와 미래의료 빅데이터와 인공지능의 현재와 미래
의료 빅데이터와 인공지능의 현재와 미래
 
Data Science in Drug Discovery
Data Science in Drug DiscoveryData Science in Drug Discovery
Data Science in Drug Discovery
 
Introduction to Epidemiology and Surveillance
Introduction to Epidemiology and SurveillanceIntroduction to Epidemiology and Surveillance
Introduction to Epidemiology and Surveillance
 
Khoury ashg2014
Khoury ashg2014Khoury ashg2014
Khoury ashg2014
 
The Cochrane Collaboration Colloquium: The Human Genome Epidemiology Network:...
The Cochrane Collaboration Colloquium: The Human Genome Epidemiology Network:...The Cochrane Collaboration Colloquium: The Human Genome Epidemiology Network:...
The Cochrane Collaboration Colloquium: The Human Genome Epidemiology Network:...
 
Module5_Study_Design.pptx
Module5_Study_Design.pptxModule5_Study_Design.pptx
Module5_Study_Design.pptx
 
Day2 145pm Crawford
Day2 145pm CrawfordDay2 145pm Crawford
Day2 145pm Crawford
 
The emerging picture of host genetic control of susceptibility and outcome in...
The emerging picture of host genetic control of susceptibility and outcome in...The emerging picture of host genetic control of susceptibility and outcome in...
The emerging picture of host genetic control of susceptibility and outcome in...
 
Sheet1idsbpgestage139272242334025458335533564327739318442895229104.docx
Sheet1idsbpgestage139272242334025458335533564327739318442895229104.docxSheet1idsbpgestage139272242334025458335533564327739318442895229104.docx
Sheet1idsbpgestage139272242334025458335533564327739318442895229104.docx
 

Recently uploaded

Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any TimeTop Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any TimeCall Girls Delhi
 
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Bareilly Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore EscortsVIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escortsaditipandeya
 
Bangalore Call Girl Whatsapp Number 100% Complete Your Sexual Needs
Bangalore Call Girl Whatsapp Number 100% Complete Your Sexual NeedsBangalore Call Girl Whatsapp Number 100% Complete Your Sexual Needs
Bangalore Call Girl Whatsapp Number 100% Complete Your Sexual NeedsGfnyt
 
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...narwatsonia7
 
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiRussian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiAlinaDevecerski
 
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...Call Girls in Nagpur High Profile
 
Bangalore Call Girls Nelamangala Number 7001035870 Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 7001035870  Meetin With Bangalore Esc...Bangalore Call Girls Nelamangala Number 7001035870  Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 7001035870 Meetin With Bangalore Esc...narwatsonia7
 
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...aartirawatdelhi
 
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Tirupati Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Tirupati Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Tirupati Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Tirupati Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Gwalior Just Call 8617370543 Top Class Call Girl Service Available
Call Girls Gwalior Just Call 8617370543 Top Class Call Girl Service AvailableCall Girls Gwalior Just Call 8617370543 Top Class Call Girl Service Available
Call Girls Gwalior Just Call 8617370543 Top Class Call Girl Service AvailableDipal Arora
 
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...chandars293
 
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...Taniya Sharma
 
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipur
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls JaipurRussian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipur
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipurparulsinha
 
Call Girls Visakhapatnam Just Call 9907093804 Top Class Call Girl Service Ava...
Call Girls Visakhapatnam Just Call 9907093804 Top Class Call Girl Service Ava...Call Girls Visakhapatnam Just Call 9907093804 Top Class Call Girl Service Ava...
Call Girls Visakhapatnam Just Call 9907093804 Top Class Call Girl Service Ava...Dipal Arora
 
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Dehradun Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 

Recently uploaded (20)

Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any TimeTop Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
Top Quality Call Girl Service Kalyanpur 6378878445 Available Call Girls Any Time
 
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Aurangabad Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Bareilly Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Bareilly Just Call 9907093804 Top Class Call Girl Service Available
 
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore EscortsVIP Call Girls Indore Kirti 💚😋  9256729539 🚀 Indore Escorts
VIP Call Girls Indore Kirti 💚😋 9256729539 🚀 Indore Escorts
 
Bangalore Call Girl Whatsapp Number 100% Complete Your Sexual Needs
Bangalore Call Girl Whatsapp Number 100% Complete Your Sexual NeedsBangalore Call Girl Whatsapp Number 100% Complete Your Sexual Needs
Bangalore Call Girl Whatsapp Number 100% Complete Your Sexual Needs
 
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Cuttack Just Call 9907093804 Top Class Call Girl Service Available
 
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
Top Rated Bangalore Call Girls Richmond Circle ⟟ 8250192130 ⟟ Call Me For Gen...
 
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls DelhiRussian Escorts Girls  Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
Russian Escorts Girls Nehru Place ZINATHI 🔝9711199012 ☪ 24/7 Call Girls Delhi
 
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
Book Paid Powai Call Girls Mumbai 𖠋 9930245274 𖠋Low Budget Full Independent H...
 
Bangalore Call Girls Nelamangala Number 7001035870 Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 7001035870  Meetin With Bangalore Esc...Bangalore Call Girls Nelamangala Number 7001035870  Meetin With Bangalore Esc...
Bangalore Call Girls Nelamangala Number 7001035870 Meetin With Bangalore Esc...
 
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
Night 7k to 12k Navi Mumbai Call Girl Photo 👉 BOOK NOW 9833363713 👈 ♀️ night ...
 
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Ooty Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Tirupati Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Tirupati Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Tirupati Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Tirupati Just Call 9907093804 Top Class Call Girl Service Available
 
Call Girls Gwalior Just Call 8617370543 Top Class Call Girl Service Available
Call Girls Gwalior Just Call 8617370543 Top Class Call Girl Service AvailableCall Girls Gwalior Just Call 8617370543 Top Class Call Girl Service Available
Call Girls Gwalior Just Call 8617370543 Top Class Call Girl Service Available
 
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Varanasi Just Call 9907093804 Top Class Call Girl Service Available
 
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
The Most Attractive Hyderabad Call Girls Kothapet 𖠋 6297143586 𖠋 Will You Mis...
 
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
(👑VVIP ISHAAN ) Russian Call Girls Service Navi Mumbai🖕9920874524🖕Independent...
 
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipur
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls JaipurRussian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipur
Russian Call Girls in Jaipur Riya WhatsApp ❤8445551418 VIP Call Girls Jaipur
 
Call Girls Visakhapatnam Just Call 9907093804 Top Class Call Girl Service Ava...
Call Girls Visakhapatnam Just Call 9907093804 Top Class Call Girl Service Ava...Call Girls Visakhapatnam Just Call 9907093804 Top Class Call Girl Service Ava...
Call Girls Visakhapatnam Just Call 9907093804 Top Class Call Girl Service Ava...
 
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Dehradun Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Dehradun Just Call 9907093804 Top Class Call Girl Service Available
 

Chirag patel unite for sight 041418

  • 1. Building a search engine to find and robustly identify environmental factors with phenotype and disease Chirag J Patel Unite for Sight 4/14/2018 chirag@hms.harvard.edu @chiragjp www.chiragjpgroup.org
  • 2. P = G + EType 2 Diabetes Cancer Alzheimer’s Gene expression Phenotype Genome Variants Environment Infectious agents Diet + Nutrients Pollutants Drugs
  • 3. We are great at G investigation! >4000 (as of 1/1/18) 36,066 G-P associations Genome-wide Association Studies (GWAS) https://www.ebi.ac.uk/gwas/ G
  • 4. Nothing comparable to elucidate E influence! E: ??? We lack high-throughput methods and data to discover new E in P…
  • 5. A similar paradigm for discovery should exist for E! Why?
  • 6. σ2 P = σ2 G + σ2 E …
  • 7. σ2 G σ2 P H2 = Heritability (H2) is the range of phenotypic variability attributed to genetic variability in a population Indicator of the proportion of phenotypic differences attributed to G.
  • 8. Height is an example of a heritable trait: Francis Galton shows how its done (1887) “mid-height of 205 parents described 60% of variability of 928 offspring” What else describes height?
  • 9. Source: SNPedia.com Heritability estimates for burdensome diseases are low and variable Type 2 Diabetes (25%) Heart Disease (25-30%) cancer?
  • 10. Source: SNPedia.com G estimates for complex disease (P) are low and variable: massive opportunity for high-throughput E discovery σ2 E
  • 11. What describes this variation NOT explained by genetics?
  • 13. Is it coffee…? HR: 0.9 in N=500K
  • 14. HR: 0.9 in N=50K
  • 16. via tylervigen.com … we just don’t know
  • 17. … we just don’t know xkcd.com
  • 18. We just don’t know: Is everything we are exposed to associated with cancer? Schoenfeld and Ioannidis, AJCN 2012 50 random ingredients from Boston Cooking School Cookbook Any associated with cancer? Of 50, 40 studied in cancer risk Weak statistical evidence: non-replicated inconsistent effects non-standardized
  • 19. … we just don’t know http://fivethirtyeight.com/features/you-cant-trust-what-you-read-about-nutrition/
  • 20. The problem remains: (1) what explains the missing variation in phenotype… σ2 E
  • 21. So the problem remains: (2) and how do we find the stuff that matters? E: ??? Diet Infection Pollution Drugs
  • 22. We are great at G investigation! >4000 (as of 1/1/18) 36,066 G-P associations Genome-wide Association Studies (GWAS) https://www.ebi.ac.uk/gwas/ G How did genetics-based investigations advance? (And advance so quickly?)
  • 23. A new paradigm of GWAS for discovery of G in P: Human Genome Project to GWAS Sequencing of the genome 2001 HapMap project: http://hapmap.ncbi.nlm.nih.gov/ Characterize common variation 2001-current day High-throughput variant assay < $99 for ~1M variants Measurement tools ~2003 (ongoing) Nature 2008 Comprehensive, high-throughput analyses GWAS
  • 24. How can we do better in both discovery and translation?: Leverage data-driven “exposomic” techniques! • Data-driven discovery • search through all the possibilities • gauge the totality of the evidence • New ways to measure the exposome (E)! • scalable ways to measure diet, infection, pollution
  • 25. Explaining the missing variation: A data-driven paradigm for robust discovery of E in disease via systematic study of the “exposome” what to measure? how to measure? “A more comprehensive view of environmental exposure is needed ... to discover major causes of diseases...” how to analyze in relation to health? Wild, 2005, 2012 Rappaport and Smith, 2010, 2011 Buck-Louis and Sundaram 2012 Miller and Jones, 2014 Patel CJ and Ioannidis JPAI, 2014
  • 26. Possible to use existing technologies for E Exposure (and P) Assessment… CEBP 2017 … however, heterogeneous measures that require different study designs and analytic approaches.
  • 27. Promises and Challenges in creating a search engine for identifying E in P JAMA 2014 ARPH 2016 JECH 2014 Curr Epidemiol Rep 2017
  • 28. Examples of data-driven discovery for E associations
  • 29. Gold standard for breadth of human exposure information: National Health and Nutrition Examination Survey1 since the 1960s now biannual: 1999 onwards 10,000 participants per survey 1 http://www.cdc.gov/nchs/nhanes.htm >250 exposures (serum + urine) GWAS chip >200 quantitative clinical traits (e.g., serum glucose, lipids, body mass index) Death index linkage (cause of death)
  • 30. Gold standard for breadth of exposure & behavior data: National Health and Nutrition Examination Survey Nutrients and Vitamins vitamin D, carotenes Infectious Agents hepatitis, HIV, Staph. aureus Plastics and consumables phthalates, bisphenol A Physical Activity e.g., stepsPesticides and pollutants atrazine; cadmium; hydrocarbons Drugs statins; aspirin
  • 31. What E are associated with aging: all-cause mortality, heart disease, and telomere length? Int J Epidem 2013 Int J Epidem 2016
  • 32. Identifying E associated with all-cause mortality: Data-driven searching through 253 associations age (10 years) income (quintile 2) income (quintile 1) male black income (quintile 3) any one smoke in home? Multivariate cox (age, sex, income, education, race/ethnicity, occupation [in red]) serum and urine cadmium [1 SD] past smoker? current smoker?serum lycopene [1SD] physical activity [low, moderate, high activity]* *derived from METs per activity and categorized by Health.gov guidelines R2 ~ 14% (2%)
  • 34. What about other factors related to aging?: 452 associations in Telomere Length! Int J Epidem 2016 PCBs FDR<5% Trunk Fat Alk. PhosCRP Cadmium Cadmium (urine)cigs per day retinyl stearate R2 ~ 1% VO2 Maxpulse rate shorter telomeres longer telomeres adjusted by age, age2, race, poverty, education, occupation median N=3000; N range: 300-7000 2-8 years
  • 35. Interdependencies of the exposome: Correlation globes paint a complex view of exposure Red: positive ρ Blue: negative ρ thickness: |ρ| for each pair of E: Spearman ρ (575 factors: 81,937 correlations) permuted data to produce “null ρ” sought replication in > 1 cohort Pac Symp Biocomput. 2015 JECH. 2015
  • 36. Red: positive ρ Blue: negative ρ thickness: |ρ| for each pair of E: Spearman ρ (575 factors: 81,937 correlations) Interdependencies of the exposome: Correlation globes paint a complex view of exposure: average correlation of < 0.3 permuted data to produce “null ρ” sought replication in > 1 cohort Pac Symp Biocomput. 2015 JECH. 2015 Effective number of variables: 500 (10% decrease)
  • 37. How can we do better in both discovery and translation?: Leverage data-driven “exposomic” techniques! • Data-driven discovery • search through all the possibilities • gauge the totality of the evidence • New ways to measure the exposome (E)! • scalable ways to measure diet, infection, pollution
  • 38. Data-driven discovery to identifying factors that matter! 1.) Find elusive E in P and explain variation of disease risk 2.) Consideration of totality of evidence: Does my correlation matter? 3.) Machine learning methods to detecting signals in observational and large data
  • 39. Data-driven discovery to identifying factors that matter! 1.) Find elusive E in P and explain variation of disease risk 2.) Consideration of totality of evidence: Does my correlation matter? 3.) Machine learning methods to detecting signals in observational and large data ARPH 2016 JAMA 2014 JECH 2015
  • 40. Data-driven discovery to identifying factors that matter! 1.) Find elusive E in P and explain variation of disease risk 2.) Consideration of totality of evidence: Does my correlation matter? 3.) Machine learning methods to detecting signals in observational and large data ARPH 2016 JAMA 2014 JECH 2015
  • 41. How can we do better in both discovery and translation?: Leverage data-driven “exposomic” techniques! • Data-driven discovery • search through all the possibilities • gauge the totality of the evidence • New ways to measure the exposome (E)! • scalable ways to measure diet, infection, pollution
  • 42. Explaining the missing variation: A data-driven paradigm for robust discovery of E in disease via systematic study of the “exposome” what to measure? how to measure? “A more comprehensive view of environmental exposure is needed ... to discover major causes of diseases...” how to analyze in relation to health? Wild, 2005, 2012 Rappaport and Smith, 2010, 2011 Buck-Louis and Sundaram 2012 Miller and Jones, 2014 Patel CJ and Ioannidis JPAI, 2014
  • 43. Need to assess the exposome globally: (e.g., India and China) c/o Getty Images c/o AFP
  • 44. … and Sub-Saharan Africa! Can we predict HIV as a function of the exposome? AIDS 2018
  • 45. Harvard DBMI Susanne Churchill Nathan Palmer Sophia Mamousette Sunny Alvear Chirag J Patel chirag@hms.harvard.edu @chiragjp www.chiragjpgroup.org NIH Common Fund Big Data to Knowledge Acknowledgements RagGroup Arjun Manrai Nam Pho Jake Chung Kajal Claypool Chirag Lakhani Danielle Rasooly Alan LeGoallec Sivateja Tangirala Mentioned Collaborators Isaac Kohane John Ioannidis Dennis Bier Hugo Aschard