SlideShare a Scribd company logo
1 of 28
Download to read offline
Rampant Purifying Selection Drives Singleton
Variants To Be Major Source Of Heritability For
Human Gene Expression
Ryan D. Hernandez
ryan.hernandez@me.com
@rdhernand
Postdoc
positions
available!!!
Singleton Variants Dominate The Genetic
Architecture Of Human Gene Expression
Hernandez, Uricchio, Hartman, Ye, Dahl, Zaitlen
Selection And Explosive Growth Alter
Genetic Architecture And Hamper The
Detection Of Causal Rare Variants
Uricchio, Zaitlen, Ye, Witte, Hernandez. Genome Research (2016).
Ryan D. Hernandez
ryan.hernandez@me.com
@rdhernand
Heritability and Human Height
http://i.ytimg.com/vi/E0Aeks_id6c/maxresdefault.jpg
heritability: how
much of the variation
in height among
people can be
explained by genetic
factors?
This is not the same
as asking how much
genetic factors
influence height in
any one person.
https://en.wikipedia.org/wiki/Heritability
GWAS have the potential to explain
60% of the variation in height
€
hg
2
: The narrow-sense heritability
explained by all genotyped SNPs.
250,000 subjects
Wood et al, 2014 Nat. Genet.
i.ytimg.com/vi/E0Aeks_id6c/maxresdefault.jpg
Challenges For Studying Complex Diseases
Thecaseofthemissingheritability
NATURE|Vol 456|6 November 2008NEWS FEATURE PERSONAL GENOMES
Maher, Nature (2008).
World Population Growth
“World population” - Wikipedia
The Effect of Negative Selection
Deleterious
mutations will
arise in the next
generation
Chromosomes in
a population with
standing variation
Negative selection:
the action of
natural selection
purging deleterious
mutations.
Majority of human genetic
variation is rare
Class Fraction of variants < 1%
Missense 92.6%
Synonymous 88.5%
Non-coding 82.3%
Variants with frequency <1%
1 2 5 10 20 50 100 500 2000 5000
0.00.10.20.30.40.50.6
Number of non−reference alleles (log scale)
Fractionofvariants
missense
synonymous
noncoding
neutral model
0
0.2
0.4
0.6
0.8
1.0
1
10,000
2
10,000
0.03–0.1%
0.1–0.5%
0.5–1%
1–2%
2–5%
5–10%
10–100%
Synonymous
Neutral: –10–5 < s < 0
Nearly Neut.: –10–4 < s < –10–5
Weak: –10–3 < s < –10–4
Moderate: –10–2 < s < –10–3
Strong: s < –10–2
Colorversionavailableonlin
Derived allele frequency
0
0.2
0.4
0.6
0.8
1.0
1
10,000
2
10,000
0.03–0.1%
0.1–0.5%
0.5–1%
1–2%
2–5%
5–10%
10–100%
Synonymous
Neutral: –10–5 < s < 0
Nearly Neut.: –10–4 < s < –10–5
Weak: –10–3 < s < –10–4
Moderate: –10–2 < s < –10–3
Strong: s < –10–2
Colorversionavailableonlin
0
0.2
0.4
0.6
0.8
1.0
1
10,000
2
10,000
0.03–0.1%
0.1–0.5%
0.5–1%
1–2%
2–5%
5–10%
10–100%
Synonymous
Neutral: –10–5 < s < 0
Nearly Neut.: –10–4 < s < –10–5
Weak: –10–3 < s < –10–4
Moderate: –10–2 < s < –10–3
Strong: s < –10–2
Colorversionavailableonlin
Maher, et al. Hum Hered 74, 118-128 (2012).
Evolutionary Models Of
Complex Disease
Other
Phenotype
SNP
Disease
propensity
Disease
Fitness
Pleiotropy: SNP impacts multiple phenotypes
ρ:
correlation(effect size, fitness)
(Simons et al, 2014)
τ:
transforms fitness effect to
phenotype (Eyre-Walker, 2010)
Uricchio et al., Genome Research (2016)
Growth model: Gravel et al (2011)
Explosive growth: Tennessen et al (2012)
AFRICA EUROPE
0
0.2
0.4
0.6
0.8
1.0
1
10,000
2
10,000
0.03–0.1%
0.1–0.5%
0.5–1%
1–2%
2–5%
5–10%
10–100%
Synonymous
Neutral: –10–5 < s < 0
Nearly Neut.: –10–4 < s < –10–5
Weak: –10–3 < s < –10–4
Moderate: –10–2 < s < –10–3
Strong: s < –10–2
Fig. 3. The distribution of selection coefficients for variants in each
allele frequency bin (starting with singletons, doubletons, and then
a disjoint partition of rare and common variants). Common vari-
ants (>5%) are expected to be primarily neutral or nearly neutral.
Most strongly deleterious mutations have frequency <0.1%.
0.011–0.999
1
Fitness effects in non-coding DNA:
Torgerson et al (2009)
effect size = f(demography, natural selection)
Human-specific demography and Selection
Uricchio, et al. Genome Res 26, 863-873 (2016).
Neutral model: most variance
explained by common alleles
Proportionofh2explained
byalleleswithfreq≤ω
5e-04 1e-03 5e-03 1e-02 5e-02 1e-01 5e-01 1e+00
0.00.20.40.60.81.0
Standard Neutral Model
derived allele frequency, ω
Proportionofvarianceexplained,Vω/V1
Ultrarare
Intermediaterare
Common
5e-04 1e-03 5e-03 1e-02 5e-02 1e-01 5e-01 1e+00
0.00.20.40.60.81.0
Standard Neutral Model
derived allele frequency, ω
Proportionofvarianceexplained,Vω/V1
Ultrarare
Intermediaterare
Common
5e-04 1e-03 5e-03 1e-02 5e-02 1e-01 5e-01 1e+00
0.00.20.40.60.81.0
Standard Neutral Model
derived allele frequency, ω
Proportionofvarianceexplained,Vω/V1
Ultrarare
Intermediaterare
Common
Uricchio, et al. Genome Res 26, 863-873 (2016).
5e-04 5e-03 5e-02 5e-01
0.00.20.40.60.81.0
AFR, Growth
derived allele frequency, ω
VωV1
Proportionofh2explained
byalleleswithfreq≤ω
Evolutionary Model: Most Variance explained
by Ultra Rare Alleles or common alleles…
Uricchio, et al. Genome Res 26, 863-873 (2016).
Evolutionary Model: Most Variance explained
by Ultra Rare Alleles or common alleles…
Uricchio et al.
Coldon June 17, 2016 - Published bygenome.cshlp.orgDownloaded from
Uricchio et al.
Coldon June 17, 2016 - Published bygenome.cshlp.orgDownloaded from
Proportionofvarianceexplained
byalleleswithfreq≤x
Proportionofvarianceexplained
byalleleswithfreq≤x
model captures the
ill have a tight corre-
In Figure 1B, we plot the proporti
variance that is explained by singletons, V
of the genetic archi
the variance explai
frequency 1, and h
genetic variance in
Vc/V1 is strongly im
events and the rela
tion and effect siz
the role of rare vari
contractions have
tained exponentia
drastic increase in
Note that this tim
for exponential ex
different from the
pansion event at t
ts under a European growth model (Gravel et al.
singletons (c). (B) The proportion of the genetic
model captures the
ill have a tight corre-
In Figure 1B, we plot the proporti
variance that is explained by singletons, V
of the genetic archi
the variance explai
frequency 1, and h
genetic variance in
Vc/V1 is strongly im
events and the rela
tion and effect siz
the role of rare vari
contractions have
tained exponentia
drastic increase in
Note that this tim
for exponential ex
different from the
pansion event at t
ts under a European growth model (Gravel et al.
singletons (c). (B) The proportion of the genetic
Demography and selection shape the
contribution of ultra-rare variants
model captures the
ill have a tight corre-
In Figure 1B, we plot the proporti
variance that is explained by singletons, V
of the genetic archi
the variance explai
frequency 1, and h
genetic variance in
Vc/V1 is strongly im
events and the rela
tion and effect siz
the role of rare vari
contractions have
tained exponentia
drastic increase in
Note that this tim
for exponential ex
different from the
pansion event at t
ts under a European growth model (Gravel et al.
singletons (c). (B) The proportion of the genetic
Proportionofvariance
explainedbysingletons
Uricchio, et al. Genome Res 26, 863-873 (2016).
•European demography
(Gravel et al., 2011)
•n=500 chromosomes
• Large-scale RNA sequencing + WGS
• 4 European populations
• 360 individuals
• low coverage WGS + high coverage
exome: Phase 3.
• RNA-seq: median depth 58.3M reads
• Gene expression: 

log2 transformed, median centered,
and quantile normalized.
• 10,077 unique genes.
FIN
GBR
TSI
CEU
YRI
Noah
Zaitlen
• Our sample size is
small, but can we
learn anything about
the genetic basis of
complex traits from
these 10k genes?
• Let’s analyze
heritability of gene
expression due to cis
variation (within 1Mb
of gene)
• Large-scale RNA sequencing + WGS
• 4 European populations
• 360 individuals
• low coverage WGS + high coverage
exome: Phase 3.
• RNA-seq: median depth 58.3M reads
• Gene expression: 

log2 transformed, median centered,
and quantile normalized.
• 10,077 unique genes.
Heritability
• Several pioneers have developed ways to infer heritability
from unrelated individuals
• Linear Mixed Models (GCTA/LDAK)
• Requires complex optimization procedures that often fail
with very small sample sizes.
• Can be biased.
• Haseman-Elston (HE) regression
• Much faster than LMM.
• Unbiased.
• No sample size issues!
Major Problem
• We can apply a method to data and get an answer, but we have
no idea whether it would actually be a good one!
• There is no complex trait in which we know:
• The number of causal variants
• The frequencies of the causal variants
• The effect sizes of the causal variants
• The fitness effect of the causal variants
• We need a thorough simulation study where we can vary all of
these parameters and see how they effect our answer!
1 2 5 36
−1.0−0.50.00.51.01.52.0
RelativeBias
K=1
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
1
−1.0−0.50.00.51.01.52.0
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
1 2 5 36
−1.0−0.
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
1
−1.0−0.
1 2 5 36
−1.0−0.50.00.51.01.52.0
min MAC
RelativeBias
K=10
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
● ● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
● ●
●
1
−1.0−0.50.00.51.01.52.0
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
2 5 36
●
●
●
●
●
●
●
●
1 2 5 36
−1.0
●
●
●
●
●
●
●
●
2 5 36
min MAC
K=20
●
●
●
●
●
●
● ●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
● ●
●
●
●
●
●
●
●
●
●
●
● ●
●
1 2 5 36
−1.0−0.50.00.51.01.52.0
min MAC
K=50
●
●
●
●
●
●
●
●
●
● ●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ●
●
●
fracRare=1
fracRare=0.75
fracRare=0.5
fracRare=0.1
Nc=20
Nc=50
Nc=100
Simulations:
Effect Size ∝ log(freq)
1
0.00.51.0
SNP bin
Biasofh2
bin
K=1
1 2
0.00.51.0
SNP bin
K=2
1 3 5
0.00.51.0
SNP bin
K=5
1 5 10
0.00.51.0 SNP bin
K=10
1 10 20
0.00.51.0
SNP bin
K=20
Our Approach:
Partitioning SNPs by frequency
0
0.2
0.4
0.6
0.8
1.0
1
10,000
2
10,000
0.03–0.1%
0.1–0.5%
0.5–1%
1–2%
2–5%
5–10%
10–100%
Synonymous
Neutral: –10–5 < s < 0
Nearly Neut.: –10–4 < s < –10–5
Weak: –10–3 < s < –10–4
Moderate: –10–2 < s < –10–3
Strong: s < –10–2
Derived allele frequency
Singletons
emerge as
their own
category!
1
0.00.51.0
SNP bin
Biasofh2
bin
K=1
1 2
0.00.51.0
SNP bin
K=2
1 3 5
0.00.51.0
SNP bin
K=5
1 5 10
0.00.51.0 SNP bin
K=10
1 10 20
0.00.51.0
SNP bin
K=20
1
0.00.51.0
SNP bin
Biasofh2
bin
K=1
1 2
0.00.51.0
SNP bin
K=2
1 3 5
0.00.51.0
SNP bin
K=5
1 5 10
0.00.51.0 SNP bin
K=10
1 10 20
0.00.51.0
SNP bin
K=20
Our Approach:
Partitioning SNPs by frequency
Hernandez, Uricchio, Hartman, Ye, Dahl, Zaitlen. bioRxiv (2017)
1
0.00.51.0
SNP bin
Biasofh2
bin
K=1
1 2
0.00.51.0
SNP bin
K=2
1 3 5
0.00.51.0
SNP bin
K=5
1 5 10
0.00.51.0 SNP bin
K=10
1 10 20
0.00.51.0
SNP bin
K=20
1
0.00.51.0
SNP bin
Biasofh2
bin
K=1
1 2
0.00.51.0
SNP bin
K=2
1 3 5
0.00.51.0
SNP bin
K=5
1 5 10
0.00.51.0 SNP bin
K=10
1 10 20
0.00.51.0
SNP bin
K=20
Our Approach:
Partitioning SNPs by frequency
Hernandez, Uricchio, Hartman, Ye, Dahl, Zaitlen. bioRxiv (2017)
Finding Missing
heritability
●
●
●
●
MAC ≥ 1
MAC ≥ 2
MAC ≥ 5
MAF ≥ 5%
●●●●●●
●
●●
●●
●
●
●●
●
●
●
●
●
● ● ● ● ●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
● ● ● ● ● ● ●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
● ● ● ● ● ● ●
● ●
●
●
●
●
●
●
●
●
●
●
* 1e−4 1e−3 0.01 0.1 0.5
0.000.020.040.060.08
Minor Allele Frequency
Cumulativeh2
Hernandez, Uricchio, Hartman, Ye, Dahl, Zaitlen. bioRxiv (2017)
Fractionofheritability
●
●
●
●
●
● ●
●
●
●
●
● ●
●
●
●●
●
●
●
1e−3 0.01 0.1 0.5
0.000.050.100.150.200.25
Minor Allele Frequency
Prop.ofh2
total
●
●
● ● ● ●
●
●
● ● ● ● ● ●●●●●●●
* 1e−4 0.01 0.5
0.00.40.8
Hernandez, Uricchio, Hartman, Ye, Dahl, Zaitlen. bioRxiv (2017)
Singletons!
•Partition
singletons by
gnomAD MAF
(n>15,000)
•>90% of
singleton
heritability is
due to
EXTREMELY
rare variants.
Finding Missing
heritability
●
●
●
●
●
gnomAD
MAC ≥ 1
MAC ≥ 2
MAC ≥ 5
MAF ≥ 5%
●●●●●●
●
●●
●●
●
●
●●
●
●
●
●
●
● ● ● ● ●
●
●●●
●
●
●
●
●
●
●
●
●
●
●
● ● ● ● ● ● ●
●
● ●
●
●
●
●
●
●
●
●
●
●
●
● ● ● ● ● ● ●
● ●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
●
● ● ● ● ● ●
●
●
●
●
●
●
●
●
●
* 1e−4 1e−3 0.01 0.1 0.5
0.000.020.040.060.08
Minor Allele Frequency
Cumulativeh2
Hernandez, Uricchio, Hartman, Ye, Dahl, Zaitlen. bioRxiv (2017)
Summary
•We still know little about the genetic basis of most complex
traits
•Our evolutionary modeling gives insights into what the
patterns of heritability should look like
•Analyzing heritability patterns across ~10k gene expression
patterns demonstrates a major impact of rare variants across
most genes.
•But our power to detect causal loci using existing tests is
very poor.
Thanks!
Noah
Zaitlen
Jimmie
Ye
John
Witte
Lawrence
Uricchio
Ryan D. Hernandez
ryan.hernandez@me.com
@rdhernand
Aria
Hernandez
Postdoc
positions
available!!!

More Related Content

What's hot

Population Genetics
Population GeneticsPopulation Genetics
Population Genetics
Jolie Yu
 
Evolution Lectures WK8
Evolution Lectures WK8Evolution Lectures WK8
Evolution Lectures WK8
Andrea Hatlen
 
Population genetics with qs
Population genetics with qsPopulation genetics with qs
Population genetics with qs
tas11244
 
Microevolution Changing Allele Frequencies
Microevolution Changing Allele FrequenciesMicroevolution Changing Allele Frequencies
Microevolution Changing Allele Frequencies
Tauqeer Ahmad
 
Ch. 13 Population genetics
Ch. 13 Population geneticsCh. 13 Population genetics
Ch. 13 Population genetics
Martin Jellinek
 

What's hot (20)

Population Genetics
Population GeneticsPopulation Genetics
Population Genetics
 
Population and evolutionary genetics 1
Population and evolutionary genetics 1Population and evolutionary genetics 1
Population and evolutionary genetics 1
 
B.sc. agri i pog unit 4 population genetics
B.sc. agri i pog unit 4 population geneticsB.sc. agri i pog unit 4 population genetics
B.sc. agri i pog unit 4 population genetics
 
application of population genetics
 application of population genetics application of population genetics
application of population genetics
 
Population genetics
Population geneticsPopulation genetics
Population genetics
 
Genetic variation and its role in health pharmacology
Genetic variation and its role in health pharmacologyGenetic variation and its role in health pharmacology
Genetic variation and its role in health pharmacology
 
Evolution Lectures WK8
Evolution Lectures WK8Evolution Lectures WK8
Evolution Lectures WK8
 
Population genetics with qs
Population genetics with qsPopulation genetics with qs
Population genetics with qs
 
Microevolution Changing Allele Frequencies
Microevolution Changing Allele FrequenciesMicroevolution Changing Allele Frequencies
Microevolution Changing Allele Frequencies
 
Changes in gene frequency
Changes  in gene frequencyChanges  in gene frequency
Changes in gene frequency
 
microevolution
microevolutionmicroevolution
microevolution
 
Evolution lecture 2
Evolution lecture 2Evolution lecture 2
Evolution lecture 2
 
Hardy – weinberg law
 Hardy – weinberg law Hardy – weinberg law
Hardy – weinberg law
 
Mechanisms of evolution-I
Mechanisms of evolution-IMechanisms of evolution-I
Mechanisms of evolution-I
 
Unit 12 Mechanisms For Evolution
Unit 12 Mechanisms For EvolutionUnit 12 Mechanisms For Evolution
Unit 12 Mechanisms For Evolution
 
Evidence of Evolution by Natural Selection - how basic evolutionary principal...
Evidence of Evolution by Natural Selection - how basic evolutionary principal...Evidence of Evolution by Natural Selection - how basic evolutionary principal...
Evidence of Evolution by Natural Selection - how basic evolutionary principal...
 
Evolution and gene frequencies my report
Evolution and gene frequencies my reportEvolution and gene frequencies my report
Evolution and gene frequencies my report
 
Ch. 13 Population genetics
Ch. 13 Population geneticsCh. 13 Population genetics
Ch. 13 Population genetics
 
Rates and trends of evolution
Rates and trends of evolutionRates and trends of evolution
Rates and trends of evolution
 
Why need to study population genetics & applications of population genetics
Why need to study population genetics & applications of population geneticsWhy need to study population genetics & applications of population genetics
Why need to study population genetics & applications of population genetics
 

Similar to Hernandez ESEB 2018

Evolution lectures WK6
Evolution lectures WK6Evolution lectures WK6
Evolution lectures WK6
Andrea Hatlen
 
Microevolution hardy weinberg
Microevolution hardy weinbergMicroevolution hardy weinberg
Microevolution hardy weinberg
Tauqeer Ahmad
 
Population genetics with qs
Population genetics with qsPopulation genetics with qs
Population genetics with qs
tas11244
 
Learning Objectives Define inheritance Define allele and trait Describe.docx
Learning Objectives Define inheritance Define allele and trait Describe.docxLearning Objectives Define inheritance Define allele and trait Describe.docx
Learning Objectives Define inheritance Define allele and trait Describe.docx
write4
 
Learning Objectives Define inheritance Define allele and trait Describe.docx
Learning Objectives Define inheritance Define allele and trait Describe.docxLearning Objectives Define inheritance Define allele and trait Describe.docx
Learning Objectives Define inheritance Define allele and trait Describe.docx
write5
 
Using the space provided compose an ESSAY concerning the following qu.pdf
Using the space provided compose an ESSAY concerning the following qu.pdfUsing the space provided compose an ESSAY concerning the following qu.pdf
Using the space provided compose an ESSAY concerning the following qu.pdf
feelinggift
 
Honors ~ Evolution 1011
Honors ~ Evolution 1011Honors ~ Evolution 1011
Honors ~ Evolution 1011
Michael Edgar
 
Population Variation.pptx jjjjmkokokomkokk
Population Variation.pptx jjjjmkokokomkokkPopulation Variation.pptx jjjjmkokokomkokk
Population Variation.pptx jjjjmkokokomkokk
alizain9604
 
Principles of genetic epidemiology student version.pptx
Principles of genetic epidemiology student version.pptxPrinciples of genetic epidemiology student version.pptx
Principles of genetic epidemiology student version.pptx
ssuser0d70fd
 

Similar to Hernandez ESEB 2018 (20)

Week 5 genetic basis of evolution
Week 5   genetic basis of evolutionWeek 5   genetic basis of evolution
Week 5 genetic basis of evolution
 
2014 sbc174-evolution lectureswk5
2014 sbc174-evolution lectureswk52014 sbc174-evolution lectureswk5
2014 sbc174-evolution lectureswk5
 
Gene Flow.pptx
Gene Flow.pptxGene Flow.pptx
Gene Flow.pptx
 
Evolution lectures WK6
Evolution lectures WK6Evolution lectures WK6
Evolution lectures WK6
 
Microevolution hardy weinberg
Microevolution hardy weinbergMicroevolution hardy weinberg
Microevolution hardy weinberg
 
Population genetics with qs
Population genetics with qsPopulation genetics with qs
Population genetics with qs
 
06 random drift
06 random drift06 random drift
06 random drift
 
Mating systems (population genetics)
Mating systems (population genetics)Mating systems (population genetics)
Mating systems (population genetics)
 
Learning Objectives Define inheritance Define allele and trait Describe.docx
Learning Objectives Define inheritance Define allele and trait Describe.docxLearning Objectives Define inheritance Define allele and trait Describe.docx
Learning Objectives Define inheritance Define allele and trait Describe.docx
 
Learning Objectives Define inheritance Define allele and trait Describe.docx
Learning Objectives Define inheritance Define allele and trait Describe.docxLearning Objectives Define inheritance Define allele and trait Describe.docx
Learning Objectives Define inheritance Define allele and trait Describe.docx
 
W-4-MECHANISM.pptx
W-4-MECHANISM.pptxW-4-MECHANISM.pptx
W-4-MECHANISM.pptx
 
Using the space provided compose an ESSAY concerning the following qu.pdf
Using the space provided compose an ESSAY concerning the following qu.pdfUsing the space provided compose an ESSAY concerning the following qu.pdf
Using the space provided compose an ESSAY concerning the following qu.pdf
 
Allele frequency
Allele frequencyAllele frequency
Allele frequency
 
Honors ~ Evolution 1011
Honors ~ Evolution 1011Honors ~ Evolution 1011
Honors ~ Evolution 1011
 
Population genetics
Population geneticsPopulation genetics
Population genetics
 
مشروووووع الوراثه
مشروووووع الوراثهمشروووووع الوراثه
مشروووووع الوراثه
 
Evolutionary mechanisms population size, genetic drift, gene flow
Evolutionary mechanisms population size, genetic drift, gene flowEvolutionary mechanisms population size, genetic drift, gene flow
Evolutionary mechanisms population size, genetic drift, gene flow
 
Population Variation.pptx jjjjmkokokomkokk
Population Variation.pptx jjjjmkokokomkokkPopulation Variation.pptx jjjjmkokokomkokk
Population Variation.pptx jjjjmkokokomkokk
 
Microevolution
MicroevolutionMicroevolution
Microevolution
 
Principles of genetic epidemiology student version.pptx
Principles of genetic epidemiology student version.pptxPrinciples of genetic epidemiology student version.pptx
Principles of genetic epidemiology student version.pptx
 

More from rdhernand (6)

Bayes NIPT
Bayes NIPTBayes NIPT
Bayes NIPT
 
Hernandez smbe17
Hernandez smbe17Hernandez smbe17
Hernandez smbe17
 
Hernandez ashg 2016_share
Hernandez ashg 2016_shareHernandez ashg 2016_share
Hernandez ashg 2016_share
 
Hernandez ASHG 2015
Hernandez ASHG 2015Hernandez ASHG 2015
Hernandez ASHG 2015
 
Hernandez smbe 2015
Hernandez smbe 2015Hernandez smbe 2015
Hernandez smbe 2015
 
Evolutionary Forces Shaping Human Genetic Variation #1000GB
Evolutionary Forces Shaping Human Genetic Variation #1000GBEvolutionary Forces Shaping Human Genetic Variation #1000GB
Evolutionary Forces Shaping Human Genetic Variation #1000GB
 

Recently uploaded

Heat Units in plant physiology and the importance of Growing Degree days
Heat Units in plant physiology and the importance of Growing Degree daysHeat Units in plant physiology and the importance of Growing Degree days
Heat Units in plant physiology and the importance of Growing Degree days
Brahmesh Reddy B R
 
HIV AND INFULENZA VIRUS PPT HIV PPT INFULENZA VIRUS PPT
HIV AND INFULENZA VIRUS PPT HIV PPT  INFULENZA VIRUS PPTHIV AND INFULENZA VIRUS PPT HIV PPT  INFULENZA VIRUS PPT

Recently uploaded (20)

TEST BANK for Organic Chemistry 6th Edition.pdf
TEST BANK for Organic Chemistry 6th Edition.pdfTEST BANK for Organic Chemistry 6th Edition.pdf
TEST BANK for Organic Chemistry 6th Edition.pdf
 
Heads-Up Multitasker: CHI 2024 Presentation.pdf
Heads-Up Multitasker: CHI 2024 Presentation.pdfHeads-Up Multitasker: CHI 2024 Presentation.pdf
Heads-Up Multitasker: CHI 2024 Presentation.pdf
 
Vital Signs of Animals Presentation By Aftab Ahmed Rahimoon
Vital Signs of Animals Presentation By Aftab Ahmed RahimoonVital Signs of Animals Presentation By Aftab Ahmed Rahimoon
Vital Signs of Animals Presentation By Aftab Ahmed Rahimoon
 
ABHISHEK ANTIBIOTICS PPT MICROBIOLOGY // USES OF ANTIOBIOTICS TYPES OF ANTIB...
ABHISHEK ANTIBIOTICS PPT MICROBIOLOGY  // USES OF ANTIOBIOTICS TYPES OF ANTIB...ABHISHEK ANTIBIOTICS PPT MICROBIOLOGY  // USES OF ANTIOBIOTICS TYPES OF ANTIB...
ABHISHEK ANTIBIOTICS PPT MICROBIOLOGY // USES OF ANTIOBIOTICS TYPES OF ANTIB...
 
Polyethylene and its polymerization.pptx
Polyethylene and its polymerization.pptxPolyethylene and its polymerization.pptx
Polyethylene and its polymerization.pptx
 
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRingsTransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
TransientOffsetin14CAftertheCarringtonEventRecordedbyPolarTreeRings
 
Taphonomy and Quality of the Fossil Record
Taphonomy and Quality of the  Fossil RecordTaphonomy and Quality of the  Fossil Record
Taphonomy and Quality of the Fossil Record
 
GBSN - Microbiology (Unit 4) Concept of Asepsis
GBSN - Microbiology (Unit 4) Concept of AsepsisGBSN - Microbiology (Unit 4) Concept of Asepsis
GBSN - Microbiology (Unit 4) Concept of Asepsis
 
Micropropagation of Madagascar periwinkle (Catharanthus roseus)
Micropropagation of Madagascar periwinkle (Catharanthus roseus)Micropropagation of Madagascar periwinkle (Catharanthus roseus)
Micropropagation of Madagascar periwinkle (Catharanthus roseus)
 
NUMERICAL Proof Of TIme Electron Theory.
NUMERICAL Proof Of TIme Electron Theory.NUMERICAL Proof Of TIme Electron Theory.
NUMERICAL Proof Of TIme Electron Theory.
 
Heat Units in plant physiology and the importance of Growing Degree days
Heat Units in plant physiology and the importance of Growing Degree daysHeat Units in plant physiology and the importance of Growing Degree days
Heat Units in plant physiology and the importance of Growing Degree days
 
GBSN - Biochemistry (Unit 3) Metabolism
GBSN - Biochemistry (Unit 3) MetabolismGBSN - Biochemistry (Unit 3) Metabolism
GBSN - Biochemistry (Unit 3) Metabolism
 
SaffronCrocusGenomicsThessalonikiOnlineMay2024TalkOnline.pptx
SaffronCrocusGenomicsThessalonikiOnlineMay2024TalkOnline.pptxSaffronCrocusGenomicsThessalonikiOnlineMay2024TalkOnline.pptx
SaffronCrocusGenomicsThessalonikiOnlineMay2024TalkOnline.pptx
 
GBSN - Biochemistry (Unit 8) Enzymology
GBSN - Biochemistry (Unit 8) EnzymologyGBSN - Biochemistry (Unit 8) Enzymology
GBSN - Biochemistry (Unit 8) Enzymology
 
HIV AND INFULENZA VIRUS PPT HIV PPT INFULENZA VIRUS PPT
HIV AND INFULENZA VIRUS PPT HIV PPT  INFULENZA VIRUS PPTHIV AND INFULENZA VIRUS PPT HIV PPT  INFULENZA VIRUS PPT
HIV AND INFULENZA VIRUS PPT HIV PPT INFULENZA VIRUS PPT
 
Classification of Kerogen, Perspective on palynofacies in depositional envi...
Classification of Kerogen,  Perspective on palynofacies in depositional  envi...Classification of Kerogen,  Perspective on palynofacies in depositional  envi...
Classification of Kerogen, Perspective on palynofacies in depositional envi...
 
FORENSIC CHEMISTRY ARSON INVESTIGATION.pdf
FORENSIC CHEMISTRY ARSON INVESTIGATION.pdfFORENSIC CHEMISTRY ARSON INVESTIGATION.pdf
FORENSIC CHEMISTRY ARSON INVESTIGATION.pdf
 
Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...
Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...
Soil and Water Conservation Engineering (SWCE) is a specialized field of stud...
 
THE FUNDAMENTAL UNIT OF LIFE CLASS IX.ppt
THE FUNDAMENTAL UNIT OF LIFE CLASS IX.pptTHE FUNDAMENTAL UNIT OF LIFE CLASS IX.ppt
THE FUNDAMENTAL UNIT OF LIFE CLASS IX.ppt
 
Precision Farming in Fruit Crops presentation
Precision Farming in Fruit Crops presentationPrecision Farming in Fruit Crops presentation
Precision Farming in Fruit Crops presentation
 

Hernandez ESEB 2018

  • 1. Rampant Purifying Selection Drives Singleton Variants To Be Major Source Of Heritability For Human Gene Expression Ryan D. Hernandez ryan.hernandez@me.com @rdhernand Postdoc positions available!!!
  • 2. Singleton Variants Dominate The Genetic Architecture Of Human Gene Expression Hernandez, Uricchio, Hartman, Ye, Dahl, Zaitlen Selection And Explosive Growth Alter Genetic Architecture And Hamper The Detection Of Causal Rare Variants Uricchio, Zaitlen, Ye, Witte, Hernandez. Genome Research (2016). Ryan D. Hernandez ryan.hernandez@me.com @rdhernand
  • 3. Heritability and Human Height http://i.ytimg.com/vi/E0Aeks_id6c/maxresdefault.jpg heritability: how much of the variation in height among people can be explained by genetic factors? This is not the same as asking how much genetic factors influence height in any one person. https://en.wikipedia.org/wiki/Heritability
  • 4. GWAS have the potential to explain 60% of the variation in height € hg 2 : The narrow-sense heritability explained by all genotyped SNPs. 250,000 subjects Wood et al, 2014 Nat. Genet. i.ytimg.com/vi/E0Aeks_id6c/maxresdefault.jpg
  • 5. Challenges For Studying Complex Diseases Thecaseofthemissingheritability NATURE|Vol 456|6 November 2008NEWS FEATURE PERSONAL GENOMES Maher, Nature (2008).
  • 6. World Population Growth “World population” - Wikipedia
  • 7. The Effect of Negative Selection Deleterious mutations will arise in the next generation Chromosomes in a population with standing variation Negative selection: the action of natural selection purging deleterious mutations.
  • 8. Majority of human genetic variation is rare Class Fraction of variants < 1% Missense 92.6% Synonymous 88.5% Non-coding 82.3% Variants with frequency <1% 1 2 5 10 20 50 100 500 2000 5000 0.00.10.20.30.40.50.6 Number of non−reference alleles (log scale) Fractionofvariants missense synonymous noncoding neutral model
  • 9. 0 0.2 0.4 0.6 0.8 1.0 1 10,000 2 10,000 0.03–0.1% 0.1–0.5% 0.5–1% 1–2% 2–5% 5–10% 10–100% Synonymous Neutral: –10–5 < s < 0 Nearly Neut.: –10–4 < s < –10–5 Weak: –10–3 < s < –10–4 Moderate: –10–2 < s < –10–3 Strong: s < –10–2 Colorversionavailableonlin Derived allele frequency 0 0.2 0.4 0.6 0.8 1.0 1 10,000 2 10,000 0.03–0.1% 0.1–0.5% 0.5–1% 1–2% 2–5% 5–10% 10–100% Synonymous Neutral: –10–5 < s < 0 Nearly Neut.: –10–4 < s < –10–5 Weak: –10–3 < s < –10–4 Moderate: –10–2 < s < –10–3 Strong: s < –10–2 Colorversionavailableonlin 0 0.2 0.4 0.6 0.8 1.0 1 10,000 2 10,000 0.03–0.1% 0.1–0.5% 0.5–1% 1–2% 2–5% 5–10% 10–100% Synonymous Neutral: –10–5 < s < 0 Nearly Neut.: –10–4 < s < –10–5 Weak: –10–3 < s < –10–4 Moderate: –10–2 < s < –10–3 Strong: s < –10–2 Colorversionavailableonlin Maher, et al. Hum Hered 74, 118-128 (2012).
  • 10. Evolutionary Models Of Complex Disease Other Phenotype SNP Disease propensity Disease Fitness Pleiotropy: SNP impacts multiple phenotypes ρ: correlation(effect size, fitness) (Simons et al, 2014) τ: transforms fitness effect to phenotype (Eyre-Walker, 2010) Uricchio et al., Genome Research (2016)
  • 11. Growth model: Gravel et al (2011) Explosive growth: Tennessen et al (2012) AFRICA EUROPE 0 0.2 0.4 0.6 0.8 1.0 1 10,000 2 10,000 0.03–0.1% 0.1–0.5% 0.5–1% 1–2% 2–5% 5–10% 10–100% Synonymous Neutral: –10–5 < s < 0 Nearly Neut.: –10–4 < s < –10–5 Weak: –10–3 < s < –10–4 Moderate: –10–2 < s < –10–3 Strong: s < –10–2 Fig. 3. The distribution of selection coefficients for variants in each allele frequency bin (starting with singletons, doubletons, and then a disjoint partition of rare and common variants). Common vari- ants (>5%) are expected to be primarily neutral or nearly neutral. Most strongly deleterious mutations have frequency <0.1%. 0.011–0.999 1 Fitness effects in non-coding DNA: Torgerson et al (2009) effect size = f(demography, natural selection) Human-specific demography and Selection Uricchio, et al. Genome Res 26, 863-873 (2016).
  • 12. Neutral model: most variance explained by common alleles Proportionofh2explained byalleleswithfreq≤ω 5e-04 1e-03 5e-03 1e-02 5e-02 1e-01 5e-01 1e+00 0.00.20.40.60.81.0 Standard Neutral Model derived allele frequency, ω Proportionofvarianceexplained,Vω/V1 Ultrarare Intermediaterare Common 5e-04 1e-03 5e-03 1e-02 5e-02 1e-01 5e-01 1e+00 0.00.20.40.60.81.0 Standard Neutral Model derived allele frequency, ω Proportionofvarianceexplained,Vω/V1 Ultrarare Intermediaterare Common 5e-04 1e-03 5e-03 1e-02 5e-02 1e-01 5e-01 1e+00 0.00.20.40.60.81.0 Standard Neutral Model derived allele frequency, ω Proportionofvarianceexplained,Vω/V1 Ultrarare Intermediaterare Common Uricchio, et al. Genome Res 26, 863-873 (2016).
  • 13. 5e-04 5e-03 5e-02 5e-01 0.00.20.40.60.81.0 AFR, Growth derived allele frequency, ω VωV1 Proportionofh2explained byalleleswithfreq≤ω Evolutionary Model: Most Variance explained by Ultra Rare Alleles or common alleles… Uricchio, et al. Genome Res 26, 863-873 (2016).
  • 14. Evolutionary Model: Most Variance explained by Ultra Rare Alleles or common alleles… Uricchio et al. Coldon June 17, 2016 - Published bygenome.cshlp.orgDownloaded from Uricchio et al. Coldon June 17, 2016 - Published bygenome.cshlp.orgDownloaded from Proportionofvarianceexplained byalleleswithfreq≤x Proportionofvarianceexplained byalleleswithfreq≤x
  • 15. model captures the ill have a tight corre- In Figure 1B, we plot the proporti variance that is explained by singletons, V of the genetic archi the variance explai frequency 1, and h genetic variance in Vc/V1 is strongly im events and the rela tion and effect siz the role of rare vari contractions have tained exponentia drastic increase in Note that this tim for exponential ex different from the pansion event at t ts under a European growth model (Gravel et al. singletons (c). (B) The proportion of the genetic model captures the ill have a tight corre- In Figure 1B, we plot the proporti variance that is explained by singletons, V of the genetic archi the variance explai frequency 1, and h genetic variance in Vc/V1 is strongly im events and the rela tion and effect siz the role of rare vari contractions have tained exponentia drastic increase in Note that this tim for exponential ex different from the pansion event at t ts under a European growth model (Gravel et al. singletons (c). (B) The proportion of the genetic Demography and selection shape the contribution of ultra-rare variants model captures the ill have a tight corre- In Figure 1B, we plot the proporti variance that is explained by singletons, V of the genetic archi the variance explai frequency 1, and h genetic variance in Vc/V1 is strongly im events and the rela tion and effect siz the role of rare vari contractions have tained exponentia drastic increase in Note that this tim for exponential ex different from the pansion event at t ts under a European growth model (Gravel et al. singletons (c). (B) The proportion of the genetic Proportionofvariance explainedbysingletons Uricchio, et al. Genome Res 26, 863-873 (2016). •European demography (Gravel et al., 2011) •n=500 chromosomes
  • 16. • Large-scale RNA sequencing + WGS • 4 European populations • 360 individuals • low coverage WGS + high coverage exome: Phase 3. • RNA-seq: median depth 58.3M reads • Gene expression: 
 log2 transformed, median centered, and quantile normalized. • 10,077 unique genes. FIN GBR TSI CEU YRI Noah Zaitlen
  • 17. • Our sample size is small, but can we learn anything about the genetic basis of complex traits from these 10k genes? • Let’s analyze heritability of gene expression due to cis variation (within 1Mb of gene) • Large-scale RNA sequencing + WGS • 4 European populations • 360 individuals • low coverage WGS + high coverage exome: Phase 3. • RNA-seq: median depth 58.3M reads • Gene expression: 
 log2 transformed, median centered, and quantile normalized. • 10,077 unique genes.
  • 18. Heritability • Several pioneers have developed ways to infer heritability from unrelated individuals • Linear Mixed Models (GCTA/LDAK) • Requires complex optimization procedures that often fail with very small sample sizes. • Can be biased. • Haseman-Elston (HE) regression • Much faster than LMM. • Unbiased. • No sample size issues!
  • 19. Major Problem • We can apply a method to data and get an answer, but we have no idea whether it would actually be a good one! • There is no complex trait in which we know: • The number of causal variants • The frequencies of the causal variants • The effect sizes of the causal variants • The fitness effect of the causal variants • We need a thorough simulation study where we can vary all of these parameters and see how they effect our answer!
  • 20. 1 2 5 36 −1.0−0.50.00.51.01.52.0 RelativeBias K=1 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 1 −1.0−0.50.00.51.01.52.0 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 1 2 5 36 −1.0−0. ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 1 −1.0−0. 1 2 5 36 −1.0−0.50.00.51.01.52.0 min MAC RelativeBias K=10 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 1 −1.0−0.50.00.51.01.52.0 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 2 5 36 ● ● ● ● ● ● ● ● 1 2 5 36 −1.0 ● ● ● ● ● ● ● ● 2 5 36 min MAC K=20 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● 1 2 5 36 −1.0−0.50.00.51.01.52.0 min MAC K=50 ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● fracRare=1 fracRare=0.75 fracRare=0.5 fracRare=0.1 Nc=20 Nc=50 Nc=100 Simulations: Effect Size ∝ log(freq)
  • 21. 1 0.00.51.0 SNP bin Biasofh2 bin K=1 1 2 0.00.51.0 SNP bin K=2 1 3 5 0.00.51.0 SNP bin K=5 1 5 10 0.00.51.0 SNP bin K=10 1 10 20 0.00.51.0 SNP bin K=20 Our Approach: Partitioning SNPs by frequency 0 0.2 0.4 0.6 0.8 1.0 1 10,000 2 10,000 0.03–0.1% 0.1–0.5% 0.5–1% 1–2% 2–5% 5–10% 10–100% Synonymous Neutral: –10–5 < s < 0 Nearly Neut.: –10–4 < s < –10–5 Weak: –10–3 < s < –10–4 Moderate: –10–2 < s < –10–3 Strong: s < –10–2 Derived allele frequency
  • 22. Singletons emerge as their own category! 1 0.00.51.0 SNP bin Biasofh2 bin K=1 1 2 0.00.51.0 SNP bin K=2 1 3 5 0.00.51.0 SNP bin K=5 1 5 10 0.00.51.0 SNP bin K=10 1 10 20 0.00.51.0 SNP bin K=20 1 0.00.51.0 SNP bin Biasofh2 bin K=1 1 2 0.00.51.0 SNP bin K=2 1 3 5 0.00.51.0 SNP bin K=5 1 5 10 0.00.51.0 SNP bin K=10 1 10 20 0.00.51.0 SNP bin K=20 Our Approach: Partitioning SNPs by frequency Hernandez, Uricchio, Hartman, Ye, Dahl, Zaitlen. bioRxiv (2017)
  • 23. 1 0.00.51.0 SNP bin Biasofh2 bin K=1 1 2 0.00.51.0 SNP bin K=2 1 3 5 0.00.51.0 SNP bin K=5 1 5 10 0.00.51.0 SNP bin K=10 1 10 20 0.00.51.0 SNP bin K=20 1 0.00.51.0 SNP bin Biasofh2 bin K=1 1 2 0.00.51.0 SNP bin K=2 1 3 5 0.00.51.0 SNP bin K=5 1 5 10 0.00.51.0 SNP bin K=10 1 10 20 0.00.51.0 SNP bin K=20 Our Approach: Partitioning SNPs by frequency Hernandez, Uricchio, Hartman, Ye, Dahl, Zaitlen. bioRxiv (2017)
  • 24. Finding Missing heritability ● ● ● ● MAC ≥ 1 MAC ≥ 2 MAC ≥ 5 MAF ≥ 5% ●●●●●● ● ●● ●● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ●●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● * 1e−4 1e−3 0.01 0.1 0.5 0.000.020.040.060.08 Minor Allele Frequency Cumulativeh2 Hernandez, Uricchio, Hartman, Ye, Dahl, Zaitlen. bioRxiv (2017)
  • 25. Fractionofheritability ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ●● ● ● ● 1e−3 0.01 0.1 0.5 0.000.050.100.150.200.25 Minor Allele Frequency Prop.ofh2 total ● ● ● ● ● ● ● ● ● ● ● ● ● ●●●●●●● * 1e−4 0.01 0.5 0.00.40.8 Hernandez, Uricchio, Hartman, Ye, Dahl, Zaitlen. bioRxiv (2017) Singletons! •Partition singletons by gnomAD MAF (n>15,000) •>90% of singleton heritability is due to EXTREMELY rare variants.
  • 26. Finding Missing heritability ● ● ● ● ● gnomAD MAC ≥ 1 MAC ≥ 2 MAC ≥ 5 MAF ≥ 5% ●●●●●● ● ●● ●● ● ● ●● ● ● ● ● ● ● ● ● ● ● ● ●●● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● ● * 1e−4 1e−3 0.01 0.1 0.5 0.000.020.040.060.08 Minor Allele Frequency Cumulativeh2 Hernandez, Uricchio, Hartman, Ye, Dahl, Zaitlen. bioRxiv (2017)
  • 27. Summary •We still know little about the genetic basis of most complex traits •Our evolutionary modeling gives insights into what the patterns of heritability should look like •Analyzing heritability patterns across ~10k gene expression patterns demonstrates a major impact of rare variants across most genes. •But our power to detect causal loci using existing tests is very poor.