SlideShare a Scribd company logo
1 of 60
Genetics of gene expression
Stephen Montgomery
smontgom@stanford.edu
montgomerylab.stanford.edu
Stanford University School of Medicine
Chromosome map of disease-associated regions
“GWAS have so far identified only a small fraction of the
heritability of common diseases, so the ability to make
meaningful predictions is still quite limited”
Francis Collins, Director of the NIH, Nature, April 2010
Trait Heritability Individuals studied Heritability explained
Coronary artery
disease
40% 86995 10%
Type 2 Diabetes 40% 47117 10%
BMI 50% 249796 3%
Blood pressure 50% 34433 1%
Circulating lipids 50% 100000 25%
Height 80% 183727 12.5%
Disease starts at a cellular level
Individual
Organs and tissues
Cells
DNA
Disease
Understanding the influence of genetics on cells will
improve our ability to predict disease risk
Genetic studies of
gene expression
A
SNP
Expression of nearby genes
Expression of distant genes
Cellular processes
Disease risk
Explore impact of genetic variation on transcriptome diversity
Gene splicing/isoforms
Canonical model
We can identify genetic variants impacting
gene expression (eQTLs)
Genetic association can pinpoint regulatory haplotypes
C
G
CC CG GG
Expression
Population Sample
The landscape of regulatory variation
Chr1 Chr2 Chr3...
Chr1
Chr2
Chr3
...
trans-effects
Location of genetic variants by the gene’s whose expression they impact
Transcription
factor
Can rapidly evaluate 1000s of quantitative traits
Can identify genetic regulatory networks
Can easily transform or perturb the system.
Variants are directly connected to cellular mechanism.
Advantages to studying the genetics
of gene expression
Montgomery et al, Nat Rev Genetics, 2011
Sharing of association implicates genes and type of effect
eQTL can aid in identifying
candidate genes for GWAS variants
Class activity: What are my asthma
variants doing?
In the subset of individuals for whom expression data are available,
the T nucleotide allele at rs7216389 (the marker most strongly associated with
disease in the combined GWA analysis) has a frequency of 62% amongst asthmatics
compared to 52% in non-asthmatics (P = 0.005 in this sample).
Moffatt, Nature, 2007
How are eQTL detected and reported?
Reported as the number of genes with significant
heritability, linkage or association compared to an FDR
Example 1:
“Of the total set of genes, 2,340 were found to be expressed, of which 31% had significant heritability
when a false-discovery rate of 0.05 was used.”
- Monks, AJHG, 75(6): 1094–1105. 2004
Example 2:
“Applying this genome-wide threshold to 3,554 scans we would expect only 3.5 genome scans to show
any linkage evidence with a P-value this extreme by chance. Instead we found 142 expression
phenotypes with evidence for linkage beyond the P-value threshold, and in some cases far beyond, so
we conclude that false-positive linkage findings are at most a small fraction of the significant results.”
- Morley, Nature, 430(7001): 743–747. 2004
Example 3:
“We detected 293, 274, 326 and 363 cis associations for CEU, CHB, JPT and YRI,
respectively, corresponding to 783 distinct genes and an FDR of 4–5%.”
- Stranger, Nat Genetics, 39, 1217–1224. 2007
eQTL definition depends on
false discovery reported
Permutation threshold
IMPORTANT: Understand the relationship
between false positive rate and eQTL
reported!
Discovery of eQTL depends on:
(A)Biological factors
(B) Technological factors
Biological factors influencing eQTL discovery
Trait biology
Ancestry
Environment
Dimas et al. Science, 2009
Stranger, PLoS Genetics, 2012
Biological factor: Cell or tissue type
How ubiquitous are eQTLs (and potential disease
mechanisms) in different tissues.
i.e. if I find an eQTL in fat will it be informative of
mechanism underlying disease risk for a disease based in
muscle.
Probably not
268 271 262
73 85 82
86 86 86
Cell type-specific and cell type-shared gene associations
(0.001 permutation threshold)
cell type
No.
of
cell
types
with
gene
association
69-80% of cis associations are
cell type-specific
Dimas et al Science 2009
50% specific (adipose and blood)
Emilsson et al Nature 2008
>50% specific (cortical tissue and
peripheral blood)
Heinzen et al PloS Biology 2008
However, all estimates depend on eQTL discovery FDR and method for assessing sharing
Class activity: What are my migraine
variants doing in different tissues?
We identified the minor allele of rs1835740 on chromosome 8q22.1 to be
associated with migraine (P = 5.38 × 10−9, odds ratio = 1.23, 95% CI 1.150–1.324)
in a genome-wide association study of 2,731 migraine cases ascertained from
three European headache clinics and 10,747 population-matched controls. In an
expression quantitative trait study in lymphoblastoid cell lines, transcript levels of
the MTDH were found to have a significant correlation to rs1835740 (P = 3.96 ×
10−5, permuted threshold for genome-wide significance 7.7 × 10−5).
Anttila, Nature Genetics, 2011
Many existing hypotheses could be in
potentially unrelated tissues
L: 377 eQTL genes T: 299 eQTL genes
p ≤ 0.01
F: 306 eQTL genes
Example of tissue-specific GWAS-eQTL sharing
Biological factor: Development and aging
Determining how genetic variation and genes interact over time
Less eQTLs in older worms
Recombinant inbred C.elegans
Viñuela A, Genome Research 20(7):929-37. 2010
Any ideas why?
Biological factor: Studied population
How ubiquitous are eQTLs (and potential disease
mechanism) in different populations.
i.e. if I find an eQTL in Europeans will it be informative
of mechanism underlying disease risk for a disease
found in Chinese.
“We have reported that many genes showing
cis associations at the 0.001 permutation
threshold are shared (about 37%) in at least
two populations … In 95–97% of the shared
associations, the direction of the allelic effect
was the same across populations, and the
discordant 3–5% was of the same order as the
FDR.”
Stranger et al, Nat Genetics, 2007
Not all eQTL shared across populations
If we know the etiology of a disease
can we predict its population frequency
from cellular models of that disease?
Stranger et al, PLoS Genetics, 2012
Class activity: What are my BMI variants
doing in different populations?
rs713586 explained
0.06% of BMI variance
Speliotes, Nature Genetics,
2010
Multiple population study designs
Zaitlen, AJHG, ; 86(1): 23–33. 2010
Multiple populations do well at mapping causal variants; however their design results in
a reduction of power
Admixed populations
• Challenges: Loss of power if local ancestry not
known or inflation in significance if frequency
differences are large and effect is trans-acting.
Eur: mean 3.0 Afr: mean 4.0
If mean expression invariant to genotype then
allele frequency differences will create false association
Solution: Add local ancestry as a covariate
Biological factor: Environment studies
Determining how eQTLs behave under stimulus
i.e. if I find an eQTL in resting state will it be
informative of mechanism underlying an
responsive state.
“We carried out large-scale induction experiments
using primary human bone cells derived from
unrelated donors of Swedish origin treated with 18
different stimuli (7 treatments and 2 controls, each
assessed at 2 time points). … We found that 93% of
cis-eQTLs at 1% FDR were observed in at least one
additional treatment, and in fact, on average, only
1.4% of the cis-eQTLs were considered as
treatment-specific at high confidence. “
- Grundberg PloS Genetics 7(1). 2011
Answer: GxE discoveries have been study dependent
LPS response eQTLs
Orozco et al, Cell, 2012
LPS, influenza, and interferon-β (IFN-β)
response-eQTLs
Approach reveals common alleles that explain interindividual variation in pathogen sensing and
provides functional annotation for genetic variants that alter susceptibility to inflammatory diseases.
Lee, Science, 2014
Gene expression technology
PCR-based, array-based, sequencing-based
Genotyping technology
array-based, sequencing-based
Sample size
More individuals and/or families yields more power to detect
association with particular effect sizes. (Lowers FDR). Early studies used 18-30
families or 45-60 unrelated individuals.
Discovery of eQTL depends on
technological factors
The biases we don’t know about:
Hidden factors can cause false associations
• Hidden technical and biological variables. i.e.
population, sex, date of processing
• However, correcting these factors can remove
true signals (i.e. master regulators)
Methods to correct hidden factors
• Factor analysis on 40 global factors has tripled eQTL
discovery.
• Surrogate variable analysis, has increased by 20% eQTL
discovery
- Stegle, PLoS Computational Biology, 2010
- Leek, PLoS Genetics, 2007
eQTL data can open up new biology
through reverse genetic approaches
• Without traits and disease we can find
variants influencing expression level.
• We can speculate and investigate what these
effects might do.
Class activity: What are my TCF3
variants doing
RNA-Seq
ChIP-Seq
McDaniell, Science 2010
Montgomery, Nature 2010
Pickrell, Nature 2010
Next generation sequencing has increased
our ability to survey the transcriptome.
What is RNA-seq
High-throughput sequencing of cDNA to
understand/quantify a sample’s gene expression profile
Output: millions of short, single or paired-end sequences (reads)
Genetics of gene expression
using RNA-Seq
Sequencing read
Quantification Alternative splicing
Hybrid transcripts
Fusion genes Transcript termination
Unannotated structure
Allele-specific expression,
Escape from X-inactivation
RNA Editing
..GGGUAGGA..
..GGGCAGGA..
..GGGCAGGA..
..GGGTAGGA..
..GGGU.. x50
..GGGC.. x25
.GAG
.TAG GTC..
GTG..
.UAG... x25
.GAG... x50
.GAG
.TAG GTC..
GTG..
.UAG... x25
.GAG... x50
..GGGCAGGA..
..GGGTAGGA..
..GGGCAGGA..
..GGGU
Exons
Transcripts*
Genes*
Increased resolution of transcriptome through RNA- sequencing
RNA-sequencing in 60 Europeans (HapMap genotypes; LCLs)
Found 2x more expression Quantitative Trait Loci (eQTLs) and...
Exon-eQTLs UTR Length-QTLs Splicing eQTLs
Rare eQTLs with allele specific expression-based approaches
RNA-seq provides resolution of more QTLs
Class activity: rs10954213 creates a
functional polyadenylation site
Cunninghame Graham et al., HMG, 2007
The A allele of rs10954213 creates a
functional polyadenylation site
and the A genotype correlates with
increased expression of a
transcript variant containing a shorter 3′-
UTR. Expression levels of
transcript variants with the shorter or
longer 3′-UTRs are inversely correlated.
Our data support a new mechanism by
which an IRF5 polymorphism controls
the expression of alternate transcript
variants which may have
different effects on interferon signaling
Splicing eQTL
Can investigate relative transcript ratios or reads across junctions.
• Splicing also affected for many genes
sQTLs
cis eQTLs
6738
10914
1158
2851
Number
eQTLs
200 400 600 800 1000
Number individuals
Battle et al, Genome Research, 2014
Katz et al, Nature Methods, 2010
Advantages of ASE
• Test within an individual allelic imbalance,
given one has sufficient reads.
Using ASE to detect GWAS signals driven by
multiple causal variants
Lucia Conde et al, AJHG, 2013
ABUNDANT ASE
FOR HETS
ASE
LACK OF ASE
FOR HOMS
LACK OF ASE
FOR HOMS
GWAS variant genotype
Tests functional differences between alleles in population
POOL OF INDIVIDUALS
ASE
ASE
NO ASE
NO ASE
NO ASE
NO ASE
ASE can be used to map causal variants
/ -
/ -
/
- / -
- / -
- / -
Putative regulatory SNP
Montgomery et al, PLoS Genetics, 2011
Putative regulatory SNPs
are enriched around TSS
Distance to TSS
%
predictions
prSNPs that are also eQTL are enriched in
functional annotations
Tuuli Lappalainen et al., Nature, 2013
Intersection of ASE-QTL and eQTL is more likely to localize a causal variant
18.2% (1502 of 8233) Dimas, 2008
46.2% nonsynonymous sites where ASE can be detected are
significant in 1 indiv.
Abundant epistasis between regulatory
and protein coding variation
Montgomery et al., PLoS Genetics, 2011
Lappalainen et al., AJHG, 2011
Compound inheritance of regulatory and
coding polymorphism causes disease
The exon-junction complex (EJC) performs essential RNA processing tasks1–5.
Here, we describe the first human disorder, thrombocytopenia with absent radii
(TAR)6, caused by deficiency in one of the four EJC subunits.
The thrombocytopenia with absent radii (TAR) syndrome is characterized by a
reduction in the number of platelets (the cells that make blood clot)
Albers, Nature Genetics, 2012
Ten human tissues were collected post-
mortem from a healthy 25-year-old Chinese
male. RNA-Seq was performed on the ten
tissues to quantify gene expression. Exome-
Seq was performed on two tissues (bolded) to
ascertain the heterozygous sites in the
genome.
1/3 of all target variants have significant ASE
Allelic
heterogeneity
of rare
deleterious
protein-
coding
variation
Kukurba, PLoS Genetics, May 2014
Challenges with calculating allele-specific
expression from RNA-Seq data
Mapping Depth
Jacob Degner et al., Bioinformatics, 2009
52.2% of expressed sites ≤ 30 reads
Increasing the resolution of ASE effects in an
individual and across tissues (mmPCR-seq)
Allelic expression for 48 samples for 960 loci per chip in ~2-3 days
Collab with Billy Li - Rui Zhang et al, Nat Methods, 2014
Application of mmPCR-Seq to rare
deleterious and
loss-of-function alleles
• Selected all complete stop-gain sites (50 sites)
• Selected all private and predicted deleterious
and damaging nsSNPs (74 sites).
• Control sites (160)
Identification of tissue-specific ASE in
genes with rare, deleterious nsSNPs
~33% of genes show significant patterns of ASE
How will gene expression influence
decisions in the clinic?
Build cellular models of disease
Survey diagnostic responses to treatments
Identify diverse disease mechanisms; move us beyond protein coding
mutations alone
Identify pathological tissues
Allow us to identify effects (or transferability) in different populations
Classify undiagnosed conditions
Cost-effective
“The field will transition from doing primarily association
work to figuring out what implicated variants do
biologically.”
David Goldstein, Director of the Center for Human Genome
Variation, Duke University, Nature, Feb 2012
Further recommended reading:
1) Genome, epigenome and RNA sequences of monozygotic twins discordant for multiple
sclerosis (2010, Nature)
2) 9p21 DNA variants associated with coronary artery disease impair interferon-γ
signalling response (2011, Nature)
montgomerylab.stanford.edu

More Related Content

What's hot

Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Chirag Patel
 
Intro to Biomedical Informatics 701
Intro to Biomedical Informatics 701 Intro to Biomedical Informatics 701
Intro to Biomedical Informatics 701 Chirag Patel
 
Data analytics to support exposome research course slides
Data analytics to support exposome research course slidesData analytics to support exposome research course slides
Data analytics to support exposome research course slidesChirag Patel
 
Transcriptional signaling pathways inversely regulated in alzheimer's disease...
Transcriptional signaling pathways inversely regulated in alzheimer's disease...Transcriptional signaling pathways inversely regulated in alzheimer's disease...
Transcriptional signaling pathways inversely regulated in alzheimer's disease...Elsa von Licy
 
Big data exposome and pediatric outcomes
Big data exposome and pediatric outcomesBig data exposome and pediatric outcomes
Big data exposome and pediatric outcomesChirag Patel
 
EWAS and the exposome: Mt Sinai in Brescia 052119
EWAS and the exposome: Mt Sinai in Brescia 052119EWAS and the exposome: Mt Sinai in Brescia 052119
EWAS and the exposome: Mt Sinai in Brescia 052119Chirag Patel
 
10.1164@rccm.201701 0053 ed
10.1164@rccm.201701 0053 ed10.1164@rccm.201701 0053 ed
10.1164@rccm.201701 0053 edJulio A. Diaz M.
 
Methods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataMethods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataChirag Patel
 
A common rejection module (CRM) for acute rejection across multiple organs
A common rejection module (CRM) for acute rejection across multiple organsA common rejection module (CRM) for acute rejection across multiple organs
A common rejection module (CRM) for acute rejection across multiple organsKevin Jaglinski
 
Analisis de la expresion de genes en la depresion
Analisis de la expresion de genes en la depresionAnalisis de la expresion de genes en la depresion
Analisis de la expresion de genes en la depresionCinthya Yessenia
 
Building a search engine for exposures in disease
Building a search engine for exposures in disease Building a search engine for exposures in disease
Building a search engine for exposures in disease Chirag Patel
 
Opinion_FOXP3_Frontiers in Genetics
Opinion_FOXP3_Frontiers in GeneticsOpinion_FOXP3_Frontiers in Genetics
Opinion_FOXP3_Frontiers in GeneticsSurekha Tippisetty
 
BRN Seminar 12/06/14 Introduction to Network Medicine
BRN Seminar 12/06/14 Introduction to Network Medicine BRN Seminar 12/06/14 Introduction to Network Medicine
BRN Seminar 12/06/14 Introduction to Network Medicine brnmomentum
 
Introduction to Network Medicine
Introduction to Network MedicineIntroduction to Network Medicine
Introduction to Network Medicinebrnbarcelona
 

What's hot (20)

Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416Bioinformatics Strategies for Exposome 100416
Bioinformatics Strategies for Exposome 100416
 
Intro to Biomedical Informatics 701
Intro to Biomedical Informatics 701 Intro to Biomedical Informatics 701
Intro to Biomedical Informatics 701
 
Data analytics to support exposome research course slides
Data analytics to support exposome research course slidesData analytics to support exposome research course slides
Data analytics to support exposome research course slides
 
Transcriptional signaling pathways inversely regulated in alzheimer's disease...
Transcriptional signaling pathways inversely regulated in alzheimer's disease...Transcriptional signaling pathways inversely regulated in alzheimer's disease...
Transcriptional signaling pathways inversely regulated in alzheimer's disease...
 
Big data exposome and pediatric outcomes
Big data exposome and pediatric outcomesBig data exposome and pediatric outcomes
Big data exposome and pediatric outcomes
 
EWAS and the exposome: Mt Sinai in Brescia 052119
EWAS and the exposome: Mt Sinai in Brescia 052119EWAS and the exposome: Mt Sinai in Brescia 052119
EWAS and the exposome: Mt Sinai in Brescia 052119
 
Osmf rnk
Osmf rnkOsmf rnk
Osmf rnk
 
10.1164@rccm.201701 0053 ed
10.1164@rccm.201701 0053 ed10.1164@rccm.201701 0053 ed
10.1164@rccm.201701 0053 ed
 
Methods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataMethods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big data
 
A common rejection module (CRM) for acute rejection across multiple organs
A common rejection module (CRM) for acute rejection across multiple organsA common rejection module (CRM) for acute rejection across multiple organs
A common rejection module (CRM) for acute rejection across multiple organs
 
APOE Poster
APOE PosterAPOE Poster
APOE Poster
 
2710801
27108012710801
2710801
 
PMED Opening Workshop - Regrowth Rates of Tumors after Radiation Vary Dependi...
PMED Opening Workshop - Regrowth Rates of Tumors after Radiation Vary Dependi...PMED Opening Workshop - Regrowth Rates of Tumors after Radiation Vary Dependi...
PMED Opening Workshop - Regrowth Rates of Tumors after Radiation Vary Dependi...
 
Analisis de la expresion de genes en la depresion
Analisis de la expresion de genes en la depresionAnalisis de la expresion de genes en la depresion
Analisis de la expresion de genes en la depresion
 
Genomica Yquimiot
Genomica YquimiotGenomica Yquimiot
Genomica Yquimiot
 
Building a search engine for exposures in disease
Building a search engine for exposures in disease Building a search engine for exposures in disease
Building a search engine for exposures in disease
 
Opinion_FOXP3_Frontiers in Genetics
Opinion_FOXP3_Frontiers in GeneticsOpinion_FOXP3_Frontiers in Genetics
Opinion_FOXP3_Frontiers in Genetics
 
BRN Seminar 12/06/14 Introduction to Network Medicine
BRN Seminar 12/06/14 Introduction to Network Medicine BRN Seminar 12/06/14 Introduction to Network Medicine
BRN Seminar 12/06/14 Introduction to Network Medicine
 
Introduction to Network Medicine
Introduction to Network MedicineIntroduction to Network Medicine
Introduction to Network Medicine
 
MSc dissertation
MSc dissertationMSc dissertation
MSc dissertation
 

Similar to Montgomery expression

Current Directions in PsychologicalScience2015, Vol. 24(4).docx
Current Directions in PsychologicalScience2015, Vol. 24(4).docxCurrent Directions in PsychologicalScience2015, Vol. 24(4).docx
Current Directions in PsychologicalScience2015, Vol. 24(4).docxannettsparrow
 
Mark Daly - Finding risk genes in psychiatric disorders
Mark Daly - Finding risk genes in psychiatric disordersMark Daly - Finding risk genes in psychiatric disorders
Mark Daly - Finding risk genes in psychiatric disorderswef
 
Dr. Ángel Carracedo - Simposio Internacional 'La enfermedad de la duda: el TOC'
Dr. Ángel Carracedo - Simposio Internacional 'La enfermedad de la duda: el TOC'Dr. Ángel Carracedo - Simposio Internacional 'La enfermedad de la duda: el TOC'
Dr. Ángel Carracedo - Simposio Internacional 'La enfermedad de la duda: el TOC'Fundación Ramón Areces
 
Simulating Genes in Genome-wide Association Studies
Simulating Genes in Genome-wide Association StudiesSimulating Genes in Genome-wide Association Studies
Simulating Genes in Genome-wide Association StudiesKevin Thornton
 
How to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical informationHow to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical informationJoaquin Dopazo
 
From reads to pathways for efficient disease gene finding
From reads to pathways for efficient disease gene findingFrom reads to pathways for efficient disease gene finding
From reads to pathways for efficient disease gene findingJoaquin Dopazo
 
Differential connectivity in neoplastic coexpression networks (BITS2014, Rome)
Differential connectivity in neoplastic coexpression networks (BITS2014, Rome)Differential connectivity in neoplastic coexpression networks (BITS2014, Rome)
Differential connectivity in neoplastic coexpression networks (BITS2014, Rome)Roberto Anglani
 
Search engine for E NEU network science 080817
Search engine for E NEU network science 080817Search engine for E NEU network science 080817
Search engine for E NEU network science 080817Chirag Patel
 
Day2 145pm Crawford
Day2 145pm CrawfordDay2 145pm Crawford
Day2 145pm CrawfordSean Paul
 
Genetic regulation of human brain aging
Genetic regulation of human brain agingGenetic regulation of human brain aging
Genetic regulation of human brain agingAlzforum
 
헬스케어 빅데이터로 무엇을 할 수 있는가?
헬스케어 빅데이터로 무엇을 할 수 있는가?헬스케어 빅데이터로 무엇을 할 수 있는가?
헬스케어 빅데이터로 무엇을 할 수 있는가? Hyung Jin Choi
 
Genetic predisposition to papillary thyroid cancer by Albert de la Chapelle, ...
Genetic predisposition to papillary thyroid cancer by Albert de la Chapelle, ...Genetic predisposition to papillary thyroid cancer by Albert de la Chapelle, ...
Genetic predisposition to papillary thyroid cancer by Albert de la Chapelle, ...OSUCCC - James
 
5. identificacion de genes
5. identificacion de genes5. identificacion de genes
5. identificacion de genesRafael Ospina
 
20100509 bioinformatics kapushesky_lecture05_0
20100509 bioinformatics kapushesky_lecture05_020100509 bioinformatics kapushesky_lecture05_0
20100509 bioinformatics kapushesky_lecture05_0Computer Science Club
 
A New Generation Of Mechanism-Based Biomarkers For The Clinic
A New Generation Of Mechanism-Based Biomarkers For The ClinicA New Generation Of Mechanism-Based Biomarkers For The Clinic
A New Generation Of Mechanism-Based Biomarkers For The ClinicJoaquin Dopazo
 
Hertweck AB3ACBS presentation
Hertweck AB3ACBS presentationHertweck AB3ACBS presentation
Hertweck AB3ACBS presentationKate Hertweck
 
The emerging picture of host genetic control of susceptibility and outcome in...
The emerging picture of host genetic control of susceptibility and outcome in...The emerging picture of host genetic control of susceptibility and outcome in...
The emerging picture of host genetic control of susceptibility and outcome in...Meningitis Research Foundation
 

Similar to Montgomery expression (20)

Schizophrenia
SchizophreniaSchizophrenia
Schizophrenia
 
Current Directions in PsychologicalScience2015, Vol. 24(4).docx
Current Directions in PsychologicalScience2015, Vol. 24(4).docxCurrent Directions in PsychologicalScience2015, Vol. 24(4).docx
Current Directions in PsychologicalScience2015, Vol. 24(4).docx
 
Mark Daly - Finding risk genes in psychiatric disorders
Mark Daly - Finding risk genes in psychiatric disordersMark Daly - Finding risk genes in psychiatric disorders
Mark Daly - Finding risk genes in psychiatric disorders
 
Dr. Ángel Carracedo - Simposio Internacional 'La enfermedad de la duda: el TOC'
Dr. Ángel Carracedo - Simposio Internacional 'La enfermedad de la duda: el TOC'Dr. Ángel Carracedo - Simposio Internacional 'La enfermedad de la duda: el TOC'
Dr. Ángel Carracedo - Simposio Internacional 'La enfermedad de la duda: el TOC'
 
Simulating Genes in Genome-wide Association Studies
Simulating Genes in Genome-wide Association StudiesSimulating Genes in Genome-wide Association Studies
Simulating Genes in Genome-wide Association Studies
 
How to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical informationHow to transform genomic big data into valuable clinical information
How to transform genomic big data into valuable clinical information
 
From reads to pathways for efficient disease gene finding
From reads to pathways for efficient disease gene findingFrom reads to pathways for efficient disease gene finding
From reads to pathways for efficient disease gene finding
 
Differential connectivity in neoplastic coexpression networks (BITS2014, Rome)
Differential connectivity in neoplastic coexpression networks (BITS2014, Rome)Differential connectivity in neoplastic coexpression networks (BITS2014, Rome)
Differential connectivity in neoplastic coexpression networks (BITS2014, Rome)
 
Search engine for E NEU network science 080817
Search engine for E NEU network science 080817Search engine for E NEU network science 080817
Search engine for E NEU network science 080817
 
Isaac Kohane, "A Data Perspective on Autonomy, Human Rights, and the End of N...
Isaac Kohane, "A Data Perspective on Autonomy, Human Rights, and the End of N...Isaac Kohane, "A Data Perspective on Autonomy, Human Rights, and the End of N...
Isaac Kohane, "A Data Perspective on Autonomy, Human Rights, and the End of N...
 
Day2 145pm Crawford
Day2 145pm CrawfordDay2 145pm Crawford
Day2 145pm Crawford
 
Genetic regulation of human brain aging
Genetic regulation of human brain agingGenetic regulation of human brain aging
Genetic regulation of human brain aging
 
헬스케어 빅데이터로 무엇을 할 수 있는가?
헬스케어 빅데이터로 무엇을 할 수 있는가?헬스케어 빅데이터로 무엇을 할 수 있는가?
헬스케어 빅데이터로 무엇을 할 수 있는가?
 
Genetic predisposition to papillary thyroid cancer by Albert de la Chapelle, ...
Genetic predisposition to papillary thyroid cancer by Albert de la Chapelle, ...Genetic predisposition to papillary thyroid cancer by Albert de la Chapelle, ...
Genetic predisposition to papillary thyroid cancer by Albert de la Chapelle, ...
 
5. identificacion de genes
5. identificacion de genes5. identificacion de genes
5. identificacion de genes
 
Pharmacology Powered by Computational Analysis: Predicting Cardiotoxicity of ...
Pharmacology Powered by Computational Analysis: Predicting Cardiotoxicity of ...Pharmacology Powered by Computational Analysis: Predicting Cardiotoxicity of ...
Pharmacology Powered by Computational Analysis: Predicting Cardiotoxicity of ...
 
20100509 bioinformatics kapushesky_lecture05_0
20100509 bioinformatics kapushesky_lecture05_020100509 bioinformatics kapushesky_lecture05_0
20100509 bioinformatics kapushesky_lecture05_0
 
A New Generation Of Mechanism-Based Biomarkers For The Clinic
A New Generation Of Mechanism-Based Biomarkers For The ClinicA New Generation Of Mechanism-Based Biomarkers For The Clinic
A New Generation Of Mechanism-Based Biomarkers For The Clinic
 
Hertweck AB3ACBS presentation
Hertweck AB3ACBS presentationHertweck AB3ACBS presentation
Hertweck AB3ACBS presentation
 
The emerging picture of host genetic control of susceptibility and outcome in...
The emerging picture of host genetic control of susceptibility and outcome in...The emerging picture of host genetic control of susceptibility and outcome in...
The emerging picture of host genetic control of susceptibility and outcome in...
 

Recently uploaded

Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceSamikshaHamane
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPCeline George
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxthorishapillay1
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxAnupkumar Sharma
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomnelietumpap1
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
ROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationAadityaSharma884161
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Jisc
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxNirmalaLoungPoorunde1
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Planning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptxPlanning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptxLigayaBacuel1
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPCeline George
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designMIPLM
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxiammrhaywood
 

Recently uploaded (20)

Roles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in PharmacovigilanceRoles & Responsibilities in Pharmacovigilance
Roles & Responsibilities in Pharmacovigilance
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
How to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERPHow to do quick user assign in kanban in Odoo 17 ERP
How to do quick user assign in kanban in Odoo 17 ERP
 
Proudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptxProudly South Africa powerpoint Thorisha.pptx
Proudly South Africa powerpoint Thorisha.pptx
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptxMULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
MULTIDISCIPLINRY NATURE OF THE ENVIRONMENTAL STUDIES.pptx
 
ENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choomENGLISH6-Q4-W3.pptxqurter our high choom
ENGLISH6-Q4-W3.pptxqurter our high choom
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
ROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint PresentationROOT CAUSE ANALYSIS PowerPoint Presentation
ROOT CAUSE ANALYSIS PowerPoint Presentation
 
Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...Procuring digital preservation CAN be quick and painless with our new dynamic...
Procuring digital preservation CAN be quick and painless with our new dynamic...
 
Employee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptxEmployee wellbeing at the workplace.pptx
Employee wellbeing at the workplace.pptx
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Planning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptxPlanning a health career 4th Quarter.pptx
Planning a health career 4th Quarter.pptx
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...OS-operating systems- ch04 (Threads) ...
OS-operating systems- ch04 (Threads) ...
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
What is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERPWhat is Model Inheritance in Odoo 17 ERP
What is Model Inheritance in Odoo 17 ERP
 
Keynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-designKeynote by Prof. Wurzer at Nordex about IP-design
Keynote by Prof. Wurzer at Nordex about IP-design
 
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdfTataKelola dan KamSiber Kecerdasan Buatan v022.pdf
TataKelola dan KamSiber Kecerdasan Buatan v022.pdf
 
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptxECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
ECONOMIC CONTEXT - PAPER 1 Q3: NEWSPAPERS.pptx
 

Montgomery expression

  • 1. Genetics of gene expression Stephen Montgomery smontgom@stanford.edu montgomerylab.stanford.edu Stanford University School of Medicine
  • 2. Chromosome map of disease-associated regions
  • 3. “GWAS have so far identified only a small fraction of the heritability of common diseases, so the ability to make meaningful predictions is still quite limited” Francis Collins, Director of the NIH, Nature, April 2010 Trait Heritability Individuals studied Heritability explained Coronary artery disease 40% 86995 10% Type 2 Diabetes 40% 47117 10% BMI 50% 249796 3% Blood pressure 50% 34433 1% Circulating lipids 50% 100000 25% Height 80% 183727 12.5%
  • 4. Disease starts at a cellular level Individual Organs and tissues Cells DNA Disease Understanding the influence of genetics on cells will improve our ability to predict disease risk
  • 5. Genetic studies of gene expression A SNP Expression of nearby genes Expression of distant genes Cellular processes Disease risk Explore impact of genetic variation on transcriptome diversity Gene splicing/isoforms
  • 7.
  • 8. We can identify genetic variants impacting gene expression (eQTLs) Genetic association can pinpoint regulatory haplotypes C G CC CG GG Expression Population Sample
  • 9. The landscape of regulatory variation Chr1 Chr2 Chr3... Chr1 Chr2 Chr3 ... trans-effects Location of genetic variants by the gene’s whose expression they impact Transcription factor
  • 10. Can rapidly evaluate 1000s of quantitative traits Can identify genetic regulatory networks Can easily transform or perturb the system. Variants are directly connected to cellular mechanism. Advantages to studying the genetics of gene expression
  • 11. Montgomery et al, Nat Rev Genetics, 2011 Sharing of association implicates genes and type of effect eQTL can aid in identifying candidate genes for GWAS variants
  • 12. Class activity: What are my asthma variants doing? In the subset of individuals for whom expression data are available, the T nucleotide allele at rs7216389 (the marker most strongly associated with disease in the combined GWA analysis) has a frequency of 62% amongst asthmatics compared to 52% in non-asthmatics (P = 0.005 in this sample). Moffatt, Nature, 2007
  • 13. How are eQTL detected and reported? Reported as the number of genes with significant heritability, linkage or association compared to an FDR Example 1: “Of the total set of genes, 2,340 were found to be expressed, of which 31% had significant heritability when a false-discovery rate of 0.05 was used.” - Monks, AJHG, 75(6): 1094–1105. 2004 Example 2: “Applying this genome-wide threshold to 3,554 scans we would expect only 3.5 genome scans to show any linkage evidence with a P-value this extreme by chance. Instead we found 142 expression phenotypes with evidence for linkage beyond the P-value threshold, and in some cases far beyond, so we conclude that false-positive linkage findings are at most a small fraction of the significant results.” - Morley, Nature, 430(7001): 743–747. 2004 Example 3: “We detected 293, 274, 326 and 363 cis associations for CEU, CHB, JPT and YRI, respectively, corresponding to 783 distinct genes and an FDR of 4–5%.” - Stranger, Nat Genetics, 39, 1217–1224. 2007
  • 14. eQTL definition depends on false discovery reported Permutation threshold IMPORTANT: Understand the relationship between false positive rate and eQTL reported!
  • 15. Discovery of eQTL depends on: (A)Biological factors (B) Technological factors
  • 16. Biological factors influencing eQTL discovery Trait biology Ancestry Environment Dimas et al. Science, 2009 Stranger, PLoS Genetics, 2012
  • 17. Biological factor: Cell or tissue type How ubiquitous are eQTLs (and potential disease mechanisms) in different tissues. i.e. if I find an eQTL in fat will it be informative of mechanism underlying disease risk for a disease based in muscle.
  • 18. Probably not 268 271 262 73 85 82 86 86 86 Cell type-specific and cell type-shared gene associations (0.001 permutation threshold) cell type No. of cell types with gene association 69-80% of cis associations are cell type-specific Dimas et al Science 2009 50% specific (adipose and blood) Emilsson et al Nature 2008 >50% specific (cortical tissue and peripheral blood) Heinzen et al PloS Biology 2008 However, all estimates depend on eQTL discovery FDR and method for assessing sharing
  • 19. Class activity: What are my migraine variants doing in different tissues? We identified the minor allele of rs1835740 on chromosome 8q22.1 to be associated with migraine (P = 5.38 × 10−9, odds ratio = 1.23, 95% CI 1.150–1.324) in a genome-wide association study of 2,731 migraine cases ascertained from three European headache clinics and 10,747 population-matched controls. In an expression quantitative trait study in lymphoblastoid cell lines, transcript levels of the MTDH were found to have a significant correlation to rs1835740 (P = 3.96 × 10−5, permuted threshold for genome-wide significance 7.7 × 10−5). Anttila, Nature Genetics, 2011
  • 20. Many existing hypotheses could be in potentially unrelated tissues
  • 21. L: 377 eQTL genes T: 299 eQTL genes p ≤ 0.01 F: 306 eQTL genes Example of tissue-specific GWAS-eQTL sharing
  • 22. Biological factor: Development and aging Determining how genetic variation and genes interact over time
  • 23.
  • 24. Less eQTLs in older worms Recombinant inbred C.elegans Viñuela A, Genome Research 20(7):929-37. 2010 Any ideas why?
  • 25. Biological factor: Studied population How ubiquitous are eQTLs (and potential disease mechanism) in different populations. i.e. if I find an eQTL in Europeans will it be informative of mechanism underlying disease risk for a disease found in Chinese.
  • 26. “We have reported that many genes showing cis associations at the 0.001 permutation threshold are shared (about 37%) in at least two populations … In 95–97% of the shared associations, the direction of the allelic effect was the same across populations, and the discordant 3–5% was of the same order as the FDR.” Stranger et al, Nat Genetics, 2007 Not all eQTL shared across populations If we know the etiology of a disease can we predict its population frequency from cellular models of that disease? Stranger et al, PLoS Genetics, 2012
  • 27. Class activity: What are my BMI variants doing in different populations? rs713586 explained 0.06% of BMI variance Speliotes, Nature Genetics, 2010
  • 28. Multiple population study designs Zaitlen, AJHG, ; 86(1): 23–33. 2010 Multiple populations do well at mapping causal variants; however their design results in a reduction of power
  • 29. Admixed populations • Challenges: Loss of power if local ancestry not known or inflation in significance if frequency differences are large and effect is trans-acting. Eur: mean 3.0 Afr: mean 4.0 If mean expression invariant to genotype then allele frequency differences will create false association Solution: Add local ancestry as a covariate
  • 30. Biological factor: Environment studies Determining how eQTLs behave under stimulus i.e. if I find an eQTL in resting state will it be informative of mechanism underlying an responsive state.
  • 31. “We carried out large-scale induction experiments using primary human bone cells derived from unrelated donors of Swedish origin treated with 18 different stimuli (7 treatments and 2 controls, each assessed at 2 time points). … We found that 93% of cis-eQTLs at 1% FDR were observed in at least one additional treatment, and in fact, on average, only 1.4% of the cis-eQTLs were considered as treatment-specific at high confidence. “ - Grundberg PloS Genetics 7(1). 2011 Answer: GxE discoveries have been study dependent
  • 32. LPS response eQTLs Orozco et al, Cell, 2012
  • 33. LPS, influenza, and interferon-β (IFN-β) response-eQTLs Approach reveals common alleles that explain interindividual variation in pathogen sensing and provides functional annotation for genetic variants that alter susceptibility to inflammatory diseases. Lee, Science, 2014
  • 34. Gene expression technology PCR-based, array-based, sequencing-based Genotyping technology array-based, sequencing-based Sample size More individuals and/or families yields more power to detect association with particular effect sizes. (Lowers FDR). Early studies used 18-30 families or 45-60 unrelated individuals. Discovery of eQTL depends on technological factors
  • 35. The biases we don’t know about: Hidden factors can cause false associations • Hidden technical and biological variables. i.e. population, sex, date of processing • However, correcting these factors can remove true signals (i.e. master regulators)
  • 36. Methods to correct hidden factors • Factor analysis on 40 global factors has tripled eQTL discovery. • Surrogate variable analysis, has increased by 20% eQTL discovery - Stegle, PLoS Computational Biology, 2010 - Leek, PLoS Genetics, 2007
  • 37. eQTL data can open up new biology through reverse genetic approaches • Without traits and disease we can find variants influencing expression level. • We can speculate and investigate what these effects might do.
  • 38. Class activity: What are my TCF3 variants doing
  • 39. RNA-Seq ChIP-Seq McDaniell, Science 2010 Montgomery, Nature 2010 Pickrell, Nature 2010 Next generation sequencing has increased our ability to survey the transcriptome.
  • 40. What is RNA-seq High-throughput sequencing of cDNA to understand/quantify a sample’s gene expression profile Output: millions of short, single or paired-end sequences (reads)
  • 41. Genetics of gene expression using RNA-Seq
  • 42. Sequencing read Quantification Alternative splicing Hybrid transcripts Fusion genes Transcript termination Unannotated structure Allele-specific expression, Escape from X-inactivation RNA Editing ..GGGUAGGA.. ..GGGCAGGA.. ..GGGCAGGA.. ..GGGTAGGA.. ..GGGU.. x50 ..GGGC.. x25 .GAG .TAG GTC.. GTG.. .UAG... x25 .GAG... x50 .GAG .TAG GTC.. GTG.. .UAG... x25 .GAG... x50 ..GGGCAGGA.. ..GGGTAGGA.. ..GGGCAGGA.. ..GGGU Exons Transcripts* Genes* Increased resolution of transcriptome through RNA- sequencing
  • 43. RNA-sequencing in 60 Europeans (HapMap genotypes; LCLs) Found 2x more expression Quantitative Trait Loci (eQTLs) and... Exon-eQTLs UTR Length-QTLs Splicing eQTLs Rare eQTLs with allele specific expression-based approaches RNA-seq provides resolution of more QTLs
  • 44. Class activity: rs10954213 creates a functional polyadenylation site Cunninghame Graham et al., HMG, 2007 The A allele of rs10954213 creates a functional polyadenylation site and the A genotype correlates with increased expression of a transcript variant containing a shorter 3′- UTR. Expression levels of transcript variants with the shorter or longer 3′-UTRs are inversely correlated. Our data support a new mechanism by which an IRF5 polymorphism controls the expression of alternate transcript variants which may have different effects on interferon signaling
  • 45. Splicing eQTL Can investigate relative transcript ratios or reads across junctions. • Splicing also affected for many genes sQTLs cis eQTLs 6738 10914 1158 2851 Number eQTLs 200 400 600 800 1000 Number individuals Battle et al, Genome Research, 2014 Katz et al, Nature Methods, 2010
  • 46. Advantages of ASE • Test within an individual allelic imbalance, given one has sufficient reads.
  • 47. Using ASE to detect GWAS signals driven by multiple causal variants Lucia Conde et al, AJHG, 2013 ABUNDANT ASE FOR HETS ASE LACK OF ASE FOR HOMS LACK OF ASE FOR HOMS GWAS variant genotype Tests functional differences between alleles in population
  • 48. POOL OF INDIVIDUALS ASE ASE NO ASE NO ASE NO ASE NO ASE ASE can be used to map causal variants / - / - / - / - - / - - / - Putative regulatory SNP Montgomery et al, PLoS Genetics, 2011
  • 49. Putative regulatory SNPs are enriched around TSS Distance to TSS % predictions
  • 50. prSNPs that are also eQTL are enriched in functional annotations Tuuli Lappalainen et al., Nature, 2013 Intersection of ASE-QTL and eQTL is more likely to localize a causal variant
  • 51. 18.2% (1502 of 8233) Dimas, 2008 46.2% nonsynonymous sites where ASE can be detected are significant in 1 indiv. Abundant epistasis between regulatory and protein coding variation Montgomery et al., PLoS Genetics, 2011 Lappalainen et al., AJHG, 2011
  • 52. Compound inheritance of regulatory and coding polymorphism causes disease The exon-junction complex (EJC) performs essential RNA processing tasks1–5. Here, we describe the first human disorder, thrombocytopenia with absent radii (TAR)6, caused by deficiency in one of the four EJC subunits. The thrombocytopenia with absent radii (TAR) syndrome is characterized by a reduction in the number of platelets (the cells that make blood clot) Albers, Nature Genetics, 2012
  • 53. Ten human tissues were collected post- mortem from a healthy 25-year-old Chinese male. RNA-Seq was performed on the ten tissues to quantify gene expression. Exome- Seq was performed on two tissues (bolded) to ascertain the heterozygous sites in the genome. 1/3 of all target variants have significant ASE Allelic heterogeneity of rare deleterious protein- coding variation Kukurba, PLoS Genetics, May 2014
  • 54. Challenges with calculating allele-specific expression from RNA-Seq data Mapping Depth Jacob Degner et al., Bioinformatics, 2009 52.2% of expressed sites ≤ 30 reads
  • 55. Increasing the resolution of ASE effects in an individual and across tissues (mmPCR-seq) Allelic expression for 48 samples for 960 loci per chip in ~2-3 days Collab with Billy Li - Rui Zhang et al, Nat Methods, 2014
  • 56. Application of mmPCR-Seq to rare deleterious and loss-of-function alleles • Selected all complete stop-gain sites (50 sites) • Selected all private and predicted deleterious and damaging nsSNPs (74 sites). • Control sites (160)
  • 57. Identification of tissue-specific ASE in genes with rare, deleterious nsSNPs ~33% of genes show significant patterns of ASE
  • 58. How will gene expression influence decisions in the clinic? Build cellular models of disease Survey diagnostic responses to treatments Identify diverse disease mechanisms; move us beyond protein coding mutations alone Identify pathological tissues Allow us to identify effects (or transferability) in different populations Classify undiagnosed conditions Cost-effective
  • 59. “The field will transition from doing primarily association work to figuring out what implicated variants do biologically.” David Goldstein, Director of the Center for Human Genome Variation, Duke University, Nature, Feb 2012
  • 60. Further recommended reading: 1) Genome, epigenome and RNA sequences of monozygotic twins discordant for multiple sclerosis (2010, Nature) 2) 9p21 DNA variants associated with coronary artery disease impair interferon-γ signalling response (2011, Nature) montgomerylab.stanford.edu