SlideShare a Scribd company logo
1 of 32
Back to Basics:
Genome-Wide
Association Studies
December 11, 2013
Bryce Christensen
Director of Services
Use the Questions pane in
your GoToWebinar window
Questions during
the presentation
Golden Helix
Leaders in Genetic Analytics
 Founded in 1998
 Multi-disciplinary: computer science,
bioinformatics, statistics, genetics
 Software and analytic services
About Golden Helix
Core
Features
Packages
Core Features
 Powerful Data Management
 Rich Visualizations
 Robust Statistics
 Flexible
 Easy-to-use
Applications
 Genotype Analysis
 DNA sequence analysis
 CNV Analysis
 RNA-seq differential
expression
 Family Based Association
SNP & Variation Suite (SVS)
Approximate Agenda
Association Testing
Q&A
2
3
4
What about Imputation?
GWAS QC Considerations1
Previous GWAS Webcasts Online
[Demonstration]
Genetic Epidemiology 34:591-602 (2010)
[Demonstration]
Autosomal MAF distribution on Affy 500k chip
 Most GWAS chips are designed to capture global variation
 Homogeneous cohorts will only be polymorphic for some subset of SNPs.
Autosomal MAF distribution for Illumina Omni-2.5
 Higher-density chips have more rare content, so smaller relative
proportion of SNPs will be polymorphic
 Diminishing returns with increased density
X Heterozygosity Variability by Chip
 Chip design can affect
distribution of many
statistics, including X
heterozygosity
 Targeted chips may
have minimal
polymorphic content
on X
 Adjust workflows
accordingly
- Ex: Filter on MAF before
running gender inference
Why Care about Chip Design and Content?
 Many sample statistics are based on allele frequencies, and behave
differently from chip to chip
- IBD testing
- Principal Components
- Autosomal heterozygosity rates
- Runs of homozygosity
 Many of those statistics also assume that you are using a “GWAS” chip
with uniform coverage, and may be confounded when using chips with
targeted or non-uniform coverage content
- Exome chips
- ImmunoChip
- Cardio-MetaboChip
 Adjust workflows accordingly!
- Use different MAF thresholds with targeted chips
- Filter to polymorphic SNPs and prune for LD before running IBD or PCA
[Demonstration]
Original Slides by
Greta Linse Peterson, Senior
Statistician
Mixed Models: How
to Effectively
Account for
Inbreeding and
Population Structure
in GWAS
A brief background of GWAS
 First the naïve approaches:
Trend Tests, Contingency
Tables, Linear/Logistic
Regression
 Batch Effects, Population
Structure and sharing of controls
may violate assumptions of the
naïve approaches and result in
confounding of results.
 Stratification effects are more
pronounced with larger sample
sizes.
 Non-independence of samples is
especially problematic in
agrigenomic applications.
The Real Problem
 Vilhjalmsson and Nordborg (2013)
argue that “population structure”
itself is not a problem for GWAS.
 The real problems are the
environment and the genetic
background of a population.
- PCA can serve as a proxy for both, but
doesn’t entirely explain either.
 The solution is to account for the
relatedness between all pairs of
samples in a mixed linear model.
Mixed Model Method Overview
 Calculate kinship matrix
defining pairwise relationships
between all sample pairs.
 Include kinship matrix as
random effect in MLM
regression.
 May also include PCs and
other factors as fixed effects.
 Allows for population-based
and family-based cohorts to be
analyzed together.
Applied Example: WTCCC-2 Multiple Sclerosis
 About 10k cases and 17k controls
from world-wide Caucasian
populations
 Naïve GWAS: λ=2.48
 PCA adjusted: λ=1.21
 Stratified analysis in ancestry-
matched subgroups, with results
combined in meta analysis: λ=1.44
 MLM approach: λ=1.04!
Methods Implemented in SVS
 Regression with PCA Correction
- Accounts for the relationship between samples with Principal Components
- Need to know how many components to correct for
 EMMAX
- Adjusts for the pair-wise relationship between all samples using a kinship matrix
- Approximates the variance components and uses the same variance for all probes
- Tests a single locus at a time
 MLMM
- Adjusts for the pair-wise relationship between all samples using a kinship matrix
- Approximates the variance components and uses the same variance for all probes, but re-
computes at every step
- Stepwise EMMAX, assumes multiple loci are associated with the phenotype
 GBLUP
- Adjusts for the pair-wise relationship between all samples using a kinship matrix
- Computes allele substitution effects to determine best genomic predictors of the phenotype
[Demonstration]
Standard GWAS is based on tag-SNPs
 We typically test for the relationship between A and C, assuming that B
probably won’t be on the array.
 BUT: Correlation is not transitive.
- If A is correlated with B, and B with C, A is not necessarily correlated with C.
 Is that a problem?
 What does it mean for imputation?
A. Observed
GWAS SNP
B. Untyped
Causal SNP
C. Disease
Outcome
Imputation Implications
A. Observed
GWAS SNP
B. Untyped
Causal SNP
D. Additional GWAS
SNPs in LD with B.
 Imputation accuracy is usually improved when several GWAS SNPs
contribute to the imputed genotype of a given variant.
 Testing disease association with accurately imputed variants is the best
available alternative to sequencing, and much cheaper.
 As always: Carefully follow up on any significant results!
Imputation: What to Watch For
 Accuracy metrics from imputation software
 Always look for inter-cohort differences
 Example: Beagle’s Allelic R2 stat. Look at the allelic dosage histograms:
Imputation: What to Watch For
 Imputed allele frequencies different from reference panel frequencies
- Especially when common alleles are imputed with 0 frequency.
 Watch for inter-cohort differences.
rs429358 Significant But Not Assayed By 500k
Not on
500k array
Would 1000 Genomes imputation have found it?
Alzheimer’s 1kG Imputation Results
Page  32
Genotype Imputation: Why?
 Fill in the blanks—improve SNP call rate in GWAS
- This is where imputation started
 Increase density of genotype calls
- Define and/or refine the search space for identifying candidate causal variants around
GWAS signals
 Harmonize different array platforms for mega-analysis or meta-analysis
- Additional power to be gained from increased sample sizes
- Very common in disease consortia
 Identify new associations not observed in GWAS?
- Rare, but possible to identify a new locus
- Remember: our reference panels are usually made up of healthy people…
SVS and Imputation
 SVS does not have an imputation algorithm
 Add-on functions available to read and write file formats used by Beagle,
MACH/Minimac and Impute2.
 SVS supports analysis of imputed genotypes, including allelic dosage
formats
Review
 GWAS is not dead
 Golden Helix SVS is a powerful platform for GWAS analysis
- Data management
- Quality Assurance
- Visualization
- Association Testing
- LD & Haplotype Analysis
 New analysis methods like mixed model regression continue to improve
GWAS quality
 Imputation is very powerful, but has limitations
 Look for new GWAS features in SVS 8.1!
Questions or
more info:
 Email
info@goldenhelix.com
 Request an evaluation of
the software at
www.goldenhelix.com
 Check out our abstract
competition!
Questions?
Use the Questions pane in
your GoToWebinar window

More Related Content

What's hot

Association mapping
Association mapping Association mapping
Association mapping Preeti Kapoor
 
Association mapping
Association mappingAssociation mapping
Association mappingNivethitha T
 
Genotyping by Sequencing
Genotyping by SequencingGenotyping by Sequencing
Genotyping by SequencingSenthil Natesan
 
Genomic selection, prediction models, GEBV values, genomic selection in plant...
Genomic selection, prediction models, GEBV values, genomic selection in plant...Genomic selection, prediction models, GEBV values, genomic selection in plant...
Genomic selection, prediction models, GEBV values, genomic selection in plant...Mahesh Biradar
 
Mapping and Applications of Linkage Disequilibrium and Association Mapping in...
Mapping and Applications of Linkage Disequilibrium and Association Mapping in...Mapping and Applications of Linkage Disequilibrium and Association Mapping in...
Mapping and Applications of Linkage Disequilibrium and Association Mapping in...FAO
 
Genomic Selection & Precision Phenotyping
Genomic Selection & Precision PhenotypingGenomic Selection & Precision Phenotyping
Genomic Selection & Precision PhenotypingCIMMYT
 
Association mapping approaches for tagging quality traits in maize
Association mapping approaches for tagging quality traits in maizeAssociation mapping approaches for tagging quality traits in maize
Association mapping approaches for tagging quality traits in maizeSenthil Natesan
 
Fine QTL Mapping- A step towards Marker Assisted Selection (II)
Fine QTL Mapping- A step towards Marker Assisted Selection  (II)Fine QTL Mapping- A step towards Marker Assisted Selection  (II)
Fine QTL Mapping- A step towards Marker Assisted Selection (II)Mahesh Hampannavar
 
Quantitative trait loci (QTL) analysis and its applications in plant breeding
Quantitative trait loci (QTL) analysis and its applications in plant breedingQuantitative trait loci (QTL) analysis and its applications in plant breeding
Quantitative trait loci (QTL) analysis and its applications in plant breedingPGS
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomicsAmol Kunde
 
COMPARATIVE GENOMICS.ppt
COMPARATIVE GENOMICS.pptCOMPARATIVE GENOMICS.ppt
COMPARATIVE GENOMICS.pptSilpa87
 
Genomic selection for crop improvement
Genomic selection for crop improvementGenomic selection for crop improvement
Genomic selection for crop improvementnagamani gorantla
 
Genotyping by sequencing
Genotyping by sequencingGenotyping by sequencing
Genotyping by sequencingBhavya Sree
 

What's hot (20)

Association mapping
Association mappingAssociation mapping
Association mapping
 
Association mapping
Association mapping Association mapping
Association mapping
 
Association mapping
Association mappingAssociation mapping
Association mapping
 
Genotyping by Sequencing
Genotyping by SequencingGenotyping by Sequencing
Genotyping by Sequencing
 
Lecture 7 gwas full
Lecture 7 gwas fullLecture 7 gwas full
Lecture 7 gwas full
 
Genomic selection
Genomic  selectionGenomic  selection
Genomic selection
 
Genomic selection, prediction models, GEBV values, genomic selection in plant...
Genomic selection, prediction models, GEBV values, genomic selection in plant...Genomic selection, prediction models, GEBV values, genomic selection in plant...
Genomic selection, prediction models, GEBV values, genomic selection in plant...
 
Association mapping
Association mappingAssociation mapping
Association mapping
 
Mapping and Applications of Linkage Disequilibrium and Association Mapping in...
Mapping and Applications of Linkage Disequilibrium and Association Mapping in...Mapping and Applications of Linkage Disequilibrium and Association Mapping in...
Mapping and Applications of Linkage Disequilibrium and Association Mapping in...
 
Genomic Selection & Precision Phenotyping
Genomic Selection & Precision PhenotypingGenomic Selection & Precision Phenotyping
Genomic Selection & Precision Phenotyping
 
Association mapping approaches for tagging quality traits in maize
Association mapping approaches for tagging quality traits in maizeAssociation mapping approaches for tagging quality traits in maize
Association mapping approaches for tagging quality traits in maize
 
SNp mining in crops
SNp mining in cropsSNp mining in crops
SNp mining in crops
 
Fine QTL Mapping- A step towards Marker Assisted Selection (II)
Fine QTL Mapping- A step towards Marker Assisted Selection  (II)Fine QTL Mapping- A step towards Marker Assisted Selection  (II)
Fine QTL Mapping- A step towards Marker Assisted Selection (II)
 
Quantitative trait loci (QTL) analysis and its applications in plant breeding
Quantitative trait loci (QTL) analysis and its applications in plant breedingQuantitative trait loci (QTL) analysis and its applications in plant breeding
Quantitative trait loci (QTL) analysis and its applications in plant breeding
 
Comparative genomics
Comparative genomicsComparative genomics
Comparative genomics
 
COMPARATIVE GENOMICS.ppt
COMPARATIVE GENOMICS.pptCOMPARATIVE GENOMICS.ppt
COMPARATIVE GENOMICS.ppt
 
Genomic selection for crop improvement
Genomic selection for crop improvementGenomic selection for crop improvement
Genomic selection for crop improvement
 
NGS - QC & Dataformat
NGS - QC & Dataformat NGS - QC & Dataformat
NGS - QC & Dataformat
 
Qtl mapping
 Qtl mapping  Qtl mapping
Qtl mapping
 
Genotyping by sequencing
Genotyping by sequencingGenotyping by sequencing
Genotyping by sequencing
 

Similar to Back to Basics: Using GWAS to Drive Discovery for Complex Diseases

Mixed Models: How to Effectively Account for Inbreeding and Population Struct...
Mixed Models: How to Effectively Account for Inbreeding and Population Struct...Mixed Models: How to Effectively Account for Inbreeding and Population Struct...
Mixed Models: How to Effectively Account for Inbreeding and Population Struct...Golden Helix Inc
 
Genome wide Association studies.pptx
Genome wide Association studies.pptxGenome wide Association studies.pptx
Genome wide Association studies.pptxAkshitaAwasthi3
 
A Walk Through GWAS
A Walk Through GWASA Walk Through GWAS
A Walk Through GWASGolden Helix
 
RNA-seq differential expression analysis
RNA-seq differential expression analysisRNA-seq differential expression analysis
RNA-seq differential expression analysismikaelhuss
 
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...Golden Helix Inc
 
Use of SNP-HapMaps in plant breeding
Use of SNP-HapMaps in plant breeding Use of SNP-HapMaps in plant breeding
Use of SNP-HapMaps in plant breeding Anilkumar C
 
Integrating phylogenetic inference and metadata visualization for NGS data
Integrating phylogenetic inference and metadata visualization for NGS dataIntegrating phylogenetic inference and metadata visualization for NGS data
Integrating phylogenetic inference and metadata visualization for NGS dataJoão André Carriço
 
Getting More from GWAS
Getting More from GWASGetting More from GWAS
Getting More from GWASGolden Helix
 
Gene Selection for Patient Clustering by Gaussian Mixture Model
Gene Selection for Patient Clustering by Gaussian Mixture ModelGene Selection for Patient Clustering by Gaussian Mixture Model
Gene Selection for Patient Clustering by Gaussian Mixture ModelCSCJournals
 
From reads to pathways for efficient disease gene finding
From reads to pathways for efficient disease gene findingFrom reads to pathways for efficient disease gene finding
From reads to pathways for efficient disease gene findingJoaquin Dopazo
 
Potential for Genomic Selection in indigenous breeds and results of GWAS in G...
Potential for Genomic Selection in indigenous breeds and results of GWAS in G...Potential for Genomic Selection in indigenous breeds and results of GWAS in G...
Potential for Genomic Selection in indigenous breeds and results of GWAS in G...Superior Animal Genetics (SAG)
 
Genome wide association studies seminar Prepared by Ms Varsha Gaitonde.
Genome wide association studies seminar Prepared by Ms Varsha Gaitonde.Genome wide association studies seminar Prepared by Ms Varsha Gaitonde.
Genome wide association studies seminar Prepared by Ms Varsha Gaitonde.Varsha Gayatonde
 
Population-Based DNA Variant Analysis
Population-Based DNA Variant AnalysisPopulation-Based DNA Variant Analysis
Population-Based DNA Variant AnalysisGolden Helix
 
Softwares For Phylogentic Analysis
Softwares For Phylogentic AnalysisSoftwares For Phylogentic Analysis
Softwares For Phylogentic AnalysisPrasanthperceptron
 
Using VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research WorkflowsUsing VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research WorkflowsDelaina Hawkins
 
Using VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research WorkflowsUsing VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research WorkflowsGolden Helix Inc
 
High throughput Data Analysis
High throughput Data AnalysisHigh throughput Data Analysis
High throughput Data AnalysisSetia Pramana
 

Similar to Back to Basics: Using GWAS to Drive Discovery for Complex Diseases (20)

Mixed Models: How to Effectively Account for Inbreeding and Population Struct...
Mixed Models: How to Effectively Account for Inbreeding and Population Struct...Mixed Models: How to Effectively Account for Inbreeding and Population Struct...
Mixed Models: How to Effectively Account for Inbreeding and Population Struct...
 
presentation
presentationpresentation
presentation
 
Genome wide Association studies.pptx
Genome wide Association studies.pptxGenome wide Association studies.pptx
Genome wide Association studies.pptx
 
A Walk Through GWAS
A Walk Through GWASA Walk Through GWAS
A Walk Through GWAS
 
RNA-seq differential expression analysis
RNA-seq differential expression analysisRNA-seq differential expression analysis
RNA-seq differential expression analysis
 
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
Big Data at Golden Helix: Scaling to Meet the Demand of Clinical and Research...
 
Association mapping
Association mappingAssociation mapping
Association mapping
 
Use of SNP-HapMaps in plant breeding
Use of SNP-HapMaps in plant breeding Use of SNP-HapMaps in plant breeding
Use of SNP-HapMaps in plant breeding
 
Integrating phylogenetic inference and metadata visualization for NGS data
Integrating phylogenetic inference and metadata visualization for NGS dataIntegrating phylogenetic inference and metadata visualization for NGS data
Integrating phylogenetic inference and metadata visualization for NGS data
 
Getting More from GWAS
Getting More from GWASGetting More from GWAS
Getting More from GWAS
 
Gene Selection for Patient Clustering by Gaussian Mixture Model
Gene Selection for Patient Clustering by Gaussian Mixture ModelGene Selection for Patient Clustering by Gaussian Mixture Model
Gene Selection for Patient Clustering by Gaussian Mixture Model
 
From reads to pathways for efficient disease gene finding
From reads to pathways for efficient disease gene findingFrom reads to pathways for efficient disease gene finding
From reads to pathways for efficient disease gene finding
 
Potential for Genomic Selection in indigenous breeds and results of GWAS in G...
Potential for Genomic Selection in indigenous breeds and results of GWAS in G...Potential for Genomic Selection in indigenous breeds and results of GWAS in G...
Potential for Genomic Selection in indigenous breeds and results of GWAS in G...
 
Genome wide association studies seminar Prepared by Ms Varsha Gaitonde.
Genome wide association studies seminar Prepared by Ms Varsha Gaitonde.Genome wide association studies seminar Prepared by Ms Varsha Gaitonde.
Genome wide association studies seminar Prepared by Ms Varsha Gaitonde.
 
Population-Based DNA Variant Analysis
Population-Based DNA Variant AnalysisPopulation-Based DNA Variant Analysis
Population-Based DNA Variant Analysis
 
EiB Seminar from Antoni Miñarro, Ph.D
EiB Seminar from Antoni Miñarro, Ph.DEiB Seminar from Antoni Miñarro, Ph.D
EiB Seminar from Antoni Miñarro, Ph.D
 
Softwares For Phylogentic Analysis
Softwares For Phylogentic AnalysisSoftwares For Phylogentic Analysis
Softwares For Phylogentic Analysis
 
Using VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research WorkflowsUsing VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research Workflows
 
Using VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research WorkflowsUsing VarSeq to Improve Variant Analysis Research Workflows
Using VarSeq to Improve Variant Analysis Research Workflows
 
High throughput Data Analysis
High throughput Data AnalysisHigh throughput Data Analysis
High throughput Data Analysis
 

More from Golden Helix Inc

Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...
Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...
Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...Golden Helix Inc
 
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...Golden Helix Inc
 
Authoring Clinical Reports in VarSeq
Authoring Clinical Reports in VarSeqAuthoring Clinical Reports in VarSeq
Authoring Clinical Reports in VarSeqGolden Helix Inc
 
SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...
SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...
SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...Golden Helix Inc
 
Two Clinical Workflows - From Unfiltered Variants to a Clinical Report
Two Clinical Workflows - From Unfiltered Variants to a Clinical ReportTwo Clinical Workflows - From Unfiltered Variants to a Clinical Report
Two Clinical Workflows - From Unfiltered Variants to a Clinical ReportGolden Helix Inc
 
The Molecular Sciences Made Personal
The Molecular Sciences Made PersonalThe Molecular Sciences Made Personal
The Molecular Sciences Made PersonalGolden Helix Inc
 
Prediction and Meta-Analysis
Prediction and Meta-AnalysisPrediction and Meta-Analysis
Prediction and Meta-AnalysisGolden Helix Inc
 
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeq
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeqIntroducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeq
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeqGolden Helix Inc
 
MM-KBAC – Using Mixed Models to Adjust for Population Structure in a Rare-var...
MM-KBAC – Using Mixed Models to Adjust for Population Structure in a Rare-var...MM-KBAC – Using Mixed Models to Adjust for Population Structure in a Rare-var...
MM-KBAC – Using Mixed Models to Adjust for Population Structure in a Rare-var...Golden Helix Inc
 
Cancer Workflows in VarSeq
Cancer Workflows in VarSeqCancer Workflows in VarSeq
Cancer Workflows in VarSeqGolden Helix Inc
 
Getting Started with VSWarehouse - The User Experience
Getting Started with VSWarehouse - The User ExperienceGetting Started with VSWarehouse - The User Experience
Getting Started with VSWarehouse - The User ExperienceGolden Helix Inc
 
Pharmacogenomic Prediction of Antracycline-induced Cardiotoxicity
Pharmacogenomic Prediction of Antracycline-induced CardiotoxicityPharmacogenomic Prediction of Antracycline-induced Cardiotoxicity
Pharmacogenomic Prediction of Antracycline-induced CardiotoxicityGolden Helix Inc
 
Using WES in Distant Relationships to Identify Cardiomyopathy Genes
Using WES in Distant Relationships to Identify Cardiomyopathy GenesUsing WES in Distant Relationships to Identify Cardiomyopathy Genes
Using WES in Distant Relationships to Identify Cardiomyopathy GenesGolden Helix Inc
 
Using Clinical Reports as a part of a Gene Panel Pipeline
Using Clinical Reports as a part of a Gene Panel PipelineUsing Clinical Reports as a part of a Gene Panel Pipeline
Using Clinical Reports as a part of a Gene Panel PipelineGolden Helix Inc
 
Investigating Shared Additive Genetic Variation for Alcohol Dependence
Investigating Shared Additive Genetic Variation for Alcohol DependenceInvestigating Shared Additive Genetic Variation for Alcohol Dependence
Investigating Shared Additive Genetic Variation for Alcohol DependenceGolden Helix Inc
 
Personalized Medicine through Tumor Sequencing
Personalized Medicine through Tumor SequencingPersonalized Medicine through Tumor Sequencing
Personalized Medicine through Tumor SequencingGolden Helix Inc
 

More from Golden Helix Inc (20)

Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...
Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...
Pharmacological Induction of FoxO3 is a Potential Treatment for Sickle Cell D...
 
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...
Uncovering novel candidate genes for pyridoxine-responsive epilepsy in a cons...
 
Authoring Clinical Reports in VarSeq
Authoring Clinical Reports in VarSeqAuthoring Clinical Reports in VarSeq
Authoring Clinical Reports in VarSeq
 
SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...
SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...
SETBP1 as a novel candidate gene for neurodevelopmental disorders of speech a...
 
Two Clinical Workflows - From Unfiltered Variants to a Clinical Report
Two Clinical Workflows - From Unfiltered Variants to a Clinical ReportTwo Clinical Workflows - From Unfiltered Variants to a Clinical Report
Two Clinical Workflows - From Unfiltered Variants to a Clinical Report
 
The Molecular Sciences Made Personal
The Molecular Sciences Made PersonalThe Molecular Sciences Made Personal
The Molecular Sciences Made Personal
 
Prediction and Meta-Analysis
Prediction and Meta-AnalysisPrediction and Meta-Analysis
Prediction and Meta-Analysis
 
A Walk Through GWAS
A Walk Through GWASA Walk Through GWAS
A Walk Through GWAS
 
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeq
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeqIntroducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeq
Introducing VSWarehouse - A Scalable Genetic Data Warehouse for VarSeq
 
MM-KBAC – Using Mixed Models to Adjust for Population Structure in a Rare-var...
MM-KBAC – Using Mixed Models to Adjust for Population Structure in a Rare-var...MM-KBAC – Using Mixed Models to Adjust for Population Structure in a Rare-var...
MM-KBAC – Using Mixed Models to Adjust for Population Structure in a Rare-var...
 
Cancer Workflows in VarSeq
Cancer Workflows in VarSeqCancer Workflows in VarSeq
Cancer Workflows in VarSeq
 
Getting Started with VSWarehouse - The User Experience
Getting Started with VSWarehouse - The User ExperienceGetting Started with VSWarehouse - The User Experience
Getting Started with VSWarehouse - The User Experience
 
Pharmacogenomic Prediction of Antracycline-induced Cardiotoxicity
Pharmacogenomic Prediction of Antracycline-induced CardiotoxicityPharmacogenomic Prediction of Antracycline-induced Cardiotoxicity
Pharmacogenomic Prediction of Antracycline-induced Cardiotoxicity
 
Custom Family Workflows
Custom Family WorkflowsCustom Family Workflows
Custom Family Workflows
 
Using WES in Distant Relationships to Identify Cardiomyopathy Genes
Using WES in Distant Relationships to Identify Cardiomyopathy GenesUsing WES in Distant Relationships to Identify Cardiomyopathy Genes
Using WES in Distant Relationships to Identify Cardiomyopathy Genes
 
Using Clinical Reports as a part of a Gene Panel Pipeline
Using Clinical Reports as a part of a Gene Panel PipelineUsing Clinical Reports as a part of a Gene Panel Pipeline
Using Clinical Reports as a part of a Gene Panel Pipeline
 
Investigating Shared Additive Genetic Variation for Alcohol Dependence
Investigating Shared Additive Genetic Variation for Alcohol DependenceInvestigating Shared Additive Genetic Variation for Alcohol Dependence
Investigating Shared Additive Genetic Variation for Alcohol Dependence
 
Personalized Medicine through Tumor Sequencing
Personalized Medicine through Tumor SequencingPersonalized Medicine through Tumor Sequencing
Personalized Medicine through Tumor Sequencing
 
CNV Analysis in VarSeq
CNV Analysis in VarSeqCNV Analysis in VarSeq
CNV Analysis in VarSeq
 
Beagle Imputation in SVS
Beagle Imputation in SVSBeagle Imputation in SVS
Beagle Imputation in SVS
 

Recently uploaded

Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.aasikanpl
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRlizamodels9
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trssuser06f238
 
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |aasikanpl
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxyaramohamed343013
 
zoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistanzoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistanzohaibmir069
 
BREEDING FOR RESISTANCE TO BIOTIC STRESS.pptx
BREEDING FOR RESISTANCE TO BIOTIC STRESS.pptxBREEDING FOR RESISTANCE TO BIOTIC STRESS.pptx
BREEDING FOR RESISTANCE TO BIOTIC STRESS.pptxPABOLU TEJASREE
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Nistarini College, Purulia (W.B) India
 
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPirithiRaju
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naJASISJULIANOELYNV
 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPirithiRaju
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real timeSatoshi NAKAHIRA
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024AyushiRastogi48
 
Twin's paradox experiment is a meassurement of the extra dimensions.pptx
Twin's paradox experiment is a meassurement of the extra dimensions.pptxTwin's paradox experiment is a meassurement of the extra dimensions.pptx
Twin's paradox experiment is a meassurement of the extra dimensions.pptxEran Akiva Sinbar
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptArshadWarsi13
 
TOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsTOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsssuserddc89b
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxmalonesandreagweneth
 

Recently uploaded (20)

Engler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomyEngler and Prantl system of classification in plant taxonomy
Engler and Prantl system of classification in plant taxonomy
 
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Munirka Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
Call Girls in Mayapuri Delhi 💯Call Us 🔝9953322196🔝 💯Escort.
 
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCRCall Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
Call Girls In Nihal Vihar Delhi ❤️8860477959 Looking Escorts In 24/7 Delhi NCR
 
Neurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 trNeurodevelopmental disorders according to the dsm 5 tr
Neurodevelopmental disorders according to the dsm 5 tr
 
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
Call Us ≽ 9953322196 ≼ Call Girls In Lajpat Nagar (Delhi) |
 
Scheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docxScheme-of-Work-Science-Stage-4 cambridge science.docx
Scheme-of-Work-Science-Stage-4 cambridge science.docx
 
zoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistanzoogeography of pakistan.pptx fauna of Pakistan
zoogeography of pakistan.pptx fauna of Pakistan
 
BREEDING FOR RESISTANCE TO BIOTIC STRESS.pptx
BREEDING FOR RESISTANCE TO BIOTIC STRESS.pptxBREEDING FOR RESISTANCE TO BIOTIC STRESS.pptx
BREEDING FOR RESISTANCE TO BIOTIC STRESS.pptx
 
Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...Bentham & Hooker's Classification. along with the merits and demerits of the ...
Bentham & Hooker's Classification. along with the merits and demerits of the ...
 
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdfPests of jatropha_Bionomics_identification_Dr.UPR.pdf
Pests of jatropha_Bionomics_identification_Dr.UPR.pdf
 
FREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by naFREE NURSING BUNDLE FOR NURSES.PDF by na
FREE NURSING BUNDLE FOR NURSES.PDF by na
 
Pests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdfPests of castor_Binomics_Identification_Dr.UPR.pdf
Pests of castor_Binomics_Identification_Dr.UPR.pdf
 
Grafana in space: Monitoring Japan's SLIM moon lander in real time
Grafana in space: Monitoring Japan's SLIM moon lander  in real timeGrafana in space: Monitoring Japan's SLIM moon lander  in real time
Grafana in space: Monitoring Japan's SLIM moon lander in real time
 
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort ServiceHot Sexy call girls in  Moti Nagar,🔝 9953056974 🔝 escort Service
Hot Sexy call girls in Moti Nagar,🔝 9953056974 🔝 escort Service
 
Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024Vision and reflection on Mining Software Repositories research in 2024
Vision and reflection on Mining Software Repositories research in 2024
 
Twin's paradox experiment is a meassurement of the extra dimensions.pptx
Twin's paradox experiment is a meassurement of the extra dimensions.pptxTwin's paradox experiment is a meassurement of the extra dimensions.pptx
Twin's paradox experiment is a meassurement of the extra dimensions.pptx
 
Transposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.pptTransposable elements in prokaryotes.ppt
Transposable elements in prokaryotes.ppt
 
TOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physicsTOPIC 8 Temperature and Heat.pdf physics
TOPIC 8 Temperature and Heat.pdf physics
 
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptxLIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
LIGHT-PHENOMENA-BY-CABUALDIONALDOPANOGANCADIENTE-CONDEZA (1).pptx
 

Back to Basics: Using GWAS to Drive Discovery for Complex Diseases

  • 1. Back to Basics: Genome-Wide Association Studies December 11, 2013 Bryce Christensen Director of Services
  • 2. Use the Questions pane in your GoToWebinar window Questions during the presentation
  • 3. Golden Helix Leaders in Genetic Analytics  Founded in 1998  Multi-disciplinary: computer science, bioinformatics, statistics, genetics  Software and analytic services About Golden Helix
  • 4. Core Features Packages Core Features  Powerful Data Management  Rich Visualizations  Robust Statistics  Flexible  Easy-to-use Applications  Genotype Analysis  DNA sequence analysis  CNV Analysis  RNA-seq differential expression  Family Based Association SNP & Variation Suite (SVS)
  • 5. Approximate Agenda Association Testing Q&A 2 3 4 What about Imputation? GWAS QC Considerations1
  • 10. Autosomal MAF distribution on Affy 500k chip  Most GWAS chips are designed to capture global variation  Homogeneous cohorts will only be polymorphic for some subset of SNPs.
  • 11. Autosomal MAF distribution for Illumina Omni-2.5  Higher-density chips have more rare content, so smaller relative proportion of SNPs will be polymorphic  Diminishing returns with increased density
  • 12. X Heterozygosity Variability by Chip  Chip design can affect distribution of many statistics, including X heterozygosity  Targeted chips may have minimal polymorphic content on X  Adjust workflows accordingly - Ex: Filter on MAF before running gender inference
  • 13. Why Care about Chip Design and Content?  Many sample statistics are based on allele frequencies, and behave differently from chip to chip - IBD testing - Principal Components - Autosomal heterozygosity rates - Runs of homozygosity  Many of those statistics also assume that you are using a “GWAS” chip with uniform coverage, and may be confounded when using chips with targeted or non-uniform coverage content - Exome chips - ImmunoChip - Cardio-MetaboChip  Adjust workflows accordingly! - Use different MAF thresholds with targeted chips - Filter to polymorphic SNPs and prune for LD before running IBD or PCA
  • 15. Original Slides by Greta Linse Peterson, Senior Statistician Mixed Models: How to Effectively Account for Inbreeding and Population Structure in GWAS
  • 16. A brief background of GWAS  First the naïve approaches: Trend Tests, Contingency Tables, Linear/Logistic Regression  Batch Effects, Population Structure and sharing of controls may violate assumptions of the naïve approaches and result in confounding of results.  Stratification effects are more pronounced with larger sample sizes.  Non-independence of samples is especially problematic in agrigenomic applications.
  • 17. The Real Problem  Vilhjalmsson and Nordborg (2013) argue that “population structure” itself is not a problem for GWAS.  The real problems are the environment and the genetic background of a population. - PCA can serve as a proxy for both, but doesn’t entirely explain either.  The solution is to account for the relatedness between all pairs of samples in a mixed linear model.
  • 18. Mixed Model Method Overview  Calculate kinship matrix defining pairwise relationships between all sample pairs.  Include kinship matrix as random effect in MLM regression.  May also include PCs and other factors as fixed effects.  Allows for population-based and family-based cohorts to be analyzed together.
  • 19. Applied Example: WTCCC-2 Multiple Sclerosis  About 10k cases and 17k controls from world-wide Caucasian populations  Naïve GWAS: λ=2.48  PCA adjusted: λ=1.21  Stratified analysis in ancestry- matched subgroups, with results combined in meta analysis: λ=1.44  MLM approach: λ=1.04!
  • 20. Methods Implemented in SVS  Regression with PCA Correction - Accounts for the relationship between samples with Principal Components - Need to know how many components to correct for  EMMAX - Adjusts for the pair-wise relationship between all samples using a kinship matrix - Approximates the variance components and uses the same variance for all probes - Tests a single locus at a time  MLMM - Adjusts for the pair-wise relationship between all samples using a kinship matrix - Approximates the variance components and uses the same variance for all probes, but re- computes at every step - Stepwise EMMAX, assumes multiple loci are associated with the phenotype  GBLUP - Adjusts for the pair-wise relationship between all samples using a kinship matrix - Computes allele substitution effects to determine best genomic predictors of the phenotype
  • 22. Standard GWAS is based on tag-SNPs  We typically test for the relationship between A and C, assuming that B probably won’t be on the array.  BUT: Correlation is not transitive. - If A is correlated with B, and B with C, A is not necessarily correlated with C.  Is that a problem?  What does it mean for imputation? A. Observed GWAS SNP B. Untyped Causal SNP C. Disease Outcome
  • 23. Imputation Implications A. Observed GWAS SNP B. Untyped Causal SNP D. Additional GWAS SNPs in LD with B.  Imputation accuracy is usually improved when several GWAS SNPs contribute to the imputed genotype of a given variant.  Testing disease association with accurately imputed variants is the best available alternative to sequencing, and much cheaper.  As always: Carefully follow up on any significant results!
  • 24. Imputation: What to Watch For  Accuracy metrics from imputation software  Always look for inter-cohort differences  Example: Beagle’s Allelic R2 stat. Look at the allelic dosage histograms:
  • 25. Imputation: What to Watch For  Imputed allele frequencies different from reference panel frequencies - Especially when common alleles are imputed with 0 frequency.  Watch for inter-cohort differences.
  • 26. rs429358 Significant But Not Assayed By 500k Not on 500k array Would 1000 Genomes imputation have found it?
  • 27. Alzheimer’s 1kG Imputation Results Page  32
  • 28. Genotype Imputation: Why?  Fill in the blanks—improve SNP call rate in GWAS - This is where imputation started  Increase density of genotype calls - Define and/or refine the search space for identifying candidate causal variants around GWAS signals  Harmonize different array platforms for mega-analysis or meta-analysis - Additional power to be gained from increased sample sizes - Very common in disease consortia  Identify new associations not observed in GWAS? - Rare, but possible to identify a new locus - Remember: our reference panels are usually made up of healthy people…
  • 29. SVS and Imputation  SVS does not have an imputation algorithm  Add-on functions available to read and write file formats used by Beagle, MACH/Minimac and Impute2.  SVS supports analysis of imputed genotypes, including allelic dosage formats
  • 30. Review  GWAS is not dead  Golden Helix SVS is a powerful platform for GWAS analysis - Data management - Quality Assurance - Visualization - Association Testing - LD & Haplotype Analysis  New analysis methods like mixed model regression continue to improve GWAS quality  Imputation is very powerful, but has limitations  Look for new GWAS features in SVS 8.1!
  • 31. Questions or more info:  Email info@goldenhelix.com  Request an evaluation of the software at www.goldenhelix.com  Check out our abstract competition!
  • 32. Questions? Use the Questions pane in your GoToWebinar window