SlideShare a Scribd company logo
1 of 74
Association Analysis University of Louisville Center for Genetics and Molecular Medicine January 11, 2008 Dana Crawford, PhD Vanderbilt University Center for Human Genetics Research
Association Analysis Outline ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Study Design Does your trait or phenotype have a genetic component? ,[object Object],[object Object],[object Object],[object Object],[object Object]
Classic Segregation Analysis ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Note:  More complex methods needed for complex traits
Recurrence Risks The chance that a disease present in the family will recur in that family “ Lightning striking twice” If recurrence risk is greater in the family compared with unrelated individuals,  the disease has a “genetic” component Suggests familial aggregation
Recurrence Risks Measured using the risk ratio ( λ ) Sibling risk ratio =  λ s Cystic fibrosis  λ s   = (0.25/0.0004) = 500 Huntington disease  λ s = (0.50/0.0001) = 5000 λ s =  sibling recurrence risk  population prevalence
Recurrence Risks:  Complex traits λ  h ere is for first degree relative Merikangas and Risch (2003) Science 302:599-601.
Heritability Think “twin studies” The proportion of phenotypic variation in a  population attributable to genetic variation Quantitative traits Heritability measured as h 2 (Can also be family studies)
Heritability and Quantitative Traits Determined by genes and environment Boys Girls Mexican  Americans Blacks Whites Mexican  Americans Blacks Whites Example: Height NHANES 1971-1974 versus NHANES 1999-2002 Freedman et al (2006) Obesity 14:301-308
Heritability and Quantitative Traits Trait variation = genetic + environment Genetic variation = additive + dominant Environmental variation =  familial/household + random/individual σ T 2  =  σ G 2  +  σ E 2 σ G 2  =  σ a 2  +  σ d 2 σ E 2  =  σ f 2  +  σ e 2 h B 2 =  σ G 2  /  σ T 2 Broad Sense heritability Narrow Sense heritability h N 2 =  σ a 2  /  σ T 2
Heritability and Twins Studies h 2  = 2(r MZ  – r DZ ), where r is the correlation coefficient Monozygotic = same genetic material = r ~ 100%  Dizygotic = half genetic material = r ~ 50%
Heritability and Twins Studies Trait r(MZ) r(DZ) Reference Cholesterol 0.76 0.39 Fenger et al SBP 0.60 0.32 Evans et al BMI 0.67 0.32 Schousboe et al Perceived pitch 0.67 0.44 Drayna et al
Heritability: Is everything genetic? Trait r(MZ) r(DZ) Reference Vote choice 0.81 0.69 Hatemi et al Religiousness 0.62 0.42 Koenig et al
Other Evidence For A  Genetic Component Monogenic disorders Example: Phenotype of interest is sensitivity to warfarin dosing, but there are no heritability estimates Solution: Rare, familial disorder of warfarin resistance
Other Evidence For A  Genetic Component Case Reports Example: Phenotype of interest is susceptibility to Neisseria meningitidis  (prevalence: 1/100,000) Solution: Case report of recurrent  N. meningitidis  in patient
Other Evidence For A  Genetic Component ,[object Object],[object Object],[object Object],[object Object],Other good arguments…
Study Design How well can you diagnose the disease or measure the trait? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Arguably most important step in study design
Target Phenotypes Disease or Quantitative trait? Carlson et al. (2004)  Nature  429:446-452 MI Note:  SNPs associated with quantitative traits  may not be associated with clinical endpoint CRP LDL-C IL6 LDLR Acute  Illness Diet
Study Design How many cases and controls will you need to detect  an association? ,[object Object],[object Object],[object Object],[object Object]
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Study Design What are the thresholds/variables in a general power calculation? Note:  Significance threshold for 1 SNP tested
Study Design ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Study Design How can you maximize power for your study? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Study Design Power calculation example: Cases:  Adverse reaction (wheezing) to flu vaccination Controls:  Vaccinated children with no adverse reactions Calculated using Quanto 1.1.1 MAF
Study Design Power calculation example: Immunogenicity to influenza A (H5N1) vaccine Calculated using Quanto 1.1.1
Study Design Why are you considering an association study instead of linkage? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Common variant/common disease hypothesis ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Study Design
Study Design Should you design a candidate gene or whole genome study? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Candidate gene association  studies ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Direct Candidate Gene Association  Study Genotype “functional” SNPs Example:  Nonsynonymous SNPs Collins et al (1997) Science 278:1580-1581
Direct Candidate Gene Association  Study Problem:  We don’t know what is functional and what is not functional Botstein and Risch (2003) Nat Genet 33 Suppl:228-37.
Direct Candidate Gene Association  Study What would we miss? Functional synonymous SNPs in  MDR1  alter P-glycoprotein activity Komar (2007) Science 315:466-467
Direct Candidate Gene Association  Study What would we miss? ,[object Object],[object Object],[object Object],[object Object]
Indirect Candidate Gene  Association Study ,[object Object],[object Object],[object Object],Kruglyak (2005) Nat Genet 37:1299-1300
Indirect Candidate Gene  Association Study Linkage disequilibrium (LD) r 2  = 0 SNPs are independent r 2  = 1 SNPs are perfectly correlated AND have the same minor allele frequency Measured by r 2 r 2  =  [f(A 1 B 1 ) – f(A 1 )f(B 1 )] 2   f(A 1 )f(A 2 )f(B 1 )f(B 2 )
Indirect Candidate Gene  Association Study Using LD to pick “tagSNPs” CRP European-descent 10 SNPs >5% MAF CRP European-descent 4 tagSNPs r 2 >0.80
Indirect Candidate Gene  Association Study “ tagSNPs” are population specific CRP European-descent 4 tagSNPs CRP African-descent 10 tagSNPs
Indirect Candidate Gene  Association Study ,[object Object],[object Object],[object Object],[object Object],http://gvs.gs.washington.edu/GVS/
Indirect Candidate Gene  Association Study Multiple testing ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Indirect Candidate Gene  Association Study:  Pros and Cons ,[object Object],[object Object],[object Object],[object Object],[object Object]
Whole Genome Association Study ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Illumina Infinium assay Affymetrix GeneChips
Whole Genome Association Study ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Manolio  et al.   Nature Reviews Genetics   7 , 812 – 820 (October 2006) Case/Control Study Designs For either candidate gene or whole genome
Study  Pros   Cons Case/Control   Easier to collect   Subject to bias   Less expensive   No risk estimates Case/Control Study Designs:  Pros and Cons Prospective   Risk estimates   Harder to collect   More expensive   Subject to bias For rare outcomes, case/control design  may be only option
Case/Control Study Designs:  Pros and Cons Types of bias ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Manolio  et al.   Nature Reviews Genetics   7 , 812–820 (October 2006) Often ignored in genetic association studies
Analysis Methods Genotype QC ,[object Object],[object Object],[object Object],[object Object],[object Object]
Analysis Methods What statistical methods do you use to analyze your data? ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Analysis Methods Odds ratio (OR) = ratio of odds of minor allele in  Cases (A/C) and Controls (B/D) OR (A*D)/(B*C) The Case/Control Study Case   Control Minor allele   A   B Major allele   C   D
For genotypes, set homozygous for major allele (A) as  “ referent” genotype, and calculate 2 odds ratios: Analysis Methods Case   Control Aa   A   B AA   C   D Case   Control aa   A   B AA   C   D
Analysis Methods Case/control: Interpretation of Odds Ratio 1.0 – Referent >1.0 – Greater odds of disease compared with controls <1.0 – Lesser odds of disease compared with controls Confidence Intervals:  probably contain true OR OR does  not  measure risk*
Prospective cohort ,[object Object],[object Object],[object Object],[object Object],Analysis Methods
Analysis Methods Prospective cohort Risk Ratio (RR) = Incidence of disease in Exposed  A/(A+B)  or  Unexposed C/(C+D) Case   Control Total Exposed   A   B (A+B) Unexposed   C   D (C+D)
Prospective Study: Interpretation of Risk Ratio 1.0 – Referent >1.0 – Risk for disease increases <1.0 – Risk for disease decreases Confidence Intervals:  probably contain true RR *For rare diseases, OR  ~ RR Analysis Methods
Case/control:  Matching Age Gender Race Warning:  Can “over match” and miss describing an interesting factor Bad Example: Cases: Adults with heart disease Controls: Newborns without heart disease Analysis Methods
Case/control:  Stratifying Age Gender Race Warning: Need sufficient sample size to  stratify or split the data into males and females Ex. Cases with heart disease Aged-matched controls without heart disease (Exposure: smoking status) Stratify for Gender Specific Risks  Analysis Methods
Problems in Case/Control genetic association studies – ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Cardon and Palmer (2003) Lancet 361:598-604 Analysis Methods
[object Object],[object Object],[object Object],[object Object],[object Object],Analysis Methods Regression
Analysis Methods ,[object Object],[object Object],[object Object],[object Object],[object Object],Regression
[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Analysis Methods Regression
Analysis Methods Coding Genotypes 0 0 0 GG 0 1 1 AG 1 2 1 AA Recessive Additive Dominant Genotype Genotype can be re-coded in any number  of ways for regression analysis
 
 
Example of gene-environment Interaction and traditional regression
Analysis Methods Statistical Packages for Genetic Association Studies ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Analysis Methods Whole genome in PLINK (pngu.mgh.harvard.edu/~purcell/plink/) Can adjust for population stratification Can add covariates P<5x10 -8 Genome-wide significance P=5x10 -8 Plenge et al 2007 NEJM MHC removed P<1x10 -100 P<2x10 -11
SNPs versus Haplotypes ,[object Object],[object Object],[object Object]
SNPs versus Haplotypes ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Statistical Packages for Genetic Association Studies with haplotypes
Analysis Methods ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
Statistical Replication CRP SNPs and CRP  levels in NHANES III Crawford et al  Circulation  2006; 114:2458-2465 Carlson et al.  AJHG  2005 ; 77:64-77 Results Consistent  with CARDIA
[object Object],[object Object],[object Object],[object Object],[object Object],Functional Replication
Functional Replication CRP Evolutionary Conservation ,[object Object],[object Object],[object Object]
Functional Replication Low CRP Levels Associated with H1-4 ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]
High CRP Levels Associated with H6 ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],[object Object],Functional Replication
CRP Promoter Luciferase Assay Carlson et al, AJHG v77 p64 Functional Replication
Association Analysis Outline ,[object Object],[object Object],[object Object],[object Object],[object Object],[object Object]

More Related Content

What's hot

Report- Genome wide association studies.
Report- Genome wide association studies.Report- Genome wide association studies.
Report- Genome wide association studies.Varsha Gayatonde
 
Repurposing large datasets for exposomic discovery in disease
Repurposing large datasets for exposomic discovery in diseaseRepurposing large datasets for exposomic discovery in disease
Repurposing large datasets for exposomic discovery in diseaseChirag Patel
 
NSF Northeast Hub Big Data Workshop
NSF Northeast Hub Big Data WorkshopNSF Northeast Hub Big Data Workshop
NSF Northeast Hub Big Data WorkshopChirag Patel
 
Methods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataMethods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataChirag Patel
 
Data analytics to support exposome research course slides
Data analytics to support exposome research course slidesData analytics to support exposome research course slides
Data analytics to support exposome research course slidesChirag Patel
 
Back to Basics: Using GWAS to Drive Discovery for Complex Diseases
Back to Basics: Using GWAS to Drive Discovery for Complex DiseasesBack to Basics: Using GWAS to Drive Discovery for Complex Diseases
Back to Basics: Using GWAS to Drive Discovery for Complex DiseasesGolden Helix Inc
 
Dr. Andrea Wilson - New PRRS disease phenotypes as vaccine and genetic improv...
Dr. Andrea Wilson - New PRRS disease phenotypes as vaccine and genetic improv...Dr. Andrea Wilson - New PRRS disease phenotypes as vaccine and genetic improv...
Dr. Andrea Wilson - New PRRS disease phenotypes as vaccine and genetic improv...John Blue
 
Large-scale biomedical data and text integration
Large-scale biomedical data and text integrationLarge-scale biomedical data and text integration
Large-scale biomedical data and text integrationLars Juhl Jensen
 
Death prompts a review of gene therapy vector
Death prompts a review of gene therapy vectorDeath prompts a review of gene therapy vector
Death prompts a review of gene therapy vectorLindsay Meyer
 
NCI systems epidemiology 03012019
NCI systems epidemiology 03012019NCI systems epidemiology 03012019
NCI systems epidemiology 03012019Chirag Patel
 
Japanese Environmental Children's Study and Data-driven E
Japanese Environmental Children's Study and Data-driven EJapanese Environmental Children's Study and Data-driven E
Japanese Environmental Children's Study and Data-driven EChirag Patel
 
Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...
Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...
Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...ExternalEvents
 
Mixed Models: How to Effectively Account for Inbreeding and Population Struct...
Mixed Models: How to Effectively Account for Inbreeding and Population Struct...Mixed Models: How to Effectively Account for Inbreeding and Population Struct...
Mixed Models: How to Effectively Account for Inbreeding and Population Struct...Golden Helix Inc
 
Chirag patel unite for sight 041418
Chirag patel unite for sight 041418Chirag patel unite for sight 041418
Chirag patel unite for sight 041418Chirag Patel
 
Rare diseases in children and genetic diagnosis - part 1 [Today's paper]
Rare diseases in children and genetic diagnosis - part 1 [Today's paper]Rare diseases in children and genetic diagnosis - part 1 [Today's paper]
Rare diseases in children and genetic diagnosis - part 1 [Today's paper]HeonjongHan
 
Status and prospects of association mapping in crop plants
Status and prospects of association mapping in crop plantsStatus and prospects of association mapping in crop plants
Status and prospects of association mapping in crop plantsJyoti Prakash Sahoo
 
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan DiseasesUsing In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan DiseasesSean Ekins
 
Sensitivity Analysis of Population Genetic Metrics
Sensitivity Analysis of Population Genetic MetricsSensitivity Analysis of Population Genetic Metrics
Sensitivity Analysis of Population Genetic MetricsThiviya Nair
 

What's hot (20)

Report- Genome wide association studies.
Report- Genome wide association studies.Report- Genome wide association studies.
Report- Genome wide association studies.
 
Repurposing large datasets for exposomic discovery in disease
Repurposing large datasets for exposomic discovery in diseaseRepurposing large datasets for exposomic discovery in disease
Repurposing large datasets for exposomic discovery in disease
 
NSF Northeast Hub Big Data Workshop
NSF Northeast Hub Big Data WorkshopNSF Northeast Hub Big Data Workshop
NSF Northeast Hub Big Data Workshop
 
Methods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big dataMethods to enhance the validity of precision guidelines emerging from big data
Methods to enhance the validity of precision guidelines emerging from big data
 
Data analytics to support exposome research course slides
Data analytics to support exposome research course slidesData analytics to support exposome research course slides
Data analytics to support exposome research course slides
 
Back to Basics: Using GWAS to Drive Discovery for Complex Diseases
Back to Basics: Using GWAS to Drive Discovery for Complex DiseasesBack to Basics: Using GWAS to Drive Discovery for Complex Diseases
Back to Basics: Using GWAS to Drive Discovery for Complex Diseases
 
Dr. Andrea Wilson - New PRRS disease phenotypes as vaccine and genetic improv...
Dr. Andrea Wilson - New PRRS disease phenotypes as vaccine and genetic improv...Dr. Andrea Wilson - New PRRS disease phenotypes as vaccine and genetic improv...
Dr. Andrea Wilson - New PRRS disease phenotypes as vaccine and genetic improv...
 
Large-scale biomedical data and text integration
Large-scale biomedical data and text integrationLarge-scale biomedical data and text integration
Large-scale biomedical data and text integration
 
Death prompts a review of gene therapy vector
Death prompts a review of gene therapy vectorDeath prompts a review of gene therapy vector
Death prompts a review of gene therapy vector
 
NCI systems epidemiology 03012019
NCI systems epidemiology 03012019NCI systems epidemiology 03012019
NCI systems epidemiology 03012019
 
Japanese Environmental Children's Study and Data-driven E
Japanese Environmental Children's Study and Data-driven EJapanese Environmental Children's Study and Data-driven E
Japanese Environmental Children's Study and Data-driven E
 
Dr. Trivedi resume 2016
Dr. Trivedi  resume  2016Dr. Trivedi  resume  2016
Dr. Trivedi resume 2016
 
A Walk Through GWAS
A Walk Through GWASA Walk Through GWAS
A Walk Through GWAS
 
Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...
Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...
Real-Time Genome Sequencing of Resistant Bacteria Provides Precision Infectio...
 
Mixed Models: How to Effectively Account for Inbreeding and Population Struct...
Mixed Models: How to Effectively Account for Inbreeding and Population Struct...Mixed Models: How to Effectively Account for Inbreeding and Population Struct...
Mixed Models: How to Effectively Account for Inbreeding and Population Struct...
 
Chirag patel unite for sight 041418
Chirag patel unite for sight 041418Chirag patel unite for sight 041418
Chirag patel unite for sight 041418
 
Rare diseases in children and genetic diagnosis - part 1 [Today's paper]
Rare diseases in children and genetic diagnosis - part 1 [Today's paper]Rare diseases in children and genetic diagnosis - part 1 [Today's paper]
Rare diseases in children and genetic diagnosis - part 1 [Today's paper]
 
Status and prospects of association mapping in crop plants
Status and prospects of association mapping in crop plantsStatus and prospects of association mapping in crop plants
Status and prospects of association mapping in crop plants
 
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan DiseasesUsing In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
Using In Silico Tools in Repurposing Drugs for Neglected and Orphan Diseases
 
Sensitivity Analysis of Population Genetic Metrics
Sensitivity Analysis of Population Genetic MetricsSensitivity Analysis of Population Genetic Metrics
Sensitivity Analysis of Population Genetic Metrics
 

Viewers also liked

Itelecommute Telecommuting or telework and My Lead Company
Itelecommute Telecommuting or telework and My Lead CompanyItelecommute Telecommuting or telework and My Lead Company
Itelecommute Telecommuting or telework and My Lead CompanyScorpion Power PC
 
How To Analyze A New Gene
How To Analyze A New GeneHow To Analyze A New Gene
How To Analyze A New GeneSean Paul
 

Viewers also liked (7)

2009 7 17
2009 7 172009 7 17
2009 7 17
 
6 55 E
6 55 E6 55 E
6 55 E
 
Reference 1
Reference 1Reference 1
Reference 1
 
The Why
The WhyThe Why
The Why
 
Itelecommute Telecommuting or telework and My Lead Company
Itelecommute Telecommuting or telework and My Lead CompanyItelecommute Telecommuting or telework and My Lead Company
Itelecommute Telecommuting or telework and My Lead Company
 
Snp
SnpSnp
Snp
 
How To Analyze A New Gene
How To Analyze A New GeneHow To Analyze A New Gene
How To Analyze A New Gene
 

Similar to Day2 145pm Crawford

Informatics and data analytics to support for exposome-based discovery
Informatics and data analytics to support for exposome-based discoveryInformatics and data analytics to support for exposome-based discovery
Informatics and data analytics to support for exposome-based discoveryChirag Patel
 
Gene hunting strategies
Gene hunting strategiesGene hunting strategies
Gene hunting strategiesAshfaq Ahmad
 
Intro to Biomedical Informatics 701
Intro to Biomedical Informatics 701 Intro to Biomedical Informatics 701
Intro to Biomedical Informatics 701 Chirag Patel
 
Mark Daly - Finding risk genes in psychiatric disorders
Mark Daly - Finding risk genes in psychiatric disordersMark Daly - Finding risk genes in psychiatric disorders
Mark Daly - Finding risk genes in psychiatric disorderswef
 
The emerging picture of host genetic control of susceptibility and outcome in...
The emerging picture of host genetic control of susceptibility and outcome in...The emerging picture of host genetic control of susceptibility and outcome in...
The emerging picture of host genetic control of susceptibility and outcome in...Meningitis Research Foundation
 
헬스케어 빅데이터로 무엇을 할 수 있는가?
헬스케어 빅데이터로 무엇을 할 수 있는가?헬스케어 빅데이터로 무엇을 할 수 있는가?
헬스케어 빅데이터로 무엇을 할 수 있는가? Hyung Jin Choi
 
Genetics In Psychiatry
Genetics In PsychiatryGenetics In Psychiatry
Genetics In PsychiatryFrank Meissner
 
Montgomery expression
Montgomery expressionMontgomery expression
Montgomery expressionmorenorossi
 
Manteia non confidential-presentation 2003-09
Manteia non confidential-presentation 2003-09Manteia non confidential-presentation 2003-09
Manteia non confidential-presentation 2003-09Pascal Mayer
 
Studying the elusive in larger scale
Studying the elusive in larger scaleStudying the elusive in larger scale
Studying the elusive in larger scaleChirag Patel
 
GENETICS IN PSYCHIATRY.pdf
GENETICS IN PSYCHIATRY.pdfGENETICS IN PSYCHIATRY.pdf
GENETICS IN PSYCHIATRY.pdfShanuSoni7
 
Multi-trait modeling in polygenic scores
Multi-trait modeling in polygenic scoresMulti-trait modeling in polygenic scores
Multi-trait modeling in polygenic scoresYosuke Tanigawa
 
Biomarkers for psychological phenotypes?
Biomarkers for psychological phenotypes?Biomarkers for psychological phenotypes?
Biomarkers for psychological phenotypes?Dorothy Bishop
 
Biomedical Informatics 706: Precision Medicine with exposures
Biomedical Informatics 706: Precision Medicine with exposuresBiomedical Informatics 706: Precision Medicine with exposures
Biomedical Informatics 706: Precision Medicine with exposuresChirag Patel
 
Phenotype-based Matching Using PhenoDB Terms in BHCMG PhenoDB to Maximize Who...
Phenotype-based Matching Using PhenoDB Terms in BHCMG PhenoDB to Maximize Who...Phenotype-based Matching Using PhenoDB Terms in BHCMG PhenoDB to Maximize Who...
Phenotype-based Matching Using PhenoDB Terms in BHCMG PhenoDB to Maximize Who...Human Variome Project
 
Current Directions in PsychologicalScience2015, Vol. 24(4).docx
Current Directions in PsychologicalScience2015, Vol. 24(4).docxCurrent Directions in PsychologicalScience2015, Vol. 24(4).docx
Current Directions in PsychologicalScience2015, Vol. 24(4).docxannettsparrow
 
Genetic predisposition to papillary thyroid cancer by Albert de la Chapelle, ...
Genetic predisposition to papillary thyroid cancer by Albert de la Chapelle, ...Genetic predisposition to papillary thyroid cancer by Albert de la Chapelle, ...
Genetic predisposition to papillary thyroid cancer by Albert de la Chapelle, ...OSUCCC - James
 

Similar to Day2 145pm Crawford (20)

Schizophrenia
SchizophreniaSchizophrenia
Schizophrenia
 
Informatics and data analytics to support for exposome-based discovery
Informatics and data analytics to support for exposome-based discoveryInformatics and data analytics to support for exposome-based discovery
Informatics and data analytics to support for exposome-based discovery
 
Gene hunting strategies
Gene hunting strategiesGene hunting strategies
Gene hunting strategies
 
Intro to Biomedical Informatics 701
Intro to Biomedical Informatics 701 Intro to Biomedical Informatics 701
Intro to Biomedical Informatics 701
 
Mark Daly - Finding risk genes in psychiatric disorders
Mark Daly - Finding risk genes in psychiatric disordersMark Daly - Finding risk genes in psychiatric disorders
Mark Daly - Finding risk genes in psychiatric disorders
 
The emerging picture of host genetic control of susceptibility and outcome in...
The emerging picture of host genetic control of susceptibility and outcome in...The emerging picture of host genetic control of susceptibility and outcome in...
The emerging picture of host genetic control of susceptibility and outcome in...
 
헬스케어 빅데이터로 무엇을 할 수 있는가?
헬스케어 빅데이터로 무엇을 할 수 있는가?헬스케어 빅데이터로 무엇을 할 수 있는가?
헬스케어 빅데이터로 무엇을 할 수 있는가?
 
Genetics in Psychiatry
Genetics in PsychiatryGenetics in Psychiatry
Genetics in Psychiatry
 
Genetics In Psychiatry
Genetics In PsychiatryGenetics In Psychiatry
Genetics In Psychiatry
 
Montgomery expression
Montgomery expressionMontgomery expression
Montgomery expression
 
Manteia non confidential-presentation 2003-09
Manteia non confidential-presentation 2003-09Manteia non confidential-presentation 2003-09
Manteia non confidential-presentation 2003-09
 
Studying the elusive in larger scale
Studying the elusive in larger scaleStudying the elusive in larger scale
Studying the elusive in larger scale
 
GENETICS IN PSYCHIATRY.pdf
GENETICS IN PSYCHIATRY.pdfGENETICS IN PSYCHIATRY.pdf
GENETICS IN PSYCHIATRY.pdf
 
Multi-trait modeling in polygenic scores
Multi-trait modeling in polygenic scoresMulti-trait modeling in polygenic scores
Multi-trait modeling in polygenic scores
 
Biomarkers for psychological phenotypes?
Biomarkers for psychological phenotypes?Biomarkers for psychological phenotypes?
Biomarkers for psychological phenotypes?
 
10 Liu, Dajiang
10 Liu, Dajiang10 Liu, Dajiang
10 Liu, Dajiang
 
Biomedical Informatics 706: Precision Medicine with exposures
Biomedical Informatics 706: Precision Medicine with exposuresBiomedical Informatics 706: Precision Medicine with exposures
Biomedical Informatics 706: Precision Medicine with exposures
 
Phenotype-based Matching Using PhenoDB Terms in BHCMG PhenoDB to Maximize Who...
Phenotype-based Matching Using PhenoDB Terms in BHCMG PhenoDB to Maximize Who...Phenotype-based Matching Using PhenoDB Terms in BHCMG PhenoDB to Maximize Who...
Phenotype-based Matching Using PhenoDB Terms in BHCMG PhenoDB to Maximize Who...
 
Current Directions in PsychologicalScience2015, Vol. 24(4).docx
Current Directions in PsychologicalScience2015, Vol. 24(4).docxCurrent Directions in PsychologicalScience2015, Vol. 24(4).docx
Current Directions in PsychologicalScience2015, Vol. 24(4).docx
 
Genetic predisposition to papillary thyroid cancer by Albert de la Chapelle, ...
Genetic predisposition to papillary thyroid cancer by Albert de la Chapelle, ...Genetic predisposition to papillary thyroid cancer by Albert de la Chapelle, ...
Genetic predisposition to papillary thyroid cancer by Albert de la Chapelle, ...
 

Day2 145pm Crawford

  • 1. Association Analysis University of Louisville Center for Genetics and Molecular Medicine January 11, 2008 Dana Crawford, PhD Vanderbilt University Center for Human Genetics Research
  • 2.
  • 3.
  • 4.
  • 5. Recurrence Risks The chance that a disease present in the family will recur in that family “ Lightning striking twice” If recurrence risk is greater in the family compared with unrelated individuals, the disease has a “genetic” component Suggests familial aggregation
  • 6. Recurrence Risks Measured using the risk ratio ( λ ) Sibling risk ratio = λ s Cystic fibrosis λ s = (0.25/0.0004) = 500 Huntington disease λ s = (0.50/0.0001) = 5000 λ s = sibling recurrence risk population prevalence
  • 7. Recurrence Risks: Complex traits λ h ere is for first degree relative Merikangas and Risch (2003) Science 302:599-601.
  • 8. Heritability Think “twin studies” The proportion of phenotypic variation in a population attributable to genetic variation Quantitative traits Heritability measured as h 2 (Can also be family studies)
  • 9. Heritability and Quantitative Traits Determined by genes and environment Boys Girls Mexican Americans Blacks Whites Mexican Americans Blacks Whites Example: Height NHANES 1971-1974 versus NHANES 1999-2002 Freedman et al (2006) Obesity 14:301-308
  • 10. Heritability and Quantitative Traits Trait variation = genetic + environment Genetic variation = additive + dominant Environmental variation = familial/household + random/individual σ T 2 = σ G 2 + σ E 2 σ G 2 = σ a 2 + σ d 2 σ E 2 = σ f 2 + σ e 2 h B 2 = σ G 2 / σ T 2 Broad Sense heritability Narrow Sense heritability h N 2 = σ a 2 / σ T 2
  • 11. Heritability and Twins Studies h 2 = 2(r MZ – r DZ ), where r is the correlation coefficient Monozygotic = same genetic material = r ~ 100% Dizygotic = half genetic material = r ~ 50%
  • 12. Heritability and Twins Studies Trait r(MZ) r(DZ) Reference Cholesterol 0.76 0.39 Fenger et al SBP 0.60 0.32 Evans et al BMI 0.67 0.32 Schousboe et al Perceived pitch 0.67 0.44 Drayna et al
  • 13. Heritability: Is everything genetic? Trait r(MZ) r(DZ) Reference Vote choice 0.81 0.69 Hatemi et al Religiousness 0.62 0.42 Koenig et al
  • 14. Other Evidence For A Genetic Component Monogenic disorders Example: Phenotype of interest is sensitivity to warfarin dosing, but there are no heritability estimates Solution: Rare, familial disorder of warfarin resistance
  • 15. Other Evidence For A Genetic Component Case Reports Example: Phenotype of interest is susceptibility to Neisseria meningitidis (prevalence: 1/100,000) Solution: Case report of recurrent N. meningitidis in patient
  • 16.
  • 17.
  • 18. Target Phenotypes Disease or Quantitative trait? Carlson et al. (2004) Nature 429:446-452 MI Note: SNPs associated with quantitative traits may not be associated with clinical endpoint CRP LDL-C IL6 LDLR Acute Illness Diet
  • 19.
  • 20.
  • 21.
  • 22.
  • 23. Study Design Power calculation example: Cases: Adverse reaction (wheezing) to flu vaccination Controls: Vaccinated children with no adverse reactions Calculated using Quanto 1.1.1 MAF
  • 24. Study Design Power calculation example: Immunogenicity to influenza A (H5N1) vaccine Calculated using Quanto 1.1.1
  • 25.
  • 26.
  • 27.
  • 28.
  • 29. Direct Candidate Gene Association Study Genotype “functional” SNPs Example: Nonsynonymous SNPs Collins et al (1997) Science 278:1580-1581
  • 30. Direct Candidate Gene Association Study Problem: We don’t know what is functional and what is not functional Botstein and Risch (2003) Nat Genet 33 Suppl:228-37.
  • 31. Direct Candidate Gene Association Study What would we miss? Functional synonymous SNPs in MDR1 alter P-glycoprotein activity Komar (2007) Science 315:466-467
  • 32.
  • 33.
  • 34. Indirect Candidate Gene Association Study Linkage disequilibrium (LD) r 2 = 0 SNPs are independent r 2 = 1 SNPs are perfectly correlated AND have the same minor allele frequency Measured by r 2 r 2 = [f(A 1 B 1 ) – f(A 1 )f(B 1 )] 2 f(A 1 )f(A 2 )f(B 1 )f(B 2 )
  • 35. Indirect Candidate Gene Association Study Using LD to pick “tagSNPs” CRP European-descent 10 SNPs >5% MAF CRP European-descent 4 tagSNPs r 2 >0.80
  • 36. Indirect Candidate Gene Association Study “ tagSNPs” are population specific CRP European-descent 4 tagSNPs CRP African-descent 10 tagSNPs
  • 37.
  • 38.
  • 39.
  • 40.
  • 41.
  • 42. Manolio et al. Nature Reviews Genetics 7 , 812 – 820 (October 2006) Case/Control Study Designs For either candidate gene or whole genome
  • 43. Study Pros Cons Case/Control Easier to collect Subject to bias Less expensive No risk estimates Case/Control Study Designs: Pros and Cons Prospective Risk estimates Harder to collect More expensive Subject to bias For rare outcomes, case/control design may be only option
  • 44.
  • 45.
  • 46.
  • 47. Analysis Methods Odds ratio (OR) = ratio of odds of minor allele in Cases (A/C) and Controls (B/D) OR (A*D)/(B*C) The Case/Control Study Case Control Minor allele A B Major allele C D
  • 48. For genotypes, set homozygous for major allele (A) as “ referent” genotype, and calculate 2 odds ratios: Analysis Methods Case Control Aa A B AA C D Case Control aa A B AA C D
  • 49. Analysis Methods Case/control: Interpretation of Odds Ratio 1.0 – Referent >1.0 – Greater odds of disease compared with controls <1.0 – Lesser odds of disease compared with controls Confidence Intervals: probably contain true OR OR does not measure risk*
  • 50.
  • 51. Analysis Methods Prospective cohort Risk Ratio (RR) = Incidence of disease in Exposed A/(A+B) or Unexposed C/(C+D) Case Control Total Exposed A B (A+B) Unexposed C D (C+D)
  • 52. Prospective Study: Interpretation of Risk Ratio 1.0 – Referent >1.0 – Risk for disease increases <1.0 – Risk for disease decreases Confidence Intervals: probably contain true RR *For rare diseases, OR ~ RR Analysis Methods
  • 53. Case/control: Matching Age Gender Race Warning: Can “over match” and miss describing an interesting factor Bad Example: Cases: Adults with heart disease Controls: Newborns without heart disease Analysis Methods
  • 54. Case/control: Stratifying Age Gender Race Warning: Need sufficient sample size to stratify or split the data into males and females Ex. Cases with heart disease Aged-matched controls without heart disease (Exposure: smoking status) Stratify for Gender Specific Risks Analysis Methods
  • 55.
  • 56.
  • 57.
  • 58.
  • 59. Analysis Methods Coding Genotypes 0 0 0 GG 0 1 1 AG 1 2 1 AA Recessive Additive Dominant Genotype Genotype can be re-coded in any number of ways for regression analysis
  • 60.  
  • 61.  
  • 62. Example of gene-environment Interaction and traditional regression
  • 63.
  • 64. Analysis Methods Whole genome in PLINK (pngu.mgh.harvard.edu/~purcell/plink/) Can adjust for population stratification Can add covariates P<5x10 -8 Genome-wide significance P=5x10 -8 Plenge et al 2007 NEJM MHC removed P<1x10 -100 P<2x10 -11
  • 65.
  • 66.
  • 67.
  • 68. Statistical Replication CRP SNPs and CRP levels in NHANES III Crawford et al Circulation 2006; 114:2458-2465 Carlson et al. AJHG 2005 ; 77:64-77 Results Consistent with CARDIA
  • 69.
  • 70.
  • 71.
  • 72.
  • 73. CRP Promoter Luciferase Assay Carlson et al, AJHG v77 p64 Functional Replication
  • 74.