SlideShare a Scribd company logo
1 of 18
Introduction to different measures of linkage disequilibrium (LD) and their calculation Awais Khan, University of Illinois, Urbana-Champaign By Dr. M. Awais Khan University of Illinois, Urbana-Champaign
Awais Khan, University of Illinois, Urbana-Champaign To understand the calculation of linkage disequilibrium consider following example Suppose there are two genes on Chromosome 5 of apple, each with two alleles ACT G GTAT………….GATCA A CCAG ACT C GTAT………….GATCA A CCAG ACT C GTAT………….GATCA T CCAG SNP1 SNP2 Showing only alleles for both SNPs  Calculation of linkage disequilibrium Alleles SNP1 SNP2 Allele 1 G A Allele 2 C T
Awais Khan, University of Illinois, Urbana-Champaign For better understanding of LD calculation, it is divided into five steps If p1 and p2 =frequency of the alleles at SNP1 and q1 and q2 =frequency of the alleles at SNP2 then in tabular form it could be written as follows Steps in LD calculation Step 1) Calculate allele frequencies SNP1 SNP2 Allele Frequency Allele Frequency G p1 A q1 C p2 T q2
Awais Khan, University of Illinois, Urbana-Champaign Suppose haplotype frequencies are as follows From our example of two SNPs each with two alleles all possible haplotypes are Step 2) Calculate haplotype frequencies SNP2 Allele A T SNP1 G GA GT C CA CT Haplotype Frequency Haplotype Frequency GA p11 GT p12 CA p21 CT q22
Awais Khan, University of Illinois, Urbana-Champaign When haplotype frequencies are equal to the product of their corresponding allele frequencies, it means the loci are in linkage equilibrium  Step 3) Linkage equilibrium Haplotype frequency Product of allelic frequency p11 = p1q1 p12 = p1q2 p21 = p2q1 p22 = p2q2
Awais Khan, University of Illinois, Urbana-Champaign D11=p11-p1q1 D12=p12-p1q2 D21=p21-p2q1 D22=p22-p2q2 We can deduce linkage disequilibrium for each haplotype as the deviation of observed haplotype frequency from its corresponding allelic frequencies expected under equilibrium Step 4) Linkage disequilibrium After solving above for D, we get as follows: SNP2 1 2 SNP1 1 p1q1+D p1q2-D p1 2 p2q1-D p2q2+D p2 q1 q2 1
Awais Khan, University of Illinois, Urbana-Champaign p11 p22  =(p1 q1 + D )(p2 q2 + D ) =  p1q1p2q2+ p1q1D + p2q2D + D 2   p12 p21  = (p1 q2 − D )(p2 q1 − D )  =  p1q1p2q2− p2q1D − p1q2D + D 2 Subtracting  (p1q1p2q2+ p1q1D + p2q2D + D 2 )  –   (p1q1p2q2− p2q1D − p1q2D + D 2 ) p11 p22 − p12 p21 = D (p1 q1 + p2 q1 + p2 q2 + p1 q2 )  = D × (1) = D Step 5) Calculation of Linkage disequilibrium measure D a) b) c)
Awais Khan, University of Illinois, Urbana-Champaign P11 = p1q1= 0.5  x  0.5 =  0.25  P22 = p2q2= 0.5  x  0.5 =  0.25 P12 = p1q2= 0.5  x  0.5 =  0.25 P21 = p2q1= 0.5  x  0.5 =  0.25  D = (P11)(P22) -(P12)(P21) D  =  (0.25) (0.25)  - (0.25) (0.25)  =  0  Estimate of D in case of Linkage Equilibrium If allele frequencies of p1 and q1 are both 0.5 and equilibrium occurs (only Ab and aB exist in the population)
Awais Khan, University of Illinois, Urbana-Champaign P11  =  p1q1+ D  =  0.25 + D  =  0.5 P22  =  p2q2+ D  =  0.25 + D  =  0.5 P12  =  p1q2-D  =  0.25 -D  =  0 P21  =  p2q1-D  =  0.25 -D  =  0 D = (P11)(P22) -(P12)(P21) D  =  (0.5) (0.5)  - (0) (0)  =  0.25 Estimate of D in case of Linkage Disequilibrium If allele frequency of p1 and  q1 are  both  0.5  and there is complete non-random association (only AB and ab exist in the population) with equal allele frequencies at all loci
Awais Khan, University of Illinois, Urbana-Champaign Sometimes, depending on allele frequency of two loci, the value of D can be negative, but actual gametic frequencies cannot be negative To overcome this issue, standardization methods have been proposed Standardization of D
Awais Khan, University of Illinois, Urbana-Champaign In a common standardization method, a relative measure of  disequilibrium (D) compared to its maximum is used: D’ = D / Dmax When D is positive Dmax=  min  [ (p1q2)  or  (p2q1) ] When  D  is  negative  Dmax=  min  [ (p1q1)  or  (p2q2) ] This standardization makes D-values range between 0 and 1 Standardization of D
Awais Khan, University of Illinois, Urbana-Champaign Another commonly used measure to calculate LD between loci is Pearson coefficient of correlation (r) r = D/ (p1p2q1q2) 1/2 However, squared coefficient of correlation (r 2 ) is often used to remove the arbitrary sign introduced Correlation coefficient as a measure of LD
Awais Khan, University of Illinois, Urbana-Champaign However, r can be conveniently used for chi-test, as χ 2  = r  2  N where N is the number of chromosomes in the sample To test if LD is statistically significant we can do a χ 2  test χ 2 =Σ(obs-exp) 2 / exp  expected is random associations  between alleles Testing significance of LD
Awais Khan, University of Illinois, Urbana-Champaign Genotypic data GA = 474  GT = 611  CA = 142  CT = 773  Total  =  2000 Haplotype Frequencies GA  =  474 / 2000  =  .2370 GT  =  611 / 2000  =  .3055 CA  =  142 / 2000  =  .0710 CT  =  773 / 2000  =  .3865  Example Let’s assume that we have genotypic data for the two SNPs with two alleles each (same example used to deduce the equations for different LD measures) Calculation of haplotype and allele frequencies Allele  frequencies G =  0.542 C =  0.457 A =  0.308 T =  0.692
Awais Khan, University of Illinois, Urbana-Champaign D  =  (P11 P22)  - (P12 P21) D  =  (0.2370  x  0.3865)  - (0.3055 x 0.0710)  =  0.0699 Input values in the equation for D to calculate linkage disequilibrium To estimate Dmax input allelic frequencies and value for D in the following equation Dmax = min [ (p1q2) or  (p2q1)  ] Dmax =  (0.5425 x  0.692)  =  0.375 or  =  (0.4575 x  0.308)  =  0.141
Awais Khan, University of Illinois, Urbana-Champaign Now calculate D’ input value of  D and Dmax calculated in previous step in the following equation D’ = D / Dmax D’ =  0.0699 / 0.141 =  0.496  =  0.5 To calculate coefficient of correlation (r), input value of  D and allele frequencies calculated in previous steps in the following equation r  =  D / (p1p2q1q2)  ½ r  = 0.0699 /(0.5425 x 0.4575 x 0.308 x 0.692) 1/2 r  = 0.0699 / 0.23  =  0.304 r 2  = (0.304) 2   = 0 .092
Awais Khan, University of Illinois, Urbana-Champaign χ 2  = r  2  N χ 2  = 0.092  x  2000  =  184.8  (1 df) At 184.8 and df of 1, P-value is 0.0001 So, we can conclude based on our calculations that there is a significant LD between loci and it is 50% of the theoretical maximum Also note that two SNPs are in complete LD (not separated by recombination) when D ' = 1 or r 2 =1 To check the significance of LD between loci use following equation
Awais Khan, University of Illinois, Urbana-Champaign Reference: Lewontin R.C. 1988. On Measures of Gametic Disequilibrium. Genetics, 120(3):849-852. Devlin B., Risch N. 1995. A Comparison of Linkage Disequilibrium Measures for Fine-Scale Mapping. Genomics 29 (2): 311-322.

More Related Content

What's hot

Genome wide Association studies.pptx
Genome wide Association studies.pptxGenome wide Association studies.pptx
Genome wide Association studies.pptxAkshitaAwasthi3
 
Whole Genome Selection
Whole Genome SelectionWhole Genome Selection
Whole Genome SelectionRaghav N.R
 
Association mapping
Association mapping Association mapping
Association mapping Preeti Kapoor
 
Association mapping in plants
Association mapping in plantsAssociation mapping in plants
Association mapping in plantsWaseem Hussain
 
Linkage mapping and QTL analysis_Lecture
Linkage mapping and QTL analysis_LectureLinkage mapping and QTL analysis_Lecture
Linkage mapping and QTL analysis_LectureSameer Khanal
 
effective population size and its applications in plant breeding
effective population size and its applications in plant breeding effective population size and its applications in plant breeding
effective population size and its applications in plant breeding Anilkumar C
 
Qtl analysis and its mapping
Qtl analysis and its mappingQtl analysis and its mapping
Qtl analysis and its mappingVikas Verma
 
Report- Genome wide association studies.
Report- Genome wide association studies.Report- Genome wide association studies.
Report- Genome wide association studies.Varsha Gayatonde
 
Statistical analysis of genetics parameter
Statistical analysis of genetics parameterStatistical analysis of genetics parameter
Statistical analysis of genetics parameterPawan Nagar
 
Association mapping, GWAS, Mapping, natural population mapping
Association mapping, GWAS, Mapping, natural population mappingAssociation mapping, GWAS, Mapping, natural population mapping
Association mapping, GWAS, Mapping, natural population mappingMahesh Biradar
 

What's hot (20)

Qtl and its mapping
Qtl and its mappingQtl and its mapping
Qtl and its mapping
 
Genome wide Association studies.pptx
Genome wide Association studies.pptxGenome wide Association studies.pptx
Genome wide Association studies.pptx
 
Whole Genome Selection
Whole Genome SelectionWhole Genome Selection
Whole Genome Selection
 
Association mapping
Association mapping Association mapping
Association mapping
 
Association mapping in plants
Association mapping in plantsAssociation mapping in plants
Association mapping in plants
 
Genome wide association mapping
Genome wide association mappingGenome wide association mapping
Genome wide association mapping
 
Linkage mapping and QTL analysis_Lecture
Linkage mapping and QTL analysis_LectureLinkage mapping and QTL analysis_Lecture
Linkage mapping and QTL analysis_Lecture
 
Basics of association_mapping
Basics of association_mappingBasics of association_mapping
Basics of association_mapping
 
Association mapping
Association mappingAssociation mapping
Association mapping
 
Genome Mapping
Genome MappingGenome Mapping
Genome Mapping
 
Qtl mapping
 Qtl mapping  Qtl mapping
Qtl mapping
 
effective population size and its applications in plant breeding
effective population size and its applications in plant breeding effective population size and its applications in plant breeding
effective population size and its applications in plant breeding
 
Association mapping
Association mappingAssociation mapping
Association mapping
 
GWAS
GWASGWAS
GWAS
 
Qtl analysis and its mapping
Qtl analysis and its mappingQtl analysis and its mapping
Qtl analysis and its mapping
 
Report- Genome wide association studies.
Report- Genome wide association studies.Report- Genome wide association studies.
Report- Genome wide association studies.
 
SNp mining in crops
SNp mining in cropsSNp mining in crops
SNp mining in crops
 
Statistical analysis of genetics parameter
Statistical analysis of genetics parameterStatistical analysis of genetics parameter
Statistical analysis of genetics parameter
 
Association mapping, GWAS, Mapping, natural population mapping
Association mapping, GWAS, Mapping, natural population mappingAssociation mapping, GWAS, Mapping, natural population mapping
Association mapping, GWAS, Mapping, natural population mapping
 
Vr wr graph
Vr wr  graphVr wr  graph
Vr wr graph
 

Viewers also liked

Estimation of Linkage Disequilibrium using GGT2 Software
Estimation of Linkage Disequilibrium using GGT2 SoftwareEstimation of Linkage Disequilibrium using GGT2 Software
Estimation of Linkage Disequilibrium using GGT2 SoftwareAwais Khan
 
Single nucleotide polymorphisms (sn ps), haplotypes,
Single nucleotide polymorphisms (sn ps), haplotypes,Single nucleotide polymorphisms (sn ps), haplotypes,
Single nucleotide polymorphisms (sn ps), haplotypes,Karan Veer Singh
 
Sperm DNA Fragmentation
Sperm DNA FragmentationSperm DNA Fragmentation
Sperm DNA Fragmentationsasicrea
 
Haplotype resolved structural variation assembly with long reads
Haplotype resolved structural variation assembly with long readsHaplotype resolved structural variation assembly with long reads
Haplotype resolved structural variation assembly with long readsGenome Reference Consortium
 
Tetrad analysis, positive and negative interference, mapping through somatic ...
Tetrad analysis, positive and negative interference, mapping through somatic ...Tetrad analysis, positive and negative interference, mapping through somatic ...
Tetrad analysis, positive and negative interference, mapping through somatic ...Promila Sheoran
 

Viewers also liked (10)

Estimation of Linkage Disequilibrium using GGT2 Software
Estimation of Linkage Disequilibrium using GGT2 SoftwareEstimation of Linkage Disequilibrium using GGT2 Software
Estimation of Linkage Disequilibrium using GGT2 Software
 
Single nucleotide polymorphisms (sn ps), haplotypes,
Single nucleotide polymorphisms (sn ps), haplotypes,Single nucleotide polymorphisms (sn ps), haplotypes,
Single nucleotide polymorphisms (sn ps), haplotypes,
 
Sperm DNA Fragmentation
Sperm DNA FragmentationSperm DNA Fragmentation
Sperm DNA Fragmentation
 
Tetrad analysis
Tetrad analysisTetrad analysis
Tetrad analysis
 
linkage
linkagelinkage
linkage
 
Lytic cycle
Lytic cycleLytic cycle
Lytic cycle
 
Haplotype resolved structural variation assembly with long reads
Haplotype resolved structural variation assembly with long readsHaplotype resolved structural variation assembly with long reads
Haplotype resolved structural variation assembly with long reads
 
Recombination
RecombinationRecombination
Recombination
 
Tetrad analysis, positive and negative interference, mapping through somatic ...
Tetrad analysis, positive and negative interference, mapping through somatic ...Tetrad analysis, positive and negative interference, mapping through somatic ...
Tetrad analysis, positive and negative interference, mapping through somatic ...
 
Snp
SnpSnp
Snp
 

Similar to Measures of Linkage Disequilibrium

path coeeficient
path coeeficientpath coeeficient
path coeeficientkanaka K K
 
Categorical data analysis full lecture note PPT.pptx
Categorical data analysis full lecture note  PPT.pptxCategorical data analysis full lecture note  PPT.pptx
Categorical data analysis full lecture note PPT.pptxMinilikDerseh1
 
Ning_Mei.ASSIGN01
Ning_Mei.ASSIGN01Ning_Mei.ASSIGN01
Ning_Mei.ASSIGN01宁 梅
 
Reading Seminar (140515) Spectral Learning of L-PCFGs
Reading Seminar (140515) Spectral Learning of L-PCFGsReading Seminar (140515) Spectral Learning of L-PCFGs
Reading Seminar (140515) Spectral Learning of L-PCFGsKeisuke OTAKI
 
Values and Means - Part 2
Values and Means - Part 2Values and Means - Part 2
Values and Means - Part 2SyedShaanz
 
Chapter-4 combined.pptx
Chapter-4 combined.pptxChapter-4 combined.pptx
Chapter-4 combined.pptxHamzaHaji6
 
Chapter 9 Two-Sample Inference 265 Chapter 9 Two-Sam.docx
Chapter 9 Two-Sample Inference 265 Chapter 9 Two-Sam.docxChapter 9 Two-Sample Inference 265 Chapter 9 Two-Sam.docx
Chapter 9 Two-Sample Inference 265 Chapter 9 Two-Sam.docxtiffanyd4
 
Tables and Formulas for Sullivan, Statistics Informed Decisio.docx
Tables and Formulas for Sullivan, Statistics Informed Decisio.docxTables and Formulas for Sullivan, Statistics Informed Decisio.docx
Tables and Formulas for Sullivan, Statistics Informed Decisio.docxmattinsonjanel
 
When is undersampling effective in unbalanced classification tasks?
When is undersampling effective in unbalanced classification tasks?When is undersampling effective in unbalanced classification tasks?
When is undersampling effective in unbalanced classification tasks?Andrea Dal Pozzolo
 
Waveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptxWaveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptxKIRUTHIKAAR2
 
InnerSoft STATS - Methods and formulas help
InnerSoft STATS - Methods and formulas helpInnerSoft STATS - Methods and formulas help
InnerSoft STATS - Methods and formulas helpInnerSoft
 
Waveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptxWaveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptxKIRUTHIKAAR2
 
OPTIMIZED RATE ALLOCATION OF HYPERSPECTRAL IMAGES IN COMPRESSED DOMAIN USING ...
OPTIMIZED RATE ALLOCATION OF HYPERSPECTRAL IMAGES IN COMPRESSED DOMAIN USING ...OPTIMIZED RATE ALLOCATION OF HYPERSPECTRAL IMAGES IN COMPRESSED DOMAIN USING ...
OPTIMIZED RATE ALLOCATION OF HYPERSPECTRAL IMAGES IN COMPRESSED DOMAIN USING ...Pioneer Natural Resources
 

Similar to Measures of Linkage Disequilibrium (20)

path coeeficient
path coeeficientpath coeeficient
path coeeficient
 
Categorical data analysis full lecture note PPT.pptx
Categorical data analysis full lecture note  PPT.pptxCategorical data analysis full lecture note  PPT.pptx
Categorical data analysis full lecture note PPT.pptx
 
Ning_Mei.ASSIGN01
Ning_Mei.ASSIGN01Ning_Mei.ASSIGN01
Ning_Mei.ASSIGN01
 
Reading Seminar (140515) Spectral Learning of L-PCFGs
Reading Seminar (140515) Spectral Learning of L-PCFGsReading Seminar (140515) Spectral Learning of L-PCFGs
Reading Seminar (140515) Spectral Learning of L-PCFGs
 
Values and Means - Part 2
Values and Means - Part 2Values and Means - Part 2
Values and Means - Part 2
 
Microeconomics-Help-Experts.pptx
Microeconomics-Help-Experts.pptxMicroeconomics-Help-Experts.pptx
Microeconomics-Help-Experts.pptx
 
Sample size estimation
Sample size estimationSample size estimation
Sample size estimation
 
Chapter-4 combined.pptx
Chapter-4 combined.pptxChapter-4 combined.pptx
Chapter-4 combined.pptx
 
Chapter 9 Two-Sample Inference 265 Chapter 9 Two-Sam.docx
Chapter 9 Two-Sample Inference 265 Chapter 9 Two-Sam.docxChapter 9 Two-Sample Inference 265 Chapter 9 Two-Sam.docx
Chapter 9 Two-Sample Inference 265 Chapter 9 Two-Sam.docx
 
Statistics
StatisticsStatistics
Statistics
 
Tables and Formulas for Sullivan, Statistics Informed Decisio.docx
Tables and Formulas for Sullivan, Statistics Informed Decisio.docxTables and Formulas for Sullivan, Statistics Informed Decisio.docx
Tables and Formulas for Sullivan, Statistics Informed Decisio.docx
 
Quantitative genetics
Quantitative geneticsQuantitative genetics
Quantitative genetics
 
Chap04
Chap04Chap04
Chap04
 
When is undersampling effective in unbalanced classification tasks?
When is undersampling effective in unbalanced classification tasks?When is undersampling effective in unbalanced classification tasks?
When is undersampling effective in unbalanced classification tasks?
 
Datacompression1
Datacompression1Datacompression1
Datacompression1
 
Waveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptxWaveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptx
 
InnerSoft STATS - Methods and formulas help
InnerSoft STATS - Methods and formulas helpInnerSoft STATS - Methods and formulas help
InnerSoft STATS - Methods and formulas help
 
Waveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptxWaveform_codingUNIT-II_DC_-PPT.pptx
Waveform_codingUNIT-II_DC_-PPT.pptx
 
Chapt 2
Chapt 2Chapt 2
Chapt 2
 
OPTIMIZED RATE ALLOCATION OF HYPERSPECTRAL IMAGES IN COMPRESSED DOMAIN USING ...
OPTIMIZED RATE ALLOCATION OF HYPERSPECTRAL IMAGES IN COMPRESSED DOMAIN USING ...OPTIMIZED RATE ALLOCATION OF HYPERSPECTRAL IMAGES IN COMPRESSED DOMAIN USING ...
OPTIMIZED RATE ALLOCATION OF HYPERSPECTRAL IMAGES IN COMPRESSED DOMAIN USING ...
 

Recently uploaded

Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsanshu789521
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitolTechU
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon AUnboundStockton
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxEyham Joco
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17Celine George
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfMahmoud M. Sallam
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementmkooblal
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxOH TEIK BIN
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
CELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxCELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxJiesonDelaCerna
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersSabitha Banu
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfUjwalaBharambe
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatYousafMalik24
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentInMediaRes1
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdfssuser54595a
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerunnathinaik
 

Recently uploaded (20)

Presiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha electionsPresiding Officer Training module 2024 lok sabha elections
Presiding Officer Training module 2024 lok sabha elections
 
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Tilak Nagar Delhi reach out to us at 🔝9953056974🔝
 
Capitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptxCapitol Tech U Doctoral Presentation - April 2024.pptx
Capitol Tech U Doctoral Presentation - April 2024.pptx
 
Crayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon ACrayon Activity Handout For the Crayon A
Crayon Activity Handout For the Crayon A
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Types of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptxTypes of Journalistic Writing Grade 8.pptx
Types of Journalistic Writing Grade 8.pptx
 
How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17How to Configure Email Server in Odoo 17
How to Configure Email Server in Odoo 17
 
Pharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdfPharmacognosy Flower 3. Compositae 2023.pdf
Pharmacognosy Flower 3. Compositae 2023.pdf
 
Hierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of managementHierarchy of management that covers different levels of management
Hierarchy of management that covers different levels of management
 
Solving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptxSolving Puzzles Benefits Everyone (English).pptx
Solving Puzzles Benefits Everyone (English).pptx
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
CELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptxCELL CYCLE Division Science 8 quarter IV.pptx
CELL CYCLE Division Science 8 quarter IV.pptx
 
DATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginnersDATA STRUCTURE AND ALGORITHM for beginners
DATA STRUCTURE AND ALGORITHM for beginners
 
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdfFraming an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
Framing an Appropriate Research Question 6b9b26d93da94caf993c038d9efcdedb.pdf
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
Earth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice greatEarth Day Presentation wow hello nice great
Earth Day Presentation wow hello nice great
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
Meghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media ComponentMeghan Sutherland In Media Res Media Component
Meghan Sutherland In Media Res Media Component
 
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
18-04-UA_REPORT_MEDIALITERAСY_INDEX-DM_23-1-final-eng.pdf
 
internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developer
 

Measures of Linkage Disequilibrium

  • 1. Introduction to different measures of linkage disequilibrium (LD) and their calculation Awais Khan, University of Illinois, Urbana-Champaign By Dr. M. Awais Khan University of Illinois, Urbana-Champaign
  • 2. Awais Khan, University of Illinois, Urbana-Champaign To understand the calculation of linkage disequilibrium consider following example Suppose there are two genes on Chromosome 5 of apple, each with two alleles ACT G GTAT………….GATCA A CCAG ACT C GTAT………….GATCA A CCAG ACT C GTAT………….GATCA T CCAG SNP1 SNP2 Showing only alleles for both SNPs Calculation of linkage disequilibrium Alleles SNP1 SNP2 Allele 1 G A Allele 2 C T
  • 3. Awais Khan, University of Illinois, Urbana-Champaign For better understanding of LD calculation, it is divided into five steps If p1 and p2 =frequency of the alleles at SNP1 and q1 and q2 =frequency of the alleles at SNP2 then in tabular form it could be written as follows Steps in LD calculation Step 1) Calculate allele frequencies SNP1 SNP2 Allele Frequency Allele Frequency G p1 A q1 C p2 T q2
  • 4. Awais Khan, University of Illinois, Urbana-Champaign Suppose haplotype frequencies are as follows From our example of two SNPs each with two alleles all possible haplotypes are Step 2) Calculate haplotype frequencies SNP2 Allele A T SNP1 G GA GT C CA CT Haplotype Frequency Haplotype Frequency GA p11 GT p12 CA p21 CT q22
  • 5. Awais Khan, University of Illinois, Urbana-Champaign When haplotype frequencies are equal to the product of their corresponding allele frequencies, it means the loci are in linkage equilibrium Step 3) Linkage equilibrium Haplotype frequency Product of allelic frequency p11 = p1q1 p12 = p1q2 p21 = p2q1 p22 = p2q2
  • 6. Awais Khan, University of Illinois, Urbana-Champaign D11=p11-p1q1 D12=p12-p1q2 D21=p21-p2q1 D22=p22-p2q2 We can deduce linkage disequilibrium for each haplotype as the deviation of observed haplotype frequency from its corresponding allelic frequencies expected under equilibrium Step 4) Linkage disequilibrium After solving above for D, we get as follows: SNP2 1 2 SNP1 1 p1q1+D p1q2-D p1 2 p2q1-D p2q2+D p2 q1 q2 1
  • 7. Awais Khan, University of Illinois, Urbana-Champaign p11 p22 =(p1 q1 + D )(p2 q2 + D ) = p1q1p2q2+ p1q1D + p2q2D + D 2 p12 p21 = (p1 q2 − D )(p2 q1 − D ) = p1q1p2q2− p2q1D − p1q2D + D 2 Subtracting (p1q1p2q2+ p1q1D + p2q2D + D 2 ) – (p1q1p2q2− p2q1D − p1q2D + D 2 ) p11 p22 − p12 p21 = D (p1 q1 + p2 q1 + p2 q2 + p1 q2 ) = D × (1) = D Step 5) Calculation of Linkage disequilibrium measure D a) b) c)
  • 8. Awais Khan, University of Illinois, Urbana-Champaign P11 = p1q1= 0.5 x 0.5 = 0.25 P22 = p2q2= 0.5 x 0.5 = 0.25 P12 = p1q2= 0.5 x 0.5 = 0.25 P21 = p2q1= 0.5 x 0.5 = 0.25 D = (P11)(P22) -(P12)(P21) D = (0.25) (0.25) - (0.25) (0.25) = 0 Estimate of D in case of Linkage Equilibrium If allele frequencies of p1 and q1 are both 0.5 and equilibrium occurs (only Ab and aB exist in the population)
  • 9. Awais Khan, University of Illinois, Urbana-Champaign P11 = p1q1+ D = 0.25 + D = 0.5 P22 = p2q2+ D = 0.25 + D = 0.5 P12 = p1q2-D = 0.25 -D = 0 P21 = p2q1-D = 0.25 -D = 0 D = (P11)(P22) -(P12)(P21) D = (0.5) (0.5) - (0) (0) = 0.25 Estimate of D in case of Linkage Disequilibrium If allele frequency of p1 and q1 are both 0.5 and there is complete non-random association (only AB and ab exist in the population) with equal allele frequencies at all loci
  • 10. Awais Khan, University of Illinois, Urbana-Champaign Sometimes, depending on allele frequency of two loci, the value of D can be negative, but actual gametic frequencies cannot be negative To overcome this issue, standardization methods have been proposed Standardization of D
  • 11. Awais Khan, University of Illinois, Urbana-Champaign In a common standardization method, a relative measure of disequilibrium (D) compared to its maximum is used: D’ = D / Dmax When D is positive Dmax= min [ (p1q2) or (p2q1) ] When D is negative Dmax= min [ (p1q1) or (p2q2) ] This standardization makes D-values range between 0 and 1 Standardization of D
  • 12. Awais Khan, University of Illinois, Urbana-Champaign Another commonly used measure to calculate LD between loci is Pearson coefficient of correlation (r) r = D/ (p1p2q1q2) 1/2 However, squared coefficient of correlation (r 2 ) is often used to remove the arbitrary sign introduced Correlation coefficient as a measure of LD
  • 13. Awais Khan, University of Illinois, Urbana-Champaign However, r can be conveniently used for chi-test, as χ 2 = r 2 N where N is the number of chromosomes in the sample To test if LD is statistically significant we can do a χ 2 test χ 2 =Σ(obs-exp) 2 / exp expected is random associations between alleles Testing significance of LD
  • 14. Awais Khan, University of Illinois, Urbana-Champaign Genotypic data GA = 474 GT = 611 CA = 142 CT = 773 Total = 2000 Haplotype Frequencies GA = 474 / 2000 = .2370 GT = 611 / 2000 = .3055 CA = 142 / 2000 = .0710 CT = 773 / 2000 = .3865 Example Let’s assume that we have genotypic data for the two SNPs with two alleles each (same example used to deduce the equations for different LD measures) Calculation of haplotype and allele frequencies Allele frequencies G = 0.542 C = 0.457 A = 0.308 T = 0.692
  • 15. Awais Khan, University of Illinois, Urbana-Champaign D = (P11 P22) - (P12 P21) D = (0.2370 x 0.3865) - (0.3055 x 0.0710) = 0.0699 Input values in the equation for D to calculate linkage disequilibrium To estimate Dmax input allelic frequencies and value for D in the following equation Dmax = min [ (p1q2) or (p2q1) ] Dmax = (0.5425 x 0.692) = 0.375 or = (0.4575 x 0.308) = 0.141
  • 16. Awais Khan, University of Illinois, Urbana-Champaign Now calculate D’ input value of D and Dmax calculated in previous step in the following equation D’ = D / Dmax D’ = 0.0699 / 0.141 = 0.496 = 0.5 To calculate coefficient of correlation (r), input value of D and allele frequencies calculated in previous steps in the following equation r = D / (p1p2q1q2) ½ r = 0.0699 /(0.5425 x 0.4575 x 0.308 x 0.692) 1/2 r = 0.0699 / 0.23 = 0.304 r 2 = (0.304) 2 = 0 .092
  • 17. Awais Khan, University of Illinois, Urbana-Champaign χ 2 = r 2 N χ 2 = 0.092 x 2000 = 184.8 (1 df) At 184.8 and df of 1, P-value is 0.0001 So, we can conclude based on our calculations that there is a significant LD between loci and it is 50% of the theoretical maximum Also note that two SNPs are in complete LD (not separated by recombination) when D ' = 1 or r 2 =1 To check the significance of LD between loci use following equation
  • 18. Awais Khan, University of Illinois, Urbana-Champaign Reference: Lewontin R.C. 1988. On Measures of Gametic Disequilibrium. Genetics, 120(3):849-852. Devlin B., Risch N. 1995. A Comparison of Linkage Disequilibrium Measures for Fine-Scale Mapping. Genomics 29 (2): 311-322.