SlideShare a Scribd company logo
Lecture
Concepts: Linkage and Linkage Mapping
Linkage (of genes):
“the association of genes that results from their
being on the same chromosome (i.e., physically
associated)”. For example, genes A and B in
chromosomes Chr1 and Chr2 (Fig. 1a).
Linkage group:
“all genes in one chromosome form one linkage
group”. For example: Chr1 and Chr2 are two different
linkage groups (Fig. 1a).
Linked (genes):
“a pair of linked genes (specifically, their alleles) tend
to be transmitted together during meiotic cycle and
progenies deviate from Mendelian ratios depending
upon recombination fraction (r) between the two
genes”. For example, genes A and B in Fig. 1b.
A
B
Fig. 1a. A and B linked;
C unlinked to A and B
C
Chr1 Chr2
AA aa
BB bb
Aa aa
Bb bb
X
X
a
b Unlinked Linked
A Aa
B Bb
A Aa
b bb
a aa
B Bb
a aa
b bb
Frequency
1/4
1/4
1/4
1/4
(1-r)/2
r/2
r/2
(1-r)/2
Fig. 1b. Test cross frequenciesSource: R.H.J. Schlegel, Encyclopedic Dictionary of Plant Breeding
Concepts: Linkage and Linkage Mapping
Linkage map:
- “is a map of the frequencies of
recombination that occur between markers
on homologous chromosomes during
meiosis.”
- distance is measured in cM.
Physical map:
- “shows the physical locations of genes and
other DNA sequences of interest.
- distance measure in base pairs
Comparative map:
- a map that compares linkage maps or
physical maps of related species based on
shared markers or sequences, respectively
(Fig. 2)
Fig. 2. Test cross frequencies
Source: Fig. 2 - www.pnas.org/content/102/37/13206/F3.expansion.html
1. Monogenic or oligogenic
2. Discreet phenotypic classes
(nominal scale).
3. Typically, environmental effect on
trait expression is absent or low
4. Discontinuous variation (Fig. 3)
5. Genes have large effect
6. Mapped as visible marker
(i.e., linkage mapping)
Concepts: QTL Analysis
Qualitative traits Quantitative traits
1. Polygenic (quantitative trait loci)
2. Continuum of measures (interval
scale).
3. Trait expression may show
profound environmental effect
4. Continuous variation (Fig. 4)
5. Genes have smaller effects
6. Mapping requires QTL analysis
cubocube.com
Fig.3.Discreettrait
Fig. 4. Fruit shape: a quantitative trait
www.nature.com
Lecture Outline: Linkage Mapping
1. A peek into the history of linkage mapping
1.1. Mendel’s work: rediscovery, validation and exceptions
1.2. Early genetic linkage maps
- natural mutants as genetic markers
- two-point and three-point linkage analysis
1.3. Mapping functions
2. Molecular era and revolution in genetic linkage mapping
2.1. Molecular markers
- isozymes, RFLPs, SSRs and SNPs
2.2. Mapping populations in plants
- F2, RILs, BC
2.3. Methods and tools for linkage mapping in plants
- maximum likelihood, LOD support, multipoint linkage mapping
2.4. Mapping polyploid genomes and outcrossing species
1. A peek into the history of linkage mapping
1.1. Mendel’s work: rediscovery,
validation and exceptions
- Experiments in Plant Hybridization
(1865). Crosses between natural
mutants (Fig. 5)
- Rediscovered in 1900
- Laws of segregation (Fig. 6) and
independent assortment (Fig. 7)
- Wide validity in diverse organisms
for unlinked qualitative traits
Source: monohybrid cross - www.desktopclass.com
Fig. 6. Monohybrid
Cross
Fig. 5. Mendel’s traits
Source: Mendel’s traits -www.nature.com
Fig. 7
Source: Punnett square - sites.saschina.org
1. A peek into the history of linkage mapping
1.1. Mendel’s work: rediscovery,
validation and exceptions
- Bateson and Punnett (1904)
- Deviation from Mendelian inheritance
(Fig. 8)
www.cas.miamioh.edu
1900
1865
Gregor Mendel:
- Proposed basic laws of inheritance
H. de Vries, E. von Tschermak, C. Correns
- Rediscovered Mendel’s work
Boveri and Sutton:
- Chromosome theory of inheritance
1902
Bateson and Punnett:
- Linkage
1904
Fig. 8
1. A peek into the history of linkage mapping
1.2. Early genetic linkage maps
- 1900 – 1910: concepts of gene, allele,
genotype, phenotype, homozygote,
heterozygote
Thomas Hunt Morgan:
i. studied Drosophila genetics
ii. genes responsible for discreet
phenotypic differences are located on
chromosomes
iii. likelihood of co-transmission and
reshuffling (due to recombination)
were dependent on linkage between
genes (Fig. 9)
iv. linkages can be quantified
(i.e., linkage mapping is a possibility) Fig. 9. An illustration of Morgan’s
study in Drosophila
Source: Fig. 9. - http://bio.vtn2.com/bio-home/harvey/lect/images/morgan15.4.gif
1. A peek into the history of linkage mapping
1.2. Early genetic linkage maps
Quantifying genetic linkages:
- mostly dihybrid test crosses and F2
populations (Fig. 10)
- segregating for wild-type (+) and mutant
(-) alleles
- sex-linked genes (X-linked)
First genetic linkage map of Sturtevant
(Morgan’s student):
- Series of dihybrid crosses. Example,
Fig. 10
- Map distance between body color and
eye color genes
= Recombination frequency, RF (%)
= [(0+2)/373)]*100 = 0.5
Fig. 10. An illustration of a dihybrid
cross, based on Sturtevant
(1913)
Source: Fig 10 - http://www.esp.org/foundations/genetics/classical/holdings/s/ahs-13.pdf
RF (%) = (recombinant type)*100/total
(+)
(-) (+)
(-)
Parental type
1. A peek into the history of linkage mapping
1.2. Early genetic linkage maps
First genetic linkage map of Sturtevant
(Morgan’s student) (Fig. 11):
- a series of two-point recombination
frequencies (%) between 6 genes (Fig.
12). Here, 19 different populations
- started marker order from closest
linkages and manually added other loci
Fig. 11. First genetic linkage map. Sturtevant (1913)
Factors
concered
Proportion of
crossovers
% of
crossovers
BCO 193 / 16278 1.2
BO 2 / 373 0.5
BP 1464 / 4551 32.2
BR 115 / 324 35.5
BM 260 / 693 37.5
COP 224 / 748 29.9
COR 1643 / 4749 34.6
COM 76 / 161 47.2
OP 247 / 836 29.5
OR 183 / 538 34.0
OM 218 / 404 54.0
CR 236 / 829 28.5
CM 112 / 333 33.6
B(C,O) 214 / 21736 1.0
(C,O)P 471 / 1584 29.7
(C,O)R 2062 / 6116 33.7
(C,O)M 406 / 898 45.2
PR 17 / 573 3.0
PM 109 / 405 26.9
Source: Fig.11, Fig. 12 - www.nature.com/scitable/content/The-linear-arrangement-of-six-sex-linked-16655
Fig. 12. Sturtevant table of RF (%)
1. A peek into the history of linkage mapping
1.2. Early genetic linkage maps
Limitations of two-point linkage
analysis
- Consider that 2 genes are far enough
apart that 2 crossovers (XOs) occur
between them (occasionally) and
involves:
i. same two nonsister chromatids for
both (Fig. 13)
ii. different nonsister chromatids for
both (Fig. 14)
- Result: either underestimation or
overestimation of RF
Fig. 13. Double crossover (same)
A
A
B
B
AB
AB
Gametes
a
a
b
b
ab
ab
Fig. 14. Double crossover (different )
A
A
B
B
Ab
Ab
Gametes
a
a
b
b
aB
aB
1. A peek into the history of linkage mapping
1.2. Early genetic linkage maps
The three point test cross
- Using trihybrid crosses
- more efficient; includes 2 XOs
- allows calculation of XO interference
Example (Fig. 15):
i.First, test linkage. Here, they are
linked
ii.Most frequent are parental types
ii. Four single crossovers (SCOs)
iii. Two double crossovers (DCOs)
X- Z+Y+
offspring No. of Parental/
phenotypes individual
s
Recombinant
X+
Y-
Z+
1 Recombinant DCO
X-
Y+
Z+
440 Parental
X
-
Y
-
Z
+
26 Recombinant SCO #1
X-
Y-
Z-
61 Recombinant SCO #2
X+
Y+
Z-
32 Recombinant SCO #1
X+
Y-
Z-
442 Parental
X+
Y+
Z+
58 Recombinant SCO #2
X-
Y+
Z-
2 Recombinant DCO
total 1062
XO type
Fig. 15. Three point test cross freq.
X+ Z-Y-
X- Z-Y-
X- Z-Y-
Triple
Heterozygote
Triple
HomozygousX
1. A peek into the history of linkage mapping
1.2. Early genetic linkage maps
Example (Fig. 16) continued..
iv. Compare either parental type to
double XO types
v. Conclusion: gene Z is in center
vi. Map distance (X-Z)
= [SCO (X-Z) + DCOs]*100/total
vii. Coefficient of coincidence (C)
= observed DCO freq./expected DCO
freq.
where, expected DCO freq
= (X-Z SCO freq. * Z-Y SCO freq)
viii. Interference = 1 - C
X- Z+Y+
offspring No. of Parental/
phenotypes individual
s
Recombinant
X+
Y-
Z+
1 Recombinant DCO
X-
Y+
Z+
440 Parental
X
-
Y
-
Z
+
26 Recombinant SCO #1
X-
Y-
Z-
61 Recombinant SCO #2
X+
Y+
Z-
32 Recombinant SCO #1
X+
Y-
Z-
442 Parental
X+
Y+
Z+
58 Recombinant SCO #2
X-
Y+
Z-
2 Recombinant DCO
total 1062
XO type
Fig. 16. Three point test cross freq.
X+ Z-Y-
X- Z-Y-
X- Z-Y-
Triple
Heterozygote
Triple
HomozygousX
P X
-
Y
+
Z
+
X
+
Y
-
Z
-
DCO X+
Y-
Z+
X+
Y-
Z+
D D S S S D
1. A peek into the history of linkage mapping
1.3. Mapping functions
- “for more than three loci,
relationship among possible
recombination fractions is complex”
- “RFs between loci flanking a region
are not simple sum of recombination
fractions for adjacent loci within the
region”
- “conversion of recombination
fractions to additive map distances
requires mapping functions (Fig. 17):
i. Haldane
ii. Kosambi
Fig. 17. Table: Haldane and Kosambi
mapping functions. Chart:
comparison of mapping functions.
“r” is recombination fraction and
“d’ is map distance.
Source: Ben Hui Liu, Statistical Genomics; Roling Wu et al. , Statistical Genetics of Quantitative Traits
1. A peek into the history of linkage mapping
Summary:
-Paucity of visible natural markers
(phenotypic mutants)
-Radiation mutants offered additional traits,
but lethality, sterility was a problem
-Nevertheless, two point and three point
linkage maps persisted for several decades
(~70 years)
-Example:
i. tomato: 258 morphological and
physiological markers (Rick 1975)
Fig. 18. An illustration of A tomato
linkage map made in 1952
Source: Fig. 18 – An introduction to Genetic Analysis, 5th edition.
2. Molecular era and revolution in genetic linkage mapping
2.1. Molecular markers
- gel electrophoresis brought isozyme markers in
picture
-restriction endonuclease and southern blot
techniques brought RFLPs
-DNA sequencing and PCR brought SSRs and
SNPs
- virtually unlimited number of “visible markers”
-gaps in genetic linkage maps could be filled
- comparative mapping, gene cloning, QTL
analysis and MAS could be done
Fig. 19. Classes of molecular
markers
Source: Fig.19 -nature.berkeley.edu/brunslab/tour/tour2.html
RFLP SSR
2. Molecular era and revolution in genetic linkage mapping
2.2. Mapping populations in
plants - considerations:
1st: marker polymorphism
- adequate polymorphic markers
between parents
- contrasting traits of interest
2nd: reproductive mode
- If inbreeding is a possibility:
F2, recombinant inbred lines
(RIL), backcross (BC)
- Mostly outcrossing (or self-
incompatible), long generation
time:
pseudo-testcross, backcross
Fig. 20a. F2 population
Source: Fig.20 –K. Meksem and G. Kahl, The Handbook of Plant Genome Mapping
Fig. 20b. RIL population
Fig. 20c. BC population
Fig. 20d. pseudo-
testcross population
2. Molecular era and revolution in genetic linkage mapping
2.3. Methods and tools for linkage mapping in plants
Steps:
i. Data generation: genotype mapping population and prepare input format
for mapping
ii. Calculating recombination fractions (RFs): maximum likelihood estimates
of pair-wise RFs
iii. Locus grouping: grouping of markers into prospective linkage groups based
on linkage (maximum recombination fraction) and LOD (minimum limit of
support) thresholds
iv. Locus ordering: finding the best possible order based on highest multi
point likelihood (LOD) among different probable orders
v. Multilocus distance estimation
2. Molecular era and revolution in genetic linkage mapping
2.3. Methods and tools for linkage mapping in plants
Detailed procedural discourse on MapMaker
i. Data generation:
mapmaker input file format (Fig. 21)
Type of cross: F2 intercross
F2 backcross
F3 self
RI self
RI sib
Defaults
Genotype Score:
Default symbols are
A : homozygous for parent A
H : heterozygous
B : homozygous for parent B
C : not homozygous for parent A
D : not homozygous for parent B
- : for missing
ScoresMarker Names
Population Size
Number of Markers
Fig. 21. MapMaker input format
2. Molecular era and revolution in genetic linkage mapping
2.3. Methods and tools for linkage mapping in plants
Detailed procedural discourse on MapMaker
ii. Calculating recombination fractions (RFs): in backcross mating design (BC1)
- progenies can be distinctly
categorized into parental
or recombinant (Fig. 22a)
- recombination fraction is
simply the frequency of
recombinant type
(Fig 22b)
Fig. 22a. Freq. of gametes in BC mating
Fig. 22b. RF estimation is plain
and simple for a
backcross mating design
2. Molecular era and revolution in genetic linkage mapping
2.3. Methods and tools for linkage mapping in plants
Detailed procedural discourse on MapMaker
ii. Calculating recombination fractions (RFs): in F2 mating design (Fig. 23a)
- progenies cannot be distinctly
categorized. For illustration, four
possible genotypes shown in Fig. 23b
belong to same genotype class
A1A2B1B2, but may come from parental
gametes without XO or recombinant
gametes (with XO) in both parents
Fig. 23a. F2 mating design and F2 genotypes
Fig. 23b. The counts (in parenthesis)
and frequencies of the 16
possible genotypes in an F2
family
2. Molecular era and revolution in genetic linkage mapping
2.3. Methods and tools for linkage mapping in plants
Detailed procedural discourse on MapMaker
ii. Calculating recombination fractions (RFs): in F2 mating design
- 16 possible genotypes coalesce into 9 observable genotypic classes
Fig. 24. Frequencies of the nine observed genotypes in
an F2 population
2. Molecular era and revolution in genetic linkage mapping
2.3. Methods and tools for linkage mapping in plants
Detailed procedural discourse on MapMaker
ii. Calculating recombination fractions (RFs): in F2 mating design
- likelihood function for estimating RF ( )
- “Maximum likelihood for r is
obtained by setting S(r) = 0 and
solving for r”
- “however, there is no explicit
solution for r”
- different ways to invoke iterative
algorithm to solve for r:
a. Grid search
b. Newton-Raphson MethodFig. 25. Likelihood function of r
2. Molecular era and revolution in genetic linkage mapping
2.3. Methods and tools for linkage mapping in plants
Detailed procedural discourse on MapMaker
iii. Locus grouping :
- MapMaker’s “GROUP” command builds preliminary linkage groups based on
maximum-likelihood estimates of RF and corresponding LOD score between
marker pairs
- maximum allowable RF and minimum LOD score thresholds can be manually
updated to track changes in grouping structure with corresponding changes in
thresholds
- finally, linkage groups are formed by marker associations. For example, if A is
linked to B, and B is linked to C, all three belong to a group (remember, RF and
LOD thresholds are there for minimizing spurious linkages)
2. Molecular era and revolution in genetic linkage mapping
2.3. Methods and tools for linkage mapping in plants
Detailed procedural discourse on MapMaker
iv. Locus ordering:
- “ ordering is the central problem in linkage mapping, and also the most
interesting in the sense that for groups of even modest size there is no sure
way to find the best (N! / 2) possible order”
- MapMaker’s “COMPARE” command is exhaustive - computes maximum
likelihood score for all possible orders and reports a subset of most likely ones
- however, ordering more than 5-7 markers with “COMPARE” is not practical
(time issue!)
Source: Meksem and Kahl, The Handbook of Plant Genome Mapping
2. Molecular era and revolution in genetic linkage mapping
2.3. Methods and tools for linkage mapping in plants
Detailed procedural discourse on MapMaker
iv. Locus ordering:
- therefore, have to resort to faster algorithms. For example, MapMaker’s
“ORDER” command:
a. identifies the most informative subset of markers (default 5 markers)
b. performs exhaustive order search (akin to COMPARE) and finds one
c. tries to add remaining markers individually (at default RF = 0.5 and LOD =
3.0)
d. drops LOD threshold to 2.0 and tries remaining ones
e. in case markers still cannot be assigned a particular position, reports as such
f. such markers can be manually tried with “TRY” command and dropped if fails
Source: Meksem and Kahl, The Handbook of Plant Genome Mapping
2. Molecular era and revolution in genetic linkage mapping
2.3. Methods and tools for linkage mapping in plants
Detailed procedural discourse on MapMaker
v. Multipoint distance estimation:
- MapMaker uses MAP command for multipoint estimates (not two-point
estimates)
- it employs EM algorithm (expectation-maximization algorithm), where
mutually dependent unknown parameters are alternately updated to converge
to a maximum.
- for example, an initial estimate (two-point) of r (θold = θ1, θ2, … θl-1, where l is
the number of loci) is used to compute expected number of recombinant type
for each interval (E step)
- (M step): using the new expected value MLE of θnew is computed
- E and M is iterated until θnew θold (the likelihood converges to a maximum)
- map distances are calculated using different mapping functions (default
Haldane)
Source: Ben Hui Liu, Statistical Genomics
Revisiting tomato genetic linkage maps:
-Example:
Tomato: (Sim et al. 2012)
Fig. 26a and 26b
- 7,666 SNPs
2. Molecular era and revolution in genetic linkage mapping
Fig. 26a. SNP
distribution
Fig. 26b. Two tomato linkage maps
compared to draft genome
assembly
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0040563
2. Molecular era and revolution in genetic linkage mapping
2.4. Mapping polyploid genomes
- Allopolyploids show disomic segregation.
Hence, linkage mapping in allopolyploids are
similar to diploid linkage mapping
- Autopolyploids (e.g., potato, sugarcane etc)
show polysomic segregation (Fig. 27a).
Hence, linkage mapping in autopolyploids
employ different mapping techniques
- For example, single dose markers (SDMs)
segregating in 1:1 ratio (Fig. 27b) used in
pseudo-testcross mapping strategy
- Also, biparental and double-dose markers
can be integrated using TetraploidMap
software
Fig. 27a. Single locus Segregation
Aaaa X aaaa
1/2 Aaaa 1/2 aaaa
Autotetraploid
Fig. 27b. Segregation
of a SDM
2. Molecular era and revolution in genetic linkage mapping
2.4. Mapping polyploid genomes
- Example TetraploidMap: four homologous chromosomes and a consensus
map
Source: TetraploidMap manual
Linkage Mapping
Summary
i. Genetic linkage maps were originally built to map phenotypic
mutants
ii. Modern linkage maps use molecular markers (predominantly, DNA
markers)
iii. Different types of mapping populations are used
iv. Mapping studies in diploid and allopolyploids use similar tools and
techniques
v. Linkage maps in autopolyploids neccessitates different mapping
strategies
vi. Linkage maps are useful for
- tagging markers along chromosomes
- identifying markers linked to genes and cloning genes
- identifying quantitative trait loci for traits of interest
- marker assisted selection
- comparative mapping and evolutionary studies
Lecture Outline: QTL Analysis
3. QTL mapping: models and methods
3.1. Single QTL model
3.1.1. Single marker analysis (SMA)
- t-tests, ANOVA, linear regression
3.1.2. Simple interval mapping (SIM)
3.2. Multiple QTL model
3.2.1. Multiple regression
3.2.2. Composite interval mapping (CIM)
3.3. QTL mapping in polyploid genomes
3. QTL Mapping: Models and Methods
3.1. Single QTL model
- Assessing marker-trait associations at individual marker locus
- gene effects for single QTL model:
Backcross: g = 0.5 (µ1 - µ2), where
µ1 = mean for homozygous
µ2 = mean for heterozygous
F2: additive (α) = 0.5 (µ1 - µ3) and
dominance (d) = 0.5 (2µ2- µ1 - µ3), where
µ3 = mean for homozygous for parent B alleles
- Employs single marker analysis (SMA) techniques
Source: Ben Hui Liu, Statistical Genomics
3.1.1. Single marker analysis (SMA)
- based on linear model:
yj = µ + f (markerj) + ɛj, where
yj is trait value of the jth individual in the population
µ is population mean
f (markerj) is a function of marker genotype
ɛj is the residual associated with the jth individual
Different methods:
a. marker genotypes treated as classification variable
- for a backcross (2 genotypes): use t-test
- for F2 population (up to 3 genotypes): use ANOVA
b. marker genotypes treated as dummy variables
- use marker-trait regression
c. likelihood ratio test and maximum likelihood estimation
Source: Ben Hui Liu, Statistical Genomics
3.1.1. SMA
Source: Ben Hui Liu, Statistical Genomics
yj = β0 + β1xj + ɛj ,where
yj is the trait value for the jth individual in
the population, xj is the dummy variable
taking 1 if the individual is AA and -1 for
Aa. β0 is the intercept for the regression
which is the overall mean for the trait. β1
is the slope for the regression line and ɛj
is the random error.
yj = β0 + β1x1j + β2x2j + ɛj ,where
yj is the trait value for the jth individual in the population,
x1j is the dummy variable for the marker additive effect
taking 1, 0, and -1 for marker genotypes AA, Aa and aa,
respectively. x2j is the dummy variable for the marker
dominant effect taking 1, 0, and -1 for marker genotypes
AA, Aa and aa. β0 is the intercept for the regression
which is the overall mean for the trait. β1 and β2 are the
slopes for the additive and dominant regression lines,
respectively. ɛj is the random error.
BC
F2
- t-test and ANOVA
Steps (given alleles A and a at a marker locus):
a. sort marker genotype classes into groups
- “AA” and “Aa” in backcross; “AA”, “Aa”, and
“aa” in (F2)
b. test significant difference in means
- t statistic (in backcross), F statistic (in F2)
- Linear regression approach
Fig. 27. One way analysis
1. Conceptually and
computationally simple
2. Genetic linkage map
information not needed
3. Easily incorporates
covariates
4. Informative when
markers sufficiently
cover the genome
5. Can be extended to
multiple regression for
multiple QTL model
3.1.1. SMA
Advantages Limitations
1. Location and effects of detected QTLs are
confounded
larger QTL effect could be because the
marker is close to a QTL or
farther from the QTL, but the QTL
contributes much significantly to the trait
2. QTL position cannot be precisely detected
3. Power to detect QTL is low when marker
density is low
4. Multiple comparison increases false
positives
5. Missing genotypes are totally excluded from
analysis
6. Limited ability to separate linked QTLs and
no ability to assess interacting QTLs
Basic statistical analysis
platforms:
Excel
JMP
SAS
R etc
QTL mapping platforms:
WinQTLCartographer
R/QTL
JoinMap
MapMarker/QTL etc.
3.1.1. SMA
Software tools Windows QTL Cartographer
SMA analysis fits the data to the simple linear
regression model
y = b0 + b1 x + e
Results reported includes b0, b1 and the F statistic
for each marker
F statistic compares the hypothesis
H0: b1 = 0; H1: b1
The pr(F) is a measure of how much support there
is for H0
A smaller pr(F) indicates less support for H0 and
thus more support for H1
Likelihood ratio test statistic compares two nested
hypothesis H0 and H1 with L0 and L1
likelihoods. Then, the “Likelihood Ratio Test
Statistic: is: -2ln(L0/L1)
3.1.2. Simple interval mapping (IM)
- “Mapping Mendelian factors underlying Quantitative Traits
using RFLP linkage maps” (Lander and Bolstein 1989)
- Concept:
Based on joint segregation of a pair of adjacent markers and a
putative QTL within an interval flanked by the marker pair (Fig.
28)
Methods:
a. Likelihood approach (preferred over regression)
b. Regression approach (faster computation than ML)
Source: Ben Hui Liu, Statistical Genomics
Fig. 28. Linkage relationship of a QTL and two
flanking markers
3.1.2. SIM
Likelihood approach (employed in WinQTLCart):
Source: Course notes, QTL mapping and Discovery
The density function for the
normal distribution with
mean μQk, and variance σ2.
There are K=1 to N
genotypes.
probability of the QTL
genotype, given the jth
genotypes of the flanking
markers
likelihood of phenotypic
value z, given the jth
genotypes of the flanking
markers.
MLE estimate under the reduced model of no QTL: μQQ=μQq=μqq
MLE estimate under the full model including a QTL.
LOD scores (log10 of the odds ratio), where
OR LR= 4.6LOD
1. Conceptually and computationally
simple
2. Genetic linkage map information
not needed
3. Easily incorporates covariates
4. Informative when markers
sufficiently cover the genome
5. Can be extended to multiple
regression for multiple QTL model
3.1.1. SIM
Advantages Limitations
1. Location and effects of detected
QTLs are confounded
larger QTL effect could be because
the marker is close to a QTL or
farther from the QTL, but the QTL
contributes much significantly to
the trait
2. QTL positions cannot be precisely
detected
3. Power to detect QTL is low when
marker density is low
4. Multiple comparison increases
false positives
5. Missing genotypes are totally
excluded from analysis
3.2. Composite interval mapping (CIM)
Source: Course notes, QTL mapping and Discovery
Test Interval
Left Marker Right Marker
Blocked Region (Cofactors)
CIM is a combination of IM and multiple regression (multiple QTL model)
- Fits both the effects of a QTL as well as the effects of covariates (subset of
selected genetic markers)
- CIM adds background loci to simple interval mapping (IM).
- It fits parameters for a target QTL in one interval while simultaneously fitting
partial regression coefficients for "background markers" to account for
variance caused by non-target QTL.
- Background markers are usually 20-40 cM apart
3.2. CIM
General CIM statistical model can be written as:
Phenotypic
trait value of
subject i
Overall
mean
Row vector of predictor variables
corresponding to the effects of the
putative QTL
Row vector of predictor
variables corresponding
to the rth cofactor marker
Column vector with
the coefficient of the
rth cofactor marker
N(0,δ2)
Zi1α: additive effect
Zi1d: dominance effect
3.2. CIM
Set of statistical models evaluated in the CIM analysis
(WinQTLCartographer):
- For backcross, recombinant inbred lines, and double haploids,
only Model 0 and Model 1 are generated and tested
- For F2 design, all four models are generated and tested
Comparison of SMA, SIM and CIM
Much precise location
http://solcap.msu.edu/pdf%20files/5PAA_Douches_2_Mapping_Populations.pdf
3.3. QTL mapping in polyploid genomes
- Generally, QTL mapping in allopolyploid genomes is same as
diploids
- However, QTL mapping in autopolyploid genomes require
different strategies
- Example:
QTL mapping in
autotetraploids using
TetraploidMap
3.3. QTL mapping in polyploid genomes
Summary
- Single marker analysis (SMA) involves t-test, ANOVA, or linear
regression approach
- Interval mapping is based on joint segregation of a pair of
adjacent markers
- CIM is a combination of IM and multiple regression and is
desirable among the three
- QTL mapping in autopolyploids require different analytical
strategies
Linkage mapping and QTL analysis_Lecture

More Related Content

What's hot

Mapping and QTL
Mapping and QTLMapping and QTL
Mapping and QTL
FAO
 
CYTOPLASMIC INHERITANCE
CYTOPLASMIC INHERITANCECYTOPLASMIC INHERITANCE
CYTOPLASMIC INHERITANCE
Ankit Kumar Dubey
 
Mapping and Applications of Linkage Disequilibrium and Association Mapping in...
Mapping and Applications of Linkage Disequilibrium and Association Mapping in...Mapping and Applications of Linkage Disequilibrium and Association Mapping in...
Mapping and Applications of Linkage Disequilibrium and Association Mapping in...
FAO
 
QTL
QTLQTL
Mapping
MappingMapping
Population genetics
Population geneticsPopulation genetics
Population genetics
Jwalit93
 
Association mapping, GWAS, Mapping, natural population mapping
Association mapping, GWAS, Mapping, natural population mappingAssociation mapping, GWAS, Mapping, natural population mapping
Association mapping, GWAS, Mapping, natural population mapping
Mahesh Biradar
 
genetic linkage and gene mapping
genetic linkage and gene mappinggenetic linkage and gene mapping
genetic linkage and gene mapping
Mahammed Faizan
 
Quantitative trait loci (QTL) analysis and its applications in plant breeding
Quantitative trait loci (QTL) analysis and its applications in plant breedingQuantitative trait loci (QTL) analysis and its applications in plant breeding
Quantitative trait loci (QTL) analysis and its applications in plant breeding
PGS
 
Gene Mapping Methods:Linkage Maps & Mapping with Molecular Markers
Gene  Mapping  Methods:Linkage Maps & Mapping with Molecular MarkersGene  Mapping  Methods:Linkage Maps & Mapping with Molecular Markers
Gene Mapping Methods:Linkage Maps & Mapping with Molecular Markers
Assam University, Silchar, Assam, India
 
Association mapping
Association mappingAssociation mapping
Association mapping
Senthil Natesan
 
monosomics and their role in cytogenetics
monosomics and their role in cytogeneticsmonosomics and their role in cytogenetics
monosomics and their role in cytogenetics
SANJAY KUMAR SANADYA
 
Intervarietal chromosomal substitution
Intervarietal chromosomal  substitutionIntervarietal chromosomal  substitution
Intervarietal chromosomal substitution
Kartik Madankar
 
Chromosomal aberrations, utilization of aneuploids, chimeras and role of allo...
Chromosomal aberrations, utilization of aneuploids, chimeras and role of allo...Chromosomal aberrations, utilization of aneuploids, chimeras and role of allo...
Chromosomal aberrations, utilization of aneuploids, chimeras and role of allo...
GauravRajSinhVaghela
 
TILLING & ECO-TILLING
TILLING & ECO-TILLINGTILLING & ECO-TILLING
TILLING & ECO-TILLING
Rachana Bagudam
 
Self incompatability in plants,pseudoalleles and isoalleles
Self incompatability in plants,pseudoalleles and isoallelesSelf incompatability in plants,pseudoalleles and isoalleles
Self incompatability in plants,pseudoalleles and isoalleles
Kanimoli Mathivathana
 
Hardy weinberg law
Hardy  weinberg lawHardy  weinberg law
Hardy weinberg law
Kanimoli Mathivathana
 
Numerical changes in chromosome
Numerical changes in chromosomeNumerical changes in chromosome
Numerical changes in chromosome
Jaleelkabdul Jaleel
 
Genome wide association studies seminar
Genome wide association studies seminarGenome wide association studies seminar
Genome wide association studies seminarVarsha Gayatonde
 

What's hot (20)

Mapping and QTL
Mapping and QTLMapping and QTL
Mapping and QTL
 
CYTOPLASMIC INHERITANCE
CYTOPLASMIC INHERITANCECYTOPLASMIC INHERITANCE
CYTOPLASMIC INHERITANCE
 
Mapping and Applications of Linkage Disequilibrium and Association Mapping in...
Mapping and Applications of Linkage Disequilibrium and Association Mapping in...Mapping and Applications of Linkage Disequilibrium and Association Mapping in...
Mapping and Applications of Linkage Disequilibrium and Association Mapping in...
 
QTL
QTLQTL
QTL
 
Mapping
MappingMapping
Mapping
 
Population genetics
Population geneticsPopulation genetics
Population genetics
 
Monosomics
MonosomicsMonosomics
Monosomics
 
Association mapping, GWAS, Mapping, natural population mapping
Association mapping, GWAS, Mapping, natural population mappingAssociation mapping, GWAS, Mapping, natural population mapping
Association mapping, GWAS, Mapping, natural population mapping
 
genetic linkage and gene mapping
genetic linkage and gene mappinggenetic linkage and gene mapping
genetic linkage and gene mapping
 
Quantitative trait loci (QTL) analysis and its applications in plant breeding
Quantitative trait loci (QTL) analysis and its applications in plant breedingQuantitative trait loci (QTL) analysis and its applications in plant breeding
Quantitative trait loci (QTL) analysis and its applications in plant breeding
 
Gene Mapping Methods:Linkage Maps & Mapping with Molecular Markers
Gene  Mapping  Methods:Linkage Maps & Mapping with Molecular MarkersGene  Mapping  Methods:Linkage Maps & Mapping with Molecular Markers
Gene Mapping Methods:Linkage Maps & Mapping with Molecular Markers
 
Association mapping
Association mappingAssociation mapping
Association mapping
 
monosomics and their role in cytogenetics
monosomics and their role in cytogeneticsmonosomics and their role in cytogenetics
monosomics and their role in cytogenetics
 
Intervarietal chromosomal substitution
Intervarietal chromosomal  substitutionIntervarietal chromosomal  substitution
Intervarietal chromosomal substitution
 
Chromosomal aberrations, utilization of aneuploids, chimeras and role of allo...
Chromosomal aberrations, utilization of aneuploids, chimeras and role of allo...Chromosomal aberrations, utilization of aneuploids, chimeras and role of allo...
Chromosomal aberrations, utilization of aneuploids, chimeras and role of allo...
 
TILLING & ECO-TILLING
TILLING & ECO-TILLINGTILLING & ECO-TILLING
TILLING & ECO-TILLING
 
Self incompatability in plants,pseudoalleles and isoalleles
Self incompatability in plants,pseudoalleles and isoallelesSelf incompatability in plants,pseudoalleles and isoalleles
Self incompatability in plants,pseudoalleles and isoalleles
 
Hardy weinberg law
Hardy  weinberg lawHardy  weinberg law
Hardy weinberg law
 
Numerical changes in chromosome
Numerical changes in chromosomeNumerical changes in chromosome
Numerical changes in chromosome
 
Genome wide association studies seminar
Genome wide association studies seminarGenome wide association studies seminar
Genome wide association studies seminar
 

Viewers also liked

Genotype by environment interactions (GxE) - Van Etten
Genotype by environment interactions (GxE) - Van EttenGenotype by environment interactions (GxE) - Van Etten
Genotype by environment interactions (GxE) - Van Etten
CCAFS | CGIAR Research Program on Climate Change, Agriculture and Food Security
 
use of ammi model for stability analysis of crop.
use of ammi model for stability analysis of crop.use of ammi model for stability analysis of crop.
use of ammi model for stability analysis of crop.
Vaibhav Chavan
 
Models for g x e analysis
Models for g x e analysisModels for g x e analysis
Models for g x e analysis
ICRISAT
 
Qtl mapping sachin pbt
Qtl mapping sachin pbtQtl mapping sachin pbt
Qtl mapping sachin pbt
Sachin Ekatpure
 
Genotype x Environment (GxE) interaction studies in hybrids and elite cultiva...
Genotype x Environment (GxE) interaction studies in hybrids and elite cultiva...Genotype x Environment (GxE) interaction studies in hybrids and elite cultiva...
Genotype x Environment (GxE) interaction studies in hybrids and elite cultiva...
ICRISAT
 
Genetic mapping and qtl detection
Genetic mapping and qtl detectionGenetic mapping and qtl detection
Genetic mapping and qtl detection
Bahauddin Zakariya University lahore
 
Molecular plant breeding some basic information
Molecular plant breeding some basic informationMolecular plant breeding some basic information
Molecular plant breeding some basic information
bawonpon chonnipat
 
Molecular Marker-assisted Breeding in Rice
Molecular Marker-assisted Breeding in RiceMolecular Marker-assisted Breeding in Rice
Molecular Marker-assisted Breeding in Rice
FOODCROPS
 
Allele mining
Allele miningAllele mining
Allele mining
arjun pimple
 
markers in plant breeding.
markers in plant breeding.markers in plant breeding.
markers in plant breeding.Alemu Abate
 
Regression analysis ppt
Regression analysis pptRegression analysis ppt
Regression analysis pptElkana Rorio
 

Viewers also liked (12)

Genotype by environment interactions (GxE) - Van Etten
Genotype by environment interactions (GxE) - Van EttenGenotype by environment interactions (GxE) - Van Etten
Genotype by environment interactions (GxE) - Van Etten
 
use of ammi model for stability analysis of crop.
use of ammi model for stability analysis of crop.use of ammi model for stability analysis of crop.
use of ammi model for stability analysis of crop.
 
Models for g x e analysis
Models for g x e analysisModels for g x e analysis
Models for g x e analysis
 
Qtl mapping sachin pbt
Qtl mapping sachin pbtQtl mapping sachin pbt
Qtl mapping sachin pbt
 
Genotype x Environment (GxE) interaction studies in hybrids and elite cultiva...
Genotype x Environment (GxE) interaction studies in hybrids and elite cultiva...Genotype x Environment (GxE) interaction studies in hybrids and elite cultiva...
Genotype x Environment (GxE) interaction studies in hybrids and elite cultiva...
 
Genetic mapping and qtl detection
Genetic mapping and qtl detectionGenetic mapping and qtl detection
Genetic mapping and qtl detection
 
Molecular plant breeding some basic information
Molecular plant breeding some basic informationMolecular plant breeding some basic information
Molecular plant breeding some basic information
 
Molecular Marker-assisted Breeding in Rice
Molecular Marker-assisted Breeding in RiceMolecular Marker-assisted Breeding in Rice
Molecular Marker-assisted Breeding in Rice
 
Allele mining
Allele miningAllele mining
Allele mining
 
markers in plant breeding.
markers in plant breeding.markers in plant breeding.
markers in plant breeding.
 
Molecular marker
Molecular markerMolecular marker
Molecular marker
 
Regression analysis ppt
Regression analysis pptRegression analysis ppt
Regression analysis ppt
 

Similar to Linkage mapping and QTL analysis_Lecture

genome wide linkage mapping
genome wide linkage mappinggenome wide linkage mapping
genome wide linkage mapping
Ravi Kamble
 
3. Chromosome Mapping in Drosophila flies
3. Chromosome Mapping in Drosophila flies3. Chromosome Mapping in Drosophila flies
3. Chromosome Mapping in Drosophila flies
saifulzooru
 
linkage
linkagelinkage
Recombination and LinkageA Three point test cross in Drosophil.docx
Recombination and LinkageA Three point test cross in Drosophil.docxRecombination and LinkageA Three point test cross in Drosophil.docx
Recombination and LinkageA Three point test cross in Drosophil.docx
sodhi3
 
Linkage and recombination of gene
Linkage and recombination of geneLinkage and recombination of gene
Linkage and recombination of gene
Promila Sheoran
 
Comparitive genome mapping and model systems
Comparitive genome mapping and model systemsComparitive genome mapping and model systems
Comparitive genome mapping and model systems
Himanshi Chauhan
 
QTL mapping for crop improvement
QTL mapping for crop improvementQTL mapping for crop improvement
QTL mapping for crop improvement
Dr. Sandeep Kumar Singh
 
Problems2. This is a map for a diploid plantR--------35-------.pdf
Problems2. This is a map for a diploid plantR--------35-------.pdfProblems2. This is a map for a diploid plantR--------35-------.pdf
Problems2. This is a map for a diploid plantR--------35-------.pdf
Footageetoffe16
 
Gene mapping
Gene  mappingGene  mapping
Gene mapping
rashzz
 
Seminar2015
Seminar2015Seminar2015
Seminar2015
Kevin Thornton
 
Gene mapping ppt
Gene mapping pptGene mapping ppt
Gene mapping ppt
Zeeshan Ahmed
 
LINKAGE DRAG BY ANKIT RAJ.pptx
LINKAGE DRAG BY ANKIT RAJ.pptxLINKAGE DRAG BY ANKIT RAJ.pptx
LINKAGE DRAG BY ANKIT RAJ.pptx
ANKIT RAJ
 
Gene mapping
Gene mappingGene mapping
Gene mapping
Deepak Kumar
 
Matthew Pennell - Young Investigator Prize Talk
Matthew Pennell - Young Investigator Prize TalkMatthew Pennell - Young Investigator Prize Talk
Matthew Pennell - Young Investigator Prize Talk
mwpennell
 
Association mapping in plants
Association mapping in plantsAssociation mapping in plants
Association mapping in plants
Waseem Hussain
 
Linkage mapping
Linkage mappingLinkage mapping
Linkage mapping
SnehaSahu20
 
Origami conductivity and structural stability
Origami conductivity and structural stabilityOrigami conductivity and structural stability
Origami conductivity and structural stability
dodo5575
 
BP219 class 4 04 2011
BP219 class 4 04 2011BP219 class 4 04 2011
BP219 class 4 04 2011
waddling
 

Similar to Linkage mapping and QTL analysis_Lecture (20)

genome wide linkage mapping
genome wide linkage mappinggenome wide linkage mapping
genome wide linkage mapping
 
3. Chromosome Mapping in Drosophila flies
3. Chromosome Mapping in Drosophila flies3. Chromosome Mapping in Drosophila flies
3. Chromosome Mapping in Drosophila flies
 
Genetic mapping
Genetic mappingGenetic mapping
Genetic mapping
 
linkage
linkagelinkage
linkage
 
Recombination and LinkageA Three point test cross in Drosophil.docx
Recombination and LinkageA Three point test cross in Drosophil.docxRecombination and LinkageA Three point test cross in Drosophil.docx
Recombination and LinkageA Three point test cross in Drosophil.docx
 
Linkage and recombination of gene
Linkage and recombination of geneLinkage and recombination of gene
Linkage and recombination of gene
 
Comparitive genome mapping and model systems
Comparitive genome mapping and model systemsComparitive genome mapping and model systems
Comparitive genome mapping and model systems
 
QTL mapping for crop improvement
QTL mapping for crop improvementQTL mapping for crop improvement
QTL mapping for crop improvement
 
Problems2. This is a map for a diploid plantR--------35-------.pdf
Problems2. This is a map for a diploid plantR--------35-------.pdfProblems2. This is a map for a diploid plantR--------35-------.pdf
Problems2. This is a map for a diploid plantR--------35-------.pdf
 
Gene mapping
Gene  mappingGene  mapping
Gene mapping
 
Seminar2015
Seminar2015Seminar2015
Seminar2015
 
Gene mapping ppt
Gene mapping pptGene mapping ppt
Gene mapping ppt
 
LINKAGE DRAG BY ANKIT RAJ.pptx
LINKAGE DRAG BY ANKIT RAJ.pptxLINKAGE DRAG BY ANKIT RAJ.pptx
LINKAGE DRAG BY ANKIT RAJ.pptx
 
Map functions
Map functionsMap functions
Map functions
 
Gene mapping
Gene mappingGene mapping
Gene mapping
 
Matthew Pennell - Young Investigator Prize Talk
Matthew Pennell - Young Investigator Prize TalkMatthew Pennell - Young Investigator Prize Talk
Matthew Pennell - Young Investigator Prize Talk
 
Association mapping in plants
Association mapping in plantsAssociation mapping in plants
Association mapping in plants
 
Linkage mapping
Linkage mappingLinkage mapping
Linkage mapping
 
Origami conductivity and structural stability
Origami conductivity and structural stabilityOrigami conductivity and structural stability
Origami conductivity and structural stability
 
BP219 class 4 04 2011
BP219 class 4 04 2011BP219 class 4 04 2011
BP219 class 4 04 2011
 

More from Sameer Khanal

Khanal beltwide conference_2021
Khanal beltwide conference_2021Khanal beltwide conference_2021
Khanal beltwide conference_2021
Sameer Khanal
 
Beltwide poster 2021
Beltwide poster 2021Beltwide poster 2021
Beltwide poster 2021
Sameer Khanal
 
Linkage mapping lab
Linkage mapping labLinkage mapping lab
Linkage mapping lab
Sameer Khanal
 
Transcriptomics of RKN resitance in Upland Cotton
Transcriptomics of RKN resitance in Upland CottonTranscriptomics of RKN resitance in Upland Cotton
Transcriptomics of RKN resitance in Upland Cotton
Sameer Khanal
 
Cotton QTL pyramiding
Cotton QTL pyramidingCotton QTL pyramiding
Cotton QTL pyramiding
Sameer Khanal
 
Beltwide poster main
Beltwide poster mainBeltwide poster main
Beltwide poster main
Sameer Khanal
 
Effects of exotic alleles and genetic backgrounds on fiber quality traits in ...
Effects of exotic alleles and genetic backgrounds on fiber quality traits in ...Effects of exotic alleles and genetic backgrounds on fiber quality traits in ...
Effects of exotic alleles and genetic backgrounds on fiber quality traits in ...
Sameer Khanal
 
Dissecting quantitative variation introgressed into bermudagrass and Upland c...
Dissecting quantitative variation introgressed into bermudagrass and Upland c...Dissecting quantitative variation introgressed into bermudagrass and Upland c...
Dissecting quantitative variation introgressed into bermudagrass and Upland c...
Sameer Khanal
 
Microsatellite markers in bermudagrass
Microsatellite markers in bermudagrassMicrosatellite markers in bermudagrass
Microsatellite markers in bermudagrass
Sameer Khanal
 
Cytogenetics of arachis_spp.
Cytogenetics of arachis_spp.Cytogenetics of arachis_spp.
Cytogenetics of arachis_spp.
Sameer Khanal
 
PAG XXV POSTER KHANAL
PAG XXV POSTER KHANALPAG XXV POSTER KHANAL
PAG XXV POSTER KHANALSameer Khanal
 
Peanut NIILs proposal
Peanut NIILs proposalPeanut NIILs proposal
Peanut NIILs proposal
Sameer Khanal
 
NBS-LRR proposal
NBS-LRR proposalNBS-LRR proposal
NBS-LRR proposal
Sameer Khanal
 
Algodones dune sunflower
Algodones dune sunflowerAlgodones dune sunflower
Algodones dune sunflower
Sameer Khanal
 
Linkage mapping and QTL analysis_Lab
Linkage mapping and QTL analysis_LabLinkage mapping and QTL analysis_Lab
Linkage mapping and QTL analysis_Lab
Sameer Khanal
 
PAG Poster
PAG PosterPAG Poster
PAG Poster
Sameer Khanal
 
GCP_abstract_Khanal_2008
GCP_abstract_Khanal_2008GCP_abstract_Khanal_2008
GCP_abstract_Khanal_2008Sameer Khanal
 
Winning the future with GMOs
Winning the future with GMOsWinning the future with GMOs
Winning the future with GMOs
Sameer Khanal
 
Sunflower domestication khanal_2010
Sunflower domestication khanal_2010Sunflower domestication khanal_2010
Sunflower domestication khanal_2010
Sameer Khanal
 

More from Sameer Khanal (20)

Khanal beltwide conference_2021
Khanal beltwide conference_2021Khanal beltwide conference_2021
Khanal beltwide conference_2021
 
Beltwide poster 2021
Beltwide poster 2021Beltwide poster 2021
Beltwide poster 2021
 
Linkage mapping lab
Linkage mapping labLinkage mapping lab
Linkage mapping lab
 
Transcriptomics of RKN resitance in Upland Cotton
Transcriptomics of RKN resitance in Upland CottonTranscriptomics of RKN resitance in Upland Cotton
Transcriptomics of RKN resitance in Upland Cotton
 
Cotton QTL pyramiding
Cotton QTL pyramidingCotton QTL pyramiding
Cotton QTL pyramiding
 
Beltwide poster main
Beltwide poster mainBeltwide poster main
Beltwide poster main
 
Effects of exotic alleles and genetic backgrounds on fiber quality traits in ...
Effects of exotic alleles and genetic backgrounds on fiber quality traits in ...Effects of exotic alleles and genetic backgrounds on fiber quality traits in ...
Effects of exotic alleles and genetic backgrounds on fiber quality traits in ...
 
Dissecting quantitative variation introgressed into bermudagrass and Upland c...
Dissecting quantitative variation introgressed into bermudagrass and Upland c...Dissecting quantitative variation introgressed into bermudagrass and Upland c...
Dissecting quantitative variation introgressed into bermudagrass and Upland c...
 
Microsatellite markers in bermudagrass
Microsatellite markers in bermudagrassMicrosatellite markers in bermudagrass
Microsatellite markers in bermudagrass
 
Cytogenetics of arachis_spp.
Cytogenetics of arachis_spp.Cytogenetics of arachis_spp.
Cytogenetics of arachis_spp.
 
PAG XXV POSTER KHANAL
PAG XXV POSTER KHANALPAG XXV POSTER KHANAL
PAG XXV POSTER KHANAL
 
Poster_GCP_Knapp
Poster_GCP_KnappPoster_GCP_Knapp
Poster_GCP_Knapp
 
Peanut NIILs proposal
Peanut NIILs proposalPeanut NIILs proposal
Peanut NIILs proposal
 
NBS-LRR proposal
NBS-LRR proposalNBS-LRR proposal
NBS-LRR proposal
 
Algodones dune sunflower
Algodones dune sunflowerAlgodones dune sunflower
Algodones dune sunflower
 
Linkage mapping and QTL analysis_Lab
Linkage mapping and QTL analysis_LabLinkage mapping and QTL analysis_Lab
Linkage mapping and QTL analysis_Lab
 
PAG Poster
PAG PosterPAG Poster
PAG Poster
 
GCP_abstract_Khanal_2008
GCP_abstract_Khanal_2008GCP_abstract_Khanal_2008
GCP_abstract_Khanal_2008
 
Winning the future with GMOs
Winning the future with GMOsWinning the future with GMOs
Winning the future with GMOs
 
Sunflower domestication khanal_2010
Sunflower domestication khanal_2010Sunflower domestication khanal_2010
Sunflower domestication khanal_2010
 

Recently uploaded

NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
pablovgd
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
AADYARAJPANDEY1
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
NathanBaughman3
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
YOGESH DOGRA
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
Areesha Ahmad
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Sérgio Sacani
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
ssuserbfdca9
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
muralinath2
 
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SELF-EXPLANATORY
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
University of Maribor
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
IvanMallco1
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Ana Luísa Pinho
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
Lokesh Patil
 
Comparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebratesComparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebrates
sachin783648
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
AlaminAfendy1
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
muralinath2
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Erdal Coalmaker
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
Areesha Ahmad
 
Citrus Greening Disease and its Management
Citrus Greening Disease and its ManagementCitrus Greening Disease and its Management
Citrus Greening Disease and its Management
subedisuryaofficial
 

Recently uploaded (20)

NuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final versionNuGOweek 2024 Ghent - programme - final version
NuGOweek 2024 Ghent - programme - final version
 
Cancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate PathwayCancer cell metabolism: special Reference to Lactate Pathway
Cancer cell metabolism: special Reference to Lactate Pathway
 
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
Astronomy Update- Curiosity’s exploration of Mars _ Local Briefs _ leadertele...
 
Mammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also FunctionsMammalian Pineal Body Structure and Also Functions
Mammalian Pineal Body Structure and Also Functions
 
GBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture MediaGBSN - Microbiology (Lab 4) Culture Media
GBSN - Microbiology (Lab 4) Culture Media
 
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
Earliest Galaxies in the JADES Origins Field: Luminosity Function and Cosmic ...
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
4. An Overview of Sugarcane White Leaf Disease in Vietnam.pdf
 
ESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptxESR_factors_affect-clinic significance-Pathysiology.pptx
ESR_factors_affect-clinic significance-Pathysiology.pptx
 
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdfSCHIZOPHRENIA Disorder/ Brain Disorder.pdf
SCHIZOPHRENIA Disorder/ Brain Disorder.pdf
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
 
filosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptxfilosofia boliviana introducción jsjdjd.pptx
filosofia boliviana introducción jsjdjd.pptx
 
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
Deep Behavioral Phenotyping in Systems Neuroscience for Functional Atlasing a...
 
Nutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technologyNutraceutical market, scope and growth: Herbal drug technology
Nutraceutical market, scope and growth: Herbal drug technology
 
Comparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebratesComparative structure of adrenal gland in vertebrates
Comparative structure of adrenal gland in vertebrates
 
In silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptxIn silico drugs analogue design: novobiocin analogues.pptx
In silico drugs analogue design: novobiocin analogues.pptx
 
Hemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptxHemoglobin metabolism_pathophysiology.pptx
Hemoglobin metabolism_pathophysiology.pptx
 
Unveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdfUnveiling the Energy Potential of Marshmallow Deposits.pdf
Unveiling the Energy Potential of Marshmallow Deposits.pdf
 
GBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram StainingGBSN- Microbiology (Lab 3) Gram Staining
GBSN- Microbiology (Lab 3) Gram Staining
 
Citrus Greening Disease and its Management
Citrus Greening Disease and its ManagementCitrus Greening Disease and its Management
Citrus Greening Disease and its Management
 

Linkage mapping and QTL analysis_Lecture

  • 2. Concepts: Linkage and Linkage Mapping Linkage (of genes): “the association of genes that results from their being on the same chromosome (i.e., physically associated)”. For example, genes A and B in chromosomes Chr1 and Chr2 (Fig. 1a). Linkage group: “all genes in one chromosome form one linkage group”. For example: Chr1 and Chr2 are two different linkage groups (Fig. 1a). Linked (genes): “a pair of linked genes (specifically, their alleles) tend to be transmitted together during meiotic cycle and progenies deviate from Mendelian ratios depending upon recombination fraction (r) between the two genes”. For example, genes A and B in Fig. 1b. A B Fig. 1a. A and B linked; C unlinked to A and B C Chr1 Chr2 AA aa BB bb Aa aa Bb bb X X a b Unlinked Linked A Aa B Bb A Aa b bb a aa B Bb a aa b bb Frequency 1/4 1/4 1/4 1/4 (1-r)/2 r/2 r/2 (1-r)/2 Fig. 1b. Test cross frequenciesSource: R.H.J. Schlegel, Encyclopedic Dictionary of Plant Breeding
  • 3. Concepts: Linkage and Linkage Mapping Linkage map: - “is a map of the frequencies of recombination that occur between markers on homologous chromosomes during meiosis.” - distance is measured in cM. Physical map: - “shows the physical locations of genes and other DNA sequences of interest. - distance measure in base pairs Comparative map: - a map that compares linkage maps or physical maps of related species based on shared markers or sequences, respectively (Fig. 2) Fig. 2. Test cross frequencies Source: Fig. 2 - www.pnas.org/content/102/37/13206/F3.expansion.html
  • 4. 1. Monogenic or oligogenic 2. Discreet phenotypic classes (nominal scale). 3. Typically, environmental effect on trait expression is absent or low 4. Discontinuous variation (Fig. 3) 5. Genes have large effect 6. Mapped as visible marker (i.e., linkage mapping) Concepts: QTL Analysis Qualitative traits Quantitative traits 1. Polygenic (quantitative trait loci) 2. Continuum of measures (interval scale). 3. Trait expression may show profound environmental effect 4. Continuous variation (Fig. 4) 5. Genes have smaller effects 6. Mapping requires QTL analysis cubocube.com Fig.3.Discreettrait Fig. 4. Fruit shape: a quantitative trait www.nature.com
  • 5. Lecture Outline: Linkage Mapping 1. A peek into the history of linkage mapping 1.1. Mendel’s work: rediscovery, validation and exceptions 1.2. Early genetic linkage maps - natural mutants as genetic markers - two-point and three-point linkage analysis 1.3. Mapping functions 2. Molecular era and revolution in genetic linkage mapping 2.1. Molecular markers - isozymes, RFLPs, SSRs and SNPs 2.2. Mapping populations in plants - F2, RILs, BC 2.3. Methods and tools for linkage mapping in plants - maximum likelihood, LOD support, multipoint linkage mapping 2.4. Mapping polyploid genomes and outcrossing species
  • 6. 1. A peek into the history of linkage mapping 1.1. Mendel’s work: rediscovery, validation and exceptions - Experiments in Plant Hybridization (1865). Crosses between natural mutants (Fig. 5) - Rediscovered in 1900 - Laws of segregation (Fig. 6) and independent assortment (Fig. 7) - Wide validity in diverse organisms for unlinked qualitative traits Source: monohybrid cross - www.desktopclass.com Fig. 6. Monohybrid Cross Fig. 5. Mendel’s traits Source: Mendel’s traits -www.nature.com Fig. 7 Source: Punnett square - sites.saschina.org
  • 7. 1. A peek into the history of linkage mapping 1.1. Mendel’s work: rediscovery, validation and exceptions - Bateson and Punnett (1904) - Deviation from Mendelian inheritance (Fig. 8) www.cas.miamioh.edu 1900 1865 Gregor Mendel: - Proposed basic laws of inheritance H. de Vries, E. von Tschermak, C. Correns - Rediscovered Mendel’s work Boveri and Sutton: - Chromosome theory of inheritance 1902 Bateson and Punnett: - Linkage 1904 Fig. 8
  • 8. 1. A peek into the history of linkage mapping 1.2. Early genetic linkage maps - 1900 – 1910: concepts of gene, allele, genotype, phenotype, homozygote, heterozygote Thomas Hunt Morgan: i. studied Drosophila genetics ii. genes responsible for discreet phenotypic differences are located on chromosomes iii. likelihood of co-transmission and reshuffling (due to recombination) were dependent on linkage between genes (Fig. 9) iv. linkages can be quantified (i.e., linkage mapping is a possibility) Fig. 9. An illustration of Morgan’s study in Drosophila Source: Fig. 9. - http://bio.vtn2.com/bio-home/harvey/lect/images/morgan15.4.gif
  • 9. 1. A peek into the history of linkage mapping 1.2. Early genetic linkage maps Quantifying genetic linkages: - mostly dihybrid test crosses and F2 populations (Fig. 10) - segregating for wild-type (+) and mutant (-) alleles - sex-linked genes (X-linked) First genetic linkage map of Sturtevant (Morgan’s student): - Series of dihybrid crosses. Example, Fig. 10 - Map distance between body color and eye color genes = Recombination frequency, RF (%) = [(0+2)/373)]*100 = 0.5 Fig. 10. An illustration of a dihybrid cross, based on Sturtevant (1913) Source: Fig 10 - http://www.esp.org/foundations/genetics/classical/holdings/s/ahs-13.pdf RF (%) = (recombinant type)*100/total (+) (-) (+) (-) Parental type
  • 10. 1. A peek into the history of linkage mapping 1.2. Early genetic linkage maps First genetic linkage map of Sturtevant (Morgan’s student) (Fig. 11): - a series of two-point recombination frequencies (%) between 6 genes (Fig. 12). Here, 19 different populations - started marker order from closest linkages and manually added other loci Fig. 11. First genetic linkage map. Sturtevant (1913) Factors concered Proportion of crossovers % of crossovers BCO 193 / 16278 1.2 BO 2 / 373 0.5 BP 1464 / 4551 32.2 BR 115 / 324 35.5 BM 260 / 693 37.5 COP 224 / 748 29.9 COR 1643 / 4749 34.6 COM 76 / 161 47.2 OP 247 / 836 29.5 OR 183 / 538 34.0 OM 218 / 404 54.0 CR 236 / 829 28.5 CM 112 / 333 33.6 B(C,O) 214 / 21736 1.0 (C,O)P 471 / 1584 29.7 (C,O)R 2062 / 6116 33.7 (C,O)M 406 / 898 45.2 PR 17 / 573 3.0 PM 109 / 405 26.9 Source: Fig.11, Fig. 12 - www.nature.com/scitable/content/The-linear-arrangement-of-six-sex-linked-16655 Fig. 12. Sturtevant table of RF (%)
  • 11. 1. A peek into the history of linkage mapping 1.2. Early genetic linkage maps Limitations of two-point linkage analysis - Consider that 2 genes are far enough apart that 2 crossovers (XOs) occur between them (occasionally) and involves: i. same two nonsister chromatids for both (Fig. 13) ii. different nonsister chromatids for both (Fig. 14) - Result: either underestimation or overestimation of RF Fig. 13. Double crossover (same) A A B B AB AB Gametes a a b b ab ab Fig. 14. Double crossover (different ) A A B B Ab Ab Gametes a a b b aB aB
  • 12. 1. A peek into the history of linkage mapping 1.2. Early genetic linkage maps The three point test cross - Using trihybrid crosses - more efficient; includes 2 XOs - allows calculation of XO interference Example (Fig. 15): i.First, test linkage. Here, they are linked ii.Most frequent are parental types ii. Four single crossovers (SCOs) iii. Two double crossovers (DCOs) X- Z+Y+ offspring No. of Parental/ phenotypes individual s Recombinant X+ Y- Z+ 1 Recombinant DCO X- Y+ Z+ 440 Parental X - Y - Z + 26 Recombinant SCO #1 X- Y- Z- 61 Recombinant SCO #2 X+ Y+ Z- 32 Recombinant SCO #1 X+ Y- Z- 442 Parental X+ Y+ Z+ 58 Recombinant SCO #2 X- Y+ Z- 2 Recombinant DCO total 1062 XO type Fig. 15. Three point test cross freq. X+ Z-Y- X- Z-Y- X- Z-Y- Triple Heterozygote Triple HomozygousX
  • 13. 1. A peek into the history of linkage mapping 1.2. Early genetic linkage maps Example (Fig. 16) continued.. iv. Compare either parental type to double XO types v. Conclusion: gene Z is in center vi. Map distance (X-Z) = [SCO (X-Z) + DCOs]*100/total vii. Coefficient of coincidence (C) = observed DCO freq./expected DCO freq. where, expected DCO freq = (X-Z SCO freq. * Z-Y SCO freq) viii. Interference = 1 - C X- Z+Y+ offspring No. of Parental/ phenotypes individual s Recombinant X+ Y- Z+ 1 Recombinant DCO X- Y+ Z+ 440 Parental X - Y - Z + 26 Recombinant SCO #1 X- Y- Z- 61 Recombinant SCO #2 X+ Y+ Z- 32 Recombinant SCO #1 X+ Y- Z- 442 Parental X+ Y+ Z+ 58 Recombinant SCO #2 X- Y+ Z- 2 Recombinant DCO total 1062 XO type Fig. 16. Three point test cross freq. X+ Z-Y- X- Z-Y- X- Z-Y- Triple Heterozygote Triple HomozygousX P X - Y + Z + X + Y - Z - DCO X+ Y- Z+ X+ Y- Z+ D D S S S D
  • 14. 1. A peek into the history of linkage mapping 1.3. Mapping functions - “for more than three loci, relationship among possible recombination fractions is complex” - “RFs between loci flanking a region are not simple sum of recombination fractions for adjacent loci within the region” - “conversion of recombination fractions to additive map distances requires mapping functions (Fig. 17): i. Haldane ii. Kosambi Fig. 17. Table: Haldane and Kosambi mapping functions. Chart: comparison of mapping functions. “r” is recombination fraction and “d’ is map distance. Source: Ben Hui Liu, Statistical Genomics; Roling Wu et al. , Statistical Genetics of Quantitative Traits
  • 15. 1. A peek into the history of linkage mapping Summary: -Paucity of visible natural markers (phenotypic mutants) -Radiation mutants offered additional traits, but lethality, sterility was a problem -Nevertheless, two point and three point linkage maps persisted for several decades (~70 years) -Example: i. tomato: 258 morphological and physiological markers (Rick 1975) Fig. 18. An illustration of A tomato linkage map made in 1952 Source: Fig. 18 – An introduction to Genetic Analysis, 5th edition.
  • 16. 2. Molecular era and revolution in genetic linkage mapping 2.1. Molecular markers - gel electrophoresis brought isozyme markers in picture -restriction endonuclease and southern blot techniques brought RFLPs -DNA sequencing and PCR brought SSRs and SNPs - virtually unlimited number of “visible markers” -gaps in genetic linkage maps could be filled - comparative mapping, gene cloning, QTL analysis and MAS could be done Fig. 19. Classes of molecular markers Source: Fig.19 -nature.berkeley.edu/brunslab/tour/tour2.html RFLP SSR
  • 17. 2. Molecular era and revolution in genetic linkage mapping 2.2. Mapping populations in plants - considerations: 1st: marker polymorphism - adequate polymorphic markers between parents - contrasting traits of interest 2nd: reproductive mode - If inbreeding is a possibility: F2, recombinant inbred lines (RIL), backcross (BC) - Mostly outcrossing (or self- incompatible), long generation time: pseudo-testcross, backcross Fig. 20a. F2 population Source: Fig.20 –K. Meksem and G. Kahl, The Handbook of Plant Genome Mapping Fig. 20b. RIL population Fig. 20c. BC population Fig. 20d. pseudo- testcross population
  • 18. 2. Molecular era and revolution in genetic linkage mapping 2.3. Methods and tools for linkage mapping in plants Steps: i. Data generation: genotype mapping population and prepare input format for mapping ii. Calculating recombination fractions (RFs): maximum likelihood estimates of pair-wise RFs iii. Locus grouping: grouping of markers into prospective linkage groups based on linkage (maximum recombination fraction) and LOD (minimum limit of support) thresholds iv. Locus ordering: finding the best possible order based on highest multi point likelihood (LOD) among different probable orders v. Multilocus distance estimation
  • 19. 2. Molecular era and revolution in genetic linkage mapping 2.3. Methods and tools for linkage mapping in plants Detailed procedural discourse on MapMaker i. Data generation: mapmaker input file format (Fig. 21) Type of cross: F2 intercross F2 backcross F3 self RI self RI sib Defaults Genotype Score: Default symbols are A : homozygous for parent A H : heterozygous B : homozygous for parent B C : not homozygous for parent A D : not homozygous for parent B - : for missing ScoresMarker Names Population Size Number of Markers Fig. 21. MapMaker input format
  • 20. 2. Molecular era and revolution in genetic linkage mapping 2.3. Methods and tools for linkage mapping in plants Detailed procedural discourse on MapMaker ii. Calculating recombination fractions (RFs): in backcross mating design (BC1) - progenies can be distinctly categorized into parental or recombinant (Fig. 22a) - recombination fraction is simply the frequency of recombinant type (Fig 22b) Fig. 22a. Freq. of gametes in BC mating Fig. 22b. RF estimation is plain and simple for a backcross mating design
  • 21. 2. Molecular era and revolution in genetic linkage mapping 2.3. Methods and tools for linkage mapping in plants Detailed procedural discourse on MapMaker ii. Calculating recombination fractions (RFs): in F2 mating design (Fig. 23a) - progenies cannot be distinctly categorized. For illustration, four possible genotypes shown in Fig. 23b belong to same genotype class A1A2B1B2, but may come from parental gametes without XO or recombinant gametes (with XO) in both parents Fig. 23a. F2 mating design and F2 genotypes Fig. 23b. The counts (in parenthesis) and frequencies of the 16 possible genotypes in an F2 family
  • 22. 2. Molecular era and revolution in genetic linkage mapping 2.3. Methods and tools for linkage mapping in plants Detailed procedural discourse on MapMaker ii. Calculating recombination fractions (RFs): in F2 mating design - 16 possible genotypes coalesce into 9 observable genotypic classes Fig. 24. Frequencies of the nine observed genotypes in an F2 population
  • 23. 2. Molecular era and revolution in genetic linkage mapping 2.3. Methods and tools for linkage mapping in plants Detailed procedural discourse on MapMaker ii. Calculating recombination fractions (RFs): in F2 mating design - likelihood function for estimating RF ( ) - “Maximum likelihood for r is obtained by setting S(r) = 0 and solving for r” - “however, there is no explicit solution for r” - different ways to invoke iterative algorithm to solve for r: a. Grid search b. Newton-Raphson MethodFig. 25. Likelihood function of r
  • 24. 2. Molecular era and revolution in genetic linkage mapping 2.3. Methods and tools for linkage mapping in plants Detailed procedural discourse on MapMaker iii. Locus grouping : - MapMaker’s “GROUP” command builds preliminary linkage groups based on maximum-likelihood estimates of RF and corresponding LOD score between marker pairs - maximum allowable RF and minimum LOD score thresholds can be manually updated to track changes in grouping structure with corresponding changes in thresholds - finally, linkage groups are formed by marker associations. For example, if A is linked to B, and B is linked to C, all three belong to a group (remember, RF and LOD thresholds are there for minimizing spurious linkages)
  • 25. 2. Molecular era and revolution in genetic linkage mapping 2.3. Methods and tools for linkage mapping in plants Detailed procedural discourse on MapMaker iv. Locus ordering: - “ ordering is the central problem in linkage mapping, and also the most interesting in the sense that for groups of even modest size there is no sure way to find the best (N! / 2) possible order” - MapMaker’s “COMPARE” command is exhaustive - computes maximum likelihood score for all possible orders and reports a subset of most likely ones - however, ordering more than 5-7 markers with “COMPARE” is not practical (time issue!) Source: Meksem and Kahl, The Handbook of Plant Genome Mapping
  • 26. 2. Molecular era and revolution in genetic linkage mapping 2.3. Methods and tools for linkage mapping in plants Detailed procedural discourse on MapMaker iv. Locus ordering: - therefore, have to resort to faster algorithms. For example, MapMaker’s “ORDER” command: a. identifies the most informative subset of markers (default 5 markers) b. performs exhaustive order search (akin to COMPARE) and finds one c. tries to add remaining markers individually (at default RF = 0.5 and LOD = 3.0) d. drops LOD threshold to 2.0 and tries remaining ones e. in case markers still cannot be assigned a particular position, reports as such f. such markers can be manually tried with “TRY” command and dropped if fails Source: Meksem and Kahl, The Handbook of Plant Genome Mapping
  • 27. 2. Molecular era and revolution in genetic linkage mapping 2.3. Methods and tools for linkage mapping in plants Detailed procedural discourse on MapMaker v. Multipoint distance estimation: - MapMaker uses MAP command for multipoint estimates (not two-point estimates) - it employs EM algorithm (expectation-maximization algorithm), where mutually dependent unknown parameters are alternately updated to converge to a maximum. - for example, an initial estimate (two-point) of r (θold = θ1, θ2, … θl-1, where l is the number of loci) is used to compute expected number of recombinant type for each interval (E step) - (M step): using the new expected value MLE of θnew is computed - E and M is iterated until θnew θold (the likelihood converges to a maximum) - map distances are calculated using different mapping functions (default Haldane) Source: Ben Hui Liu, Statistical Genomics
  • 28. Revisiting tomato genetic linkage maps: -Example: Tomato: (Sim et al. 2012) Fig. 26a and 26b - 7,666 SNPs 2. Molecular era and revolution in genetic linkage mapping Fig. 26a. SNP distribution Fig. 26b. Two tomato linkage maps compared to draft genome assembly http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0040563
  • 29. 2. Molecular era and revolution in genetic linkage mapping 2.4. Mapping polyploid genomes - Allopolyploids show disomic segregation. Hence, linkage mapping in allopolyploids are similar to diploid linkage mapping - Autopolyploids (e.g., potato, sugarcane etc) show polysomic segregation (Fig. 27a). Hence, linkage mapping in autopolyploids employ different mapping techniques - For example, single dose markers (SDMs) segregating in 1:1 ratio (Fig. 27b) used in pseudo-testcross mapping strategy - Also, biparental and double-dose markers can be integrated using TetraploidMap software Fig. 27a. Single locus Segregation Aaaa X aaaa 1/2 Aaaa 1/2 aaaa Autotetraploid Fig. 27b. Segregation of a SDM
  • 30. 2. Molecular era and revolution in genetic linkage mapping 2.4. Mapping polyploid genomes - Example TetraploidMap: four homologous chromosomes and a consensus map Source: TetraploidMap manual
  • 31. Linkage Mapping Summary i. Genetic linkage maps were originally built to map phenotypic mutants ii. Modern linkage maps use molecular markers (predominantly, DNA markers) iii. Different types of mapping populations are used iv. Mapping studies in diploid and allopolyploids use similar tools and techniques v. Linkage maps in autopolyploids neccessitates different mapping strategies vi. Linkage maps are useful for - tagging markers along chromosomes - identifying markers linked to genes and cloning genes - identifying quantitative trait loci for traits of interest - marker assisted selection - comparative mapping and evolutionary studies
  • 32. Lecture Outline: QTL Analysis 3. QTL mapping: models and methods 3.1. Single QTL model 3.1.1. Single marker analysis (SMA) - t-tests, ANOVA, linear regression 3.1.2. Simple interval mapping (SIM) 3.2. Multiple QTL model 3.2.1. Multiple regression 3.2.2. Composite interval mapping (CIM) 3.3. QTL mapping in polyploid genomes
  • 33. 3. QTL Mapping: Models and Methods 3.1. Single QTL model - Assessing marker-trait associations at individual marker locus - gene effects for single QTL model: Backcross: g = 0.5 (µ1 - µ2), where µ1 = mean for homozygous µ2 = mean for heterozygous F2: additive (α) = 0.5 (µ1 - µ3) and dominance (d) = 0.5 (2µ2- µ1 - µ3), where µ3 = mean for homozygous for parent B alleles - Employs single marker analysis (SMA) techniques Source: Ben Hui Liu, Statistical Genomics
  • 34. 3.1.1. Single marker analysis (SMA) - based on linear model: yj = µ + f (markerj) + ɛj, where yj is trait value of the jth individual in the population µ is population mean f (markerj) is a function of marker genotype ɛj is the residual associated with the jth individual Different methods: a. marker genotypes treated as classification variable - for a backcross (2 genotypes): use t-test - for F2 population (up to 3 genotypes): use ANOVA b. marker genotypes treated as dummy variables - use marker-trait regression c. likelihood ratio test and maximum likelihood estimation Source: Ben Hui Liu, Statistical Genomics
  • 35. 3.1.1. SMA Source: Ben Hui Liu, Statistical Genomics yj = β0 + β1xj + ɛj ,where yj is the trait value for the jth individual in the population, xj is the dummy variable taking 1 if the individual is AA and -1 for Aa. β0 is the intercept for the regression which is the overall mean for the trait. β1 is the slope for the regression line and ɛj is the random error. yj = β0 + β1x1j + β2x2j + ɛj ,where yj is the trait value for the jth individual in the population, x1j is the dummy variable for the marker additive effect taking 1, 0, and -1 for marker genotypes AA, Aa and aa, respectively. x2j is the dummy variable for the marker dominant effect taking 1, 0, and -1 for marker genotypes AA, Aa and aa. β0 is the intercept for the regression which is the overall mean for the trait. β1 and β2 are the slopes for the additive and dominant regression lines, respectively. ɛj is the random error. BC F2 - t-test and ANOVA Steps (given alleles A and a at a marker locus): a. sort marker genotype classes into groups - “AA” and “Aa” in backcross; “AA”, “Aa”, and “aa” in (F2) b. test significant difference in means - t statistic (in backcross), F statistic (in F2) - Linear regression approach Fig. 27. One way analysis
  • 36. 1. Conceptually and computationally simple 2. Genetic linkage map information not needed 3. Easily incorporates covariates 4. Informative when markers sufficiently cover the genome 5. Can be extended to multiple regression for multiple QTL model 3.1.1. SMA Advantages Limitations 1. Location and effects of detected QTLs are confounded larger QTL effect could be because the marker is close to a QTL or farther from the QTL, but the QTL contributes much significantly to the trait 2. QTL position cannot be precisely detected 3. Power to detect QTL is low when marker density is low 4. Multiple comparison increases false positives 5. Missing genotypes are totally excluded from analysis 6. Limited ability to separate linked QTLs and no ability to assess interacting QTLs
  • 37. Basic statistical analysis platforms: Excel JMP SAS R etc QTL mapping platforms: WinQTLCartographer R/QTL JoinMap MapMarker/QTL etc. 3.1.1. SMA Software tools Windows QTL Cartographer SMA analysis fits the data to the simple linear regression model y = b0 + b1 x + e Results reported includes b0, b1 and the F statistic for each marker F statistic compares the hypothesis H0: b1 = 0; H1: b1 The pr(F) is a measure of how much support there is for H0 A smaller pr(F) indicates less support for H0 and thus more support for H1 Likelihood ratio test statistic compares two nested hypothesis H0 and H1 with L0 and L1 likelihoods. Then, the “Likelihood Ratio Test Statistic: is: -2ln(L0/L1)
  • 38. 3.1.2. Simple interval mapping (IM) - “Mapping Mendelian factors underlying Quantitative Traits using RFLP linkage maps” (Lander and Bolstein 1989) - Concept: Based on joint segregation of a pair of adjacent markers and a putative QTL within an interval flanked by the marker pair (Fig. 28) Methods: a. Likelihood approach (preferred over regression) b. Regression approach (faster computation than ML) Source: Ben Hui Liu, Statistical Genomics Fig. 28. Linkage relationship of a QTL and two flanking markers
  • 39. 3.1.2. SIM Likelihood approach (employed in WinQTLCart): Source: Course notes, QTL mapping and Discovery The density function for the normal distribution with mean μQk, and variance σ2. There are K=1 to N genotypes. probability of the QTL genotype, given the jth genotypes of the flanking markers likelihood of phenotypic value z, given the jth genotypes of the flanking markers. MLE estimate under the reduced model of no QTL: μQQ=μQq=μqq MLE estimate under the full model including a QTL. LOD scores (log10 of the odds ratio), where OR LR= 4.6LOD
  • 40. 1. Conceptually and computationally simple 2. Genetic linkage map information not needed 3. Easily incorporates covariates 4. Informative when markers sufficiently cover the genome 5. Can be extended to multiple regression for multiple QTL model 3.1.1. SIM Advantages Limitations 1. Location and effects of detected QTLs are confounded larger QTL effect could be because the marker is close to a QTL or farther from the QTL, but the QTL contributes much significantly to the trait 2. QTL positions cannot be precisely detected 3. Power to detect QTL is low when marker density is low 4. Multiple comparison increases false positives 5. Missing genotypes are totally excluded from analysis
  • 41. 3.2. Composite interval mapping (CIM) Source: Course notes, QTL mapping and Discovery Test Interval Left Marker Right Marker Blocked Region (Cofactors) CIM is a combination of IM and multiple regression (multiple QTL model) - Fits both the effects of a QTL as well as the effects of covariates (subset of selected genetic markers) - CIM adds background loci to simple interval mapping (IM). - It fits parameters for a target QTL in one interval while simultaneously fitting partial regression coefficients for "background markers" to account for variance caused by non-target QTL. - Background markers are usually 20-40 cM apart
  • 42. 3.2. CIM General CIM statistical model can be written as: Phenotypic trait value of subject i Overall mean Row vector of predictor variables corresponding to the effects of the putative QTL Row vector of predictor variables corresponding to the rth cofactor marker Column vector with the coefficient of the rth cofactor marker N(0,δ2) Zi1α: additive effect Zi1d: dominance effect
  • 43. 3.2. CIM Set of statistical models evaluated in the CIM analysis (WinQTLCartographer): - For backcross, recombinant inbred lines, and double haploids, only Model 0 and Model 1 are generated and tested - For F2 design, all four models are generated and tested
  • 44. Comparison of SMA, SIM and CIM Much precise location http://solcap.msu.edu/pdf%20files/5PAA_Douches_2_Mapping_Populations.pdf
  • 45. 3.3. QTL mapping in polyploid genomes - Generally, QTL mapping in allopolyploid genomes is same as diploids - However, QTL mapping in autopolyploid genomes require different strategies - Example: QTL mapping in autotetraploids using TetraploidMap
  • 46. 3.3. QTL mapping in polyploid genomes Summary - Single marker analysis (SMA) involves t-test, ANOVA, or linear regression approach - Interval mapping is based on joint segregation of a pair of adjacent markers - CIM is a combination of IM and multiple regression and is desirable among the three - QTL mapping in autopolyploids require different analytical strategies