SlideShare a Scribd company logo
• We developed a pipeline for building and annotating
Sanger-454 transcript assemblies and developed a groundnut
EST database (Figure 1).
EST-SSR Marker Resources for Groundnut
Sameer Khanal1, Shunxue Tang1, Yufang Guo1, Yan Li2, Vadim Beilinson3, Phillip San Miguel4, Baozhu Guo2,
Niels Nielsen3, Thomas Stalker3, Marie-Michele Cordonnier-Pratt5, Lee H. Pratt5, Virgil Ed Johnson5,
Christopher A. Taylor1, Bruce Roe6, David Hoisington7, Rajeev Varshney7, and Steven J. Knapp1
1Institute of Plant Breeding, Genetics, and Genomics, The University of Georgia, Athens, Georgia, 30602, USA
2USDA-ARS, Tifton, Georgia, 31793, USA
3Crop Science Department, North Carolina State University, Raleigh, North Carolina, 27695, USA
4Genomics Center, Purdue University, West Lafayette, Indiana, 47907, USA
5Laboratory for Genomics and Bioinformatics, The University of Georgia, Athens, Georgia, 30602, USA
6Advanced Center for Genome Technology (ACGT), University of Oklahoma, Norman, OK 73019, USA
7International Crops Research Institute for Semi Arid Tropics (ICRISAT), Patancheru, 502 324, AP, INDIA
Background
Narrow genetic diversity and a deficiency of polymorphic DNA
markers have hindered genetic mapping, forward genetics,
and the application of genomics and molecular breeding
approaches in groundnut (Arachis hypogaea). To further
break the DNA marker bottleneck, we mined an EST database
for simple sequence repeats (SSRs) and developed 2,100 EST-
SSR markers for groundnut. The EST database was developed
by Sanger and next-generation (454-FLX) DNA sequencing
technologies. Here, we describe a pipeline for assembling and
annotating Sanger-454 ESTs, the development of the
groundnut EST database (ESTdb), SSR markers developed
from ESTs, and polymorphisms of EST-SSRs among 28 elite
tetraploid and four wild diploid inbred lines.
Mining a Groundnut EST Database for SSRs
• Sanger and 454 ESTs were produced from normalized and
non-normalized leaf and developing-seed cDNA libraries of
Tifrunner, GTC20, NC12C, A13, and New Mexico Valencia A
(Table 1).
Table 1. Groundnut ESTs and Transcript Assembly Statistics
ESTs Unigenes
Sanger 454 Total Singletons Contigs Total
71,448 304,215 375,663 63,218 37,914 101,132
Figure 1. Uniscript Alignment in Groundnut ESTdb for a
Sanger-454 Contig Harboring an Trinculeotide (CAA5)
Simple Sequence Repeat.
Figure 2. Simple Sequence Repeats Identified by Mining
the Groundnut EST Database (101,132 Uniscripts).
Pilot Study: EST-SSR Polymorphisms in Groundnut
• To gain an understanding of the potential utility of EST-
SSRs for genotyping applications in groundnut, particularly in
modern cultivars, 58 EST-SSR markers were developed by
targeting a broad range of repeat motifs and repeat lengths
identified in the database.
• The 58 EST-SSR markers were screened for allele length
polymorphisms among 28 elite tetraploid, two A. duranensis,
and two A. batizocoi lines. The diploid lines are the parents of
intraspecific A- and B-genome diploid mapping populations.
The tetraploid lines are the parents of elite x elite mapping
populations and important founders of modern cultivars
(Figure 3).
• We identified 7,314 perfect SSRs in 6,371 unigenes and
selected 2,134 for EST-SSR marker development (Figure 2),
96% of which were dinucleotide and trinucleotide repeats.
These are currently being screened for polymorphisms and
genetically mapped in several elite x elite tetraploid and wild
diploid mapping populations.
Figure 3. Genetic Distance-Based Dendrograms Estimated
from Genotypes for 58 EST-SSR Markers.
• 80% of the EST-SSR markers were polymorphic in one or
both of the diploid mapping populations (DUR25 x DUR35 and
BAT3 x BAT9) (Figure 3).
• Half of the EST-SSR markers were polymorphic among
tetraploid germplasm accessions; however, the mean
heterozygosity (H) or polymorphic information content was
low (H = 0.17). Trinucleotide repeats (H = 0.20) were more
polymorphic than dinucleotide repeats (H = 0.10).
• The well-known diversity bottleneck in modern cultivars is
illustrated by the tight cluster of tetraploid nodes in the
diploid-tetraploid dendrogram (Figure 3).
• SSR length was positively correlated with allelic diversity (r
= 0.45; p < 0.0001) (Figure 4). SSRs longer than 26 bp (H =
0.26) were significantly more polymorphic than SSRs shorter
than 26 bp (H = 0.10).
Conclusions
• Twenty to 30% of the 2,134 EST-SSR markers developed in
the present study are predicted to be polymorphic and have
utility in modern cultivars, whereas 80-90% are predicted to
be polymorphic in two diploid mapping populations.
• Large-scale DNA resequencing approaches are needed to
identify DNA sequence variants in modern cultivars of
groundnut to further increase the supply of polymorphic DNA
markers for molecular breeding and other genotyping
applications.

More Related Content

What's hot

Assesment of genetic divergence in chickpea kabuli cultivars
Assesment of genetic divergence in chickpea kabuli cultivarsAssesment of genetic divergence in chickpea kabuli cultivars
Assesment of genetic divergence in chickpea kabuli cultivars
Naveen Jakhar
 
ZFN-Science-Rats
ZFN-Science-RatsZFN-Science-Rats
ZFN-Science-RatsGreg Davis
 
Transcriptome Analysis of Spontaneous PDF
Transcriptome Analysis of Spontaneous PDFTranscriptome Analysis of Spontaneous PDF
Transcriptome Analysis of Spontaneous PDFJanaya Shelly
 
Databases ii
Databases iiDatabases ii
Databases ii
Sucheta Tripathy
 
EMBL
EMBLEMBL
Biological database by kk sahu
Biological database by kk sahuBiological database by kk sahu
Biological database by kk sahu
KAUSHAL SAHU
 
Human genome project slides
Human genome project slidesHuman genome project slides
Human genome project slides
Tracy Neal Spence
 

What's hot (10)

Assesment of genetic divergence in chickpea kabuli cultivars
Assesment of genetic divergence in chickpea kabuli cultivarsAssesment of genetic divergence in chickpea kabuli cultivars
Assesment of genetic divergence in chickpea kabuli cultivars
 
ZFN-Science-Rats
ZFN-Science-RatsZFN-Science-Rats
ZFN-Science-Rats
 
Transcriptome Analysis of Spontaneous PDF
Transcriptome Analysis of Spontaneous PDFTranscriptome Analysis of Spontaneous PDF
Transcriptome Analysis of Spontaneous PDF
 
387.full
387.full387.full
387.full
 
Databases ii
Databases iiDatabases ii
Databases ii
 
EMBL
EMBLEMBL
EMBL
 
Biological database by kk sahu
Biological database by kk sahuBiological database by kk sahu
Biological database by kk sahu
 
0032-Ijabpt-Imed pub
0032-Ijabpt-Imed pub0032-Ijabpt-Imed pub
0032-Ijabpt-Imed pub
 
ownership nbt.3393
ownership nbt.3393ownership nbt.3393
ownership nbt.3393
 
Human genome project slides
Human genome project slidesHuman genome project slides
Human genome project slides
 

Similar to Poster_GCP_Knapp

GCP_abstract_Khanal_2008
GCP_abstract_Khanal_2008GCP_abstract_Khanal_2008
GCP_abstract_Khanal_2008Sameer Khanal
 
PAG Poster
PAG PosterPAG Poster
PAG Poster
Sameer Khanal
 
Yam: Recent Advances in Genetic and Genomic Resources
Yam: Recent Advances in Genetic and Genomic Resources Yam: Recent Advances in Genetic and Genomic Resources
Yam: Recent Advances in Genetic and Genomic Resources
CGIAR Research Program on Roots, Tubers and Bananas
 
MS thesis presentation_FINAL
MS thesis presentation_FINALMS thesis presentation_FINAL
MS thesis presentation_FINALTom Hajek
 
Gene expression group presentation at GAW 19
Gene expression group presentation at GAW 19Gene expression group presentation at GAW 19
Gene expression group presentation at GAW 19
Francesco Gadaleta
 
Genetic diversity in wild and cultivated peanut_Khanal_2008
Genetic diversity in wild and cultivated peanut_Khanal_2008Genetic diversity in wild and cultivated peanut_Khanal_2008
Genetic diversity in wild and cultivated peanut_Khanal_2008Sameer Khanal
 
Building Genomic Data Processing and Machine Learning Workflows Using Apache ...
Building Genomic Data Processing and Machine Learning Workflows Using Apache ...Building Genomic Data Processing and Machine Learning Workflows Using Apache ...
Building Genomic Data Processing and Machine Learning Workflows Using Apache ...
Databricks
 
Systemic analysis of data combined from genetic qtl's and gene expression dat...
Systemic analysis of data combined from genetic qtl's and gene expression dat...Systemic analysis of data combined from genetic qtl's and gene expression dat...
Systemic analysis of data combined from genetic qtl's and gene expression dat...
Laurence Dawkins-Hall
 
21 kebere bezaweletaw 207-217
21 kebere bezaweletaw 207-21721 kebere bezaweletaw 207-217
21 kebere bezaweletaw 207-217
Alexander Decker
 
A systematic, data driven approach to the combined analysis of microarray and...
A systematic, data driven approach to the combined analysis of microarray and...A systematic, data driven approach to the combined analysis of microarray and...
A systematic, data driven approach to the combined analysis of microarray and...
Laurence Dawkins-Hall
 
Data as research output, data as part of the scholarly record
Data as research output, data as part of the scholarly recordData as research output, data as part of the scholarly record
Data as research output, data as part of the scholarly record
Todd Vision
 
Frequency of Polyploids of Solanum tuberosum Dihaploids in 2X × 2X Crosses
Frequency of Polyploids of Solanum tuberosum Dihaploids in 2X × 2X CrossesFrequency of Polyploids of Solanum tuberosum Dihaploids in 2X × 2X Crosses
Frequency of Polyploids of Solanum tuberosum Dihaploids in 2X × 2X Crosses
Journal of Agriculture and Crops
 
Genotyping an invasive vine
Genotyping an invasive vineGenotyping an invasive vine
Genotyping an invasive vine
Roy Moger-Reischer
 
EVE 161 Winter 2018 Class 13
EVE 161 Winter 2018 Class 13EVE 161 Winter 2018 Class 13
EVE 161 Winter 2018 Class 13
Jonathan Eisen
 
Genome responses of trypanosome infected cattle
Genome responses of trypanosome infected cattleGenome responses of trypanosome infected cattle
Genome responses of trypanosome infected cattle
Laurence Dawkins-Hall
 
Whole Transcriptome Analysis of Testicular Germ Cell Tumors
Whole Transcriptome Analysis of Testicular Germ Cell TumorsWhole Transcriptome Analysis of Testicular Germ Cell Tumors
Whole Transcriptome Analysis of Testicular Germ Cell Tumors
Thermo Fisher Scientific
 
Molecular phylogenetics
Molecular phylogeneticsMolecular phylogenetics
Molecular phylogenetics
Ajay Kumar Chandra
 
Development of SSR markers in mungbean
Development of SSR markers in mungbeanDevelopment of SSR markers in mungbean
Development of SSR markers in mungbean
Nidhi Singh
 
SURCA 2016 poster
SURCA 2016 posterSURCA 2016 poster
SURCA 2016 posterMitchell Go
 
Striving for excellence in yam breeding using genomics tools
Striving for excellence in yam breeding using genomics toolsStriving for excellence in yam breeding using genomics tools
Striving for excellence in yam breeding using genomics tools
International Institute of Tropical Agriculture
 

Similar to Poster_GCP_Knapp (20)

GCP_abstract_Khanal_2008
GCP_abstract_Khanal_2008GCP_abstract_Khanal_2008
GCP_abstract_Khanal_2008
 
PAG Poster
PAG PosterPAG Poster
PAG Poster
 
Yam: Recent Advances in Genetic and Genomic Resources
Yam: Recent Advances in Genetic and Genomic Resources Yam: Recent Advances in Genetic and Genomic Resources
Yam: Recent Advances in Genetic and Genomic Resources
 
MS thesis presentation_FINAL
MS thesis presentation_FINALMS thesis presentation_FINAL
MS thesis presentation_FINAL
 
Gene expression group presentation at GAW 19
Gene expression group presentation at GAW 19Gene expression group presentation at GAW 19
Gene expression group presentation at GAW 19
 
Genetic diversity in wild and cultivated peanut_Khanal_2008
Genetic diversity in wild and cultivated peanut_Khanal_2008Genetic diversity in wild and cultivated peanut_Khanal_2008
Genetic diversity in wild and cultivated peanut_Khanal_2008
 
Building Genomic Data Processing and Machine Learning Workflows Using Apache ...
Building Genomic Data Processing and Machine Learning Workflows Using Apache ...Building Genomic Data Processing and Machine Learning Workflows Using Apache ...
Building Genomic Data Processing and Machine Learning Workflows Using Apache ...
 
Systemic analysis of data combined from genetic qtl's and gene expression dat...
Systemic analysis of data combined from genetic qtl's and gene expression dat...Systemic analysis of data combined from genetic qtl's and gene expression dat...
Systemic analysis of data combined from genetic qtl's and gene expression dat...
 
21 kebere bezaweletaw 207-217
21 kebere bezaweletaw 207-21721 kebere bezaweletaw 207-217
21 kebere bezaweletaw 207-217
 
A systematic, data driven approach to the combined analysis of microarray and...
A systematic, data driven approach to the combined analysis of microarray and...A systematic, data driven approach to the combined analysis of microarray and...
A systematic, data driven approach to the combined analysis of microarray and...
 
Data as research output, data as part of the scholarly record
Data as research output, data as part of the scholarly recordData as research output, data as part of the scholarly record
Data as research output, data as part of the scholarly record
 
Frequency of Polyploids of Solanum tuberosum Dihaploids in 2X × 2X Crosses
Frequency of Polyploids of Solanum tuberosum Dihaploids in 2X × 2X CrossesFrequency of Polyploids of Solanum tuberosum Dihaploids in 2X × 2X Crosses
Frequency of Polyploids of Solanum tuberosum Dihaploids in 2X × 2X Crosses
 
Genotyping an invasive vine
Genotyping an invasive vineGenotyping an invasive vine
Genotyping an invasive vine
 
EVE 161 Winter 2018 Class 13
EVE 161 Winter 2018 Class 13EVE 161 Winter 2018 Class 13
EVE 161 Winter 2018 Class 13
 
Genome responses of trypanosome infected cattle
Genome responses of trypanosome infected cattleGenome responses of trypanosome infected cattle
Genome responses of trypanosome infected cattle
 
Whole Transcriptome Analysis of Testicular Germ Cell Tumors
Whole Transcriptome Analysis of Testicular Germ Cell TumorsWhole Transcriptome Analysis of Testicular Germ Cell Tumors
Whole Transcriptome Analysis of Testicular Germ Cell Tumors
 
Molecular phylogenetics
Molecular phylogeneticsMolecular phylogenetics
Molecular phylogenetics
 
Development of SSR markers in mungbean
Development of SSR markers in mungbeanDevelopment of SSR markers in mungbean
Development of SSR markers in mungbean
 
SURCA 2016 poster
SURCA 2016 posterSURCA 2016 poster
SURCA 2016 poster
 
Striving for excellence in yam breeding using genomics tools
Striving for excellence in yam breeding using genomics toolsStriving for excellence in yam breeding using genomics tools
Striving for excellence in yam breeding using genomics tools
 

More from Sameer Khanal

Khanal beltwide conference_2021
Khanal beltwide conference_2021Khanal beltwide conference_2021
Khanal beltwide conference_2021
Sameer Khanal
 
Beltwide poster 2021
Beltwide poster 2021Beltwide poster 2021
Beltwide poster 2021
Sameer Khanal
 
Linkage mapping lab
Linkage mapping labLinkage mapping lab
Linkage mapping lab
Sameer Khanal
 
Transcriptomics of RKN resitance in Upland Cotton
Transcriptomics of RKN resitance in Upland CottonTranscriptomics of RKN resitance in Upland Cotton
Transcriptomics of RKN resitance in Upland Cotton
Sameer Khanal
 
Cotton QTL pyramiding
Cotton QTL pyramidingCotton QTL pyramiding
Cotton QTL pyramiding
Sameer Khanal
 
Beltwide poster main
Beltwide poster mainBeltwide poster main
Beltwide poster main
Sameer Khanal
 
Effects of exotic alleles and genetic backgrounds on fiber quality traits in ...
Effects of exotic alleles and genetic backgrounds on fiber quality traits in ...Effects of exotic alleles and genetic backgrounds on fiber quality traits in ...
Effects of exotic alleles and genetic backgrounds on fiber quality traits in ...
Sameer Khanal
 
Dissecting quantitative variation introgressed into bermudagrass and Upland c...
Dissecting quantitative variation introgressed into bermudagrass and Upland c...Dissecting quantitative variation introgressed into bermudagrass and Upland c...
Dissecting quantitative variation introgressed into bermudagrass and Upland c...
Sameer Khanal
 
Microsatellite markers in bermudagrass
Microsatellite markers in bermudagrassMicrosatellite markers in bermudagrass
Microsatellite markers in bermudagrass
Sameer Khanal
 
Cytogenetics of arachis_spp.
Cytogenetics of arachis_spp.Cytogenetics of arachis_spp.
Cytogenetics of arachis_spp.
Sameer Khanal
 
PAG XXV POSTER KHANAL
PAG XXV POSTER KHANALPAG XXV POSTER KHANAL
PAG XXV POSTER KHANALSameer Khanal
 
Peanut NIILs proposal
Peanut NIILs proposalPeanut NIILs proposal
Peanut NIILs proposal
Sameer Khanal
 
NBS-LRR proposal
NBS-LRR proposalNBS-LRR proposal
NBS-LRR proposal
Sameer Khanal
 
Algodones dune sunflower
Algodones dune sunflowerAlgodones dune sunflower
Algodones dune sunflower
Sameer Khanal
 
Linkage mapping and QTL analysis_Lab
Linkage mapping and QTL analysis_LabLinkage mapping and QTL analysis_Lab
Linkage mapping and QTL analysis_Lab
Sameer Khanal
 
Linkage mapping and QTL analysis_Lecture
Linkage mapping and QTL analysis_LectureLinkage mapping and QTL analysis_Lecture
Linkage mapping and QTL analysis_Lecture
Sameer Khanal
 
Winning the future with GMOs
Winning the future with GMOsWinning the future with GMOs
Winning the future with GMOs
Sameer Khanal
 
Sunflower domestication khanal_2010
Sunflower domestication khanal_2010Sunflower domestication khanal_2010
Sunflower domestication khanal_2010
Sameer Khanal
 
Comfy cotton clothes and their genetics khanal 2013
Comfy cotton clothes and their genetics khanal 2013Comfy cotton clothes and their genetics khanal 2013
Comfy cotton clothes and their genetics khanal 2013
Sameer Khanal
 
Genetic Diversity in Wild and Cultivated Peanut
Genetic Diversity in Wild and Cultivated PeanutGenetic Diversity in Wild and Cultivated Peanut
Genetic Diversity in Wild and Cultivated PeanutSameer Khanal
 

More from Sameer Khanal (20)

Khanal beltwide conference_2021
Khanal beltwide conference_2021Khanal beltwide conference_2021
Khanal beltwide conference_2021
 
Beltwide poster 2021
Beltwide poster 2021Beltwide poster 2021
Beltwide poster 2021
 
Linkage mapping lab
Linkage mapping labLinkage mapping lab
Linkage mapping lab
 
Transcriptomics of RKN resitance in Upland Cotton
Transcriptomics of RKN resitance in Upland CottonTranscriptomics of RKN resitance in Upland Cotton
Transcriptomics of RKN resitance in Upland Cotton
 
Cotton QTL pyramiding
Cotton QTL pyramidingCotton QTL pyramiding
Cotton QTL pyramiding
 
Beltwide poster main
Beltwide poster mainBeltwide poster main
Beltwide poster main
 
Effects of exotic alleles and genetic backgrounds on fiber quality traits in ...
Effects of exotic alleles and genetic backgrounds on fiber quality traits in ...Effects of exotic alleles and genetic backgrounds on fiber quality traits in ...
Effects of exotic alleles and genetic backgrounds on fiber quality traits in ...
 
Dissecting quantitative variation introgressed into bermudagrass and Upland c...
Dissecting quantitative variation introgressed into bermudagrass and Upland c...Dissecting quantitative variation introgressed into bermudagrass and Upland c...
Dissecting quantitative variation introgressed into bermudagrass and Upland c...
 
Microsatellite markers in bermudagrass
Microsatellite markers in bermudagrassMicrosatellite markers in bermudagrass
Microsatellite markers in bermudagrass
 
Cytogenetics of arachis_spp.
Cytogenetics of arachis_spp.Cytogenetics of arachis_spp.
Cytogenetics of arachis_spp.
 
PAG XXV POSTER KHANAL
PAG XXV POSTER KHANALPAG XXV POSTER KHANAL
PAG XXV POSTER KHANAL
 
Peanut NIILs proposal
Peanut NIILs proposalPeanut NIILs proposal
Peanut NIILs proposal
 
NBS-LRR proposal
NBS-LRR proposalNBS-LRR proposal
NBS-LRR proposal
 
Algodones dune sunflower
Algodones dune sunflowerAlgodones dune sunflower
Algodones dune sunflower
 
Linkage mapping and QTL analysis_Lab
Linkage mapping and QTL analysis_LabLinkage mapping and QTL analysis_Lab
Linkage mapping and QTL analysis_Lab
 
Linkage mapping and QTL analysis_Lecture
Linkage mapping and QTL analysis_LectureLinkage mapping and QTL analysis_Lecture
Linkage mapping and QTL analysis_Lecture
 
Winning the future with GMOs
Winning the future with GMOsWinning the future with GMOs
Winning the future with GMOs
 
Sunflower domestication khanal_2010
Sunflower domestication khanal_2010Sunflower domestication khanal_2010
Sunflower domestication khanal_2010
 
Comfy cotton clothes and their genetics khanal 2013
Comfy cotton clothes and their genetics khanal 2013Comfy cotton clothes and their genetics khanal 2013
Comfy cotton clothes and their genetics khanal 2013
 
Genetic Diversity in Wild and Cultivated Peanut
Genetic Diversity in Wild and Cultivated PeanutGenetic Diversity in Wild and Cultivated Peanut
Genetic Diversity in Wild and Cultivated Peanut
 

Poster_GCP_Knapp

  • 1. • We developed a pipeline for building and annotating Sanger-454 transcript assemblies and developed a groundnut EST database (Figure 1). EST-SSR Marker Resources for Groundnut Sameer Khanal1, Shunxue Tang1, Yufang Guo1, Yan Li2, Vadim Beilinson3, Phillip San Miguel4, Baozhu Guo2, Niels Nielsen3, Thomas Stalker3, Marie-Michele Cordonnier-Pratt5, Lee H. Pratt5, Virgil Ed Johnson5, Christopher A. Taylor1, Bruce Roe6, David Hoisington7, Rajeev Varshney7, and Steven J. Knapp1 1Institute of Plant Breeding, Genetics, and Genomics, The University of Georgia, Athens, Georgia, 30602, USA 2USDA-ARS, Tifton, Georgia, 31793, USA 3Crop Science Department, North Carolina State University, Raleigh, North Carolina, 27695, USA 4Genomics Center, Purdue University, West Lafayette, Indiana, 47907, USA 5Laboratory for Genomics and Bioinformatics, The University of Georgia, Athens, Georgia, 30602, USA 6Advanced Center for Genome Technology (ACGT), University of Oklahoma, Norman, OK 73019, USA 7International Crops Research Institute for Semi Arid Tropics (ICRISAT), Patancheru, 502 324, AP, INDIA Background Narrow genetic diversity and a deficiency of polymorphic DNA markers have hindered genetic mapping, forward genetics, and the application of genomics and molecular breeding approaches in groundnut (Arachis hypogaea). To further break the DNA marker bottleneck, we mined an EST database for simple sequence repeats (SSRs) and developed 2,100 EST- SSR markers for groundnut. The EST database was developed by Sanger and next-generation (454-FLX) DNA sequencing technologies. Here, we describe a pipeline for assembling and annotating Sanger-454 ESTs, the development of the groundnut EST database (ESTdb), SSR markers developed from ESTs, and polymorphisms of EST-SSRs among 28 elite tetraploid and four wild diploid inbred lines. Mining a Groundnut EST Database for SSRs • Sanger and 454 ESTs were produced from normalized and non-normalized leaf and developing-seed cDNA libraries of Tifrunner, GTC20, NC12C, A13, and New Mexico Valencia A (Table 1). Table 1. Groundnut ESTs and Transcript Assembly Statistics ESTs Unigenes Sanger 454 Total Singletons Contigs Total 71,448 304,215 375,663 63,218 37,914 101,132 Figure 1. Uniscript Alignment in Groundnut ESTdb for a Sanger-454 Contig Harboring an Trinculeotide (CAA5) Simple Sequence Repeat. Figure 2. Simple Sequence Repeats Identified by Mining the Groundnut EST Database (101,132 Uniscripts). Pilot Study: EST-SSR Polymorphisms in Groundnut • To gain an understanding of the potential utility of EST- SSRs for genotyping applications in groundnut, particularly in modern cultivars, 58 EST-SSR markers were developed by targeting a broad range of repeat motifs and repeat lengths identified in the database. • The 58 EST-SSR markers were screened for allele length polymorphisms among 28 elite tetraploid, two A. duranensis, and two A. batizocoi lines. The diploid lines are the parents of intraspecific A- and B-genome diploid mapping populations. The tetraploid lines are the parents of elite x elite mapping populations and important founders of modern cultivars (Figure 3). • We identified 7,314 perfect SSRs in 6,371 unigenes and selected 2,134 for EST-SSR marker development (Figure 2), 96% of which were dinucleotide and trinucleotide repeats. These are currently being screened for polymorphisms and genetically mapped in several elite x elite tetraploid and wild diploid mapping populations. Figure 3. Genetic Distance-Based Dendrograms Estimated from Genotypes for 58 EST-SSR Markers. • 80% of the EST-SSR markers were polymorphic in one or both of the diploid mapping populations (DUR25 x DUR35 and BAT3 x BAT9) (Figure 3). • Half of the EST-SSR markers were polymorphic among tetraploid germplasm accessions; however, the mean heterozygosity (H) or polymorphic information content was low (H = 0.17). Trinucleotide repeats (H = 0.20) were more polymorphic than dinucleotide repeats (H = 0.10). • The well-known diversity bottleneck in modern cultivars is illustrated by the tight cluster of tetraploid nodes in the diploid-tetraploid dendrogram (Figure 3). • SSR length was positively correlated with allelic diversity (r = 0.45; p < 0.0001) (Figure 4). SSRs longer than 26 bp (H = 0.26) were significantly more polymorphic than SSRs shorter than 26 bp (H = 0.10). Conclusions • Twenty to 30% of the 2,134 EST-SSR markers developed in the present study are predicted to be polymorphic and have utility in modern cultivars, whereas 80-90% are predicted to be polymorphic in two diploid mapping populations. • Large-scale DNA resequencing approaches are needed to identify DNA sequence variants in modern cultivars of groundnut to further increase the supply of polymorphic DNA markers for molecular breeding and other genotyping applications.