SlideShare a Scribd company logo
THE NEXT DIMENSION IN STR SEQUENCING:
Polymorphisms in Flanking Regions
& their Allelic Associations
Katherine Butler Gettings
Rachel Aponte
Kevin Kiesler
Peter Vallone
September 2, 2015
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Disclaimer
I will mention commercial STR kit names and information, but I am not
attempting to endorse any specific products.
NIST Disclaimer: Certain commercial equipment, instruments and materials
are identified in order to specify experimental procedures as completely as
possible. In no case does such identification imply a recommendation or it
imply that any of the materials, instruments or equipment identified are
necessarily the best available for the purpose.
Information presented does not necessarily represent the official position of
the National Institute of Standards and Technology or the U.S. Department of
Justice.
Our group receives or has received funding from the FBI Laboratory and the
National Institute of Justice.
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Population
Samples
• N=183
•Caucasian - 70
•Hispanic - 45
•African American - 68
Amplification
& Library Prep
•PowerSeq Auto
System (Beta)
•PPFusion Loci
•Illumina TruSeq
HS PCR-Free
Sequencing
•MiSeq
Bioinformatics
•STRait Razor
•ExactID (Battelle)
•CE concordance
Analysis
•Repeat Region
•Flanking Region
•Stutter
Data GenerationNIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
N=183
Length
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Forensic STR Sequence Diversity
N=183
Simple Repeats
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
150 bp
flanks
110 bp
flanks
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Categories of Flanking Regions
1. No polymorphisms
2. Polymorphisms Associated with a Sequence Variant
3. Rare polymorphisms
4. Population or Allele Specific polymorphisms
5. “Old” polymorphisms (not population or allele specific)
6. Multiple polymorphisms in Haplotype
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Category Locus rs Number
African
American
Caucasian Hispanic
African
American
Caucasian Hispanic
D3S1358
FGA
D12S391
Penta E
D19S433
D21S11
D1S1656 rs4847015 0.191 0.336 0.311 5 5 4 0
rs11063971 0.044 0.086 0.078 1 1 1
rs11063970 0.044 0.086 0.078 1 1 1
rs11063969 0.044 0.086 0.078 1 1 1
rs75219269 0.044 0.086 0.078 1 1 1
rs199970098 0.029 - 0.011 2 - 1
D2S441 rs74640515 0.015 0.014 0.067 1 1 2 2
CSF1PO C-T 36bp 5' - - 0.011 - - 1 1
D8S1179 rs138862078 0.007 - - 1 - - 1
D10S1248 T-G 2bp 3' - 0.007 - - 1 - 1
D18S51 rs535833682 0.029 - - 3 - - 1
Penta D rs7279663 0.022 0.014 0.011 1 2 1 2
D22S1045 rs190864081 0.007 - - 1 - - 1
rs13422969 0.135 - 0.011 2 - 1
rs115644759 0.022 - - 2 - -
rs149212737 - 0.007 - - 1 -
TH01 rs79373318 0.148 - - 3 - - 3
rs1728369 0.404 0.171 0.278 4 4 4
G-A 94bp 5' - - 0.011 - - 1
D2S1338 rs6736691 0.221 0.264 0.300 8 6 4 3
G-T 4bp 3' 0.316 0.257 0.189 7 5 6
rs25768 0.228 0.257 0.156 5 5 5
rs146841551 0.029 - - 1 - -
rs541272009 0.007 - - 1 - -
rs9546005 0.309 0.286 0.333 5 4 7
rs73525369 0.066 - - 3 - -
rs73250432 - 0.014 0.011 - 2 1
rs146621667 0.015 - - 2 - -
4bp del 8bp 3' - - 0.022 - - 2
4bp del 21bp 3' 0.007 - 0.022 1 - 1
rs16887642 0.177 0.050 0.022 4 2 2
rs7789995 0.015 0.164 0.122 2 4 4
rs7786079 0.176 0.021 0.011 5 2 1
1bp del 21bp 3' - - 0.011 - - 1
Minor Allele Frequency Number of Associated STR Alleles Alleles Gained
with Flanking
SNPs
No SNPs
0vWA
SNPs Associated with
Repeat Region
Sequence Variant
5D16S539
Single "Old" SNP
(not population or
allele specific)
TPOXPopulation / Allele
Specific SNPs
Rare SNPs
4
16D13S317
D5S818 17
D7S820 15
Multiple SNPs in
Haplotype
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
No SNPs were identified
in the sequenced flanking regions of:
D3S1358 Penta E
FGA D19S433
D12S391 D21S11
Category Locus rs Number
African
American
Caucasian Hispanic
African
American
Caucasian Hispanic
D3S1358
FGA
D12S391
Penta E
D19S433
D21S11
Minor Allele Frequency Number of Associated STR Alleles Alleles Gained
with Flanking
SNPs
No SNPs
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Category Locus rs Number
African
American
Caucasian Hispanic
African
American
Caucasian Hispanic
D1S1656 rs4847015 0.191 0.336 0.311 5 5 4 0
rs11063971 0.044 0.086 0.078 1 1 1
rs11063970 0.044 0.086 0.078 1 1 1
rs11063969 0.044 0.086 0.078 1 1 1
rs75219269 0.044 0.086 0.078 1 1 1
rs199970098 0.029 - 0.011 2 - 1
0vWA
SNPs Associated with
Repeat Region
Sequence Variant
Minor Allele Frequency Number of Associated STR Alleles Alleles Gained
with Flanking
SNPs
D1S1656 has one SNP, rs4847015
• Well distributed across populations…
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Category Locus rs Number
African
American
Caucasian Hispanic
African
American
Caucasian Hispanic
D1S1656 rs4847015 0.191 0.336 0.311 5 5 4 0
rs11063971 0.044 0.086 0.078 1 1 1
rs11063970 0.044 0.086 0.078 1 1 1
rs11063969 0.044 0.086 0.078 1 1 1
rs75219269 0.044 0.086 0.078 1 1 1
rs199970098 0.029 - 0.011 2 - 1
0vWA
SNPs Associated with
Repeat Region
Sequence Variant
Minor Allele Frequency Number of Associated STR Alleles Alleles Gained
with Flanking
SNPs
D1S1656 has one SNP, rs4847015
• Well distributed across populations
• Well distributed across STR alleles
• Does NOT increase the number of alleles…
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
D1S1656 has one SNP, rs4847015
and it is always associated with x.3 alleles:
15.3
16.3
17.3 rs4847015 = T
18.3
19.3
Category Locus rs Number
African
American
Caucasian Hispanic
African
American
Caucasian Hispanic
D1S1656 rs4847015 0.191 0.336 0.311 5 5 4 0
rs11063971 0.044 0.086 0.078 1 1 1
rs11063970 0.044 0.086 0.078 1 1 1
rs11063969 0.044 0.086 0.078 1 1 1
rs75219269 0.044 0.086 0.078 1 1 1
rs199970098 0.029 - 0.011 2 - 1
0vWA
SNPs Associated with
Repeat Region
Sequence Variant
Minor Allele Frequency Number of Associated STR Alleles Alleles Gained
with Flanking
SNPs
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Rare SNPs were identified
in the sequenced flanking regions of:
D2S441 D18S51
CSF1PO Penta D
D8S1179 D22S1045
D10S1248
Category Locus rs Number
African
American
Caucasian Hispanic
African
American
Caucasian Hispanic
D2S441 rs74640515 0.015 0.014 0.067 1 1 2 2
CSF1PO C-T 36bp 5' - - 0.011 - - 1 1
D8S1179 rs138862078 0.007 - - 1 - - 1
D10S1248 T-G 2bp 3' - 0.007 - - 1 - 1
D18S51 rs535833682 0.029 - - 3 - - 1
Penta D rs7279663 0.022 0.014 0.011 1 2 1 2
D22S1045 rs190864081 0.007 - - 1 - - 1
Minor Allele Frequency Number of Associated STR Alleles Alleles Gained
with Flanking
SNPs
Rare SNPs
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Category Locus rs Number
African
American
Caucasian Hispanic
African
American
Caucasian Hispanic
rs13422969 0.135 - 0.011 2 - 1
rs115644759 0.022 - - 2 - -
rs149212737 - 0.007 - - 1 -
TH01 rs79373318 0.148 - - 3 - - 3
TPOX
Minor Allele Frequency Number of Associated STR Alleles Alleles Gained
with Flanking
SNPs
Population / Allele
Specific SNPs
4
TPOX: rs13422969 and two additional rare SNPs
• Well distributed across Africa, rare elsewhere
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Category Locus rs Number
African
American
Caucasian Hispanic
African
American
Caucasian Hispanic
rs13422969 0.135 - 0.011 2 - 1
rs115644759 0.022 - - 2 - -
rs149212737 - 0.007 - - 1 -
TH01 rs79373318 0.148 - - 3 - - 3
TPOX
Minor Allele Frequency Number of Associated STR Alleles Alleles Gained
with Flanking
SNPs
Population / Allele
Specific SNPs
4
TPOX: rs13422969 and two additional rare SNPs
• Well distributed across Africa, rare elsewhere
• Associated with “9” allele
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
D16S539: rs1728369 and one additional rare SNP
• Well distributed across populations and alleles
Category Locus rs Number
African
American
Caucasian Hispanic
African
American
Caucasian Hispanic
rs1728369 0.404 0.171 0.278 4 4 4
G-A 94bp 5' - - 0.011 - - 1
D2S1338 rs6736691 0.221 0.264 0.300 8 6 4 3
5D16S539
Single "Old" SNP
(not population or
allele specific)
Minor Allele Frequency Number of Associated STR Alleles Alleles Gained
with Flanking
SNPs
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Category Locus rs Number
African
American
Caucasian Hispanic
African
American
Caucasian Hispanic
G-T 4bp 3' 0.316 0.257 0.189 7 5 6
rs25768 0.228 0.257 0.156 5 5 5
rs146841551 0.029 - - 1 - -
rs541272009 0.007 - - 1 - -
rs9546005 0.309 0.286 0.333 5 4 7
rs73525369 0.066 - - 3 - -
rs73250432 - 0.014 0.011 - 2 1
rs146621667 0.015 - - 2 - -
4bp del 8bp 3' - - 0.022 - - 2
4bp del 21bp 3' 0.007 - 0.022 1 - 1
rs16887642 0.177 0.050 0.022 4 2 2
rs7789995 0.015 0.164 0.122 2 4 4
rs7786079 0.176 0.021 0.011 5 2 1
1bp del 21bp 3' - - 0.011 - - 1
D7S820 15
Multiple SNPs in
Haplotype
16D13S317
D5S818 17
Minor Allele Frequency Number of Associated STR Alleles Alleles Gained
with Flanking
SNPs
D5S818:
• G→T 4bp 3’
• rs25768
• two additional rare SNPs
Well distributed across
populations and alleles
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
*[AGAT]n[ACAT][AGAT]
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Conclusions
• Ideal SNPs for increasing allelic diversity are neither population nor
allele specific
• Multiple SNPs in haplotype may substantially increase allelic diversity
• Population studies are needed to associate flanking region
polymorphisms with STR alleles
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Flanking Region Polymorphisms in Practice
Problem is ambiguity
Full string is unambiguous,
intractable for reporting
Report differences from a reference genome
• Delineate length of amplicons
• Agree upon a reference genome
“My Wife and My Mother-in-Law”
William Ely Hill, 1915
Sample …CCAGTTACGACGCTAACTACTCTGAG…
Reference…CCAGTTACGACGCTAACTACTTTGAG…
Lab 1
Sample …CCAGTTACGACGCTAACT
Reference…CCAGTTACGACGCTAACTACTTTGAG…
Lab 3
Sample …CCAGTTACGACGCTAACTACTCTGAG…
Reference…CCAGTTACGACGCTAACTACTCTGAG…
Lab 2
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Flanking Region Polymorphisms in Practice
Problem is ambiguity
Full string is unambiguous,
intractable for reporting
Report differences from a reference genome
• Delineate length of amplicons
• Agree upon a reference genome
• Report rs number (not always available)
• Report position and base differences,
e.g. Chr6:1004589:T
– No allowance for new genome versions
• InDel alignment parameters
“My Wife and My Mother-in-Law”
William Ely Hill, 1915
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Looking Forward
Central repository for population sequence data at forensic STR loci
Labs generating worldwide population data will upload full strings
Database converts strings to tractable notation
• Bracketed repeat region
• Flank polymorphisms with length delineated
Database is searchable by repeat motif or string
Gatekeeper for
unusual sequences:
consistency and
back-compatible
allele calling
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
Acknowledgements
NIST
Pete Vallone
Kevin Kiesler
Nate Olson
Jo Lynne Harenza
Mike Coble
Becky Hill
Margaret Kline
Student Interns
Rachel Aponte - George Washington University
Harish Swaminathan - Rutgers University
Anna Blendermann - Mongomery College
Promega
Doug Storts
Jay Patel
Contact Information
katherine.gettings@nist.gov
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group
NIST Applied Genetics Group

More Related Content

What's hot

An Introduction to Crispr Genome Editing
An Introduction to Crispr Genome EditingAn Introduction to Crispr Genome Editing
An Introduction to Crispr Genome Editing
Chris Thorne
 
Ashg sedlazeck grc_share
Ashg sedlazeck grc_shareAshg sedlazeck grc_share
Ashg sedlazeck grc_share
Genome Reference Consortium
 
GENASSIST™ CRISPR & rAAV Genome Editing Tools
GENASSIST™ CRISPR & rAAV Genome Editing ToolsGENASSIST™ CRISPR & rAAV Genome Editing Tools
GENASSIST™ CRISPR & rAAV Genome Editing Tools
Candy Smellie
 
Molecular QC: Interpreting your Bioinformatics Pipeline
Molecular QC: Interpreting your Bioinformatics PipelineMolecular QC: Interpreting your Bioinformatics Pipeline
Molecular QC: Interpreting your Bioinformatics Pipeline
Candy Smellie
 
Research Program Genetic Gains (RPGG) Review Meeting 2021: Groundnut genomic ...
Research Program Genetic Gains (RPGG) Review Meeting 2021: Groundnut genomic ...Research Program Genetic Gains (RPGG) Review Meeting 2021: Groundnut genomic ...
Research Program Genetic Gains (RPGG) Review Meeting 2021: Groundnut genomic ...
ICRISAT
 
Genome Editing Comes of Age; CRISPR, rAAV and the new landscape of molecular ...
Genome Editing Comes of Age; CRISPR, rAAV and the new landscape of molecular ...Genome Editing Comes of Age; CRISPR, rAAV and the new landscape of molecular ...
Genome Editing Comes of Age; CRISPR, rAAV and the new landscape of molecular ...
Candy Smellie
 
Striving for excellence in yam breeding using genomics tools
Striving for excellence in yam breeding using genomics toolsStriving for excellence in yam breeding using genomics tools
Striving for excellence in yam breeding using genomics tools
International Institute of Tropical Agriculture
 
171017 giab for giab grc workshop
171017 giab for giab grc workshop171017 giab for giab grc workshop
171017 giab for giab grc workshop
Genome Reference Consortium
 
Church dm grc_workshop
Church dm grc_workshopChurch dm grc_workshop
Church dm grc_workshop
Genome Reference Consortium
 
Gene Editing - Challenges and Future of CRISPR in Clinical Development
Gene Editing - Challenges and Future of CRISPR in Clinical DevelopmentGene Editing - Challenges and Future of CRISPR in Clinical Development
Gene Editing - Challenges and Future of CRISPR in Clinical Development
Medpace
 
Characterizing Alzheimer’s Disease candidate genes and transcripts with targe...
Characterizing Alzheimer’s Disease candidate genes and transcripts with targe...Characterizing Alzheimer’s Disease candidate genes and transcripts with targe...
Characterizing Alzheimer’s Disease candidate genes and transcripts with targe...
Integrated DNA Technologies
 
Translating Genomes | Personalizing Medicine
Translating Genomes | Personalizing MedicineTranslating Genomes | Personalizing Medicine
Translating Genomes | Personalizing Medicine
Candy Smellie
 
Why Y Chromosome Markers are an Ever Expanding Essential Tool in Sexual Assau...
Why Y Chromosome Markers are an Ever Expanding Essential Tool in Sexual Assau...Why Y Chromosome Markers are an Ever Expanding Essential Tool in Sexual Assau...
Why Y Chromosome Markers are an Ever Expanding Essential Tool in Sexual Assau...
Thermo Fisher Scientific
 
Rewriting the Genome Using CRISPR and Synthetic Biology
Rewriting the Genome Using CRISPR and Synthetic Biology Rewriting the Genome Using CRISPR and Synthetic Biology
Rewriting the Genome Using CRISPR and Synthetic Biology
Integrated DNA Technologies
 
Genome Editing Comes of Age
Genome Editing Comes of AgeGenome Editing Comes of Age
Genome Editing Comes of Age
Candy Smellie
 
CRISPR Screening: the What, Why and How
CRISPR Screening: the What, Why and HowCRISPR Screening: the What, Why and How
CRISPR Screening: the What, Why and How
HorizonDiscovery
 
140127 platinum genomes pedigree analyses
140127 platinum genomes pedigree analyses140127 platinum genomes pedigree analyses
140127 platinum genomes pedigree analyses
GenomeInABottle
 
Evaluation of the impact of error correction algorithms on SNP calling.
Evaluation of the impact of error correction algorithms on SNP calling.Evaluation of the impact of error correction algorithms on SNP calling.
Evaluation of the impact of error correction algorithms on SNP calling.
Nathan Olson
 
26072016 uc davis_small
26072016 uc davis_small26072016 uc davis_small
26072016 uc davis_small
Benjamin Schwessinger
 
Next Generation Sequencing application in virology
Next Generation Sequencing application in virologyNext Generation Sequencing application in virology
Next Generation Sequencing application in virology
Eben Titus
 

What's hot (20)

An Introduction to Crispr Genome Editing
An Introduction to Crispr Genome EditingAn Introduction to Crispr Genome Editing
An Introduction to Crispr Genome Editing
 
Ashg sedlazeck grc_share
Ashg sedlazeck grc_shareAshg sedlazeck grc_share
Ashg sedlazeck grc_share
 
GENASSIST™ CRISPR & rAAV Genome Editing Tools
GENASSIST™ CRISPR & rAAV Genome Editing ToolsGENASSIST™ CRISPR & rAAV Genome Editing Tools
GENASSIST™ CRISPR & rAAV Genome Editing Tools
 
Molecular QC: Interpreting your Bioinformatics Pipeline
Molecular QC: Interpreting your Bioinformatics PipelineMolecular QC: Interpreting your Bioinformatics Pipeline
Molecular QC: Interpreting your Bioinformatics Pipeline
 
Research Program Genetic Gains (RPGG) Review Meeting 2021: Groundnut genomic ...
Research Program Genetic Gains (RPGG) Review Meeting 2021: Groundnut genomic ...Research Program Genetic Gains (RPGG) Review Meeting 2021: Groundnut genomic ...
Research Program Genetic Gains (RPGG) Review Meeting 2021: Groundnut genomic ...
 
Genome Editing Comes of Age; CRISPR, rAAV and the new landscape of molecular ...
Genome Editing Comes of Age; CRISPR, rAAV and the new landscape of molecular ...Genome Editing Comes of Age; CRISPR, rAAV and the new landscape of molecular ...
Genome Editing Comes of Age; CRISPR, rAAV and the new landscape of molecular ...
 
Striving for excellence in yam breeding using genomics tools
Striving for excellence in yam breeding using genomics toolsStriving for excellence in yam breeding using genomics tools
Striving for excellence in yam breeding using genomics tools
 
171017 giab for giab grc workshop
171017 giab for giab grc workshop171017 giab for giab grc workshop
171017 giab for giab grc workshop
 
Church dm grc_workshop
Church dm grc_workshopChurch dm grc_workshop
Church dm grc_workshop
 
Gene Editing - Challenges and Future of CRISPR in Clinical Development
Gene Editing - Challenges and Future of CRISPR in Clinical DevelopmentGene Editing - Challenges and Future of CRISPR in Clinical Development
Gene Editing - Challenges and Future of CRISPR in Clinical Development
 
Characterizing Alzheimer’s Disease candidate genes and transcripts with targe...
Characterizing Alzheimer’s Disease candidate genes and transcripts with targe...Characterizing Alzheimer’s Disease candidate genes and transcripts with targe...
Characterizing Alzheimer’s Disease candidate genes and transcripts with targe...
 
Translating Genomes | Personalizing Medicine
Translating Genomes | Personalizing MedicineTranslating Genomes | Personalizing Medicine
Translating Genomes | Personalizing Medicine
 
Why Y Chromosome Markers are an Ever Expanding Essential Tool in Sexual Assau...
Why Y Chromosome Markers are an Ever Expanding Essential Tool in Sexual Assau...Why Y Chromosome Markers are an Ever Expanding Essential Tool in Sexual Assau...
Why Y Chromosome Markers are an Ever Expanding Essential Tool in Sexual Assau...
 
Rewriting the Genome Using CRISPR and Synthetic Biology
Rewriting the Genome Using CRISPR and Synthetic Biology Rewriting the Genome Using CRISPR and Synthetic Biology
Rewriting the Genome Using CRISPR and Synthetic Biology
 
Genome Editing Comes of Age
Genome Editing Comes of AgeGenome Editing Comes of Age
Genome Editing Comes of Age
 
CRISPR Screening: the What, Why and How
CRISPR Screening: the What, Why and HowCRISPR Screening: the What, Why and How
CRISPR Screening: the What, Why and How
 
140127 platinum genomes pedigree analyses
140127 platinum genomes pedigree analyses140127 platinum genomes pedigree analyses
140127 platinum genomes pedigree analyses
 
Evaluation of the impact of error correction algorithms on SNP calling.
Evaluation of the impact of error correction algorithms on SNP calling.Evaluation of the impact of error correction algorithms on SNP calling.
Evaluation of the impact of error correction algorithms on SNP calling.
 
26072016 uc davis_small
26072016 uc davis_small26072016 uc davis_small
26072016 uc davis_small
 
Next Generation Sequencing application in virology
Next Generation Sequencing application in virologyNext Generation Sequencing application in virology
Next Generation Sequencing application in virology
 

Similar to The Next Dimension in STR Sequencing: Polymorphisms in Flanking Regions and Their Allelic Associations

Genome Wide SNPs for Admixture Analysis and Selection Signatures
Genome Wide SNPs for Admixture Analysis and Selection SignaturesGenome Wide SNPs for Admixture Analysis and Selection Signatures
Genome Wide SNPs for Admixture Analysis and Selection Signatures
firdous ahmad
 
Dr. Igor Paploski - Making Epidemiological Sense Out of Large Datasets of PRR...
Dr. Igor Paploski - Making Epidemiological Sense Out of Large Datasets of PRR...Dr. Igor Paploski - Making Epidemiological Sense Out of Large Datasets of PRR...
Dr. Igor Paploski - Making Epidemiological Sense Out of Large Datasets of PRR...
John Blue
 
150224 giab 30 min generic slides
150224 giab 30 min generic slides150224 giab 30 min generic slides
150224 giab 30 min generic slides
GenomeInABottle
 
A SNP array for human population genetics studies
A SNP array for human population genetics studiesA SNP array for human population genetics studies
A SNP array for human population genetics studies
Affymetrix
 
Eshg poster roman-naranjo
Eshg poster roman-naranjoEshg poster roman-naranjo
Eshg poster roman-naranjo
Pablo Román-Naranjo Varela
 
Forensics: Human Identity Testing in the Applied Genetics Group
Forensics: Human Identity Testing in the Applied Genetics GroupForensics: Human Identity Testing in the Applied Genetics Group
Forensics: Human Identity Testing in the Applied Genetics Group
Nathan Olson
 
Shapiro Webinar V9
Shapiro Webinar V9Shapiro Webinar V9
Shapiro Webinar V9
Brian Shapiro
 
2009 CSBB LAB 新生訓練
2009 CSBB LAB 新生訓練2009 CSBB LAB 新生訓練
2009 CSBB LAB 新生訓練
Abner Huang
 
Tair workshop stanford2017
Tair workshop stanford2017Tair workshop stanford2017
Tair workshop stanford2017
Phoenix Bioinformatics
 
Pete thorpe wp1 april 2018
Pete thorpe wp1 april 2018Pete thorpe wp1 april 2018
Pete thorpe wp1 april 2018
Forest Research
 
Massively Parallel Sequencing - integrating the Ion PGM™ sequencer into your ...
Massively Parallel Sequencing - integrating the Ion PGM™ sequencer into your ...Massively Parallel Sequencing - integrating the Ion PGM™ sequencer into your ...
Massively Parallel Sequencing - integrating the Ion PGM™ sequencer into your ...
Thermo Fisher Scientific
 
Y_Workshop_WI_planz (3).ppt12345789999987543
Y_Workshop_WI_planz (3).ppt12345789999987543Y_Workshop_WI_planz (3).ppt12345789999987543
Y_Workshop_WI_planz (3).ppt12345789999987543
alizain9604
 
Y_Workshop_WI_planz (1).ppt123457800kjbvc
Y_Workshop_WI_planz (1).ppt123457800kjbvcY_Workshop_WI_planz (1).ppt123457800kjbvc
Y_Workshop_WI_planz (1).ppt123457800kjbvc
alizain9604
 

Similar to The Next Dimension in STR Sequencing: Polymorphisms in Flanking Regions and Their Allelic Associations (13)

Genome Wide SNPs for Admixture Analysis and Selection Signatures
Genome Wide SNPs for Admixture Analysis and Selection SignaturesGenome Wide SNPs for Admixture Analysis and Selection Signatures
Genome Wide SNPs for Admixture Analysis and Selection Signatures
 
Dr. Igor Paploski - Making Epidemiological Sense Out of Large Datasets of PRR...
Dr. Igor Paploski - Making Epidemiological Sense Out of Large Datasets of PRR...Dr. Igor Paploski - Making Epidemiological Sense Out of Large Datasets of PRR...
Dr. Igor Paploski - Making Epidemiological Sense Out of Large Datasets of PRR...
 
150224 giab 30 min generic slides
150224 giab 30 min generic slides150224 giab 30 min generic slides
150224 giab 30 min generic slides
 
A SNP array for human population genetics studies
A SNP array for human population genetics studiesA SNP array for human population genetics studies
A SNP array for human population genetics studies
 
Eshg poster roman-naranjo
Eshg poster roman-naranjoEshg poster roman-naranjo
Eshg poster roman-naranjo
 
Forensics: Human Identity Testing in the Applied Genetics Group
Forensics: Human Identity Testing in the Applied Genetics GroupForensics: Human Identity Testing in the Applied Genetics Group
Forensics: Human Identity Testing in the Applied Genetics Group
 
Shapiro Webinar V9
Shapiro Webinar V9Shapiro Webinar V9
Shapiro Webinar V9
 
2009 CSBB LAB 新生訓練
2009 CSBB LAB 新生訓練2009 CSBB LAB 新生訓練
2009 CSBB LAB 新生訓練
 
Tair workshop stanford2017
Tair workshop stanford2017Tair workshop stanford2017
Tair workshop stanford2017
 
Pete thorpe wp1 april 2018
Pete thorpe wp1 april 2018Pete thorpe wp1 april 2018
Pete thorpe wp1 april 2018
 
Massively Parallel Sequencing - integrating the Ion PGM™ sequencer into your ...
Massively Parallel Sequencing - integrating the Ion PGM™ sequencer into your ...Massively Parallel Sequencing - integrating the Ion PGM™ sequencer into your ...
Massively Parallel Sequencing - integrating the Ion PGM™ sequencer into your ...
 
Y_Workshop_WI_planz (3).ppt12345789999987543
Y_Workshop_WI_planz (3).ppt12345789999987543Y_Workshop_WI_planz (3).ppt12345789999987543
Y_Workshop_WI_planz (3).ppt12345789999987543
 
Y_Workshop_WI_planz (1).ppt123457800kjbvc
Y_Workshop_WI_planz (1).ppt123457800kjbvcY_Workshop_WI_planz (1).ppt123457800kjbvc
Y_Workshop_WI_planz (1).ppt123457800kjbvc
 

Recently uploaded

Nucleophilic Addition of carbonyl compounds.pptx
Nucleophilic Addition of carbonyl  compounds.pptxNucleophilic Addition of carbonyl  compounds.pptx
Nucleophilic Addition of carbonyl compounds.pptx
SSR02
 
molar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptxmolar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptx
Anagha Prasad
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
muralinath2
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
RitabrataSarkar3
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
Sérgio Sacani
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
Abdul Wali Khan University Mardan,kP,Pakistan
 
Applied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdfApplied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdf
University of Hertfordshire
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
University of Maribor
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
TinyAnderson
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
University of Maribor
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
KrushnaDarade1
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
Nistarini College, Purulia (W.B) India
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
Sharon Liu
 
Bob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdfBob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdf
Texas Alliance of Groundwater Districts
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
kejapriya1
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
University of Rennes, INSA Rennes, Inria/IRISA, CNRS
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
MAGOTI ERNEST
 
Equivariant neural networks and representation theory
Equivariant neural networks and representation theoryEquivariant neural networks and representation theory
Equivariant neural networks and representation theory
Daniel Tubbenhauer
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
moosaasad1975
 
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
AbdullaAlAsif1
 

Recently uploaded (20)

Nucleophilic Addition of carbonyl compounds.pptx
Nucleophilic Addition of carbonyl  compounds.pptxNucleophilic Addition of carbonyl  compounds.pptx
Nucleophilic Addition of carbonyl compounds.pptx
 
molar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptxmolar-distalization in orthodontics-seminar.pptx
molar-distalization in orthodontics-seminar.pptx
 
Oedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptxOedema_types_causes_pathophysiology.pptx
Oedema_types_causes_pathophysiology.pptx
 
Eukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptxEukaryotic Transcription Presentation.pptx
Eukaryotic Transcription Presentation.pptx
 
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
EWOCS-I: The catalog of X-ray sources in Westerlund 1 from the Extended Weste...
 
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...THEMATIC  APPERCEPTION  TEST(TAT) cognitive abilities, creativity, and critic...
THEMATIC APPERCEPTION TEST(TAT) cognitive abilities, creativity, and critic...
 
Applied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdfApplied Science: Thermodynamics, Laws & Methodology.pdf
Applied Science: Thermodynamics, Laws & Methodology.pdf
 
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
Comparing Evolved Extractive Text Summary Scores of Bidirectional Encoder Rep...
 
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdfTopic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
Topic: SICKLE CELL DISEASE IN CHILDREN-3.pdf
 
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
Remote Sensing and Computational, Evolutionary, Supercomputing, and Intellige...
 
SAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdfSAR of Medicinal Chemistry 1st by dk.pdf
SAR of Medicinal Chemistry 1st by dk.pdf
 
Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.Nucleic Acid-its structural and functional complexity.
Nucleic Acid-its structural and functional complexity.
 
20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx20240520 Planning a Circuit Simulator in JavaScript.pptx
20240520 Planning a Circuit Simulator in JavaScript.pptx
 
Bob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdfBob Reedy - Nitrate in Texas Groundwater.pdf
Bob Reedy - Nitrate in Texas Groundwater.pdf
 
bordetella pertussis.................................ppt
bordetella pertussis.................................pptbordetella pertussis.................................ppt
bordetella pertussis.................................ppt
 
Deep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless ReproducibilityDeep Software Variability and Frictionless Reproducibility
Deep Software Variability and Frictionless Reproducibility
 
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptxThe use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
The use of Nauplii and metanauplii artemia in aquaculture (brine shrimp).pptx
 
Equivariant neural networks and representation theory
Equivariant neural networks and representation theoryEquivariant neural networks and representation theory
Equivariant neural networks and representation theory
 
What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.What is greenhouse gasses and how many gasses are there to affect the Earth.
What is greenhouse gasses and how many gasses are there to affect the Earth.
 
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
Unlocking the mysteries of reproduction: Exploring fecundity and gonadosomati...
 

The Next Dimension in STR Sequencing: Polymorphisms in Flanking Regions and Their Allelic Associations

  • 1. THE NEXT DIMENSION IN STR SEQUENCING: Polymorphisms in Flanking Regions & their Allelic Associations Katherine Butler Gettings Rachel Aponte Kevin Kiesler Peter Vallone September 2, 2015 NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 2. Disclaimer I will mention commercial STR kit names and information, but I am not attempting to endorse any specific products. NIST Disclaimer: Certain commercial equipment, instruments and materials are identified in order to specify experimental procedures as completely as possible. In no case does such identification imply a recommendation or it imply that any of the materials, instruments or equipment identified are necessarily the best available for the purpose. Information presented does not necessarily represent the official position of the National Institute of Standards and Technology or the U.S. Department of Justice. Our group receives or has received funding from the FBI Laboratory and the National Institute of Justice. NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 3. NIST Population Samples • N=183 •Caucasian - 70 •Hispanic - 45 •African American - 68 Amplification & Library Prep •PowerSeq Auto System (Beta) •PPFusion Loci •Illumina TruSeq HS PCR-Free Sequencing •MiSeq Bioinformatics •STRait Razor •ExactID (Battelle) •CE concordance Analysis •Repeat Region •Flanking Region •Stutter Data GenerationNIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 4. N=183 Length NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 5. Forensic STR Sequence Diversity N=183 Simple Repeats NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 6. 150 bp flanks 110 bp flanks NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 7. Categories of Flanking Regions 1. No polymorphisms 2. Polymorphisms Associated with a Sequence Variant 3. Rare polymorphisms 4. Population or Allele Specific polymorphisms 5. “Old” polymorphisms (not population or allele specific) 6. Multiple polymorphisms in Haplotype NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 8. Category Locus rs Number African American Caucasian Hispanic African American Caucasian Hispanic D3S1358 FGA D12S391 Penta E D19S433 D21S11 D1S1656 rs4847015 0.191 0.336 0.311 5 5 4 0 rs11063971 0.044 0.086 0.078 1 1 1 rs11063970 0.044 0.086 0.078 1 1 1 rs11063969 0.044 0.086 0.078 1 1 1 rs75219269 0.044 0.086 0.078 1 1 1 rs199970098 0.029 - 0.011 2 - 1 D2S441 rs74640515 0.015 0.014 0.067 1 1 2 2 CSF1PO C-T 36bp 5' - - 0.011 - - 1 1 D8S1179 rs138862078 0.007 - - 1 - - 1 D10S1248 T-G 2bp 3' - 0.007 - - 1 - 1 D18S51 rs535833682 0.029 - - 3 - - 1 Penta D rs7279663 0.022 0.014 0.011 1 2 1 2 D22S1045 rs190864081 0.007 - - 1 - - 1 rs13422969 0.135 - 0.011 2 - 1 rs115644759 0.022 - - 2 - - rs149212737 - 0.007 - - 1 - TH01 rs79373318 0.148 - - 3 - - 3 rs1728369 0.404 0.171 0.278 4 4 4 G-A 94bp 5' - - 0.011 - - 1 D2S1338 rs6736691 0.221 0.264 0.300 8 6 4 3 G-T 4bp 3' 0.316 0.257 0.189 7 5 6 rs25768 0.228 0.257 0.156 5 5 5 rs146841551 0.029 - - 1 - - rs541272009 0.007 - - 1 - - rs9546005 0.309 0.286 0.333 5 4 7 rs73525369 0.066 - - 3 - - rs73250432 - 0.014 0.011 - 2 1 rs146621667 0.015 - - 2 - - 4bp del 8bp 3' - - 0.022 - - 2 4bp del 21bp 3' 0.007 - 0.022 1 - 1 rs16887642 0.177 0.050 0.022 4 2 2 rs7789995 0.015 0.164 0.122 2 4 4 rs7786079 0.176 0.021 0.011 5 2 1 1bp del 21bp 3' - - 0.011 - - 1 Minor Allele Frequency Number of Associated STR Alleles Alleles Gained with Flanking SNPs No SNPs 0vWA SNPs Associated with Repeat Region Sequence Variant 5D16S539 Single "Old" SNP (not population or allele specific) TPOXPopulation / Allele Specific SNPs Rare SNPs 4 16D13S317 D5S818 17 D7S820 15 Multiple SNPs in Haplotype NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 9. No SNPs were identified in the sequenced flanking regions of: D3S1358 Penta E FGA D19S433 D12S391 D21S11 Category Locus rs Number African American Caucasian Hispanic African American Caucasian Hispanic D3S1358 FGA D12S391 Penta E D19S433 D21S11 Minor Allele Frequency Number of Associated STR Alleles Alleles Gained with Flanking SNPs No SNPs NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 10. Category Locus rs Number African American Caucasian Hispanic African American Caucasian Hispanic D1S1656 rs4847015 0.191 0.336 0.311 5 5 4 0 rs11063971 0.044 0.086 0.078 1 1 1 rs11063970 0.044 0.086 0.078 1 1 1 rs11063969 0.044 0.086 0.078 1 1 1 rs75219269 0.044 0.086 0.078 1 1 1 rs199970098 0.029 - 0.011 2 - 1 0vWA SNPs Associated with Repeat Region Sequence Variant Minor Allele Frequency Number of Associated STR Alleles Alleles Gained with Flanking SNPs D1S1656 has one SNP, rs4847015 • Well distributed across populations… NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 11. Category Locus rs Number African American Caucasian Hispanic African American Caucasian Hispanic D1S1656 rs4847015 0.191 0.336 0.311 5 5 4 0 rs11063971 0.044 0.086 0.078 1 1 1 rs11063970 0.044 0.086 0.078 1 1 1 rs11063969 0.044 0.086 0.078 1 1 1 rs75219269 0.044 0.086 0.078 1 1 1 rs199970098 0.029 - 0.011 2 - 1 0vWA SNPs Associated with Repeat Region Sequence Variant Minor Allele Frequency Number of Associated STR Alleles Alleles Gained with Flanking SNPs D1S1656 has one SNP, rs4847015 • Well distributed across populations • Well distributed across STR alleles • Does NOT increase the number of alleles… NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 12. D1S1656 has one SNP, rs4847015 and it is always associated with x.3 alleles: 15.3 16.3 17.3 rs4847015 = T 18.3 19.3 Category Locus rs Number African American Caucasian Hispanic African American Caucasian Hispanic D1S1656 rs4847015 0.191 0.336 0.311 5 5 4 0 rs11063971 0.044 0.086 0.078 1 1 1 rs11063970 0.044 0.086 0.078 1 1 1 rs11063969 0.044 0.086 0.078 1 1 1 rs75219269 0.044 0.086 0.078 1 1 1 rs199970098 0.029 - 0.011 2 - 1 0vWA SNPs Associated with Repeat Region Sequence Variant Minor Allele Frequency Number of Associated STR Alleles Alleles Gained with Flanking SNPs NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 13. Rare SNPs were identified in the sequenced flanking regions of: D2S441 D18S51 CSF1PO Penta D D8S1179 D22S1045 D10S1248 Category Locus rs Number African American Caucasian Hispanic African American Caucasian Hispanic D2S441 rs74640515 0.015 0.014 0.067 1 1 2 2 CSF1PO C-T 36bp 5' - - 0.011 - - 1 1 D8S1179 rs138862078 0.007 - - 1 - - 1 D10S1248 T-G 2bp 3' - 0.007 - - 1 - 1 D18S51 rs535833682 0.029 - - 3 - - 1 Penta D rs7279663 0.022 0.014 0.011 1 2 1 2 D22S1045 rs190864081 0.007 - - 1 - - 1 Minor Allele Frequency Number of Associated STR Alleles Alleles Gained with Flanking SNPs Rare SNPs NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 14. Category Locus rs Number African American Caucasian Hispanic African American Caucasian Hispanic rs13422969 0.135 - 0.011 2 - 1 rs115644759 0.022 - - 2 - - rs149212737 - 0.007 - - 1 - TH01 rs79373318 0.148 - - 3 - - 3 TPOX Minor Allele Frequency Number of Associated STR Alleles Alleles Gained with Flanking SNPs Population / Allele Specific SNPs 4 TPOX: rs13422969 and two additional rare SNPs • Well distributed across Africa, rare elsewhere NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 15. Category Locus rs Number African American Caucasian Hispanic African American Caucasian Hispanic rs13422969 0.135 - 0.011 2 - 1 rs115644759 0.022 - - 2 - - rs149212737 - 0.007 - - 1 - TH01 rs79373318 0.148 - - 3 - - 3 TPOX Minor Allele Frequency Number of Associated STR Alleles Alleles Gained with Flanking SNPs Population / Allele Specific SNPs 4 TPOX: rs13422969 and two additional rare SNPs • Well distributed across Africa, rare elsewhere • Associated with “9” allele NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 16. D16S539: rs1728369 and one additional rare SNP • Well distributed across populations and alleles Category Locus rs Number African American Caucasian Hispanic African American Caucasian Hispanic rs1728369 0.404 0.171 0.278 4 4 4 G-A 94bp 5' - - 0.011 - - 1 D2S1338 rs6736691 0.221 0.264 0.300 8 6 4 3 5D16S539 Single "Old" SNP (not population or allele specific) Minor Allele Frequency Number of Associated STR Alleles Alleles Gained with Flanking SNPs NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 17. Category Locus rs Number African American Caucasian Hispanic African American Caucasian Hispanic G-T 4bp 3' 0.316 0.257 0.189 7 5 6 rs25768 0.228 0.257 0.156 5 5 5 rs146841551 0.029 - - 1 - - rs541272009 0.007 - - 1 - - rs9546005 0.309 0.286 0.333 5 4 7 rs73525369 0.066 - - 3 - - rs73250432 - 0.014 0.011 - 2 1 rs146621667 0.015 - - 2 - - 4bp del 8bp 3' - - 0.022 - - 2 4bp del 21bp 3' 0.007 - 0.022 1 - 1 rs16887642 0.177 0.050 0.022 4 2 2 rs7789995 0.015 0.164 0.122 2 4 4 rs7786079 0.176 0.021 0.011 5 2 1 1bp del 21bp 3' - - 0.011 - - 1 D7S820 15 Multiple SNPs in Haplotype 16D13S317 D5S818 17 Minor Allele Frequency Number of Associated STR Alleles Alleles Gained with Flanking SNPs D5S818: • G→T 4bp 3’ • rs25768 • two additional rare SNPs Well distributed across populations and alleles NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 18. NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 19. *[AGAT]n[ACAT][AGAT] NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 20. Conclusions • Ideal SNPs for increasing allelic diversity are neither population nor allele specific • Multiple SNPs in haplotype may substantially increase allelic diversity • Population studies are needed to associate flanking region polymorphisms with STR alleles NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 21. Flanking Region Polymorphisms in Practice Problem is ambiguity Full string is unambiguous, intractable for reporting Report differences from a reference genome • Delineate length of amplicons • Agree upon a reference genome “My Wife and My Mother-in-Law” William Ely Hill, 1915 Sample …CCAGTTACGACGCTAACTACTCTGAG… Reference…CCAGTTACGACGCTAACTACTTTGAG… Lab 1 Sample …CCAGTTACGACGCTAACT Reference…CCAGTTACGACGCTAACTACTTTGAG… Lab 3 Sample …CCAGTTACGACGCTAACTACTCTGAG… Reference…CCAGTTACGACGCTAACTACTCTGAG… Lab 2 NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 22. Flanking Region Polymorphisms in Practice Problem is ambiguity Full string is unambiguous, intractable for reporting Report differences from a reference genome • Delineate length of amplicons • Agree upon a reference genome • Report rs number (not always available) • Report position and base differences, e.g. Chr6:1004589:T – No allowance for new genome versions • InDel alignment parameters “My Wife and My Mother-in-Law” William Ely Hill, 1915 NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 23. Looking Forward Central repository for population sequence data at forensic STR loci Labs generating worldwide population data will upload full strings Database converts strings to tractable notation • Bracketed repeat region • Flank polymorphisms with length delineated Database is searchable by repeat motif or string Gatekeeper for unusual sequences: consistency and back-compatible allele calling NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group
  • 24. Acknowledgements NIST Pete Vallone Kevin Kiesler Nate Olson Jo Lynne Harenza Mike Coble Becky Hill Margaret Kline Student Interns Rachel Aponte - George Washington University Harish Swaminathan - Rutgers University Anna Blendermann - Mongomery College Promega Doug Storts Jay Patel Contact Information katherine.gettings@nist.gov NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group NIST Applied Genetics Group