SlideShare a Scribd company logo
1 of 21
4th International Conference on Bioinformatics and Computational Biology (BICoB)

Detecting STR peaks in degraded
DNA samples
Emanuela Marasco, Arun Ross, Jeremy Dawson, Tina Moroose, Tanya Ambrose
Lane Department of Computer Science and Electrical Engineering
&
Forensic and Investigative Science Program

March 13, 2012 - Las Vegas, Nevada, USA

Thanks to
Prof. Tom O’Haver
1
Outline
•
•
•
•
•
•

DNA typing
Short Tandem Repeat (STR)
DNA analysis via GeneMapperID
The proposed signal processing approach
Results
Conclusions and future research

2
DNA typing
• Technologies used for
performing DNA analysis
differ in their ability to
differentiate two
individuals for 2 aspects:
• The speed
• The sensitivity

• STR typing offers the
best trade-off

• Our contribution: improve the sensitivity of the DNA
processing methodologies in the presence of degraded samples
J. Butler, Forensic DNA typing, Biology, Technology, and Genetic of STR markers, Second Edition

3
The human genome
• The human genome contained
in every cell consists of 23
pairs chromosomes

• Chromosomes which are dense
packets of DNA and proteins
• Four bases make up DNA:
Adenine (A), Guanine (G),
Cytosine (C) and Thymine (T)

• Most human identity testing is performed using markers on
autosomal chromosomes
• A DNA marker or locus refers to the chromosomal position
J. Butler, Forensic DNA typing, Biology, Technology, and Genetic of STR markers, Second Edition
Alleles
• The 99.7% of our DNA molecules is the
same between people; a small fraction of
0.3% makes us unique individuals
• DNA variation is exhibited in the form of
different alleles; an allele is a variant of the
DNA sequence or length at a given locus
Sequence
polymorphism

Length
polymorphism

5
What is a Short Tandem Repeat (STR)?
ATCTTCTAACACATGACCGATCATGCATGCATGCATGCATGC
ATGCATGCATGCATGCATGCATGTTCCATGATAGCACAT
• An STR is the repetitive section of the sequence
(2-5 base pair)
• STRs are short and fast to be processed
(all together at a time)

• Human identification from DNA is
typically based on 13 core loci
(american system)
• They are very discriminative
among individuals
• There is no overlapping among
loci
6
Steps in DNA sample processing
Biological perspective

DNA
extraction

DNA
quantitation

PCR
amplification
of multiple
STR markers

PCR
products

Technological perspective
Separation and
detection of PCR product
(STR alleles)

•

Sample
genotype
determination

DNA
profile

Amplified data are displayed as fluorescent peaks
7
DNA analysis via GeneMapperID

• the Internal Lane Standard (ILS)
is used to assign the size the peaks

Internal Lane Standard
• only peaks above an analytical threshold are
considered real data (lower peaks are noise)
• actual peaks are higher than a stochastic
threshold
8
Effects of degradation on STR peaks
•

•

Degradation process occurs

rapidly when samples are
exposed to the enviromental
conditions: UV light, humidity,
high temperature and bacterial
contamination
As samples age, DNA begins to
break down (or degrade)

Nondegraded

45 secs
• Degradation can reduce the
height of some peaks or making
them disappear entirely
• Peaks are shifted

http://www.bioforensics.com/articles/champion1/champion1.html

75 secs

9
The proposed signal processing scheme
• A DNA sample is represented by a DNA signal in which x-axis
indicates data point and y-axis the amplitude
• A degraded DNA sample is represented by a weak signal
confounded by several noise sources

Input:
• Let x= {xj} with j=1, …N
be the input DNA signal
• Let NC= {NCj} with
j=1, …N be the signal
due to the Negative
Control
• Let t= {tj} with j=1, …N
be the instances in which
the signals are sampled

Positive Control
(Reagents, DNA, ILS)

Negative Control
(Reagents, ILS)

10
Differentiation of signals
Peak-type signal

Derivative
• Given a peak-type
signal such as those
used in STR analysis,
the location of the
maximum can be
computed as location of
the zero-crossing points
in its first derivative

Peak-type signal

Derivative
• Given two peaks
having the same
height, the wider
peak results in a
lower amplitude in
the first derivative
11
Our proposed method
1. De-noise the DNA signal x:

PC-NC

Negative Control (NC) contains
only reagents and no DNA

2. Compute the first derivative
of the enhanced signal:

Diff(PC-NC)

False
Positive
12
Our proposed method
3. Smooth the derivative of the signal
each point in the signal is replaced with the average of
m adiacent points (m is the smooth width)

http://terpconnect.umd.edu/~toh/spectrum/PeakFindingandMeasurement.htm

13
Our proposed method
4. Peak detection

based on an amplitude threshold and a slope threshold
5. Estimate peaks details (location, height and width) on the
un-smoothed signal by using a curve fitting function:
a polynomial of degree 2 is fitted through the detected peaks

and

are mean and standard deviation of a set of
points in the vicinity of the detected peak

14
Dataset 1: ultraviolet degradation
• The first dataset was collected at the WVU Department of
Forensic and Investigative Sciences by performing a controlled
artificial DNA degradation using ultraviolet radiation
• Positive Control
• Degraded samples obtained after an exposure to UV of
35secs, 75secs, 150secs and 240secs

Pang B. C. M. and Cheung B. K. K. One-step generation of degraded DNA by UV irradiation.
Analytical Biochemistry 2007; 360: 163-165

15
Dataset 2: Low Copy Number (LCN) data
• Low Copy Number (LCN) are samples which contain
less than 100pg of DNA template
• One of the reason for LCN is degradation
• The second dataset was provided by NIST obtained
by varying cycle counts for the PCR processing step
•
•

Positive Control, 1 ng/ μL
Low Copy Number (LCN):
100pg/ μL, 30pg/ μL and 10pg/ μL
at 28 cycles and 31 cycles

16
Results on degraded data with UV
GeneMapper results
In the presence of degraded samples,
for th=100
• 1 detected peak for a degradation level
obtained after an UV exposure of 75 secs
• 0 detected peaks for a obtained after an
exposure of 150 secs onward 0

Peak finding results
For Th= 0.37*max(x(t))

7 peaks detected for the sample obtained
after an UV exposure of 150 seconds
• 3 peaks for the sample obtained after an
UV exposure of 240 seconds

17
Results on Low Copy Number (LCN) data
GeneMapper results
Th = 100

Peak finding results

• The success rate of GeneMapper typing system decreases when
decreasing the DNA amount present in the analyzed samples
• The amount of DNA factoring the sample presents a non-significant
impact on the performance detection of the peak finding algorithm

18
Obtained improvement
• Under UV degradation
PC

0%

35 secs

0%

75 secs

80%

150 secs

100%

240 secs

100%

• In the presence of LCN
1ng

-50.0%

100 pg

3.1%

30 pg

34.4%

10 pg

25.0%

• Results are reported for blue dye data by setting the
GeneMapperID threshold to 100 and averaging on two
samples for LCN data

• The peak detection rate improves in the presence of
degraded samples since the algorithm has been designed
to deal with noise signals
19
Conclusions
•

Strength of the proposed approach:
– it uses an adaptive threshold
– High discrimination power against wider peaks provided by
differentiation

•

Our experiments show the robustness of the proposed peak
finding algorithm to high level UV degradation and when dealing
with critical amount of DNA (less than 100pg)

•

Limitation: the peak detection algorithm uses a global threshold

•

Coming up:
– to model the degradation process
– designing a local threshold
– since the adopted derivative was first-order, we will carry out
experiments with higher order
20
Thanks!
Any questions?
Acknowledgements
•
•
•
•
•

The work was funded by Citer
Many thanks to Raghunandan Pasula, Lane Department of Computer
Science and Electrical Engineering, West Virginia University, for his
assistance during the development of the project;
Prof. Thomas C. O’Haver, Department of Chemistry and Biochemistry,
University of Maryland for his assistance with our queries regarding
peak detection;
National Forensic Science Technology Center (NFSTC) for providing
scientific training services.
The Peak Finding code developed by Prof. O’Haver was used in this work
21

More Related Content

What's hot

What's hot (20)

STR IMP.pptx
STR IMP.pptxSTR IMP.pptx
STR IMP.pptx
 
Forensic DNA Typing-M. Asif
Forensic DNA Typing-M. AsifForensic DNA Typing-M. Asif
Forensic DNA Typing-M. Asif
 
Dna forensic
Dna forensicDna forensic
Dna forensic
 
DNA FORENSIC ANALYSIS
DNA FORENSIC ANALYSISDNA FORENSIC ANALYSIS
DNA FORENSIC ANALYSIS
 
Interpretation of dna typing results and codis
Interpretation of dna typing results and codis Interpretation of dna typing results and codis
Interpretation of dna typing results and codis
 
understanding Dna as an evidence
 understanding Dna as an evidence understanding Dna as an evidence
understanding Dna as an evidence
 
Quantitation of DNA
Quantitation of DNAQuantitation of DNA
Quantitation of DNA
 
Dna in forensics
Dna in forensicsDna in forensics
Dna in forensics
 
Dna typing methods
Dna typing methodsDna typing methods
Dna typing methods
 
Forensic Serology
Forensic SerologyForensic Serology
Forensic Serology
 
Collection of DNA evidence at crime scene
Collection of DNA evidence at crime sceneCollection of DNA evidence at crime scene
Collection of DNA evidence at crime scene
 
Automated Fingerprint Identification System (AFIS)
Automated Fingerprint Identification System (AFIS)Automated Fingerprint Identification System (AFIS)
Automated Fingerprint Identification System (AFIS)
 
An introduction to fluorescent in situ hybridization.
An introduction to fluorescent in situ hybridization.An introduction to fluorescent in situ hybridization.
An introduction to fluorescent in situ hybridization.
 
Overview History of Forensic Science
Overview History of Forensic ScienceOverview History of Forensic Science
Overview History of Forensic Science
 
forensic molecular biology
forensic molecular biologyforensic molecular biology
forensic molecular biology
 
Physical evidence in forensic science-1.pptx
Physical evidence in forensic science-1.pptxPhysical evidence in forensic science-1.pptx
Physical evidence in forensic science-1.pptx
 
Forensic Science around the world
Forensic Science around the worldForensic Science around the world
Forensic Science around the world
 
DNA Analysis
DNA Analysis DNA Analysis
DNA Analysis
 
Forensic Serology: BLOOD
Forensic Serology: BLOODForensic Serology: BLOOD
Forensic Serology: BLOOD
 
DNA TYPING
DNA TYPINGDNA TYPING
DNA TYPING
 

Viewers also liked

Gender Estimation from Fingerprints / Image De-identification for Gender
Gender Estimation from Fingerprints / Image De-identification for Gender Gender Estimation from Fingerprints / Image De-identification for Gender
Gender Estimation from Fingerprints / Image De-identification for Gender Emanuela Marasco
 
Dnaprofiling
DnaprofilingDnaprofiling
Dnaprofilingallyjer
 
DNA extraction presentation
DNA extraction presentationDNA extraction presentation
DNA extraction presentationnortje
 
Dna profiling presentation x2
Dna profiling presentation x2Dna profiling presentation x2
Dna profiling presentation x2teamchaotex
 
Sherlyn's genetic epidemiology
Sherlyn's genetic epidemiologySherlyn's genetic epidemiology
Sherlyn's genetic epidemiologyvavaponnu
 
State v. Mott: A Case Study in Forensic Science
State v. Mott: A Case Study in Forensic ScienceState v. Mott: A Case Study in Forensic Science
State v. Mott: A Case Study in Forensic Sciencegcpolando
 
DNA Markers Techniques for Plant Varietal Identification
DNA Markers Techniques for Plant Varietal Identification DNA Markers Techniques for Plant Varietal Identification
DNA Markers Techniques for Plant Varietal Identification Senthil Natesan
 
Single Nucleotide Polymorphism Analysis (SNPs)
Single Nucleotide Polymorphism Analysis (SNPs)Single Nucleotide Polymorphism Analysis (SNPs)
Single Nucleotide Polymorphism Analysis (SNPs)Data Science Thailand
 
Types of pcr
Types of pcr Types of pcr
Types of pcr Asma Gul
 
DNA fingerprinting 7 jan 2015
DNA fingerprinting 7 jan 2015DNA fingerprinting 7 jan 2015
DNA fingerprinting 7 jan 2015ICHHA PURAK
 
Molecular markers used in biotechnology
Molecular markers used in biotechnology Molecular markers used in biotechnology
Molecular markers used in biotechnology sana sana
 
Dna Fingerprinting And Forensic Applications
Dna Fingerprinting And Forensic ApplicationsDna Fingerprinting And Forensic Applications
Dna Fingerprinting And Forensic Applicationsdheva B
 
MICROSATELITE Markers for LIVESTOCK Genetic DIVERSITY ANALYSES
MICROSATELITE Markers for LIVESTOCK Genetic DIVERSITY ANALYSESMICROSATELITE Markers for LIVESTOCK Genetic DIVERSITY ANALYSES
MICROSATELITE Markers for LIVESTOCK Genetic DIVERSITY ANALYSESKaran Veer Singh
 
Find the CAT - DNA Profiling Worksheet
Find the CAT - DNA Profiling WorksheetFind the CAT - DNA Profiling Worksheet
Find the CAT - DNA Profiling WorksheetStephen Taylor
 
VNTR and RFLP
VNTR and RFLPVNTR and RFLP
VNTR and RFLPUlaa Iman
 

Viewers also liked (20)

Gender Estimation from Fingerprints / Image De-identification for Gender
Gender Estimation from Fingerprints / Image De-identification for Gender Gender Estimation from Fingerprints / Image De-identification for Gender
Gender Estimation from Fingerprints / Image De-identification for Gender
 
Dnaprofiling
DnaprofilingDnaprofiling
Dnaprofiling
 
DNA extraction presentation
DNA extraction presentationDNA extraction presentation
DNA extraction presentation
 
Fs Ch 14
Fs Ch 14Fs Ch 14
Fs Ch 14
 
Ch12 notes
Ch12 notesCh12 notes
Ch12 notes
 
Dna profiling
Dna profilingDna profiling
Dna profiling
 
Dna profiling presentation x2
Dna profiling presentation x2Dna profiling presentation x2
Dna profiling presentation x2
 
Sherlyn's genetic epidemiology
Sherlyn's genetic epidemiologySherlyn's genetic epidemiology
Sherlyn's genetic epidemiology
 
State v. Mott: A Case Study in Forensic Science
State v. Mott: A Case Study in Forensic ScienceState v. Mott: A Case Study in Forensic Science
State v. Mott: A Case Study in Forensic Science
 
DNA Fingerprinting
DNA FingerprintingDNA Fingerprinting
DNA Fingerprinting
 
DNA Markers Techniques for Plant Varietal Identification
DNA Markers Techniques for Plant Varietal Identification DNA Markers Techniques for Plant Varietal Identification
DNA Markers Techniques for Plant Varietal Identification
 
Single Nucleotide Polymorphism Analysis (SNPs)
Single Nucleotide Polymorphism Analysis (SNPs)Single Nucleotide Polymorphism Analysis (SNPs)
Single Nucleotide Polymorphism Analysis (SNPs)
 
Types of pcr
Types of pcr Types of pcr
Types of pcr
 
DNA fingerprinting 7 jan 2015
DNA fingerprinting 7 jan 2015DNA fingerprinting 7 jan 2015
DNA fingerprinting 7 jan 2015
 
Molecular markers used in biotechnology
Molecular markers used in biotechnology Molecular markers used in biotechnology
Molecular markers used in biotechnology
 
Dna Fingerprinting And Forensic Applications
Dna Fingerprinting And Forensic ApplicationsDna Fingerprinting And Forensic Applications
Dna Fingerprinting And Forensic Applications
 
MICROSATELITE Markers for LIVESTOCK Genetic DIVERSITY ANALYSES
MICROSATELITE Markers for LIVESTOCK Genetic DIVERSITY ANALYSESMICROSATELITE Markers for LIVESTOCK Genetic DIVERSITY ANALYSES
MICROSATELITE Markers for LIVESTOCK Genetic DIVERSITY ANALYSES
 
Find the CAT - DNA Profiling Worksheet
Find the CAT - DNA Profiling WorksheetFind the CAT - DNA Profiling Worksheet
Find the CAT - DNA Profiling Worksheet
 
VNTR and RFLP
VNTR and RFLPVNTR and RFLP
VNTR and RFLP
 
Dna profiling
Dna profilingDna profiling
Dna profiling
 

Similar to Detecting STR Peaks in Degraded DNA samples

Microarray technique
Microarray techniqueMicroarray technique
Microarray techniquearunchacko14
 
Microarry andd NGS.pdf
Microarry andd NGS.pdfMicroarry andd NGS.pdf
Microarry andd NGS.pdfnedalalazzwy
 
DNA MICROARRAY TECHNOLOGY FOR PRINCIPLE OF DRUG DISCOVERY
DNA  MICROARRAY  TECHNOLOGY FOR  PRINCIPLE OF DRUG DISCOVERYDNA  MICROARRAY  TECHNOLOGY FOR  PRINCIPLE OF DRUG DISCOVERY
DNA MICROARRAY TECHNOLOGY FOR PRINCIPLE OF DRUG DISCOVERYDhanashri Prakash Sonavane
 
Next generation sequencing methods
Next generation sequencing methods Next generation sequencing methods
Next generation sequencing methods Mrinal Vashisth
 
Dna fingerprinting
Dna fingerprintingDna fingerprinting
Dna fingerprintingmantoshrock
 
Lecture 10 2023Lecture 10 2023Lecture 10 2023.ppt
Lecture 10 2023Lecture 10 2023Lecture 10 2023.pptLecture 10 2023Lecture 10 2023Lecture 10 2023.ppt
Lecture 10 2023Lecture 10 2023Lecture 10 2023.pptAbdelrhman Abooda
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomicsAjit Shinde
 
Useful.ppt
Useful.pptUseful.ppt
Useful.pptaaaa bbb
 
DNA Sequencing - DNA sequencing is like reading the instructions inside a cell
DNA Sequencing -  DNA sequencing is like reading the instructions inside a cellDNA Sequencing -  DNA sequencing is like reading the instructions inside a cell
DNA Sequencing - DNA sequencing is like reading the instructions inside a cellAmitSamadhiya1
 
DNA Sequencing: History, methods and NGS
DNA Sequencing: History, methods and NGSDNA Sequencing: History, methods and NGS
DNA Sequencing: History, methods and NGS4RTPCRAnand
 
DNA Microarray introdution and application
DNA Microarray introdution and applicationDNA Microarray introdution and application
DNA Microarray introdution and applicationNeeraj Sharma
 
DNA based diagnosis of geneticdiseases - by Chinmayi Upadhyaya
DNA based diagnosis of geneticdiseases - by Chinmayi UpadhyayaDNA based diagnosis of geneticdiseases - by Chinmayi Upadhyaya
DNA based diagnosis of geneticdiseases - by Chinmayi UpadhyayaChinmayi Upadhyaya
 
Next generation sequencing methods (final edit)
Next generation sequencing methods (final edit)Next generation sequencing methods (final edit)
Next generation sequencing methods (final edit)Mrinal Vashisth
 
Nucleic acid techniques in diagnostic microbiology
Nucleic acid techniques in diagnostic microbiologyNucleic acid techniques in diagnostic microbiology
Nucleic acid techniques in diagnostic microbiologymohit kumar
 
20100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_020100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_0Computer Science Club
 

Similar to Detecting STR Peaks in Degraded DNA samples (20)

Microarray technique
Microarray techniqueMicroarray technique
Microarray technique
 
Microarry andd NGS.pdf
Microarry andd NGS.pdfMicroarry andd NGS.pdf
Microarry andd NGS.pdf
 
Presentation (4).pptx
Presentation (4).pptxPresentation (4).pptx
Presentation (4).pptx
 
DNA MICROARRAY TECHNOLOGY FOR PRINCIPLE OF DRUG DISCOVERY
DNA  MICROARRAY  TECHNOLOGY FOR  PRINCIPLE OF DRUG DISCOVERYDNA  MICROARRAY  TECHNOLOGY FOR  PRINCIPLE OF DRUG DISCOVERY
DNA MICROARRAY TECHNOLOGY FOR PRINCIPLE OF DRUG DISCOVERY
 
Next generation sequencing methods
Next generation sequencing methods Next generation sequencing methods
Next generation sequencing methods
 
12 arrays
12 arrays12 arrays
12 arrays
 
12 arrays
12 arrays12 arrays
12 arrays
 
Dna fingerprinting
Dna fingerprintingDna fingerprinting
Dna fingerprinting
 
Lecture 10 2023Lecture 10 2023Lecture 10 2023.ppt
Lecture 10 2023Lecture 10 2023Lecture 10 2023.pptLecture 10 2023Lecture 10 2023Lecture 10 2023.ppt
Lecture 10 2023Lecture 10 2023Lecture 10 2023.ppt
 
Functional genomics
Functional genomicsFunctional genomics
Functional genomics
 
Useful.ppt
Useful.pptUseful.ppt
Useful.ppt
 
DNA Sequencing - DNA sequencing is like reading the instructions inside a cell
DNA Sequencing -  DNA sequencing is like reading the instructions inside a cellDNA Sequencing -  DNA sequencing is like reading the instructions inside a cell
DNA Sequencing - DNA sequencing is like reading the instructions inside a cell
 
DNA Sequencing: History, methods and NGS
DNA Sequencing: History, methods and NGSDNA Sequencing: History, methods and NGS
DNA Sequencing: History, methods and NGS
 
DNA Microarray introdution and application
DNA Microarray introdution and applicationDNA Microarray introdution and application
DNA Microarray introdution and application
 
DNA based diagnosis of geneticdiseases - by Chinmayi Upadhyaya
DNA based diagnosis of geneticdiseases - by Chinmayi UpadhyayaDNA based diagnosis of geneticdiseases - by Chinmayi Upadhyaya
DNA based diagnosis of geneticdiseases - by Chinmayi Upadhyaya
 
Next generation sequencing methods (final edit)
Next generation sequencing methods (final edit)Next generation sequencing methods (final edit)
Next generation sequencing methods (final edit)
 
Hamas 1
Hamas 1Hamas 1
Hamas 1
 
Nucleic acid techniques in diagnostic microbiology
Nucleic acid techniques in diagnostic microbiologyNucleic acid techniques in diagnostic microbiology
Nucleic acid techniques in diagnostic microbiology
 
20100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_020100509 bioinformatics kapushesky_lecture03-04_0
20100509 bioinformatics kapushesky_lecture03-04_0
 
DNA Chip
DNA ChipDNA Chip
DNA Chip
 

Recently uploaded

Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Association for Project Management
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxHumphrey A Beña
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWQuiz Club NITW
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptxmary850239
 
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptxDIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptxMichelleTuguinay1
 
How to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseHow to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseCeline George
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operationalssuser3e220a
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4JOYLYNSAMANIEGO
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvRicaMaeCastro1
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)lakshayb543
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxlancelewisportillo
 
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptxMan or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptxDhatriParmar
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management SystemChristalin Nelson
 
4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptxmary850239
 
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...Nguyen Thanh Tu Collection
 
Using Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea DevelopmentUsing Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea Developmentchesterberbo7
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Seán Kennedy
 

Recently uploaded (20)

Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
Team Lead Succeed – Helping you and your team achieve high-performance teamwo...
 
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptxINTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
INTRODUCTION TO CATHOLIC CHRISTOLOGY.pptx
 
Mythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITWMythology Quiz-4th April 2024, Quiz Club NITW
Mythology Quiz-4th April 2024, Quiz Club NITW
 
4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx4.11.24 Poverty and Inequality in America.pptx
4.11.24 Poverty and Inequality in America.pptx
 
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptxDIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
DIFFERENT BASKETRY IN THE PHILIPPINES PPT.pptx
 
How to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 DatabaseHow to Make a Duplicate of Your Odoo 17 Database
How to Make a Duplicate of Your Odoo 17 Database
 
prashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Professionprashanth updated resume 2024 for Teaching Profession
prashanth updated resume 2024 for Teaching Profession
 
Expanded definition: technical and operational
Expanded definition: technical and operationalExpanded definition: technical and operational
Expanded definition: technical and operational
 
Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4Daily Lesson Plan in Mathematics Quarter 4
Daily Lesson Plan in Mathematics Quarter 4
 
Faculty Profile prashantha K EEE dept Sri Sairam college of Engineering
Faculty Profile prashantha K EEE dept Sri Sairam college of EngineeringFaculty Profile prashantha K EEE dept Sri Sairam college of Engineering
Faculty Profile prashantha K EEE dept Sri Sairam college of Engineering
 
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnvESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
ESP 4-EDITED.pdfmmcncncncmcmmnmnmncnmncmnnjvnnv
 
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
Visit to a blind student's school🧑‍🦯🧑‍🦯(community medicine)
 
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptxQ4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
Q4-PPT-Music9_Lesson-1-Romantic-Opera.pptx
 
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptxMan or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx
Man or Manufactured_ Redefining Humanity Through Biopunk Narratives.pptx
 
Transaction Management in Database Management System
Transaction Management in Database Management SystemTransaction Management in Database Management System
Transaction Management in Database Management System
 
4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx4.11.24 Mass Incarceration and the New Jim Crow.pptx
4.11.24 Mass Incarceration and the New Jim Crow.pptx
 
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
31 ĐỀ THI THỬ VÀO LỚP 10 - TIẾNG ANH - FORM MỚI 2025 - 40 CÂU HỎI - BÙI VĂN V...
 
Using Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea DevelopmentUsing Grammatical Signals Suitable to Patterns of Idea Development
Using Grammatical Signals Suitable to Patterns of Idea Development
 
Paradigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTAParadigm shift in nursing research by RS MEHTA
Paradigm shift in nursing research by RS MEHTA
 
Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...Student Profile Sample - We help schools to connect the data they have, with ...
Student Profile Sample - We help schools to connect the data they have, with ...
 

Detecting STR Peaks in Degraded DNA samples

  • 1. 4th International Conference on Bioinformatics and Computational Biology (BICoB) Detecting STR peaks in degraded DNA samples Emanuela Marasco, Arun Ross, Jeremy Dawson, Tina Moroose, Tanya Ambrose Lane Department of Computer Science and Electrical Engineering & Forensic and Investigative Science Program March 13, 2012 - Las Vegas, Nevada, USA Thanks to Prof. Tom O’Haver 1
  • 2. Outline • • • • • • DNA typing Short Tandem Repeat (STR) DNA analysis via GeneMapperID The proposed signal processing approach Results Conclusions and future research 2
  • 3. DNA typing • Technologies used for performing DNA analysis differ in their ability to differentiate two individuals for 2 aspects: • The speed • The sensitivity • STR typing offers the best trade-off • Our contribution: improve the sensitivity of the DNA processing methodologies in the presence of degraded samples J. Butler, Forensic DNA typing, Biology, Technology, and Genetic of STR markers, Second Edition 3
  • 4. The human genome • The human genome contained in every cell consists of 23 pairs chromosomes • Chromosomes which are dense packets of DNA and proteins • Four bases make up DNA: Adenine (A), Guanine (G), Cytosine (C) and Thymine (T) • Most human identity testing is performed using markers on autosomal chromosomes • A DNA marker or locus refers to the chromosomal position J. Butler, Forensic DNA typing, Biology, Technology, and Genetic of STR markers, Second Edition
  • 5. Alleles • The 99.7% of our DNA molecules is the same between people; a small fraction of 0.3% makes us unique individuals • DNA variation is exhibited in the form of different alleles; an allele is a variant of the DNA sequence or length at a given locus Sequence polymorphism Length polymorphism 5
  • 6. What is a Short Tandem Repeat (STR)? ATCTTCTAACACATGACCGATCATGCATGCATGCATGCATGC ATGCATGCATGCATGCATGCATGTTCCATGATAGCACAT • An STR is the repetitive section of the sequence (2-5 base pair) • STRs are short and fast to be processed (all together at a time) • Human identification from DNA is typically based on 13 core loci (american system) • They are very discriminative among individuals • There is no overlapping among loci 6
  • 7. Steps in DNA sample processing Biological perspective DNA extraction DNA quantitation PCR amplification of multiple STR markers PCR products Technological perspective Separation and detection of PCR product (STR alleles) • Sample genotype determination DNA profile Amplified data are displayed as fluorescent peaks 7
  • 8. DNA analysis via GeneMapperID • the Internal Lane Standard (ILS) is used to assign the size the peaks Internal Lane Standard • only peaks above an analytical threshold are considered real data (lower peaks are noise) • actual peaks are higher than a stochastic threshold 8
  • 9. Effects of degradation on STR peaks • • Degradation process occurs rapidly when samples are exposed to the enviromental conditions: UV light, humidity, high temperature and bacterial contamination As samples age, DNA begins to break down (or degrade) Nondegraded 45 secs • Degradation can reduce the height of some peaks or making them disappear entirely • Peaks are shifted http://www.bioforensics.com/articles/champion1/champion1.html 75 secs 9
  • 10. The proposed signal processing scheme • A DNA sample is represented by a DNA signal in which x-axis indicates data point and y-axis the amplitude • A degraded DNA sample is represented by a weak signal confounded by several noise sources Input: • Let x= {xj} with j=1, …N be the input DNA signal • Let NC= {NCj} with j=1, …N be the signal due to the Negative Control • Let t= {tj} with j=1, …N be the instances in which the signals are sampled Positive Control (Reagents, DNA, ILS) Negative Control (Reagents, ILS) 10
  • 11. Differentiation of signals Peak-type signal Derivative • Given a peak-type signal such as those used in STR analysis, the location of the maximum can be computed as location of the zero-crossing points in its first derivative Peak-type signal Derivative • Given two peaks having the same height, the wider peak results in a lower amplitude in the first derivative 11
  • 12. Our proposed method 1. De-noise the DNA signal x: PC-NC Negative Control (NC) contains only reagents and no DNA 2. Compute the first derivative of the enhanced signal: Diff(PC-NC) False Positive 12
  • 13. Our proposed method 3. Smooth the derivative of the signal each point in the signal is replaced with the average of m adiacent points (m is the smooth width) http://terpconnect.umd.edu/~toh/spectrum/PeakFindingandMeasurement.htm 13
  • 14. Our proposed method 4. Peak detection based on an amplitude threshold and a slope threshold 5. Estimate peaks details (location, height and width) on the un-smoothed signal by using a curve fitting function: a polynomial of degree 2 is fitted through the detected peaks and are mean and standard deviation of a set of points in the vicinity of the detected peak 14
  • 15. Dataset 1: ultraviolet degradation • The first dataset was collected at the WVU Department of Forensic and Investigative Sciences by performing a controlled artificial DNA degradation using ultraviolet radiation • Positive Control • Degraded samples obtained after an exposure to UV of 35secs, 75secs, 150secs and 240secs Pang B. C. M. and Cheung B. K. K. One-step generation of degraded DNA by UV irradiation. Analytical Biochemistry 2007; 360: 163-165 15
  • 16. Dataset 2: Low Copy Number (LCN) data • Low Copy Number (LCN) are samples which contain less than 100pg of DNA template • One of the reason for LCN is degradation • The second dataset was provided by NIST obtained by varying cycle counts for the PCR processing step • • Positive Control, 1 ng/ μL Low Copy Number (LCN): 100pg/ μL, 30pg/ μL and 10pg/ μL at 28 cycles and 31 cycles 16
  • 17. Results on degraded data with UV GeneMapper results In the presence of degraded samples, for th=100 • 1 detected peak for a degradation level obtained after an UV exposure of 75 secs • 0 detected peaks for a obtained after an exposure of 150 secs onward 0 Peak finding results For Th= 0.37*max(x(t)) 7 peaks detected for the sample obtained after an UV exposure of 150 seconds • 3 peaks for the sample obtained after an UV exposure of 240 seconds 17
  • 18. Results on Low Copy Number (LCN) data GeneMapper results Th = 100 Peak finding results • The success rate of GeneMapper typing system decreases when decreasing the DNA amount present in the analyzed samples • The amount of DNA factoring the sample presents a non-significant impact on the performance detection of the peak finding algorithm 18
  • 19. Obtained improvement • Under UV degradation PC 0% 35 secs 0% 75 secs 80% 150 secs 100% 240 secs 100% • In the presence of LCN 1ng -50.0% 100 pg 3.1% 30 pg 34.4% 10 pg 25.0% • Results are reported for blue dye data by setting the GeneMapperID threshold to 100 and averaging on two samples for LCN data • The peak detection rate improves in the presence of degraded samples since the algorithm has been designed to deal with noise signals 19
  • 20. Conclusions • Strength of the proposed approach: – it uses an adaptive threshold – High discrimination power against wider peaks provided by differentiation • Our experiments show the robustness of the proposed peak finding algorithm to high level UV degradation and when dealing with critical amount of DNA (less than 100pg) • Limitation: the peak detection algorithm uses a global threshold • Coming up: – to model the degradation process – designing a local threshold – since the adopted derivative was first-order, we will carry out experiments with higher order 20
  • 21. Thanks! Any questions? Acknowledgements • • • • • The work was funded by Citer Many thanks to Raghunandan Pasula, Lane Department of Computer Science and Electrical Engineering, West Virginia University, for his assistance during the development of the project; Prof. Thomas C. O’Haver, Department of Chemistry and Biochemistry, University of Maryland for his assistance with our queries regarding peak detection; National Forensic Science Technology Center (NFSTC) for providing scientific training services. The Peak Finding code developed by Prof. O’Haver was used in this work 21