SlideShare a Scribd company logo
1 of 4
Download to read offline
International Journal on Recent and Innovation Trends in Computing and Communication ISSN: 2321-8169
Volume: 5 Issue: 7 57 – 60
_______________________________________________________________________________________________
57
IJRITCC | July 2017, Available @ http://www.ijritcc.org
_______________________________________________________________________________________
Sequence Similarity between Genetic Codes using Improved Longest Common
Subsequence Algorithm
A. Murugan
Associate Professor, PG & Research Department
of Computer Science
DrAmbedkar Government Arts College,
Vyasarpadi, Chennai – 600 039, Tamilnadu.
e-mail: amurugan1972@gmail.com
U. Udayakumar
M.Phil Research Scholar, PG & Research Department
of Computer Science
DrAmbedkar Government Arts College,
Vyasarpadi, Chennai – 600 039, Tamilnadu.
e-mail: udaya.vijay13@gmail.com
Abstract— Finding the sequence similarity between two genetic codes is an important problem in computational biology. In this paper, we
developed an efficient algorithm to find sequence similarity between genetic codes using longest common subsequence algorithm. The algorithm
takes the advantages over the edit distance algorithm and improves the performance. The proposed algorithm is tested on randomly generated
DNA sequence and finding the exact DNA sequence comparison. The DNA genetic code sequence comparison can be used to discover
information such as evolutionary divergence and ways to apply genetic codes from one DNA sequence to another sequence.
Keywords-Dynamic Programming, DNA sequence comparison, DNA Similarity algorithms, Longest Common Subsequence.
__________________________________________________*****_________________________________________________
I. INTRODUCTION
DNA is the hereditary material found in almost all
organisms [1]. DNA resides in every cell in the body of
organism. The DNA is a double helix like structure made up of
two twisted strands. A single strand carries a nucleotide bases,
these bases are adenine (A), guanine (G), cytosine (C), thymine
(T). These chemical letters are in the form of triplets which
defines a meaningful code. The triplets give arise to specific
amino acid which later helps in the formation of protein[2]. An
important property of DNA is that it can replicate, or make
copies of itself. Each strand of DNA in the double helix can
serve as a pattern for duplicating the sequence of bases as
shown in Fig.1. This is critical when cells divide because each
new cell needs to have an exact copy of the DNA present in the
old cell.
Figure 1. Structure of DNA
The knowledge of a DNA sequence and gene analysis can be
used in several biological, medicine and agriculture research
fields such as: possible disease or abnormality diagnoses,
forensic science, pattern matching, biotechnology, etc[3][4].
The analysis and comparison studies for DNA sequences
connected information technology tools and methods to
accelerate findings and knowledge in biological related
sciences.
II. LITERATURE SURVEY
There are many traditional pattern matching methodologies
available in literature, they are Naive Brute force, Boyer
Moore, Knuth Morris Pratt and Dynamic algorithms[2]. Pattern
matching is used in various processes, like codon optimization
which is carried out to enhance the efficiency of the DNA
expression vectors used in DNA vaccination by increasing
protein expression[5].
A. Naive Brute force
It is one of the simplest algorithm having complexity
O(m*n). In this, first character of pattern P(with length m) is
aligned with first character of text T (with length n). Then
scanning is done from left to right. As shifting is done at each
step it gives less efficiency[6].
B. Boyer-Moore Algorithm
It performs larger shift-increment whenever mismatch is
detected. It differs from Naïve in the way of scanning. It scans
the string from right to left; unlike Naive i.e. P is aligned with
T such that last character of P will be matched to first character
of T. If character is matched then pointer is shifted to left to
very rest of the characters of the pattern[7][8]. If a mismatch is
detected at character c, in T which is not in P, then P is shifted
right to m positions and P is aligned to the next character after
c. If c is part of P, then P is shifted right so that c is aligned
with the right most occurrence of c in P. The worst time
complexity is still O(m+n).
C. Knuth-Morris-Pratt
This algorithm is based on automaton theory. First a finite
state automata model M is being created for the given pattern
P. The input string T with Σ= {A, C, T, G} is processed
through the model. If the pattern is present in text, the text is
accepted otherwise rejected.
International Journal on Recent and Innovation Trends in Computing and Communication ISSN: 2321-8169
Volume: 5 Issue: 7 57 – 60
_______________________________________________________________________________________________
58
IJRITCC | July 2017, Available @ http://www.ijritcc.org
_______________________________________________________________________________________
III. EFFICIENT LCS ALGORITHM
The longest common subsequence problem is to find a
substring that is common to the two given strings. Given two
sequences, find the length of the longest subsequence present in
both of them. A subsequence is a sequence that appears in the
same relative order, but not necessarily contiguous [16].
In LCS problem, we eliminate the operation of substitution
and allow only insertions and deletions. A Subsequence of a
string v is simply an ordered sequence of characters from v. For
example, if v = ATTGCTA, then AGCA and ATTA are
subsequences of v whereas TGTT and TCG are not. Where
ATGC denotes Adenine, Thymine, Guanine, Cytosine.
Formally, we define the common subsequence of strings[9].
1 ≤ i1 ≤ i2< . . . ik ≤ n (1)
and a sequence of positions in w,
1 ≤ j1 ≤ j2< . . . jk ≤ m (2)
Let si,j be the length of an LCS between v1...vi, the i-prefix of v
and w1...wj, the j-prefix of w. Clearly, si,0 = s0,j = 0 for all 1 ≤ i
≤ n and 1 ≤ i ≤ n one can see that si,j satisfy the following
recurrence:
si,j = max
si-1,j
si,j-1
si-1,j-1+1, if vi=wj
(3)
Note that one can "rewrite" these recurrences by adding some
zeros
si,j=max
si-1,j+0
si,j-1+0
si-1,j-1+1, if vi=wj
(4)
The LCS computation is like the recurrence given at the
equations (3) and (4), if we were to build particularly gnarly
version of Manhattan and gave horizontal and vertical edges
weights of 0, and set the weights of diagonal edges equal to +
1[10].
The following is the efficient algorithm for solving longest
common subsequence problem [15] that improves the time
complexity and performance.
Algorithm Backtracking Pointers and stored three values up,
right and diagonal in Multidimensional array
while x! := ―0‖ then
ifsolna,b == ―Diagonal‖ then
a := a - 1
b := b - 1
else
b := b - 1
a := a - 1
end if
return LCS[A.length][B.length]
end while
The following Fig. 2 shows Multidimensional array
representation of the LCS algorithm
Figure 2. Multidimensional Array representation for efficient
LCS Algorithm
The following Fig. 3 shows LCS Time Complexity and
Performance
Figure 3. LCS Time Complexity and Performance
IV. DNA SEQUENCE SIMILARITY
We proposed an algorithm to find the similarity between
DNA genetic code sequences. In DNA sequence similarity,
find the matching percentage between DNA genetic codes. In
this paper, we proposed a new method for DNA sequence
similarity.
A modified Longest Common Subsequences algorithm is
proposed in [15] and which is applied in the edit distance
algorithm, which is an existing algorithm.
The following is the modified edit distance algorithm used
to find the similarity between two genetic codes.
Algorithm 1: Edit Distance for DNA genetic code
AlgorithmeditDistance(s1,s2)
cost[] : = s2.length + 1
for i:= 0 to s1.length do
lastvalue := i
for j:= 0 to s2.length do
if(i == j)
cost[j] : = j
elseif(j > 0)
newValue : = costs[j - 1]
costs[j - 1] : = lastValue
lastValue : = newValue
end if
end for
end for
end algorithm
International Journal on Recent and Innovation Trends in Computing and Communication ISSN: 2321-8169
Volume: 5 Issue: 7 57 – 60
_______________________________________________________________________________________________
59
IJRITCC | July 2017, Available @ http://www.ijritcc.org
_______________________________________________________________________________________
The following Fig. 4 explain the entire process of finding
DNA sequence similarity
Figure 4. Implementing LCS Algorithm in DNA Genetic Code
Sequence Similarity
The computation of the similarity score s(v,w) between v and
w, while the table on the right presents the computation of the
edit distance between v and w under the assumption that
insertions and deletions are the only allowed operations[14].
The following TABLE Ishows DNA genetic code sequence
similarity
TABLE I. GENETIC CODE SIMILARITY BETWEEN SEQUENCES
Human DNA Animals DNA
% of
matching
GTCACGATTTGGGGGATGC
TTCTGGCTC------A-
GTCACGATTTGGGGGATGC
TTCTGGCTC------A
100.00%
GTCACGATTTGGGGGATGC
TTCTGGCTC------A-
GTCACGATTTGGGGGA
TGCTTCTGGCTC------A-
100.00%
GTCACGATTTGGGGGATGC
TTCTGGCTC------A-
GTCACGATTTGGGAGA
TGCTTCTGGCTC-----G-
91.67%
GTCACGATTTGGGGGATGC
TTCTGGCTC------A-
GTCAGAATTTGGGGGA
TGCTTCTGGCTC-----T-
88.89%
GTCACGATTTGGGGGATGC
TTCTGGCTC------A-
GTCAGAATTTGGGGGA
TGCTTCTGGCTC-----T-
88.89%
GTCACGATTTGGGGGATGC
TTCTGGCTC------A-
GTCAGAATTTGGGGGA
TGCTTCTGGCTC-----T-
88.89%
GTCACGATTTGGGGGATGC
TTCTGGCTC------A-
ATCACAGTTGGGGGAT
GCCACTGGCCT-----C-
75.00%
GTCACGATTTGGGGGATGC
TTCTGGCTC------A-
ATCACAA-TTGGGGG-
TGCCACGGTCCT-----C-
69.44%
GTCACGATTTGGGGGATGC
TTCTGGCTC------A-
ATCACAATTTGGGGAA
CACCACTGGCAT-----C-
69.44%
GTCACGATTTGGGGGATGC
TTCTGGCTC------A-
GTCACAATTTGGAGGA
TGTTACTGGCAT-----C-
77.78%
GTCACGATTTGGGGGATGC
TTCTGGCTC------A-
GTCACAGTTTGGAGGA
TGTTACTGACAT-----C-
72.22%
GTCACGATTTGGGGGATGC
TTCTGGCTC------A-
GTCATAGTTT----
GATTATATGGGCTT-----
C-
58.33%
GTCACGATTTGGGGGATGC
TTCTGGCTC------A-
GTCACAATTTGGGGGA
TACTACTGGCAT-----C-
80.56%
GTCACGATTTGGGGGATGC
TTCTGGCTC------A-
GTCACAGTTTAGGGGG
TACTACTGGCAT-----C-
72.22%
GTCACGATTTGGGGGATGC
TTCTGGCTC------A-
GTCACAATTTAGGAAG
TGCCACTGGCCT-----C-
72.22%
GTCACGATTTGGGGGATGC
TTCTGGCTC------A-
GCCTCTCTTT-----------
CTGCCCTGCAGGC-
33.33%
GTCACGATTTGGGGGATGC
TTCTGGCTC------A-
----------------
TGCTACTAATAT-----T-
36.11%
Algorithm 2: genetic code similarity
Algorithmsimilarity(s1, s2)
longer := s1, shorter := s2
if (s1.length() < s2.length())
longer := s2
shorter : = s1
longerLength := longer.length();
if (longerLength == 0)
return (longerLength - editDistance(longer, shorter)) /
(double) longerLength;
end if
end algorithm
The following Fig. 6 shows the Output for Similarity
Sequences between genetic codes
Figure 5. Output for DNA Genetic Code Similarity Sequences
A. Time Complexity
The Time Complexity of Sequence Similarity Algorithms
TC(DSSA) = max{TC(LCS), TC(EDA), TC(SS)}
TC(DSSA) = max{O(n2
), O(n2
), O(n2
)}
So the time complexity of the DNA Sequence Similarity
Algorithm is O(n2
)
The best case and average case time complexity
T(n) = max((O(n2
) and O(levelnumber)), O(|n|).
The Worst case time complexity
T(n) = max((O(n3
) and O(levelnumber)), O(|n|).
The following Figure 7 shows Graphical Representation of
DNA Genetic Code between Human and Animals
International Journal on Recent and Innovation Trends in Computing and Communication ISSN: 2321-8169
Volume: 5 Issue: 7 57 – 60
_______________________________________________________________________________________________
60
IJRITCC | July 2017, Available @ http://www.ijritcc.org
_______________________________________________________________________________________
Figure 6. Graphical Representation of DNA Genetic Code
Similarity Sequences
V. DNA SEQUENCE SIMILARITY
In this paper, we proposed an algorithm for finding similarity
between two strings. It calculate the matching percentage
between two strings by using longest common subsequence
(LCS) and edit distance approach by avoiding unnecessary
comparisons that reduces its time complexity. The running
time is better than dynamic programming based algorithms
and it is due to time control parameter. The DNA sequence
similarity algorithm is tested on 50 samples with two input
DNA genetic code sequence strings and selected randomly in
Pentium processor machine is used and it shows good results.
In future the algorithm is implemented in Multiple Longest
Common Subsequences which will also have many
applications.
REFERENCES
[1] Neil C.Jones and Pavel A.Pevzner, "An Introduction to Bioinformatics
Algorithms" (2004)
[2] Jiaoyun Yang, Yun Xu, Yi Shang, ―An Efficient Parallel Algorithm for
Longest Common Subsequence Problem on GPUs‖, World Congress
Engineering, 2010.
[3] Maier. D, ―The Complexity of some problems on subsequences and
super sequences‖, ACM, (25), 1978, 332-336.
[4] B.Lavanya and A.Murugan, "Mining Longest Common Subsequence
and other related patterns using DNA operations", International Journal
of Computer Application, (18), 2012, 38-44.
[5] Hirosawa et al, "Comprehensive study on iterative algorithms of
multiple sequence alignment". Computational Applications in
Biosciences, 1995, 13-18.
[6] Mahdi Esmaieli and Mansour Tarafdar, "Sequential Pattern Mining from
multidimensional sequence data in parallel", International journal of
Computer theory and engineering, 2010, 730-733.
[7] Stormo. G, "DNA binding sites: representation and discovery".
Bioinformatics, 2000, 16:16-23
[8] Kyle Jensen. L., Mark Styczynski. P., Isidore Rigoutsos, and Gregory
Stephanopoulos. V, "A generic motif discovery algorithm for sequential
data". Bioinformatics, 22(1) 2006, 21-28.
[9] Wang. L. and Jiang. T, "On the complexity of multiple sequence
alignment". Journal of Computational Biology, 1994, 337-348.
[10] Nan Li and Tompa. M, "Analysis of computational tools for motif
discovery". Algorithms of molecular biology, 2006, 1-8.
[11] Suyama. M., Nishioka. T., and Junichi. O, "Searching for common
sequence patterns among distantly related proteins". Protein
Engineering, 1995,1075-1080.
[12] Smith. H. O., Annau. T. M., and Chandrasegaran.S, "Finding sequence
motifs in groups of functionally related proteins". Proceedings of
National Academy (USA), 1990, 826-830.
[13] Bin Ma, "A polynomial time approximation scheme for the closest
substring problem". LCNS Springer, 2000, 99-107.
[14] Martinez. M, "An efficient method to find repeats in molecular
sequences". Nucleic Acid Research, 1983, 4629-4634.
[15] U. Udayakumar and A. Murugan "An Efficient Algorithm For Solving
Longest Common Subsequence Problem", IJESMR, 2017,4(5).
[16] www.geeksforgeeks.org/dynamic-programming-set-4-longest-common-
subsequence

More Related Content

What's hot

Heptagonal Fuzzy Numbers by Max Min Method
Heptagonal Fuzzy Numbers by Max Min MethodHeptagonal Fuzzy Numbers by Max Min Method
Heptagonal Fuzzy Numbers by Max Min MethodYogeshIJTSRD
 
Computationally simple approach for solving fully intuitionistic fuzzy real l...
Computationally simple approach for solving fully intuitionistic fuzzy real l...Computationally simple approach for solving fully intuitionistic fuzzy real l...
Computationally simple approach for solving fully intuitionistic fuzzy real l...Navodaya Institute of Technology
 
A note on 'a new approach for solving intuitionistic fuzzy transportation pro...
A note on 'a new approach for solving intuitionistic fuzzy transportation pro...A note on 'a new approach for solving intuitionistic fuzzy transportation pro...
A note on 'a new approach for solving intuitionistic fuzzy transportation pro...Navodaya Institute of Technology
 
The Analysis of Performance Measures of Generalized Trapezoidal Fuzzy Queuing...
The Analysis of Performance Measures of Generalized Trapezoidal Fuzzy Queuing...The Analysis of Performance Measures of Generalized Trapezoidal Fuzzy Queuing...
The Analysis of Performance Measures of Generalized Trapezoidal Fuzzy Queuing...IJERA Editor
 
AN ADVANCED TOOL FOR MANAGING FUZZY COMPLEX TEMPORAL INFORMATION
AN ADVANCED TOOL FOR MANAGING FUZZY COMPLEX TEMPORAL INFORMATIONAN ADVANCED TOOL FOR MANAGING FUZZY COMPLEX TEMPORAL INFORMATION
AN ADVANCED TOOL FOR MANAGING FUZZY COMPLEX TEMPORAL INFORMATIONcsandit
 
Comparison of Several Numerical Algorithms with the use of Predictor and Corr...
Comparison of Several Numerical Algorithms with the use of Predictor and Corr...Comparison of Several Numerical Algorithms with the use of Predictor and Corr...
Comparison of Several Numerical Algorithms with the use of Predictor and Corr...ijtsrd
 
Mild balanced Intuitionistic Fuzzy Graphs
Mild balanced Intuitionistic Fuzzy Graphs Mild balanced Intuitionistic Fuzzy Graphs
Mild balanced Intuitionistic Fuzzy Graphs IJERA Editor
 
SENSE DISAMBIGUATION TECHNIQUE FOR PROVIDING MORE ACCURATE RESULTS IN WEB SEARCH
SENSE DISAMBIGUATION TECHNIQUE FOR PROVIDING MORE ACCURATE RESULTS IN WEB SEARCHSENSE DISAMBIGUATION TECHNIQUE FOR PROVIDING MORE ACCURATE RESULTS IN WEB SEARCH
SENSE DISAMBIGUATION TECHNIQUE FOR PROVIDING MORE ACCURATE RESULTS IN WEB SEARCHijwscjournal
 
The transportation problem in an intuitionistic fuzzy environment
The transportation problem in an intuitionistic fuzzy environmentThe transportation problem in an intuitionistic fuzzy environment
The transportation problem in an intuitionistic fuzzy environmentNavodaya Institute of Technology
 
COMPARISON OF DIFFERENT APPROXIMATIONS OF FUZZY NUMBERS
COMPARISON OF DIFFERENT APPROXIMATIONS OF FUZZY NUMBERSCOMPARISON OF DIFFERENT APPROXIMATIONS OF FUZZY NUMBERS
COMPARISON OF DIFFERENT APPROXIMATIONS OF FUZZY NUMBERSijfls
 
A survey on parallel corpora alignment
A survey on parallel corpora alignment A survey on parallel corpora alignment
A survey on parallel corpora alignment andrefsantos
 

What's hot (17)

Lesson 31
Lesson 31Lesson 31
Lesson 31
 
Heptagonal Fuzzy Numbers by Max Min Method
Heptagonal Fuzzy Numbers by Max Min MethodHeptagonal Fuzzy Numbers by Max Min Method
Heptagonal Fuzzy Numbers by Max Min Method
 
Computationally simple approach for solving fully intuitionistic fuzzy real l...
Computationally simple approach for solving fully intuitionistic fuzzy real l...Computationally simple approach for solving fully intuitionistic fuzzy real l...
Computationally simple approach for solving fully intuitionistic fuzzy real l...
 
A note on 'a new approach for solving intuitionistic fuzzy transportation pro...
A note on 'a new approach for solving intuitionistic fuzzy transportation pro...A note on 'a new approach for solving intuitionistic fuzzy transportation pro...
A note on 'a new approach for solving intuitionistic fuzzy transportation pro...
 
The Analysis of Performance Measures of Generalized Trapezoidal Fuzzy Queuing...
The Analysis of Performance Measures of Generalized Trapezoidal Fuzzy Queuing...The Analysis of Performance Measures of Generalized Trapezoidal Fuzzy Queuing...
The Analysis of Performance Measures of Generalized Trapezoidal Fuzzy Queuing...
 
AN ADVANCED TOOL FOR MANAGING FUZZY COMPLEX TEMPORAL INFORMATION
AN ADVANCED TOOL FOR MANAGING FUZZY COMPLEX TEMPORAL INFORMATIONAN ADVANCED TOOL FOR MANAGING FUZZY COMPLEX TEMPORAL INFORMATION
AN ADVANCED TOOL FOR MANAGING FUZZY COMPLEX TEMPORAL INFORMATION
 
Comparison of Several Numerical Algorithms with the use of Predictor and Corr...
Comparison of Several Numerical Algorithms with the use of Predictor and Corr...Comparison of Several Numerical Algorithms with the use of Predictor and Corr...
Comparison of Several Numerical Algorithms with the use of Predictor and Corr...
 
Lesson 30
Lesson 30Lesson 30
Lesson 30
 
F5233444
F5233444F5233444
F5233444
 
AI Lesson 34
AI Lesson 34AI Lesson 34
AI Lesson 34
 
Mild balanced Intuitionistic Fuzzy Graphs
Mild balanced Intuitionistic Fuzzy Graphs Mild balanced Intuitionistic Fuzzy Graphs
Mild balanced Intuitionistic Fuzzy Graphs
 
SENSE DISAMBIGUATION TECHNIQUE FOR PROVIDING MORE ACCURATE RESULTS IN WEB SEARCH
SENSE DISAMBIGUATION TECHNIQUE FOR PROVIDING MORE ACCURATE RESULTS IN WEB SEARCHSENSE DISAMBIGUATION TECHNIQUE FOR PROVIDING MORE ACCURATE RESULTS IN WEB SEARCH
SENSE DISAMBIGUATION TECHNIQUE FOR PROVIDING MORE ACCURATE RESULTS IN WEB SEARCH
 
Bm35359363
Bm35359363Bm35359363
Bm35359363
 
The transportation problem in an intuitionistic fuzzy environment
The transportation problem in an intuitionistic fuzzy environmentThe transportation problem in an intuitionistic fuzzy environment
The transportation problem in an intuitionistic fuzzy environment
 
Introduction to fuzzy logic
Introduction to fuzzy logicIntroduction to fuzzy logic
Introduction to fuzzy logic
 
COMPARISON OF DIFFERENT APPROXIMATIONS OF FUZZY NUMBERS
COMPARISON OF DIFFERENT APPROXIMATIONS OF FUZZY NUMBERSCOMPARISON OF DIFFERENT APPROXIMATIONS OF FUZZY NUMBERS
COMPARISON OF DIFFERENT APPROXIMATIONS OF FUZZY NUMBERS
 
A survey on parallel corpora alignment
A survey on parallel corpora alignment A survey on parallel corpora alignment
A survey on parallel corpora alignment
 

Similar to DNA Sequence Similarity Algorithm Improves LCS

Comparative analysis of dynamic programming algorithms to find similarity in ...
Comparative analysis of dynamic programming algorithms to find similarity in ...Comparative analysis of dynamic programming algorithms to find similarity in ...
Comparative analysis of dynamic programming algorithms to find similarity in ...eSAT Journals
 
Comparative analysis of dynamic programming
Comparative analysis of dynamic programmingComparative analysis of dynamic programming
Comparative analysis of dynamic programmingeSAT Publishing House
 
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...ijcseit
 
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ijcseit
 
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...ijcseit
 
An Index Based K-Partitions Multiple Pattern Matching Algorithm
An Index Based K-Partitions Multiple Pattern Matching AlgorithmAn Index Based K-Partitions Multiple Pattern Matching Algorithm
An Index Based K-Partitions Multiple Pattern Matching AlgorithmIDES Editor
 
BEHAVIOR STUDY OF ENTROPY IN A DIGITAL IMAGE THROUGH AN ITERATIVE ALGORITHM O...
BEHAVIOR STUDY OF ENTROPY IN A DIGITAL IMAGE THROUGH AN ITERATIVE ALGORITHM O...BEHAVIOR STUDY OF ENTROPY IN A DIGITAL IMAGE THROUGH AN ITERATIVE ALGORITHM O...
BEHAVIOR STUDY OF ENTROPY IN A DIGITAL IMAGE THROUGH AN ITERATIVE ALGORITHM O...ijscmcj
 
Behavior study of entropy in a digital image through an iterative algorithm
Behavior study of entropy in a digital image through an iterative algorithmBehavior study of entropy in a digital image through an iterative algorithm
Behavior study of entropy in a digital image through an iterative algorithmijscmcj
 
An Algorithm For Vector Quantizer Design
An Algorithm For Vector Quantizer DesignAn Algorithm For Vector Quantizer Design
An Algorithm For Vector Quantizer DesignAngie Miller
 
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKSEVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKSijcsit
 
Double layered dna based cryptography
Double layered dna based cryptographyDouble layered dna based cryptography
Double layered dna based cryptographyeSAT Journals
 
User_42751212015Module1and2pagestocompetework.pdf.docx
User_42751212015Module1and2pagestocompetework.pdf.docxUser_42751212015Module1and2pagestocompetework.pdf.docx
User_42751212015Module1and2pagestocompetework.pdf.docxdickonsondorris
 
International Journal of Computer Science and Security Volume (2) Issue (5)
International Journal of Computer Science and Security Volume (2) Issue (5)International Journal of Computer Science and Security Volume (2) Issue (5)
International Journal of Computer Science and Security Volume (2) Issue (5)CSCJournals
 
Combining text and pattern preprocessing in an adaptive dna pattern matcher
Combining text and pattern preprocessing in an adaptive dna pattern matcherCombining text and pattern preprocessing in an adaptive dna pattern matcher
Combining text and pattern preprocessing in an adaptive dna pattern matcherIAEME Publication
 
Performance Efficient DNA Sequence Detectionalgo
Performance Efficient DNA Sequence DetectionalgoPerformance Efficient DNA Sequence Detectionalgo
Performance Efficient DNA Sequence DetectionalgoRahul Shirude
 
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...IRJET Journal
 
Optimized Parameter of Wavelet Neural Network (WNN) using INGA
Optimized Parameter of Wavelet Neural Network (WNN) using INGAOptimized Parameter of Wavelet Neural Network (WNN) using INGA
Optimized Parameter of Wavelet Neural Network (WNN) using INGArahulmonikasharma
 

Similar to DNA Sequence Similarity Algorithm Improves LCS (20)

Comparative analysis of dynamic programming algorithms to find similarity in ...
Comparative analysis of dynamic programming algorithms to find similarity in ...Comparative analysis of dynamic programming algorithms to find similarity in ...
Comparative analysis of dynamic programming algorithms to find similarity in ...
 
Comparative analysis of dynamic programming
Comparative analysis of dynamic programmingComparative analysis of dynamic programming
Comparative analysis of dynamic programming
 
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
 
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES
 
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
 
An Index Based K-Partitions Multiple Pattern Matching Algorithm
An Index Based K-Partitions Multiple Pattern Matching AlgorithmAn Index Based K-Partitions Multiple Pattern Matching Algorithm
An Index Based K-Partitions Multiple Pattern Matching Algorithm
 
BEHAVIOR STUDY OF ENTROPY IN A DIGITAL IMAGE THROUGH AN ITERATIVE ALGORITHM O...
BEHAVIOR STUDY OF ENTROPY IN A DIGITAL IMAGE THROUGH AN ITERATIVE ALGORITHM O...BEHAVIOR STUDY OF ENTROPY IN A DIGITAL IMAGE THROUGH AN ITERATIVE ALGORITHM O...
BEHAVIOR STUDY OF ENTROPY IN A DIGITAL IMAGE THROUGH AN ITERATIVE ALGORITHM O...
 
40220140505002
4022014050500240220140505002
40220140505002
 
Behavior study of entropy in a digital image through an iterative algorithm
Behavior study of entropy in a digital image through an iterative algorithmBehavior study of entropy in a digital image through an iterative algorithm
Behavior study of entropy in a digital image through an iterative algorithm
 
4 report format
4 report format4 report format
4 report format
 
4 report format
4 report format4 report format
4 report format
 
An Algorithm For Vector Quantizer Design
An Algorithm For Vector Quantizer DesignAn Algorithm For Vector Quantizer Design
An Algorithm For Vector Quantizer Design
 
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKSEVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
EVOLUTIONARY CENTRALITY AND MAXIMAL CLIQUES IN MOBILE SOCIAL NETWORKS
 
Double layered dna based cryptography
Double layered dna based cryptographyDouble layered dna based cryptography
Double layered dna based cryptography
 
User_42751212015Module1and2pagestocompetework.pdf.docx
User_42751212015Module1and2pagestocompetework.pdf.docxUser_42751212015Module1and2pagestocompetework.pdf.docx
User_42751212015Module1and2pagestocompetework.pdf.docx
 
International Journal of Computer Science and Security Volume (2) Issue (5)
International Journal of Computer Science and Security Volume (2) Issue (5)International Journal of Computer Science and Security Volume (2) Issue (5)
International Journal of Computer Science and Security Volume (2) Issue (5)
 
Combining text and pattern preprocessing in an adaptive dna pattern matcher
Combining text and pattern preprocessing in an adaptive dna pattern matcherCombining text and pattern preprocessing in an adaptive dna pattern matcher
Combining text and pattern preprocessing in an adaptive dna pattern matcher
 
Performance Efficient DNA Sequence Detectionalgo
Performance Efficient DNA Sequence DetectionalgoPerformance Efficient DNA Sequence Detectionalgo
Performance Efficient DNA Sequence Detectionalgo
 
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
 
Optimized Parameter of Wavelet Neural Network (WNN) using INGA
Optimized Parameter of Wavelet Neural Network (WNN) using INGAOptimized Parameter of Wavelet Neural Network (WNN) using INGA
Optimized Parameter of Wavelet Neural Network (WNN) using INGA
 

More from rahulmonikasharma

Data Mining Concepts - A survey paper
Data Mining Concepts - A survey paperData Mining Concepts - A survey paper
Data Mining Concepts - A survey paperrahulmonikasharma
 
A Review on Real Time Integrated CCTV System Using Face Detection for Vehicle...
A Review on Real Time Integrated CCTV System Using Face Detection for Vehicle...A Review on Real Time Integrated CCTV System Using Face Detection for Vehicle...
A Review on Real Time Integrated CCTV System Using Face Detection for Vehicle...rahulmonikasharma
 
Considering Two Sides of One Review Using Stanford NLP Framework
Considering Two Sides of One Review Using Stanford NLP FrameworkConsidering Two Sides of One Review Using Stanford NLP Framework
Considering Two Sides of One Review Using Stanford NLP Frameworkrahulmonikasharma
 
A New Detection and Decoding Technique for (2×N_r ) MIMO Communication Systems
A New Detection and Decoding Technique for (2×N_r ) MIMO Communication SystemsA New Detection and Decoding Technique for (2×N_r ) MIMO Communication Systems
A New Detection and Decoding Technique for (2×N_r ) MIMO Communication Systemsrahulmonikasharma
 
Broadcasting Scenario under Different Protocols in MANET: A Survey
Broadcasting Scenario under Different Protocols in MANET: A SurveyBroadcasting Scenario under Different Protocols in MANET: A Survey
Broadcasting Scenario under Different Protocols in MANET: A Surveyrahulmonikasharma
 
Sybil Attack Analysis and Detection Techniques in MANET
Sybil Attack Analysis and Detection Techniques in MANETSybil Attack Analysis and Detection Techniques in MANET
Sybil Attack Analysis and Detection Techniques in MANETrahulmonikasharma
 
A Landmark Based Shortest Path Detection by Using A* and Haversine Formula
A Landmark Based Shortest Path Detection by Using A* and Haversine FormulaA Landmark Based Shortest Path Detection by Using A* and Haversine Formula
A Landmark Based Shortest Path Detection by Using A* and Haversine Formularahulmonikasharma
 
Processing Over Encrypted Query Data In Internet of Things (IoTs) : CryptDBs,...
Processing Over Encrypted Query Data In Internet of Things (IoTs) : CryptDBs,...Processing Over Encrypted Query Data In Internet of Things (IoTs) : CryptDBs,...
Processing Over Encrypted Query Data In Internet of Things (IoTs) : CryptDBs,...rahulmonikasharma
 
Quality Determination and Grading of Tomatoes using Raspberry Pi
Quality Determination and Grading of Tomatoes using Raspberry PiQuality Determination and Grading of Tomatoes using Raspberry Pi
Quality Determination and Grading of Tomatoes using Raspberry Pirahulmonikasharma
 
Comparative of Delay Tolerant Network Routings and Scheduling using Max-Weigh...
Comparative of Delay Tolerant Network Routings and Scheduling using Max-Weigh...Comparative of Delay Tolerant Network Routings and Scheduling using Max-Weigh...
Comparative of Delay Tolerant Network Routings and Scheduling using Max-Weigh...rahulmonikasharma
 
DC Conductivity Study of Cadmium Sulfide Nanoparticles
DC Conductivity Study of Cadmium Sulfide NanoparticlesDC Conductivity Study of Cadmium Sulfide Nanoparticles
DC Conductivity Study of Cadmium Sulfide Nanoparticlesrahulmonikasharma
 
A Survey on Peak to Average Power Ratio Reduction Methods for LTE-OFDM
A Survey on Peak to Average Power Ratio Reduction Methods for LTE-OFDMA Survey on Peak to Average Power Ratio Reduction Methods for LTE-OFDM
A Survey on Peak to Average Power Ratio Reduction Methods for LTE-OFDMrahulmonikasharma
 
IOT Based Home Appliance Control System, Location Tracking and Energy Monitoring
IOT Based Home Appliance Control System, Location Tracking and Energy MonitoringIOT Based Home Appliance Control System, Location Tracking and Energy Monitoring
IOT Based Home Appliance Control System, Location Tracking and Energy Monitoringrahulmonikasharma
 
Thermal Radiation and Viscous Dissipation Effects on an Oscillatory Heat and ...
Thermal Radiation and Viscous Dissipation Effects on an Oscillatory Heat and ...Thermal Radiation and Viscous Dissipation Effects on an Oscillatory Heat and ...
Thermal Radiation and Viscous Dissipation Effects on an Oscillatory Heat and ...rahulmonikasharma
 
Advance Approach towards Key Feature Extraction Using Designed Filters on Dif...
Advance Approach towards Key Feature Extraction Using Designed Filters on Dif...Advance Approach towards Key Feature Extraction Using Designed Filters on Dif...
Advance Approach towards Key Feature Extraction Using Designed Filters on Dif...rahulmonikasharma
 
Alamouti-STBC based Channel Estimation Technique over MIMO OFDM System
Alamouti-STBC based Channel Estimation Technique over MIMO OFDM SystemAlamouti-STBC based Channel Estimation Technique over MIMO OFDM System
Alamouti-STBC based Channel Estimation Technique over MIMO OFDM Systemrahulmonikasharma
 
Empirical Mode Decomposition Based Signal Analysis of Gear Fault Diagnosis
Empirical Mode Decomposition Based Signal Analysis of Gear Fault DiagnosisEmpirical Mode Decomposition Based Signal Analysis of Gear Fault Diagnosis
Empirical Mode Decomposition Based Signal Analysis of Gear Fault Diagnosisrahulmonikasharma
 
Short Term Load Forecasting Using ARIMA Technique
Short Term Load Forecasting Using ARIMA TechniqueShort Term Load Forecasting Using ARIMA Technique
Short Term Load Forecasting Using ARIMA Techniquerahulmonikasharma
 
Impact of Coupling Coefficient on Coupled Line Coupler
Impact of Coupling Coefficient on Coupled Line CouplerImpact of Coupling Coefficient on Coupled Line Coupler
Impact of Coupling Coefficient on Coupled Line Couplerrahulmonikasharma
 
Design Evaluation and Temperature Rise Test of Flameproof Induction Motor
Design Evaluation and Temperature Rise Test of Flameproof Induction MotorDesign Evaluation and Temperature Rise Test of Flameproof Induction Motor
Design Evaluation and Temperature Rise Test of Flameproof Induction Motorrahulmonikasharma
 

More from rahulmonikasharma (20)

Data Mining Concepts - A survey paper
Data Mining Concepts - A survey paperData Mining Concepts - A survey paper
Data Mining Concepts - A survey paper
 
A Review on Real Time Integrated CCTV System Using Face Detection for Vehicle...
A Review on Real Time Integrated CCTV System Using Face Detection for Vehicle...A Review on Real Time Integrated CCTV System Using Face Detection for Vehicle...
A Review on Real Time Integrated CCTV System Using Face Detection for Vehicle...
 
Considering Two Sides of One Review Using Stanford NLP Framework
Considering Two Sides of One Review Using Stanford NLP FrameworkConsidering Two Sides of One Review Using Stanford NLP Framework
Considering Two Sides of One Review Using Stanford NLP Framework
 
A New Detection and Decoding Technique for (2×N_r ) MIMO Communication Systems
A New Detection and Decoding Technique for (2×N_r ) MIMO Communication SystemsA New Detection and Decoding Technique for (2×N_r ) MIMO Communication Systems
A New Detection and Decoding Technique for (2×N_r ) MIMO Communication Systems
 
Broadcasting Scenario under Different Protocols in MANET: A Survey
Broadcasting Scenario under Different Protocols in MANET: A SurveyBroadcasting Scenario under Different Protocols in MANET: A Survey
Broadcasting Scenario under Different Protocols in MANET: A Survey
 
Sybil Attack Analysis and Detection Techniques in MANET
Sybil Attack Analysis and Detection Techniques in MANETSybil Attack Analysis and Detection Techniques in MANET
Sybil Attack Analysis and Detection Techniques in MANET
 
A Landmark Based Shortest Path Detection by Using A* and Haversine Formula
A Landmark Based Shortest Path Detection by Using A* and Haversine FormulaA Landmark Based Shortest Path Detection by Using A* and Haversine Formula
A Landmark Based Shortest Path Detection by Using A* and Haversine Formula
 
Processing Over Encrypted Query Data In Internet of Things (IoTs) : CryptDBs,...
Processing Over Encrypted Query Data In Internet of Things (IoTs) : CryptDBs,...Processing Over Encrypted Query Data In Internet of Things (IoTs) : CryptDBs,...
Processing Over Encrypted Query Data In Internet of Things (IoTs) : CryptDBs,...
 
Quality Determination and Grading of Tomatoes using Raspberry Pi
Quality Determination and Grading of Tomatoes using Raspberry PiQuality Determination and Grading of Tomatoes using Raspberry Pi
Quality Determination and Grading of Tomatoes using Raspberry Pi
 
Comparative of Delay Tolerant Network Routings and Scheduling using Max-Weigh...
Comparative of Delay Tolerant Network Routings and Scheduling using Max-Weigh...Comparative of Delay Tolerant Network Routings and Scheduling using Max-Weigh...
Comparative of Delay Tolerant Network Routings and Scheduling using Max-Weigh...
 
DC Conductivity Study of Cadmium Sulfide Nanoparticles
DC Conductivity Study of Cadmium Sulfide NanoparticlesDC Conductivity Study of Cadmium Sulfide Nanoparticles
DC Conductivity Study of Cadmium Sulfide Nanoparticles
 
A Survey on Peak to Average Power Ratio Reduction Methods for LTE-OFDM
A Survey on Peak to Average Power Ratio Reduction Methods for LTE-OFDMA Survey on Peak to Average Power Ratio Reduction Methods for LTE-OFDM
A Survey on Peak to Average Power Ratio Reduction Methods for LTE-OFDM
 
IOT Based Home Appliance Control System, Location Tracking and Energy Monitoring
IOT Based Home Appliance Control System, Location Tracking and Energy MonitoringIOT Based Home Appliance Control System, Location Tracking and Energy Monitoring
IOT Based Home Appliance Control System, Location Tracking and Energy Monitoring
 
Thermal Radiation and Viscous Dissipation Effects on an Oscillatory Heat and ...
Thermal Radiation and Viscous Dissipation Effects on an Oscillatory Heat and ...Thermal Radiation and Viscous Dissipation Effects on an Oscillatory Heat and ...
Thermal Radiation and Viscous Dissipation Effects on an Oscillatory Heat and ...
 
Advance Approach towards Key Feature Extraction Using Designed Filters on Dif...
Advance Approach towards Key Feature Extraction Using Designed Filters on Dif...Advance Approach towards Key Feature Extraction Using Designed Filters on Dif...
Advance Approach towards Key Feature Extraction Using Designed Filters on Dif...
 
Alamouti-STBC based Channel Estimation Technique over MIMO OFDM System
Alamouti-STBC based Channel Estimation Technique over MIMO OFDM SystemAlamouti-STBC based Channel Estimation Technique over MIMO OFDM System
Alamouti-STBC based Channel Estimation Technique over MIMO OFDM System
 
Empirical Mode Decomposition Based Signal Analysis of Gear Fault Diagnosis
Empirical Mode Decomposition Based Signal Analysis of Gear Fault DiagnosisEmpirical Mode Decomposition Based Signal Analysis of Gear Fault Diagnosis
Empirical Mode Decomposition Based Signal Analysis of Gear Fault Diagnosis
 
Short Term Load Forecasting Using ARIMA Technique
Short Term Load Forecasting Using ARIMA TechniqueShort Term Load Forecasting Using ARIMA Technique
Short Term Load Forecasting Using ARIMA Technique
 
Impact of Coupling Coefficient on Coupled Line Coupler
Impact of Coupling Coefficient on Coupled Line CouplerImpact of Coupling Coefficient on Coupled Line Coupler
Impact of Coupling Coefficient on Coupled Line Coupler
 
Design Evaluation and Temperature Rise Test of Flameproof Induction Motor
Design Evaluation and Temperature Rise Test of Flameproof Induction MotorDesign Evaluation and Temperature Rise Test of Flameproof Induction Motor
Design Evaluation and Temperature Rise Test of Flameproof Induction Motor
 

Recently uploaded

Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerAnamika Sarkar
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2RajaP95
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLDeelipZope
 
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)dollysharma2066
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...srsj9000
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionDr.Costas Sachpazis
 
Electronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfElectronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfme23b1001
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxDeepakSakkari2
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girlsssuser7cb4ff
 
power system scada applications and uses
power system scada applications and usespower system scada applications and uses
power system scada applications and usesDevarapalliHaritha
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineeringmalavadedarshan25
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024Mark Billinghurst
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfAsst.prof M.Gokilavani
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfAsst.prof M.Gokilavani
 

Recently uploaded (20)

Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube ExchangerStudy on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
Study on Air-Water & Water-Water Heat Exchange in a Finned Tube Exchanger
 
Design and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdfDesign and analysis of solar grass cutter.pdf
Design and analysis of solar grass cutter.pdf
 
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2HARMONY IN THE HUMAN BEING - Unit-II UHV-2
HARMONY IN THE HUMAN BEING - Unit-II UHV-2
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
Current Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCLCurrent Transformer Drawing and GTP for MSETCL
Current Transformer Drawing and GTP for MSETCL
 
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
Call Us ≽ 8377877756 ≼ Call Girls In Shastri Nagar (Delhi)
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
young call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Serviceyoung call girls in Green Park🔝 9953056974 🔝 escort Service
young call girls in Green Park🔝 9953056974 🔝 escort Service
 
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
Gfe Mayur Vihar Call Girls Service WhatsApp -> 9999965857 Available 24x7 ^ De...
 
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective IntroductionSachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
Sachpazis Costas: Geotechnical Engineering: A student's Perspective Introduction
 
Electronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdfElectronically Controlled suspensions system .pdf
Electronically Controlled suspensions system .pdf
 
Biology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptxBiology for Computer Engineers Course Handout.pptx
Biology for Computer Engineers Course Handout.pptx
 
Call Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call GirlsCall Girls Narol 7397865700 Independent Call Girls
Call Girls Narol 7397865700 Independent Call Girls
 
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptxExploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
Exploring_Network_Security_with_JA3_by_Rakesh Seal.pptx
 
power system scada applications and uses
power system scada applications and usespower system scada applications and uses
power system scada applications and uses
 
Internship report on mechanical engineering
Internship report on mechanical engineeringInternship report on mechanical engineering
Internship report on mechanical engineering
 
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCRCall Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
Call Us -/9953056974- Call Girls In Vikaspuri-/- Delhi NCR
 
IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024IVE Industry Focused Event - Defence Sector 2024
IVE Industry Focused Event - Defence Sector 2024
 
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdfCCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
CCS355 Neural Network & Deep Learning Unit II Notes with Question bank .pdf
 
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdfCCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
CCS355 Neural Network & Deep Learning UNIT III notes and Question bank .pdf
 

DNA Sequence Similarity Algorithm Improves LCS

  • 1. International Journal on Recent and Innovation Trends in Computing and Communication ISSN: 2321-8169 Volume: 5 Issue: 7 57 – 60 _______________________________________________________________________________________________ 57 IJRITCC | July 2017, Available @ http://www.ijritcc.org _______________________________________________________________________________________ Sequence Similarity between Genetic Codes using Improved Longest Common Subsequence Algorithm A. Murugan Associate Professor, PG & Research Department of Computer Science DrAmbedkar Government Arts College, Vyasarpadi, Chennai – 600 039, Tamilnadu. e-mail: amurugan1972@gmail.com U. Udayakumar M.Phil Research Scholar, PG & Research Department of Computer Science DrAmbedkar Government Arts College, Vyasarpadi, Chennai – 600 039, Tamilnadu. e-mail: udaya.vijay13@gmail.com Abstract— Finding the sequence similarity between two genetic codes is an important problem in computational biology. In this paper, we developed an efficient algorithm to find sequence similarity between genetic codes using longest common subsequence algorithm. The algorithm takes the advantages over the edit distance algorithm and improves the performance. The proposed algorithm is tested on randomly generated DNA sequence and finding the exact DNA sequence comparison. The DNA genetic code sequence comparison can be used to discover information such as evolutionary divergence and ways to apply genetic codes from one DNA sequence to another sequence. Keywords-Dynamic Programming, DNA sequence comparison, DNA Similarity algorithms, Longest Common Subsequence. __________________________________________________*****_________________________________________________ I. INTRODUCTION DNA is the hereditary material found in almost all organisms [1]. DNA resides in every cell in the body of organism. The DNA is a double helix like structure made up of two twisted strands. A single strand carries a nucleotide bases, these bases are adenine (A), guanine (G), cytosine (C), thymine (T). These chemical letters are in the form of triplets which defines a meaningful code. The triplets give arise to specific amino acid which later helps in the formation of protein[2]. An important property of DNA is that it can replicate, or make copies of itself. Each strand of DNA in the double helix can serve as a pattern for duplicating the sequence of bases as shown in Fig.1. This is critical when cells divide because each new cell needs to have an exact copy of the DNA present in the old cell. Figure 1. Structure of DNA The knowledge of a DNA sequence and gene analysis can be used in several biological, medicine and agriculture research fields such as: possible disease or abnormality diagnoses, forensic science, pattern matching, biotechnology, etc[3][4]. The analysis and comparison studies for DNA sequences connected information technology tools and methods to accelerate findings and knowledge in biological related sciences. II. LITERATURE SURVEY There are many traditional pattern matching methodologies available in literature, they are Naive Brute force, Boyer Moore, Knuth Morris Pratt and Dynamic algorithms[2]. Pattern matching is used in various processes, like codon optimization which is carried out to enhance the efficiency of the DNA expression vectors used in DNA vaccination by increasing protein expression[5]. A. Naive Brute force It is one of the simplest algorithm having complexity O(m*n). In this, first character of pattern P(with length m) is aligned with first character of text T (with length n). Then scanning is done from left to right. As shifting is done at each step it gives less efficiency[6]. B. Boyer-Moore Algorithm It performs larger shift-increment whenever mismatch is detected. It differs from Naïve in the way of scanning. It scans the string from right to left; unlike Naive i.e. P is aligned with T such that last character of P will be matched to first character of T. If character is matched then pointer is shifted to left to very rest of the characters of the pattern[7][8]. If a mismatch is detected at character c, in T which is not in P, then P is shifted right to m positions and P is aligned to the next character after c. If c is part of P, then P is shifted right so that c is aligned with the right most occurrence of c in P. The worst time complexity is still O(m+n). C. Knuth-Morris-Pratt This algorithm is based on automaton theory. First a finite state automata model M is being created for the given pattern P. The input string T with Σ= {A, C, T, G} is processed through the model. If the pattern is present in text, the text is accepted otherwise rejected.
  • 2. International Journal on Recent and Innovation Trends in Computing and Communication ISSN: 2321-8169 Volume: 5 Issue: 7 57 – 60 _______________________________________________________________________________________________ 58 IJRITCC | July 2017, Available @ http://www.ijritcc.org _______________________________________________________________________________________ III. EFFICIENT LCS ALGORITHM The longest common subsequence problem is to find a substring that is common to the two given strings. Given two sequences, find the length of the longest subsequence present in both of them. A subsequence is a sequence that appears in the same relative order, but not necessarily contiguous [16]. In LCS problem, we eliminate the operation of substitution and allow only insertions and deletions. A Subsequence of a string v is simply an ordered sequence of characters from v. For example, if v = ATTGCTA, then AGCA and ATTA are subsequences of v whereas TGTT and TCG are not. Where ATGC denotes Adenine, Thymine, Guanine, Cytosine. Formally, we define the common subsequence of strings[9]. 1 ≤ i1 ≤ i2< . . . ik ≤ n (1) and a sequence of positions in w, 1 ≤ j1 ≤ j2< . . . jk ≤ m (2) Let si,j be the length of an LCS between v1...vi, the i-prefix of v and w1...wj, the j-prefix of w. Clearly, si,0 = s0,j = 0 for all 1 ≤ i ≤ n and 1 ≤ i ≤ n one can see that si,j satisfy the following recurrence: si,j = max si-1,j si,j-1 si-1,j-1+1, if vi=wj (3) Note that one can "rewrite" these recurrences by adding some zeros si,j=max si-1,j+0 si,j-1+0 si-1,j-1+1, if vi=wj (4) The LCS computation is like the recurrence given at the equations (3) and (4), if we were to build particularly gnarly version of Manhattan and gave horizontal and vertical edges weights of 0, and set the weights of diagonal edges equal to + 1[10]. The following is the efficient algorithm for solving longest common subsequence problem [15] that improves the time complexity and performance. Algorithm Backtracking Pointers and stored three values up, right and diagonal in Multidimensional array while x! := ―0‖ then ifsolna,b == ―Diagonal‖ then a := a - 1 b := b - 1 else b := b - 1 a := a - 1 end if return LCS[A.length][B.length] end while The following Fig. 2 shows Multidimensional array representation of the LCS algorithm Figure 2. Multidimensional Array representation for efficient LCS Algorithm The following Fig. 3 shows LCS Time Complexity and Performance Figure 3. LCS Time Complexity and Performance IV. DNA SEQUENCE SIMILARITY We proposed an algorithm to find the similarity between DNA genetic code sequences. In DNA sequence similarity, find the matching percentage between DNA genetic codes. In this paper, we proposed a new method for DNA sequence similarity. A modified Longest Common Subsequences algorithm is proposed in [15] and which is applied in the edit distance algorithm, which is an existing algorithm. The following is the modified edit distance algorithm used to find the similarity between two genetic codes. Algorithm 1: Edit Distance for DNA genetic code AlgorithmeditDistance(s1,s2) cost[] : = s2.length + 1 for i:= 0 to s1.length do lastvalue := i for j:= 0 to s2.length do if(i == j) cost[j] : = j elseif(j > 0) newValue : = costs[j - 1] costs[j - 1] : = lastValue lastValue : = newValue end if end for end for end algorithm
  • 3. International Journal on Recent and Innovation Trends in Computing and Communication ISSN: 2321-8169 Volume: 5 Issue: 7 57 – 60 _______________________________________________________________________________________________ 59 IJRITCC | July 2017, Available @ http://www.ijritcc.org _______________________________________________________________________________________ The following Fig. 4 explain the entire process of finding DNA sequence similarity Figure 4. Implementing LCS Algorithm in DNA Genetic Code Sequence Similarity The computation of the similarity score s(v,w) between v and w, while the table on the right presents the computation of the edit distance between v and w under the assumption that insertions and deletions are the only allowed operations[14]. The following TABLE Ishows DNA genetic code sequence similarity TABLE I. GENETIC CODE SIMILARITY BETWEEN SEQUENCES Human DNA Animals DNA % of matching GTCACGATTTGGGGGATGC TTCTGGCTC------A- GTCACGATTTGGGGGATGC TTCTGGCTC------A 100.00% GTCACGATTTGGGGGATGC TTCTGGCTC------A- GTCACGATTTGGGGGA TGCTTCTGGCTC------A- 100.00% GTCACGATTTGGGGGATGC TTCTGGCTC------A- GTCACGATTTGGGAGA TGCTTCTGGCTC-----G- 91.67% GTCACGATTTGGGGGATGC TTCTGGCTC------A- GTCAGAATTTGGGGGA TGCTTCTGGCTC-----T- 88.89% GTCACGATTTGGGGGATGC TTCTGGCTC------A- GTCAGAATTTGGGGGA TGCTTCTGGCTC-----T- 88.89% GTCACGATTTGGGGGATGC TTCTGGCTC------A- GTCAGAATTTGGGGGA TGCTTCTGGCTC-----T- 88.89% GTCACGATTTGGGGGATGC TTCTGGCTC------A- ATCACAGTTGGGGGAT GCCACTGGCCT-----C- 75.00% GTCACGATTTGGGGGATGC TTCTGGCTC------A- ATCACAA-TTGGGGG- TGCCACGGTCCT-----C- 69.44% GTCACGATTTGGGGGATGC TTCTGGCTC------A- ATCACAATTTGGGGAA CACCACTGGCAT-----C- 69.44% GTCACGATTTGGGGGATGC TTCTGGCTC------A- GTCACAATTTGGAGGA TGTTACTGGCAT-----C- 77.78% GTCACGATTTGGGGGATGC TTCTGGCTC------A- GTCACAGTTTGGAGGA TGTTACTGACAT-----C- 72.22% GTCACGATTTGGGGGATGC TTCTGGCTC------A- GTCATAGTTT---- GATTATATGGGCTT----- C- 58.33% GTCACGATTTGGGGGATGC TTCTGGCTC------A- GTCACAATTTGGGGGA TACTACTGGCAT-----C- 80.56% GTCACGATTTGGGGGATGC TTCTGGCTC------A- GTCACAGTTTAGGGGG TACTACTGGCAT-----C- 72.22% GTCACGATTTGGGGGATGC TTCTGGCTC------A- GTCACAATTTAGGAAG TGCCACTGGCCT-----C- 72.22% GTCACGATTTGGGGGATGC TTCTGGCTC------A- GCCTCTCTTT----------- CTGCCCTGCAGGC- 33.33% GTCACGATTTGGGGGATGC TTCTGGCTC------A- ---------------- TGCTACTAATAT-----T- 36.11% Algorithm 2: genetic code similarity Algorithmsimilarity(s1, s2) longer := s1, shorter := s2 if (s1.length() < s2.length()) longer := s2 shorter : = s1 longerLength := longer.length(); if (longerLength == 0) return (longerLength - editDistance(longer, shorter)) / (double) longerLength; end if end algorithm The following Fig. 6 shows the Output for Similarity Sequences between genetic codes Figure 5. Output for DNA Genetic Code Similarity Sequences A. Time Complexity The Time Complexity of Sequence Similarity Algorithms TC(DSSA) = max{TC(LCS), TC(EDA), TC(SS)} TC(DSSA) = max{O(n2 ), O(n2 ), O(n2 )} So the time complexity of the DNA Sequence Similarity Algorithm is O(n2 ) The best case and average case time complexity T(n) = max((O(n2 ) and O(levelnumber)), O(|n|). The Worst case time complexity T(n) = max((O(n3 ) and O(levelnumber)), O(|n|). The following Figure 7 shows Graphical Representation of DNA Genetic Code between Human and Animals
  • 4. International Journal on Recent and Innovation Trends in Computing and Communication ISSN: 2321-8169 Volume: 5 Issue: 7 57 – 60 _______________________________________________________________________________________________ 60 IJRITCC | July 2017, Available @ http://www.ijritcc.org _______________________________________________________________________________________ Figure 6. Graphical Representation of DNA Genetic Code Similarity Sequences V. DNA SEQUENCE SIMILARITY In this paper, we proposed an algorithm for finding similarity between two strings. It calculate the matching percentage between two strings by using longest common subsequence (LCS) and edit distance approach by avoiding unnecessary comparisons that reduces its time complexity. The running time is better than dynamic programming based algorithms and it is due to time control parameter. The DNA sequence similarity algorithm is tested on 50 samples with two input DNA genetic code sequence strings and selected randomly in Pentium processor machine is used and it shows good results. In future the algorithm is implemented in Multiple Longest Common Subsequences which will also have many applications. REFERENCES [1] Neil C.Jones and Pavel A.Pevzner, "An Introduction to Bioinformatics Algorithms" (2004) [2] Jiaoyun Yang, Yun Xu, Yi Shang, ―An Efficient Parallel Algorithm for Longest Common Subsequence Problem on GPUs‖, World Congress Engineering, 2010. [3] Maier. D, ―The Complexity of some problems on subsequences and super sequences‖, ACM, (25), 1978, 332-336. [4] B.Lavanya and A.Murugan, "Mining Longest Common Subsequence and other related patterns using DNA operations", International Journal of Computer Application, (18), 2012, 38-44. [5] Hirosawa et al, "Comprehensive study on iterative algorithms of multiple sequence alignment". Computational Applications in Biosciences, 1995, 13-18. [6] Mahdi Esmaieli and Mansour Tarafdar, "Sequential Pattern Mining from multidimensional sequence data in parallel", International journal of Computer theory and engineering, 2010, 730-733. [7] Stormo. G, "DNA binding sites: representation and discovery". Bioinformatics, 2000, 16:16-23 [8] Kyle Jensen. L., Mark Styczynski. P., Isidore Rigoutsos, and Gregory Stephanopoulos. V, "A generic motif discovery algorithm for sequential data". Bioinformatics, 22(1) 2006, 21-28. [9] Wang. L. and Jiang. T, "On the complexity of multiple sequence alignment". Journal of Computational Biology, 1994, 337-348. [10] Nan Li and Tompa. M, "Analysis of computational tools for motif discovery". Algorithms of molecular biology, 2006, 1-8. [11] Suyama. M., Nishioka. T., and Junichi. O, "Searching for common sequence patterns among distantly related proteins". Protein Engineering, 1995,1075-1080. [12] Smith. H. O., Annau. T. M., and Chandrasegaran.S, "Finding sequence motifs in groups of functionally related proteins". Proceedings of National Academy (USA), 1990, 826-830. [13] Bin Ma, "A polynomial time approximation scheme for the closest substring problem". LCNS Springer, 2000, 99-107. [14] Martinez. M, "An efficient method to find repeats in molecular sequences". Nucleic Acid Research, 1983, 4629-4634. [15] U. Udayakumar and A. Murugan "An Efficient Algorithm For Solving Longest Common Subsequence Problem", IJESMR, 2017,4(5). [16] www.geeksforgeeks.org/dynamic-programming-set-4-longest-common- subsequence