SlideShare a Scribd company logo
1 of 9
Download to read offline
TELKOMNIKA, Vol.15, No.4, December 2017, pp. 1785~1793
ISSN: 1693-6930, accredited A by DIKTI, Decree No: 58/DIKTI/Kep/2013
DOI:10.12928/TELKOMNIKA.v15i4.5813  1785
Received August 1, 2017; Revised November 8, 2017; Accepted November 23, 2017
Pairwise Sequence Alignment between HBV and HCC
Using Modified Needleman-Wunsch Algorithm
Lailil Muflikhah*, Edy Santoso
Faculty of Computer Science, Brawijaya University, 8 Veteran Road, Malang Indonesia
*Corresponding author, e-mail: laililmf@gmail.com
Abstract
Ths paper aims to find similarity of Hepatitis B virus (HBV) and Hepatocelluler Carcinoma (HCC)
DNA sequences. It is very important in bioformatics task. The similarity of sequence allignments indicates
that they have similarity of chemical and physical properties. Mutation of the virus DNA in X region has
potential role in HCC. It is observed using pairwise sequence alignment of genotype-A in HBV. The
complexity of DNA sequence using dynamic programming, Needleman-Wunsch algorithm, is very high.
Therefore, it is purpose to modifiy the method of Needleman Wunsch algorithm for optimum global DNA
sequence alignment. The main idea is to optimize filling matrix and backtracking proccess of DNA
components.This method can also solve various length of the both sequence alignment.
This research is applied to DNA sequence of 858 hepatitis B virus and 12 carcinoma patient, so
that there are 10,296 pairwis of sequences. They are aligned globally using the purposed method and as
a result, it is achieved high similarity of 96.547% and validity of 99.854%. Furhthermore, this method has
reduced the complexity of original Needleman-Wunsch algorithm The reduction of computational time is as
34.6% and space complexity is as 42.52%.
Keywords: hepatitis, sequence alignment, DNA, Needleman-Wunsch, optimum global
Copyright © 2017 Universitas Ahmad Dahlan. All rights reserved.
1. Introduction
Hepatitis B virus (HBV) is a DNA virus which can causes acute and chronic hepatitis in
humans. The chronic hepatitis can further progress to liver cirrhosis and carcinoma of the liver
(hepatocellular carcinoma, HCC). There is prediction of more than 500,000 incidents in 2000,
HCC is the most common malignant tumor in the world and the attack rate is rising in many
countries, including Indonesia. It is the third rank in cause of death after lung cancer and
stomach [1].
Information technology in the field of molecular biology which is known as bioinformatics
is growing very rapidly. The activities include mapping and DNA analysis, protein sequencing,
aligning different DNA, constructing and performing models of protein structures in three
dimensions. The correction for DNA sequence can be observed before analyze the sequence.
This reseearch has been conducted using Fuzzy-based spectral alignment [2]. Another related
research of DNA sequence is applied to identify Breast Cancer disease [3].
However, the growth of DNA sequence databases makes the sequence alignment
computation, as one of bioinformatics tasks, increase significantly [4]. The complexity of
sequence alignment is O (LN), where L is the length of each sequence and N is the number of
sequences [5]. The longer and the more sequences to be aligned the longer it’s computational
time. The optimal time of sequence alignment is increased exponentially with increaseing of the
number and length of sequences [6].
Basically, there are two methods of sequence alignment including dynamic
programming method and heuristic method approach. The Needleman-Wunsch algorithm is a
method of optimal global sequence alignment, but Smith-Waterman is a method of optimal local
alignment which are based on dinamic programming approach algorithm. Meanwhile, several
tools that use heuristic approach are FASTA and Basic Local Alignment Search Tool (BLAST).
The produced sequent alignment is not optimal eventhough their computational speed is fast
[7]. Several methods or tools are needed to obtain optimal computational process. The previous
research to optimize the computational protein sequence alignment used a dynamic
programming algorithm but the computational complexity is high of O (mn) by Zhou and Chen
 ISSN: 1693-6930
TELKOMNIKA Vol. 15, No. 4, December 2017 : 1785 – 1793
1786
[8]. Furthermore, the developing performance of dynamic programming has been applied
through sharing memory to speed up the alignment process. The dynamic programming method
for sequence alignment has developed with share memory system using four different data
partitioning schemas: blocked columnwise, rowwise, antidiagonal, and revised blocked
columnwise [9]. Another research is also parallel computing utilized on clusters of computers
known as Distributed Memory. This research used star algorithms in parallel environment using
MPI to distribute computing of DNA Multiple sequence Alignment [10].
This research is purposed to improve the previous work conducted by Shehab et,al.
using Fast Dynamic Algorithm without sharing memory. The previous research had limitation in
scalability,which difference of sequence length cannot more than of 10 [11]. Therefore, we
develop this prevous reseach by modifying Needleman-Wunsch method as a dynamic
programming approarch in single memory for pairwise sequence alignment in order to get the
similarity rate between HBV and HCC. This method is address to get high performance in
computational process with limited reseource of memory.
2. Research Method
Implementation of pairwise sequence alignment is designed within three parts as shown
in Figure 1. First, the input sequences is data set of HBV and HCC and parameter for
evaluation. The second is applied the purposed method to align the sequences. The pusposed
method is address to improve the performance, including computaional time and space
complexity.
Figure 1. Block diagram of pairwise sequence alignment
2.1. Data Sets
There are various ways to identify the disease, either isolating the virus in the laboratory
or analysing DNA sequences of the viruses. Hepatitis B virus is a DNA virus to mutate and it
influences to be chronic disease. It is known as hepatitis with liver cancer (HCC). One way to
know that the HBV DNA is mutated, can be applied sequence alignment based on the similarity
percentage.
This research is examined to data set of HBV DNA sequences, with spesific genotype-A
in X region. It can be downloaded from website of HBV database at url:
https://hbvdb.ibcp.fr/HBVdb/HBVdbDataset?view=/data/nucleic/fasta/A_X.fas&seqtype=0
gnl|hbvcds|AB014370_X_P-A. There 858 HBV DNA sequences and 12 HCC DNA sequences.
They are applied to sequence in pairwise, so that there are totally 10296 in pairs.
Input Process Output
Data set of HBV
and HCC DNA
sequences
Evaluation
Parameter
Pairwise global sequence alignment
using modified Needleman Wunch
Sequence
alignment
Scoring, gap,
similarity
TELKOMNIKA ISSN: 1693-6930 
Pairwise Sequence Alignment between HBV and HCC Using Modified … (Lailil Muflikhah)
1787
2.2. Sequence Alignment
Sequence Alignment is the process whereby a sequence compared by looking for
patterns of of the most common characters and inter-related sequences. Pairwise sequence
alignment is a alignment process between two sequences based on similarity in order to search
on the database and multiple sequence alignment. This similarity can be expressed in
percentages. It means that the sequences have similarities to physical-chemical properties.
Another term, the degree of similarity in amino acid sequence is similar between the two protein
sequences [12]. The similarity may indicate a functional relationship, or the structural and
evolutionary relationships [13].
To perform sequence alignment, it is necessary to shift sequence at certain positions in
order to get the maximum similarity. There are two method of sequence alignment, ie global and
local alignment. The global alignment method is to compare the whole sequences. This method
is suitable to perform sequence alignment that has a high degree of similarity of the whole part
of the sequence. Another hand, the local alignment method is to compare a local area or part of
the sequence.
2.1.1. Global Alignmnt
Global alignment creates end-to-end alignment even though there is a difference in
some region. This approach is suitable to align similar sequence [8]. As illustration, it can be
showed in Figure 2.
EARDFNQYYSSIKRSGSI
:...:..:::::::....
EPKLFIQYYSSIKRTMGH
Figure 2. Global alignment
2.1.2. Local Alignment
Local alignment only aligns the most similar region within a sequence. There is no need
to align all part of sequence, only the region with big similarity according to some criteria. By
using the same sequence is as Figure 3, local alignment becomes as follow:
EARDFNQYYSSIKRSGSI
:::::::
EPKLFIQYYSSIKRTMGH
Figure 3. Local alignment
2.2. Needleman Wunch Algorithm
Needleman-Wunsch algorithm is a dynamic programming method to find global optimal
sequence alignment and it allows a gap. This algorithm is widely used in bioinformatics to align
nucleotide sequences. It works by creating an optimal sequence alignment. As illustration, there
are two DNA sequences of length m and n where a = a1a2 ... am and b = b1b2 ... bn. To find a
parallel sequence with the highest score, it must be constructed a matrix F where F (i, j) is the
value of the best sequence alignment between a1: i and b1: j. The first, it is constructed matrix F
and is filled in F (0,0) = 0, F (i, 0) = -i and F (0, j) = j. Furthermore, it is applied as Eq.1 [14]:
𝐹(𝑖, 𝑗) = 𝑚𝑎𝑥 {
𝐹(𝑖 − 1, 𝑗 − 1) + 𝑠(𝑎𝑖, 𝑏𝑗, 𝑝)
𝐹(𝑖 − 1, 𝑗)
𝐹(𝑖, 𝑗 − 1)
Eq. (1)
It is.defined 𝑠(𝑎𝑖, 𝑏𝑗, 𝑝) which is the returned function to give scoring scheme for a match
if ai = bj and a mismatch if ai ≠ bj. The final score can be obtained by looking the value of
F(M,N). The alignment is build by doing backtrack from F(M,N) to F(0,0). The procedure is
compare to the value on F(i,j) to its top left and diagonal using equation (1). For example if F(i,j)
 ISSN: 1693-6930
TELKOMNIKA Vol. 15, No. 4, December 2017 : 1785 – 1793
1788
= F(i,j-1), then backtrack is recorded an insertion in bj [2]. To get better performance, the data of
an insertion, deletion or gap can be recorded during filling the matrix. Sometimes, several path
can be generated from this process. This indicates that there are more than one alignment that
can be generated with same score. The steps of the Needleman Wunsch is illustrated in
Figure 4.
Figure 4. The steps of Needleman Wunch Algorithm
3. The purposed Method
The Needleman Wunsch algrithm is a dynamic programming approach to align global
optimally. How ever, this algoritm needs high computational time and space complexity.
Therefore, it is need to optimize the filling and backtracking process. The main deferrence of the
purposed method is filling process of matrix alignment with spesific area as in Figure 5. If the
both sequences have the same length, then filling mattrix is on the main diagonal (D), (D-1) and
(D + 1). The (D-1) is a diagonal below the main diagonal and (D + 1) is a diagonal above the
main diagonal. In this case, we use the same with FAST Needleman Wunsch algorithm [5].
Figure 5. Ilustration of filling matrix with the same sequence
The other hand, if the input sequence has different length, the matrix will have two main
diagonals. In this case, the marked area are including the first main diagonal (D1), the second
main diagonal (D2), area between D1 and D2 (x), one diagonal above D2 (D2+1) and one
diagonal below D1 (D1-1) as shown in Figure 6.
Figure 6. Marked area if the sequences input have different length
TELKOMNIKA ISSN: 1693-6930 
Pairwise Sequence Alignment between HBV and HCC Using Modified … (Lailil Muflikhah)
1789
Furthermore, the stages of the modified Needleman Wunsch are as follows:
1
st
step: Matrix Initialization
In this step, it is to construct matrix NW and to put a certain value for marking into
specific area as in Figure 5 or Figure 6. It will be given parameter value of gap openning and
gap extension.
2nd step: Filling to the matrix
There are two operation in this step as follows:
1. To count Linear evaluation
Each nucleotide is given the predetermined parameter value, including match, mismatch,
gap opening and gap extention.
As illustration, it is given the sequence A = GESKC and sequence B = GTASC. The
scoring scheme for the alignment is match = 2, mismatch = -1 and gap = -2. Filling F(0,0) by 0
and also filling the first row and first column of marked area is iterated gap value as in Figure 7.
In this case study, the data where is the value come form is represent by arrow (↑, ← and ↖).
This arrow is useful for backtrack process.
Figure. 7. Initialization stage
2. To fill NW matrix and tracer
After all the rows and columns of the marked matrix NW are counted, then the next
stage is to fill the matrix NW and tracer. To perform this stage, it takes the output from the
previous process in the form of a direction to tracer process later (D=diagonal), (H=horizontal)
and (V=vertical). After all marked area has been filled, the matrix will be look like Figure 8.
Figure 8. All marked area has been filled
3
rd
step: Backtracking process
Backtracking is the last stage of sequence alignment. At this stage, the system will
perform a trace back from the matrix that has been previously established. Starting from the end
entry of trace process (end position of matrix), it will be executed the alignment and it results
refer to the rules that have been determined. The backtracking process can be seen at Figure 9.
Figure 9. Backtracking path
 ISSN: 1693-6930
TELKOMNIKA Vol. 15, No. 4, December 2017 : 1785 – 1793
1790
The diagonal arrow (↖) means substitution, vertical arrow (↑) means insertion of a gap in B and
horizontal arrow (←) means insertion of a gap in A. From Figure 9 we can construct the
alignment is as follows:
Sequence A : G E - S K C
Sequence B : G T A S - C
Score : 2 -1 -2 2 -2 2
The final score can be computed by sum up 2 + (-1) + (-2) + 2 + (-2) + 2 = 1 or we can
see at F(7,7) = 1
4. Results and Analysis
There are two scenarios in this research. First scenario is finding characteristics for
DNA sequences of HBV and HCC based on similarity using either modified or original
Needeman Wunsch. The second scenario is comparing performance result between the
modified method and the original. This scenario is also to analysis the parameter value against
the performance.
In the first scenario, it is used the default parameter (match=2; mismatch=-3; gap
opening=-5; gap extension=-2). As a result, it is achieved using the modified method’s
performance including average similarity rate=96.549% and score=849.8 with computational
time=1370 ms. How ever, the performance is achieved using original Needleman Wunsch, i.e.
similarity rate= 96.548% and score = 849.8 with total computational time=64131ms. Therefore,
the accuracy rate of the both method is 99.854% and there is time reduction as 78.95%. The
performance of experimental result is partially showed in Table 1
Table 1. Comparisson of Performance the Both Algorithm for Pairwise Sequence Alignment
Sequence no
Modified Needleman-Wunsch Original Needleman-Wunsch
score
Similarity
(%)
Time (ms) score
Similarity
(%)
Time (ms)
1 900 98,71 50 900 98,71 90
2 860 96,99 38 860 96,99 28
3 875 97,63 37 875 97,63 19
4 900 98,71 5 900 98,71 10
5 900 98,71 3 900 98,71 11
6 905 98,92 36 905 98,92 8
7 895 98,49 4 895 98,49 10
8 895 98,49 7 895 98,49 11
9 895 98,49 3 895 98,49 13
10 840 96,13 6 840 96,13 13
Then, the second scenario is analisys the parameter against the computational
performance. By using various parameters including match, mismatch, gap opening and gap
extention is obtained similarity rate of the original method as shown in Figure 10. The match,
gap opening and gap extention show that the larger parameter value is given, the lower
similarity is. Otherwise, the lower parameter value of mismatch, the higher similarity is.
The performance measure including computational time and space complexity is
evaluated. Using parameter randomly, there is reduction of computational time in average of
34.60% as in Table 2. In space compexity, it is evaluated by filling the matrix of sequence
alignment and the partial result is showed as in Table 3.
TELKOMNIKA ISSN: 1693-6930 
Pairwise Sequence Alignment between HBV and HCC Using Modified … (Lailil Muflikhah)
1791
Figure 10. The effect of parameter value to similarity performance
Table 2. Computational Time for Sequence Alignment
Match Mismatch
Gap
Openning
Gap
Extention
Time requirement for sequence alignment
Modified
NW
Original
NW
Reduction
(ms)
Reduction
(%)
1 -1 -5 -2 252474 322590 70116 21,74
1 -2 -5 -2 509065 618041 108976 17,63
1 -1 -2 -4 155513 306827 151314 49,32
1 -2 -2 -4 160639 313108 152469 48,70
2 -3 -2 -2 228456 320816 92360 28,79
2 -3 -2 -4 159961 311777 151816 48,69
2 -3 -4 -4 242231 373504 131273 35,15
2 -3 -5 -2 203316 387511 184195 47,53
2 -3 -6 -2 235820 329285 93465 28,38
Average 238608,33 364828,78 126220,44 34,60
Table 3. The Number of Filled Matrix Area
Sequence # Modified NW Original NW
1 52339 99645
2 42484 89790
3 71173 118479
4 47521 94827
5 224692 271998
6 67450 114756
7 84094 131400
8 9070 36792
9 28906 76212
10 92635 139941
99.30
99.40
99.50
99.60
99.70
99.80
99.90
100.00
1 2 3 4 5 6 7 8 9 10
Similarity(%)
Match score
-
20.00
40.00
60.00
80.00
100.00
120.00
-10 -9 -8 -7 -6 -5 -4 -3 -2 -1
Similarity(%)
Mismatch score
99.30
99.40
99.50
99.60
99.70
99.80
99.90
100.00
-10 -9 -8 -7 -6 -5 -4 -3 -2 -1
Similarity(%)
Gap Opening score
99.6
99.65
99.7
99.75
99.8
99.85
99.9
99.95
100
100.05
-10 -9 -8 -7 -6 -5 -4 -3 -2 -1
Similarity(%)
Gap Extension score
 ISSN: 1693-6930
TELKOMNIKA Vol. 15, No. 4, December 2017 : 1785 – 1793
1792
Furthermore, the modified method is evaluated for the effect of total sequences
alignment against the performance. How ever, the experimental result is showed that the total
sequence is no effect to the performance as shown in Figure 11 and Figure 12.
Figure 11. The performance of scoring alignment against total sequences
Figure 12. The performance of similarity against total sequences
5. Conclusion
The modified method of Needleman Wunch has been successfully implemented to
perform the pairwise DNA sequence global alignment optimally between HBV and HCC:
1. The main principle in the modified method of Needleman Wunsch is the matrix region of
filling DNA sequences for marking and backtracking process to get the sequence result.
2. The level of space and time complexity of the method are used by O(n) of the original
method of O(mn). It is based on the experimental result that there is a reduction of filled
matrix area of 42.52%.
3. Testing pairwise sequence alignment between hepatitis B virus and hepatocellular
carcinoma DNA sequences is generated at high level of similarity rate in average value of
96 547%
6. Future Work
Based on the experimental result, there are still many shortcomings that needed some
repairs for further research as follows:
- In this study conducted by pairwise alignment (pairs), it is advisable to do multiple sequence
alignment to increase speed up the computational time.
830
840
850
860
870
880
890
10 25 50 100 150 200 300 400 500 1000 1500 2000 3000 4000 5000
Score
Number of DNA sequences
Score-modif
Score-Ori
95.5
96
96.5
97
97.5
98
98.5
Similarity (%)Modif
Similarity (%) Ori
TELKOMNIKA ISSN: 1693-6930 
Pairwise Sequence Alignment between HBV and HCC Using Modified … (Lailil Muflikhah)
1793
- This research is obtained for the scoring and similarity, therefore for further research are
expected to be able to determine the position of mutation in DNA sequences. Furthermore,
it can be used to early detection of carcinoma (HCC) based on the HBV DNA sequence
alignment.
Acknowledgement
This research was financially supported by Faculty of Computer Science, Brawijaya
University, Indonesia under program Superior Research Grant in Faculty. We would like to
thank Nashi Widodo Ph.D., Brawijaya University, for his fruitful and guidance.
References
[1] Lupberger J, Hildt E. Hepatitis B Virus-induced Ocogenesis. World J.Gastroenterol. 2007; 13(1): 74-
81
[2] Saputra S, Kana, Buono, Agus, Kusuma WA. Fuzzy-based Spectral Alighment from Generation
Sequencer. TELKOMNIKA, Telecomunication, Computing, Electronics and Control. 2016; 14(2): 707-
714.
[3] Muflikhah L, Yuliantoro I. Identifying Cancer Disease through Deoxyribonucleic Acid (DNA)
Sequential Pattern Mining. International Journal of Intelligence Science, 2017; 7: 9-23.
[4] Liu Y, Schmidt B, Maskell DL. MSA-CUDA. Multiple Sequence Alignment on Graphics Processing
Units with CUDA. IEEE International Conference on Application-specific Systems, Architectures and
Processors. 2009: 121-128
[5] Lloyd GS. 2010. Parallel Multiple Sequence Alignment: An Overview.
[6] Edgar RC, Batzoglou S. Multiple sequence alignment. Current opinion in structural biology. 2006;
16(3): 368-373.
[7] Kasap S, Benkrid K, Liu Y. Design and Implementation of an FPGA-based Core for Gapped BLAST
Sequence Alignment with the Two-Hit Method. Engineering Letters. 2008; 16(3).
[8] Zhou Zm, Chen Z-w. Dynamic Programming for Protein Sequence Alignment. International Journal of
Bio-Science and Bio-Technology. 2013; 5(2): 141-150.
[9] Rahmad A., Auriza, Sukoco H., Kusuma, A.W. Comparison of Data Partitioning Schema of Parallel
Pairwise Alignment on Shared Memory System. TELKOMNIKA, Telecomunication, Computing,
Electronics and Control. 2015; 13(2): 694-702.
[10] Satra R, Kusuma WA, Sukoco H. Accelerating Computation of DNA Multiple Sequence Alignment in
Distributed Environtment. TELKOMNIKA Indonesian Journal of Electrical Engineering. 2014; 12 (12):
8278-8285.
[11] A Shehab S, Keshk A, Mahgoub H. Fast Dynamic Algorithm for Sequence Alignment based on
Bioinformatics. International Journal of Computer Applications. 2012; 32(7): 54-61.
[12] Xiong J. Essential Bioinformatics. 2006. New York: Cambridge University Press.
[13] EMBL-EBI, 2015. Pairwise Sequence Alignment. [Online] Available at:
http://www.ebi.ac.uk/Tools/psa/ [ available at 8 10 2015].
[14] Nanni L, Lumini A. Generalized Needleman–Wunsch algorithm for the recognition of T-cell epitopes.
Expert Systems with Applications. 2008; 35:1463–1467.

More Related Content

What's hot

Pattern recognition system based on support vector machines
Pattern recognition system based on support vector machinesPattern recognition system based on support vector machines
Pattern recognition system based on support vector machinesAlexander Decker
 
Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)AnkitTiwari354
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-naveed ul mushtaq
 
Heterogeneous Information Network Embedding for Recommendation
Heterogeneous Information Network Embedding for RecommendationHeterogeneous Information Network Embedding for Recommendation
Heterogeneous Information Network Embedding for RecommendationJAYAPRAKASH JPINFOTECH
 
Heterogeneous Information Network Embedding for Recommendation
Heterogeneous Information Network Embedding for RecommendationHeterogeneous Information Network Embedding for Recommendation
Heterogeneous Information Network Embedding for RecommendationJAYAPRAKASH JPINFOTECH
 
An Efficient Clustering Method for Aggregation on Data Fragments
An Efficient Clustering Method for Aggregation on Data FragmentsAn Efficient Clustering Method for Aggregation on Data Fragments
An Efficient Clustering Method for Aggregation on Data FragmentsIJMER
 

What's hot (9)

Pattern recognition system based on support vector machines
Pattern recognition system based on support vector machinesPattern recognition system based on support vector machines
Pattern recognition system based on support vector machines
 
2018 BDSRA Roine CLN3
2018 BDSRA Roine CLN32018 BDSRA Roine CLN3
2018 BDSRA Roine CLN3
 
Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)Sequence homology search and multiple sequence alignment(1)
Sequence homology search and multiple sequence alignment(1)
 
2009 spie hmm
2009 spie hmm2009 spie hmm
2009 spie hmm
 
Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-Sequence alig Sequence Alignment Pairwise alignment:-
Sequence alig Sequence Alignment Pairwise alignment:-
 
Heterogeneous Information Network Embedding for Recommendation
Heterogeneous Information Network Embedding for RecommendationHeterogeneous Information Network Embedding for Recommendation
Heterogeneous Information Network Embedding for Recommendation
 
Heterogeneous Information Network Embedding for Recommendation
Heterogeneous Information Network Embedding for RecommendationHeterogeneous Information Network Embedding for Recommendation
Heterogeneous Information Network Embedding for Recommendation
 
An Efficient Clustering Method for Aggregation on Data Fragments
An Efficient Clustering Method for Aggregation on Data FragmentsAn Efficient Clustering Method for Aggregation on Data Fragments
An Efficient Clustering Method for Aggregation on Data Fragments
 
Topology
TopologyTopology
Topology
 

Similar to Pairwise Sequence Alignment between HBV and HCC Using Modified Needleman-Wunsch Algorithm

GPCODON ALIGNMENT: A GLOBAL PAIRWISE CODON BASED SEQUENCE ALIGNMENT APPROACH
GPCODON ALIGNMENT: A GLOBAL PAIRWISE CODON BASED SEQUENCE ALIGNMENT APPROACHGPCODON ALIGNMENT: A GLOBAL PAIRWISE CODON BASED SEQUENCE ALIGNMENT APPROACH
GPCODON ALIGNMENT: A GLOBAL PAIRWISE CODON BASED SEQUENCE ALIGNMENT APPROACHijdms
 
SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...
SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...
SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...cscpconf
 
International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...IJCSEIT Journal
 
Efficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
Efficiency of Using Sequence Discovery for Polymorphism in DNA SequenceEfficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
Efficiency of Using Sequence Discovery for Polymorphism in DNA SequenceIJSTA
 
Jcb 2005-12-1103
Jcb 2005-12-1103Jcb 2005-12-1103
Jcb 2005-12-1103Farah Diba
 
BRITEREU_finalposter
BRITEREU_finalposterBRITEREU_finalposter
BRITEREU_finalposterElsa Fecke
 
Novel modelling of clustering for enhanced classification performance on gene...
Novel modelling of clustering for enhanced classification performance on gene...Novel modelling of clustering for enhanced classification performance on gene...
Novel modelling of clustering for enhanced classification performance on gene...IJECEIAES
 
Co-clustering algorithm for the identification of cancer subtypes from gene e...
Co-clustering algorithm for the identification of cancer subtypes from gene e...Co-clustering algorithm for the identification of cancer subtypes from gene e...
Co-clustering algorithm for the identification of cancer subtypes from gene e...TELKOMNIKA JOURNAL
 
Iganfis Data Mining Approach for Forecasting Cancer Threats
Iganfis Data Mining Approach for Forecasting Cancer ThreatsIganfis Data Mining Approach for Forecasting Cancer Threats
Iganfis Data Mining Approach for Forecasting Cancer Threatsijsrd.com
 
Comparative analysis of dynamic programming algorithms to find similarity in ...
Comparative analysis of dynamic programming algorithms to find similarity in ...Comparative analysis of dynamic programming algorithms to find similarity in ...
Comparative analysis of dynamic programming algorithms to find similarity in ...eSAT Journals
 
Comparative analysis of dynamic programming
Comparative analysis of dynamic programmingComparative analysis of dynamic programming
Comparative analysis of dynamic programmingeSAT Publishing House
 
Bioinformatics_Sequence Analysis
Bioinformatics_Sequence AnalysisBioinformatics_Sequence Analysis
Bioinformatics_Sequence AnalysisSangeeta Das
 
Application of Hybrid Genetic Algorithm Using Artificial Neural Network in Da...
Application of Hybrid Genetic Algorithm Using Artificial Neural Network in Da...Application of Hybrid Genetic Algorithm Using Artificial Neural Network in Da...
Application of Hybrid Genetic Algorithm Using Artificial Neural Network in Da...IOSRjournaljce
 
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...ijcseit
 
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ijcseit
 
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...ijcseit
 

Similar to Pairwise Sequence Alignment between HBV and HCC Using Modified Needleman-Wunsch Algorithm (20)

Automatic Parallelization for Parallel Architectures Using Smith Waterman Alg...
Automatic Parallelization for Parallel Architectures Using Smith Waterman Alg...Automatic Parallelization for Parallel Architectures Using Smith Waterman Alg...
Automatic Parallelization for Parallel Architectures Using Smith Waterman Alg...
 
Sequence alignment
Sequence alignmentSequence alignment
Sequence alignment
 
GPCODON ALIGNMENT: A GLOBAL PAIRWISE CODON BASED SEQUENCE ALIGNMENT APPROACH
GPCODON ALIGNMENT: A GLOBAL PAIRWISE CODON BASED SEQUENCE ALIGNMENT APPROACHGPCODON ALIGNMENT: A GLOBAL PAIRWISE CODON BASED SEQUENCE ALIGNMENT APPROACH
GPCODON ALIGNMENT: A GLOBAL PAIRWISE CODON BASED SEQUENCE ALIGNMENT APPROACH
 
SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...
SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...
SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...
 
International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...International Journal of Computer Science, Engineering and Information Techno...
International Journal of Computer Science, Engineering and Information Techno...
 
Efficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
Efficiency of Using Sequence Discovery for Polymorphism in DNA SequenceEfficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
Efficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
 
Jcb 2005-12-1103
Jcb 2005-12-1103Jcb 2005-12-1103
Jcb 2005-12-1103
 
BRITEREU_finalposter
BRITEREU_finalposterBRITEREU_finalposter
BRITEREU_finalposter
 
1207.2600
1207.26001207.2600
1207.2600
 
Novel modelling of clustering for enhanced classification performance on gene...
Novel modelling of clustering for enhanced classification performance on gene...Novel modelling of clustering for enhanced classification performance on gene...
Novel modelling of clustering for enhanced classification performance on gene...
 
Co-clustering algorithm for the identification of cancer subtypes from gene e...
Co-clustering algorithm for the identification of cancer subtypes from gene e...Co-clustering algorithm for the identification of cancer subtypes from gene e...
Co-clustering algorithm for the identification of cancer subtypes from gene e...
 
Iganfis Data Mining Approach for Forecasting Cancer Threats
Iganfis Data Mining Approach for Forecasting Cancer ThreatsIganfis Data Mining Approach for Forecasting Cancer Threats
Iganfis Data Mining Approach for Forecasting Cancer Threats
 
Comparative analysis of dynamic programming algorithms to find similarity in ...
Comparative analysis of dynamic programming algorithms to find similarity in ...Comparative analysis of dynamic programming algorithms to find similarity in ...
Comparative analysis of dynamic programming algorithms to find similarity in ...
 
Comparative analysis of dynamic programming
Comparative analysis of dynamic programmingComparative analysis of dynamic programming
Comparative analysis of dynamic programming
 
Bioinformatics_Sequence Analysis
Bioinformatics_Sequence AnalysisBioinformatics_Sequence Analysis
Bioinformatics_Sequence Analysis
 
2224d_final
2224d_final2224d_final
2224d_final
 
Application of Hybrid Genetic Algorithm Using Artificial Neural Network in Da...
Application of Hybrid Genetic Algorithm Using Artificial Neural Network in Da...Application of Hybrid Genetic Algorithm Using Artificial Neural Network in Da...
Application of Hybrid Genetic Algorithm Using Artificial Neural Network in Da...
 
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
 
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES
 
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
A COMPARATIVE ANALYSIS OF PROGRESSIVE MULTIPLE SEQUENCE ALIGNMENT APPROACHES ...
 

More from TELKOMNIKA JOURNAL

Amazon products reviews classification based on machine learning, deep learni...
Amazon products reviews classification based on machine learning, deep learni...Amazon products reviews classification based on machine learning, deep learni...
Amazon products reviews classification based on machine learning, deep learni...TELKOMNIKA JOURNAL
 
Design, simulation, and analysis of microstrip patch antenna for wireless app...
Design, simulation, and analysis of microstrip patch antenna for wireless app...Design, simulation, and analysis of microstrip patch antenna for wireless app...
Design, simulation, and analysis of microstrip patch antenna for wireless app...TELKOMNIKA JOURNAL
 
Design and simulation an optimal enhanced PI controller for congestion avoida...
Design and simulation an optimal enhanced PI controller for congestion avoida...Design and simulation an optimal enhanced PI controller for congestion avoida...
Design and simulation an optimal enhanced PI controller for congestion avoida...TELKOMNIKA JOURNAL
 
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...TELKOMNIKA JOURNAL
 
Conceptual model of internet banking adoption with perceived risk and trust f...
Conceptual model of internet banking adoption with perceived risk and trust f...Conceptual model of internet banking adoption with perceived risk and trust f...
Conceptual model of internet banking adoption with perceived risk and trust f...TELKOMNIKA JOURNAL
 
Efficient combined fuzzy logic and LMS algorithm for smart antenna
Efficient combined fuzzy logic and LMS algorithm for smart antennaEfficient combined fuzzy logic and LMS algorithm for smart antenna
Efficient combined fuzzy logic and LMS algorithm for smart antennaTELKOMNIKA JOURNAL
 
Design and implementation of a LoRa-based system for warning of forest fire
Design and implementation of a LoRa-based system for warning of forest fireDesign and implementation of a LoRa-based system for warning of forest fire
Design and implementation of a LoRa-based system for warning of forest fireTELKOMNIKA JOURNAL
 
Wavelet-based sensing technique in cognitive radio network
Wavelet-based sensing technique in cognitive radio networkWavelet-based sensing technique in cognitive radio network
Wavelet-based sensing technique in cognitive radio networkTELKOMNIKA JOURNAL
 
A novel compact dual-band bandstop filter with enhanced rejection bands
A novel compact dual-band bandstop filter with enhanced rejection bandsA novel compact dual-band bandstop filter with enhanced rejection bands
A novel compact dual-band bandstop filter with enhanced rejection bandsTELKOMNIKA JOURNAL
 
Deep learning approach to DDoS attack with imbalanced data at the application...
Deep learning approach to DDoS attack with imbalanced data at the application...Deep learning approach to DDoS attack with imbalanced data at the application...
Deep learning approach to DDoS attack with imbalanced data at the application...TELKOMNIKA JOURNAL
 
Brief note on match and miss-match uncertainties
Brief note on match and miss-match uncertaintiesBrief note on match and miss-match uncertainties
Brief note on match and miss-match uncertaintiesTELKOMNIKA JOURNAL
 
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...TELKOMNIKA JOURNAL
 
Evaluation of the weighted-overlap add model with massive MIMO in a 5G system
Evaluation of the weighted-overlap add model with massive MIMO in a 5G systemEvaluation of the weighted-overlap add model with massive MIMO in a 5G system
Evaluation of the weighted-overlap add model with massive MIMO in a 5G systemTELKOMNIKA JOURNAL
 
Reflector antenna design in different frequencies using frequency selective s...
Reflector antenna design in different frequencies using frequency selective s...Reflector antenna design in different frequencies using frequency selective s...
Reflector antenna design in different frequencies using frequency selective s...TELKOMNIKA JOURNAL
 
Reagentless iron detection in water based on unclad fiber optical sensor
Reagentless iron detection in water based on unclad fiber optical sensorReagentless iron detection in water based on unclad fiber optical sensor
Reagentless iron detection in water based on unclad fiber optical sensorTELKOMNIKA JOURNAL
 
Impact of CuS counter electrode calcination temperature on quantum dot sensit...
Impact of CuS counter electrode calcination temperature on quantum dot sensit...Impact of CuS counter electrode calcination temperature on quantum dot sensit...
Impact of CuS counter electrode calcination temperature on quantum dot sensit...TELKOMNIKA JOURNAL
 
A progressive learning for structural tolerance online sequential extreme lea...
A progressive learning for structural tolerance online sequential extreme lea...A progressive learning for structural tolerance online sequential extreme lea...
A progressive learning for structural tolerance online sequential extreme lea...TELKOMNIKA JOURNAL
 
Electroencephalography-based brain-computer interface using neural networks
Electroencephalography-based brain-computer interface using neural networksElectroencephalography-based brain-computer interface using neural networks
Electroencephalography-based brain-computer interface using neural networksTELKOMNIKA JOURNAL
 
Adaptive segmentation algorithm based on level set model in medical imaging
Adaptive segmentation algorithm based on level set model in medical imagingAdaptive segmentation algorithm based on level set model in medical imaging
Adaptive segmentation algorithm based on level set model in medical imagingTELKOMNIKA JOURNAL
 
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...TELKOMNIKA JOURNAL
 

More from TELKOMNIKA JOURNAL (20)

Amazon products reviews classification based on machine learning, deep learni...
Amazon products reviews classification based on machine learning, deep learni...Amazon products reviews classification based on machine learning, deep learni...
Amazon products reviews classification based on machine learning, deep learni...
 
Design, simulation, and analysis of microstrip patch antenna for wireless app...
Design, simulation, and analysis of microstrip patch antenna for wireless app...Design, simulation, and analysis of microstrip patch antenna for wireless app...
Design, simulation, and analysis of microstrip patch antenna for wireless app...
 
Design and simulation an optimal enhanced PI controller for congestion avoida...
Design and simulation an optimal enhanced PI controller for congestion avoida...Design and simulation an optimal enhanced PI controller for congestion avoida...
Design and simulation an optimal enhanced PI controller for congestion avoida...
 
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
Improving the detection of intrusion in vehicular ad-hoc networks with modifi...
 
Conceptual model of internet banking adoption with perceived risk and trust f...
Conceptual model of internet banking adoption with perceived risk and trust f...Conceptual model of internet banking adoption with perceived risk and trust f...
Conceptual model of internet banking adoption with perceived risk and trust f...
 
Efficient combined fuzzy logic and LMS algorithm for smart antenna
Efficient combined fuzzy logic and LMS algorithm for smart antennaEfficient combined fuzzy logic and LMS algorithm for smart antenna
Efficient combined fuzzy logic and LMS algorithm for smart antenna
 
Design and implementation of a LoRa-based system for warning of forest fire
Design and implementation of a LoRa-based system for warning of forest fireDesign and implementation of a LoRa-based system for warning of forest fire
Design and implementation of a LoRa-based system for warning of forest fire
 
Wavelet-based sensing technique in cognitive radio network
Wavelet-based sensing technique in cognitive radio networkWavelet-based sensing technique in cognitive radio network
Wavelet-based sensing technique in cognitive radio network
 
A novel compact dual-band bandstop filter with enhanced rejection bands
A novel compact dual-band bandstop filter with enhanced rejection bandsA novel compact dual-band bandstop filter with enhanced rejection bands
A novel compact dual-band bandstop filter with enhanced rejection bands
 
Deep learning approach to DDoS attack with imbalanced data at the application...
Deep learning approach to DDoS attack with imbalanced data at the application...Deep learning approach to DDoS attack with imbalanced data at the application...
Deep learning approach to DDoS attack with imbalanced data at the application...
 
Brief note on match and miss-match uncertainties
Brief note on match and miss-match uncertaintiesBrief note on match and miss-match uncertainties
Brief note on match and miss-match uncertainties
 
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
Implementation of FinFET technology based low power 4×4 Wallace tree multipli...
 
Evaluation of the weighted-overlap add model with massive MIMO in a 5G system
Evaluation of the weighted-overlap add model with massive MIMO in a 5G systemEvaluation of the weighted-overlap add model with massive MIMO in a 5G system
Evaluation of the weighted-overlap add model with massive MIMO in a 5G system
 
Reflector antenna design in different frequencies using frequency selective s...
Reflector antenna design in different frequencies using frequency selective s...Reflector antenna design in different frequencies using frequency selective s...
Reflector antenna design in different frequencies using frequency selective s...
 
Reagentless iron detection in water based on unclad fiber optical sensor
Reagentless iron detection in water based on unclad fiber optical sensorReagentless iron detection in water based on unclad fiber optical sensor
Reagentless iron detection in water based on unclad fiber optical sensor
 
Impact of CuS counter electrode calcination temperature on quantum dot sensit...
Impact of CuS counter electrode calcination temperature on quantum dot sensit...Impact of CuS counter electrode calcination temperature on quantum dot sensit...
Impact of CuS counter electrode calcination temperature on quantum dot sensit...
 
A progressive learning for structural tolerance online sequential extreme lea...
A progressive learning for structural tolerance online sequential extreme lea...A progressive learning for structural tolerance online sequential extreme lea...
A progressive learning for structural tolerance online sequential extreme lea...
 
Electroencephalography-based brain-computer interface using neural networks
Electroencephalography-based brain-computer interface using neural networksElectroencephalography-based brain-computer interface using neural networks
Electroencephalography-based brain-computer interface using neural networks
 
Adaptive segmentation algorithm based on level set model in medical imaging
Adaptive segmentation algorithm based on level set model in medical imagingAdaptive segmentation algorithm based on level set model in medical imaging
Adaptive segmentation algorithm based on level set model in medical imaging
 
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
Automatic channel selection using shuffled frog leaping algorithm for EEG bas...
 

Recently uploaded

result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college projectTonystark477637
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVRajaP95
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxpurnimasatapathy1234
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxupamatechverse
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...Soham Mondal
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls in Nagpur High Profile
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlysanyuktamishra911
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Dr.Costas Sachpazis
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSKurinjimalarL3
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).pptssuser5c9d4b1
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations120cr0395
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCall Girls in Nagpur High Profile
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingrknatarajan
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...ranjana rawat
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Christo Ananth
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSRajkumarAkumalla
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxpranjaldaimarysona
 

Recently uploaded (20)

result management system report for college project
result management system report for college projectresult management system report for college project
result management system report for college project
 
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IVHARMONY IN THE NATURE AND EXISTENCE - Unit-IV
HARMONY IN THE NATURE AND EXISTENCE - Unit-IV
 
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service NashikCall Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
Call Girls Service Nashik Vaishnavi 7001305949 Independent Escort Service Nashik
 
Microscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptxMicroscopic Analysis of Ceramic Materials.pptx
Microscopic Analysis of Ceramic Materials.pptx
 
Introduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptxIntroduction to IEEE STANDARDS and its different types.pptx
Introduction to IEEE STANDARDS and its different types.pptx
 
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
OSVC_Meta-Data based Simulation Automation to overcome Verification Challenge...
 
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur EscortsCall Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
Call Girls Service Nagpur Tanvi Call 7001035870 Meet With Nagpur Escorts
 
KubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghlyKubeKraft presentation @CloudNativeHooghly
KubeKraft presentation @CloudNativeHooghly
 
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
Sheet Pile Wall Design and Construction: A Practical Guide for Civil Engineer...
 
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICSAPPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
APPLICATIONS-AC/DC DRIVES-OPERATING CHARACTERISTICS
 
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANVI) Koregaon Park Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
247267395-1-Symmetric-and-distributed-shared-memory-architectures-ppt (1).ppt
 
Extrusion Processes and Their Limitations
Extrusion Processes and Their LimitationsExtrusion Processes and Their Limitations
Extrusion Processes and Their Limitations
 
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service NashikCollege Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
College Call Girls Nashik Nehal 7001305949 Independent Escort Service Nashik
 
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and workingUNIT-V FMM.HYDRAULIC TURBINE - Construction and working
UNIT-V FMM.HYDRAULIC TURBINE - Construction and working
 
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
(ANJALI) Dange Chowk Call Girls Just Call 7001035870 [ Cash on Delivery ] Pun...
 
Roadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and RoutesRoadmap to Membership of RICS - Pathways and Routes
Roadmap to Membership of RICS - Pathways and Routes
 
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
Call for Papers - Educational Administration: Theory and Practice, E-ISSN: 21...
 
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICSHARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
HARDNESS, FRACTURE TOUGHNESS AND STRENGTH OF CERAMICS
 
Processing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptxProcessing & Properties of Floor and Wall Tiles.pptx
Processing & Properties of Floor and Wall Tiles.pptx
 

Pairwise Sequence Alignment between HBV and HCC Using Modified Needleman-Wunsch Algorithm

  • 1. TELKOMNIKA, Vol.15, No.4, December 2017, pp. 1785~1793 ISSN: 1693-6930, accredited A by DIKTI, Decree No: 58/DIKTI/Kep/2013 DOI:10.12928/TELKOMNIKA.v15i4.5813  1785 Received August 1, 2017; Revised November 8, 2017; Accepted November 23, 2017 Pairwise Sequence Alignment between HBV and HCC Using Modified Needleman-Wunsch Algorithm Lailil Muflikhah*, Edy Santoso Faculty of Computer Science, Brawijaya University, 8 Veteran Road, Malang Indonesia *Corresponding author, e-mail: laililmf@gmail.com Abstract Ths paper aims to find similarity of Hepatitis B virus (HBV) and Hepatocelluler Carcinoma (HCC) DNA sequences. It is very important in bioformatics task. The similarity of sequence allignments indicates that they have similarity of chemical and physical properties. Mutation of the virus DNA in X region has potential role in HCC. It is observed using pairwise sequence alignment of genotype-A in HBV. The complexity of DNA sequence using dynamic programming, Needleman-Wunsch algorithm, is very high. Therefore, it is purpose to modifiy the method of Needleman Wunsch algorithm for optimum global DNA sequence alignment. The main idea is to optimize filling matrix and backtracking proccess of DNA components.This method can also solve various length of the both sequence alignment. This research is applied to DNA sequence of 858 hepatitis B virus and 12 carcinoma patient, so that there are 10,296 pairwis of sequences. They are aligned globally using the purposed method and as a result, it is achieved high similarity of 96.547% and validity of 99.854%. Furhthermore, this method has reduced the complexity of original Needleman-Wunsch algorithm The reduction of computational time is as 34.6% and space complexity is as 42.52%. Keywords: hepatitis, sequence alignment, DNA, Needleman-Wunsch, optimum global Copyright © 2017 Universitas Ahmad Dahlan. All rights reserved. 1. Introduction Hepatitis B virus (HBV) is a DNA virus which can causes acute and chronic hepatitis in humans. The chronic hepatitis can further progress to liver cirrhosis and carcinoma of the liver (hepatocellular carcinoma, HCC). There is prediction of more than 500,000 incidents in 2000, HCC is the most common malignant tumor in the world and the attack rate is rising in many countries, including Indonesia. It is the third rank in cause of death after lung cancer and stomach [1]. Information technology in the field of molecular biology which is known as bioinformatics is growing very rapidly. The activities include mapping and DNA analysis, protein sequencing, aligning different DNA, constructing and performing models of protein structures in three dimensions. The correction for DNA sequence can be observed before analyze the sequence. This reseearch has been conducted using Fuzzy-based spectral alignment [2]. Another related research of DNA sequence is applied to identify Breast Cancer disease [3]. However, the growth of DNA sequence databases makes the sequence alignment computation, as one of bioinformatics tasks, increase significantly [4]. The complexity of sequence alignment is O (LN), where L is the length of each sequence and N is the number of sequences [5]. The longer and the more sequences to be aligned the longer it’s computational time. The optimal time of sequence alignment is increased exponentially with increaseing of the number and length of sequences [6]. Basically, there are two methods of sequence alignment including dynamic programming method and heuristic method approach. The Needleman-Wunsch algorithm is a method of optimal global sequence alignment, but Smith-Waterman is a method of optimal local alignment which are based on dinamic programming approach algorithm. Meanwhile, several tools that use heuristic approach are FASTA and Basic Local Alignment Search Tool (BLAST). The produced sequent alignment is not optimal eventhough their computational speed is fast [7]. Several methods or tools are needed to obtain optimal computational process. The previous research to optimize the computational protein sequence alignment used a dynamic programming algorithm but the computational complexity is high of O (mn) by Zhou and Chen
  • 2.  ISSN: 1693-6930 TELKOMNIKA Vol. 15, No. 4, December 2017 : 1785 – 1793 1786 [8]. Furthermore, the developing performance of dynamic programming has been applied through sharing memory to speed up the alignment process. The dynamic programming method for sequence alignment has developed with share memory system using four different data partitioning schemas: blocked columnwise, rowwise, antidiagonal, and revised blocked columnwise [9]. Another research is also parallel computing utilized on clusters of computers known as Distributed Memory. This research used star algorithms in parallel environment using MPI to distribute computing of DNA Multiple sequence Alignment [10]. This research is purposed to improve the previous work conducted by Shehab et,al. using Fast Dynamic Algorithm without sharing memory. The previous research had limitation in scalability,which difference of sequence length cannot more than of 10 [11]. Therefore, we develop this prevous reseach by modifying Needleman-Wunsch method as a dynamic programming approarch in single memory for pairwise sequence alignment in order to get the similarity rate between HBV and HCC. This method is address to get high performance in computational process with limited reseource of memory. 2. Research Method Implementation of pairwise sequence alignment is designed within three parts as shown in Figure 1. First, the input sequences is data set of HBV and HCC and parameter for evaluation. The second is applied the purposed method to align the sequences. The pusposed method is address to improve the performance, including computaional time and space complexity. Figure 1. Block diagram of pairwise sequence alignment 2.1. Data Sets There are various ways to identify the disease, either isolating the virus in the laboratory or analysing DNA sequences of the viruses. Hepatitis B virus is a DNA virus to mutate and it influences to be chronic disease. It is known as hepatitis with liver cancer (HCC). One way to know that the HBV DNA is mutated, can be applied sequence alignment based on the similarity percentage. This research is examined to data set of HBV DNA sequences, with spesific genotype-A in X region. It can be downloaded from website of HBV database at url: https://hbvdb.ibcp.fr/HBVdb/HBVdbDataset?view=/data/nucleic/fasta/A_X.fas&seqtype=0 gnl|hbvcds|AB014370_X_P-A. There 858 HBV DNA sequences and 12 HCC DNA sequences. They are applied to sequence in pairwise, so that there are totally 10296 in pairs. Input Process Output Data set of HBV and HCC DNA sequences Evaluation Parameter Pairwise global sequence alignment using modified Needleman Wunch Sequence alignment Scoring, gap, similarity
  • 3. TELKOMNIKA ISSN: 1693-6930  Pairwise Sequence Alignment between HBV and HCC Using Modified … (Lailil Muflikhah) 1787 2.2. Sequence Alignment Sequence Alignment is the process whereby a sequence compared by looking for patterns of of the most common characters and inter-related sequences. Pairwise sequence alignment is a alignment process between two sequences based on similarity in order to search on the database and multiple sequence alignment. This similarity can be expressed in percentages. It means that the sequences have similarities to physical-chemical properties. Another term, the degree of similarity in amino acid sequence is similar between the two protein sequences [12]. The similarity may indicate a functional relationship, or the structural and evolutionary relationships [13]. To perform sequence alignment, it is necessary to shift sequence at certain positions in order to get the maximum similarity. There are two method of sequence alignment, ie global and local alignment. The global alignment method is to compare the whole sequences. This method is suitable to perform sequence alignment that has a high degree of similarity of the whole part of the sequence. Another hand, the local alignment method is to compare a local area or part of the sequence. 2.1.1. Global Alignmnt Global alignment creates end-to-end alignment even though there is a difference in some region. This approach is suitable to align similar sequence [8]. As illustration, it can be showed in Figure 2. EARDFNQYYSSIKRSGSI :...:..:::::::.... EPKLFIQYYSSIKRTMGH Figure 2. Global alignment 2.1.2. Local Alignment Local alignment only aligns the most similar region within a sequence. There is no need to align all part of sequence, only the region with big similarity according to some criteria. By using the same sequence is as Figure 3, local alignment becomes as follow: EARDFNQYYSSIKRSGSI ::::::: EPKLFIQYYSSIKRTMGH Figure 3. Local alignment 2.2. Needleman Wunch Algorithm Needleman-Wunsch algorithm is a dynamic programming method to find global optimal sequence alignment and it allows a gap. This algorithm is widely used in bioinformatics to align nucleotide sequences. It works by creating an optimal sequence alignment. As illustration, there are two DNA sequences of length m and n where a = a1a2 ... am and b = b1b2 ... bn. To find a parallel sequence with the highest score, it must be constructed a matrix F where F (i, j) is the value of the best sequence alignment between a1: i and b1: j. The first, it is constructed matrix F and is filled in F (0,0) = 0, F (i, 0) = -i and F (0, j) = j. Furthermore, it is applied as Eq.1 [14]: 𝐹(𝑖, 𝑗) = 𝑚𝑎𝑥 { 𝐹(𝑖 − 1, 𝑗 − 1) + 𝑠(𝑎𝑖, 𝑏𝑗, 𝑝) 𝐹(𝑖 − 1, 𝑗) 𝐹(𝑖, 𝑗 − 1) Eq. (1) It is.defined 𝑠(𝑎𝑖, 𝑏𝑗, 𝑝) which is the returned function to give scoring scheme for a match if ai = bj and a mismatch if ai ≠ bj. The final score can be obtained by looking the value of F(M,N). The alignment is build by doing backtrack from F(M,N) to F(0,0). The procedure is compare to the value on F(i,j) to its top left and diagonal using equation (1). For example if F(i,j)
  • 4.  ISSN: 1693-6930 TELKOMNIKA Vol. 15, No. 4, December 2017 : 1785 – 1793 1788 = F(i,j-1), then backtrack is recorded an insertion in bj [2]. To get better performance, the data of an insertion, deletion or gap can be recorded during filling the matrix. Sometimes, several path can be generated from this process. This indicates that there are more than one alignment that can be generated with same score. The steps of the Needleman Wunsch is illustrated in Figure 4. Figure 4. The steps of Needleman Wunch Algorithm 3. The purposed Method The Needleman Wunsch algrithm is a dynamic programming approach to align global optimally. How ever, this algoritm needs high computational time and space complexity. Therefore, it is need to optimize the filling and backtracking process. The main deferrence of the purposed method is filling process of matrix alignment with spesific area as in Figure 5. If the both sequences have the same length, then filling mattrix is on the main diagonal (D), (D-1) and (D + 1). The (D-1) is a diagonal below the main diagonal and (D + 1) is a diagonal above the main diagonal. In this case, we use the same with FAST Needleman Wunsch algorithm [5]. Figure 5. Ilustration of filling matrix with the same sequence The other hand, if the input sequence has different length, the matrix will have two main diagonals. In this case, the marked area are including the first main diagonal (D1), the second main diagonal (D2), area between D1 and D2 (x), one diagonal above D2 (D2+1) and one diagonal below D1 (D1-1) as shown in Figure 6. Figure 6. Marked area if the sequences input have different length
  • 5. TELKOMNIKA ISSN: 1693-6930  Pairwise Sequence Alignment between HBV and HCC Using Modified … (Lailil Muflikhah) 1789 Furthermore, the stages of the modified Needleman Wunsch are as follows: 1 st step: Matrix Initialization In this step, it is to construct matrix NW and to put a certain value for marking into specific area as in Figure 5 or Figure 6. It will be given parameter value of gap openning and gap extension. 2nd step: Filling to the matrix There are two operation in this step as follows: 1. To count Linear evaluation Each nucleotide is given the predetermined parameter value, including match, mismatch, gap opening and gap extention. As illustration, it is given the sequence A = GESKC and sequence B = GTASC. The scoring scheme for the alignment is match = 2, mismatch = -1 and gap = -2. Filling F(0,0) by 0 and also filling the first row and first column of marked area is iterated gap value as in Figure 7. In this case study, the data where is the value come form is represent by arrow (↑, ← and ↖). This arrow is useful for backtrack process. Figure. 7. Initialization stage 2. To fill NW matrix and tracer After all the rows and columns of the marked matrix NW are counted, then the next stage is to fill the matrix NW and tracer. To perform this stage, it takes the output from the previous process in the form of a direction to tracer process later (D=diagonal), (H=horizontal) and (V=vertical). After all marked area has been filled, the matrix will be look like Figure 8. Figure 8. All marked area has been filled 3 rd step: Backtracking process Backtracking is the last stage of sequence alignment. At this stage, the system will perform a trace back from the matrix that has been previously established. Starting from the end entry of trace process (end position of matrix), it will be executed the alignment and it results refer to the rules that have been determined. The backtracking process can be seen at Figure 9. Figure 9. Backtracking path
  • 6.  ISSN: 1693-6930 TELKOMNIKA Vol. 15, No. 4, December 2017 : 1785 – 1793 1790 The diagonal arrow (↖) means substitution, vertical arrow (↑) means insertion of a gap in B and horizontal arrow (←) means insertion of a gap in A. From Figure 9 we can construct the alignment is as follows: Sequence A : G E - S K C Sequence B : G T A S - C Score : 2 -1 -2 2 -2 2 The final score can be computed by sum up 2 + (-1) + (-2) + 2 + (-2) + 2 = 1 or we can see at F(7,7) = 1 4. Results and Analysis There are two scenarios in this research. First scenario is finding characteristics for DNA sequences of HBV and HCC based on similarity using either modified or original Needeman Wunsch. The second scenario is comparing performance result between the modified method and the original. This scenario is also to analysis the parameter value against the performance. In the first scenario, it is used the default parameter (match=2; mismatch=-3; gap opening=-5; gap extension=-2). As a result, it is achieved using the modified method’s performance including average similarity rate=96.549% and score=849.8 with computational time=1370 ms. How ever, the performance is achieved using original Needleman Wunsch, i.e. similarity rate= 96.548% and score = 849.8 with total computational time=64131ms. Therefore, the accuracy rate of the both method is 99.854% and there is time reduction as 78.95%. The performance of experimental result is partially showed in Table 1 Table 1. Comparisson of Performance the Both Algorithm for Pairwise Sequence Alignment Sequence no Modified Needleman-Wunsch Original Needleman-Wunsch score Similarity (%) Time (ms) score Similarity (%) Time (ms) 1 900 98,71 50 900 98,71 90 2 860 96,99 38 860 96,99 28 3 875 97,63 37 875 97,63 19 4 900 98,71 5 900 98,71 10 5 900 98,71 3 900 98,71 11 6 905 98,92 36 905 98,92 8 7 895 98,49 4 895 98,49 10 8 895 98,49 7 895 98,49 11 9 895 98,49 3 895 98,49 13 10 840 96,13 6 840 96,13 13 Then, the second scenario is analisys the parameter against the computational performance. By using various parameters including match, mismatch, gap opening and gap extention is obtained similarity rate of the original method as shown in Figure 10. The match, gap opening and gap extention show that the larger parameter value is given, the lower similarity is. Otherwise, the lower parameter value of mismatch, the higher similarity is. The performance measure including computational time and space complexity is evaluated. Using parameter randomly, there is reduction of computational time in average of 34.60% as in Table 2. In space compexity, it is evaluated by filling the matrix of sequence alignment and the partial result is showed as in Table 3.
  • 7. TELKOMNIKA ISSN: 1693-6930  Pairwise Sequence Alignment between HBV and HCC Using Modified … (Lailil Muflikhah) 1791 Figure 10. The effect of parameter value to similarity performance Table 2. Computational Time for Sequence Alignment Match Mismatch Gap Openning Gap Extention Time requirement for sequence alignment Modified NW Original NW Reduction (ms) Reduction (%) 1 -1 -5 -2 252474 322590 70116 21,74 1 -2 -5 -2 509065 618041 108976 17,63 1 -1 -2 -4 155513 306827 151314 49,32 1 -2 -2 -4 160639 313108 152469 48,70 2 -3 -2 -2 228456 320816 92360 28,79 2 -3 -2 -4 159961 311777 151816 48,69 2 -3 -4 -4 242231 373504 131273 35,15 2 -3 -5 -2 203316 387511 184195 47,53 2 -3 -6 -2 235820 329285 93465 28,38 Average 238608,33 364828,78 126220,44 34,60 Table 3. The Number of Filled Matrix Area Sequence # Modified NW Original NW 1 52339 99645 2 42484 89790 3 71173 118479 4 47521 94827 5 224692 271998 6 67450 114756 7 84094 131400 8 9070 36792 9 28906 76212 10 92635 139941 99.30 99.40 99.50 99.60 99.70 99.80 99.90 100.00 1 2 3 4 5 6 7 8 9 10 Similarity(%) Match score - 20.00 40.00 60.00 80.00 100.00 120.00 -10 -9 -8 -7 -6 -5 -4 -3 -2 -1 Similarity(%) Mismatch score 99.30 99.40 99.50 99.60 99.70 99.80 99.90 100.00 -10 -9 -8 -7 -6 -5 -4 -3 -2 -1 Similarity(%) Gap Opening score 99.6 99.65 99.7 99.75 99.8 99.85 99.9 99.95 100 100.05 -10 -9 -8 -7 -6 -5 -4 -3 -2 -1 Similarity(%) Gap Extension score
  • 8.  ISSN: 1693-6930 TELKOMNIKA Vol. 15, No. 4, December 2017 : 1785 – 1793 1792 Furthermore, the modified method is evaluated for the effect of total sequences alignment against the performance. How ever, the experimental result is showed that the total sequence is no effect to the performance as shown in Figure 11 and Figure 12. Figure 11. The performance of scoring alignment against total sequences Figure 12. The performance of similarity against total sequences 5. Conclusion The modified method of Needleman Wunch has been successfully implemented to perform the pairwise DNA sequence global alignment optimally between HBV and HCC: 1. The main principle in the modified method of Needleman Wunsch is the matrix region of filling DNA sequences for marking and backtracking process to get the sequence result. 2. The level of space and time complexity of the method are used by O(n) of the original method of O(mn). It is based on the experimental result that there is a reduction of filled matrix area of 42.52%. 3. Testing pairwise sequence alignment between hepatitis B virus and hepatocellular carcinoma DNA sequences is generated at high level of similarity rate in average value of 96 547% 6. Future Work Based on the experimental result, there are still many shortcomings that needed some repairs for further research as follows: - In this study conducted by pairwise alignment (pairs), it is advisable to do multiple sequence alignment to increase speed up the computational time. 830 840 850 860 870 880 890 10 25 50 100 150 200 300 400 500 1000 1500 2000 3000 4000 5000 Score Number of DNA sequences Score-modif Score-Ori 95.5 96 96.5 97 97.5 98 98.5 Similarity (%)Modif Similarity (%) Ori
  • 9. TELKOMNIKA ISSN: 1693-6930  Pairwise Sequence Alignment between HBV and HCC Using Modified … (Lailil Muflikhah) 1793 - This research is obtained for the scoring and similarity, therefore for further research are expected to be able to determine the position of mutation in DNA sequences. Furthermore, it can be used to early detection of carcinoma (HCC) based on the HBV DNA sequence alignment. Acknowledgement This research was financially supported by Faculty of Computer Science, Brawijaya University, Indonesia under program Superior Research Grant in Faculty. We would like to thank Nashi Widodo Ph.D., Brawijaya University, for his fruitful and guidance. References [1] Lupberger J, Hildt E. Hepatitis B Virus-induced Ocogenesis. World J.Gastroenterol. 2007; 13(1): 74- 81 [2] Saputra S, Kana, Buono, Agus, Kusuma WA. Fuzzy-based Spectral Alighment from Generation Sequencer. TELKOMNIKA, Telecomunication, Computing, Electronics and Control. 2016; 14(2): 707- 714. [3] Muflikhah L, Yuliantoro I. Identifying Cancer Disease through Deoxyribonucleic Acid (DNA) Sequential Pattern Mining. International Journal of Intelligence Science, 2017; 7: 9-23. [4] Liu Y, Schmidt B, Maskell DL. MSA-CUDA. Multiple Sequence Alignment on Graphics Processing Units with CUDA. IEEE International Conference on Application-specific Systems, Architectures and Processors. 2009: 121-128 [5] Lloyd GS. 2010. Parallel Multiple Sequence Alignment: An Overview. [6] Edgar RC, Batzoglou S. Multiple sequence alignment. Current opinion in structural biology. 2006; 16(3): 368-373. [7] Kasap S, Benkrid K, Liu Y. Design and Implementation of an FPGA-based Core for Gapped BLAST Sequence Alignment with the Two-Hit Method. Engineering Letters. 2008; 16(3). [8] Zhou Zm, Chen Z-w. Dynamic Programming for Protein Sequence Alignment. International Journal of Bio-Science and Bio-Technology. 2013; 5(2): 141-150. [9] Rahmad A., Auriza, Sukoco H., Kusuma, A.W. Comparison of Data Partitioning Schema of Parallel Pairwise Alignment on Shared Memory System. TELKOMNIKA, Telecomunication, Computing, Electronics and Control. 2015; 13(2): 694-702. [10] Satra R, Kusuma WA, Sukoco H. Accelerating Computation of DNA Multiple Sequence Alignment in Distributed Environtment. TELKOMNIKA Indonesian Journal of Electrical Engineering. 2014; 12 (12): 8278-8285. [11] A Shehab S, Keshk A, Mahgoub H. Fast Dynamic Algorithm for Sequence Alignment based on Bioinformatics. International Journal of Computer Applications. 2012; 32(7): 54-61. [12] Xiong J. Essential Bioinformatics. 2006. New York: Cambridge University Press. [13] EMBL-EBI, 2015. Pairwise Sequence Alignment. [Online] Available at: http://www.ebi.ac.uk/Tools/psa/ [ available at 8 10 2015]. [14] Nanni L, Lumini A. Generalized Needleman–Wunsch algorithm for the recognition of T-cell epitopes. Expert Systems with Applications. 2008; 35:1463–1467.