SlideShare a Scribd company logo
Indonesian Journal of Electrical Engineering and Computer Science
Vol. 5, No. 2, February 2017, pp. 472 ~ 478
DOI: 10.11591/ijeecs.v5.i2.pp472-478  472
Received September 2, 2016; Revised December 25, 2016; Accepted January 11, 2017
Classification algorithm for Gene Expression Graph
and Manhattan Distance
N. Sevugapandi
1
, C. P. Chandran
2
1
Research and Development Center, Bharathiar University, Coimbatore, Tamilnadu
2
Computer Science, Ayya Nadar Janaki Ammal College, Sivakasi, Tamilnadu
Corresponding author, e-mail: snsevugapandi@gmail.com
1
, drcpchandran@gmail.com
2
Abstract
This proposed method focus on these issues by developing a novel classification algorithm by
combining Gene Expression Graph (GEG) with Manhattan distance. This method will be used to express
the gene expression data. Gene Expression Graph provides the optimal view about the relationship
between normal and unhealthy genes. The method of using a graph-based gene expression to express
gene information was first offered by the authors in [1] and [2], It will permits to construct a classifier based
on an association between graphs represented for well-known classes and graphs represented for
samples to evaluate. Additionally Euclidean distance is used to measure the strength of relationship which
exists between the genes.
Keywords: Data Mining, DNA Micro array, Gene Ontology, KEGG pathway
Copyright © 2017 Institute of Advanced Engineering and Science. All rights reserved.
1. Introduction
1.1. Data Mining
Data mining is defined as the non-trivial process of searching and analyzing data in
order to find implicit but potentially useful information. Let D = {d1... dn} be the dataset to be
analyzed. The data mining process is described as the process of finding a subset D‟ of D and
hypotheses HU(D‟,C) about D‟ that a user U considers useful in an application context C. D‟
have fewer data elements than D, but it also have a lower dimensionality (m‟). In databases the
data is partitioned into relations or object classes. D is considered as a union of relations R1...
Rk each has its own dimensionality (m1... mk)[1].
1.2. Bioinformatics
Bioinformatics is the Science of integrating, managing, mining and interpreting
information from biological datasets at genomic, metabalomic, proteomics, phylogenetic and
cellular or whole organism levels.
According to (National Institute of Health) NIH organization, the Bioinformatics and
Computational Biology have been defined as “Bioinformatics is research and development or
application of computational tools and approaches for expanding the use of biological, medical,
health data including those to acquire store, organize, active, analyze or visualize such data” [2].
1.3. Genomics
DNA (Deoxyribonucleic Acid) is a molecule encoding the genetic instructions used in
the development and functioning of all known living organisms many viruses. DNA is one of the
three major macromolecules that are essential for all known forms of life.
Genetic information is encoded as a sequence of nucleotides
(Guanine, Adenine, Thymine, and Cytosine) recorded using the letters G, A, T, and C. Most
DNA molecules are double-stranded helices, consisting of two long polymers of simple units
called nucleotides with the nucleo bases (G, A, T, C) attached to the sugars. DNA is well-suited
for biological information storage, since the DNA backbone is resistant to cleavage and the
double-stranded structure provides the molecule with a built-in duplicate of the encoded
information [3].
IJEECS ISSN: 2502-4752 
Classification algorithm for Gene Expression Graph and Manhattan… (N. Sevugapandi)
473
Figure 1. Structure of DNA
1.3.1. Nucleic Acids
Nucleic acids are large biological molecules essential for all known forms of life. They
include DNA. Together with proteins, nucleic acids are the most important biological
macromolecules; each is found in abundance in all living things, where they function in
encoding, transmitting and expressing genetic information.
The nucleic acids Deoxyribonucleic acid DNA are polymers of nucleotides, arranged in
a specific sequence. To form macromolecular polymers, nucleotides are joined between the 3‟
and 5‟ carbon atoms in their sugar moiety by a phosphodiester bond, giving rise to a nucleic
acid with a sugar–phosphate „backbone‟ to which is attached a series of bases in a specific
order. Hydrogen bonding between pairs of bases can occur, leading to the formation of double-
stranded polymers if the sequences are complementary [4].
1.3.2. Base pairs
Basepairs are the building blocks of the DNA double helix, and contribute to the folded
structure of both DNA. Dictated by specific hydrogen bonding patterns, Watson-Crick base pairs
(Guanine-Cytosine and Adenine-Thymine) allow the DNA helix to maintain a regular helical
structure that is independent of its nucleotide sequence. The complementary nature of this
based-paired structure provides a backup copy of all genetic information encoded within double-
stranded DNA. Figure 2 shows the pairs of ATGC.
The regular structure and data redundancy provided by the DNA helix make DNA an
optimal molecule for the storage of genetic information, while base-pairing between DNA and
incoming nucleotides provide the mechanism through which DNA polymerase replicates
DNA transcribes. Many DNA-binding proteins can recognize specific base pairing patterns that
identify particular regulatory regions of genes.
 ISSN: 2502-4752
IJEECS Vol. 5, No. 2, February 2017 : 472 – 478
474
Figure 2. Base pairs of ATGC
1.3.3. Gene Expression Data
Gene expression refers to a complex series of processes in which the information
encoded in a gene is used to produce a functional product such as a protein that dictates cell
function. It involves several different steps through which DNA is converted to an RNA which in
turn is converted into a protein or in some cases RNA, for example, genes encoding the
necessary information for transfer RNAs and ribosomal RNAs (tRNAs and rRNAs). The
information flow from DNA to RNA to protein can be controlled at several points helping the cell
to adjust the quality and quantity of resulting proteins and thus self-regulate its functions. Thus,
regulation of gene expression is a critical step in determining what kind of proteins and how
much of each protein is expressed in a cell.
1.3.4. DNA Motifs
Complex designs are often created using a relatively small set of common building
blocks called motifs. DNA self-assembly can exploit this same design principle to hierarchically
create more sophisticated aperiodic structures.
There are many possible DNA motifs and the focus here is on only a few in the context
of the target nanostructure. Motifs include junctions that enable three or more double stranded
helices of DNA to interact and thus form specific structures (e.g., a triangle, a corner, and so
on). Another important motif is a single strand of DNA protruding from a doublestranded helix
called a sticky-end.
Two motifs with complementary sequences on their sticky-ends will bind to form a
composite motif. Composite motifs may also have embedded sticky-end motifs and thus can
also bind with other composite motifs to form another, larger, composite motif. This results in a
hierarchical structure for motifs.
1.3.5. Various Gene Expression Techniques
Analytical methods may be used to examine mRNA expression levels or differential
mRNA expression. Some examples of these techniques are listed below.
Serial Analysis of Gene Expression (SAGE): SAGE is a technique used to create a
library of short sequence tags which can each be used to detect a transcript. The expression
level of the transcript can be determined by assessing how many times each tag is detected.
This technology enables comprehensive expression analysis across the genome [5].
DNA microarray: Also known of as biochip or DNA chip, a DNA microarray is a solid
surface to which a collection of microscopic DNA spots are attached. The microarrays are used
to determine expression levels across a large number of genes or to perform genotyping across
different regions of a genome [6].
RNA Seq: This refers to methods used to measure the sequence of RNA molecules.
Examples include shotgun sequencing of cDNA molecules acquired from RNA through reverse
transcription and technologies used to sequences.
RNA molecules from a biological sample so that the primary sequence and abundance
of each RNA molecule can be determined.
Tiling arrays: A tiling array is a type of microarray chip, with labelled DNA or RNA
targets hybridized to probes attached to a solid surface. However, the probes used differ to
IJEECS ISSN: 2502-4752 
Classification algorithm for Gene Expression Graph and Manhattan… (N. Sevugapandi)
475
those used with traditional microarrays. Rather than known sequences or predicted genes being
probed, tiling arrays probe for sequences known to be present in a contigious region.
2. Previous Works
J. Breiman, et al., [3] proposed model was used for formulating the prediction model
from the car dataset. This prediction model was made by continuously portioning the data in
data space. At the last, the portioned result is shown as a decision tree. Classification tree was
based on Decision tree, which was typically intended for dependent variables to acquire a
countable number of random values along with possible prediction error. The prediction error
was considered in terms of penalty for wrong classification. Regression tree algorithms are likely
as classification tree algorithms for dependent variables to acquire continuous or ordered
distinct values. In regression tree method, prediction error will be calculated by measuring the
squared variation among the actual and predicted values [7].
H. Zhang, et al., [4] proposed a deterministic forest methodology by means of
continuous tree partition from microarray gene expression data. The key attribute for this
methodology is to get better prediction accuracy rate and also to identify the appropriate genes
for selecting tumor cell among datasets. Deterministic classification is more likely as random
forest tree algorithm but deterministic provide accurate classification when compared with
random forest. In deterministic classification, initial level classification of gene population is
carried out by choosing predetermined number for splitting criterion. Then perform scrutinizing
operation on repeatedly occurring pairs of genes to get more needed data from dataset. This
scrutinizing process should disclose the value of the genes in classifying cancer tissues in the
relevant data sets [8].
V. N. Vapnik, et al., [5] proposed a well-known classification algorithm called support
vector machine (SVM) algorithm to collect necessary information from DNA microarray data.
This algorithm is famous for its accuracy and robust nature. The SVM will provide better
performance while using multi category of biological scrutiny in cancer diagnosis from
microarray gene expression data. It is capable of analyzing both sample and gene expression
data in order to explore the wrongly classified data result. SVM provide significant performance.
The main drawback of this method, that SVM failed to detect out of class samples and also
restricts to provide inadequate clinical diagnostics application [9].
J. Khan, et al., [6] purpose of this paper is to develop a model for classifying cancers
into exact diagnostic groups based on gene expression data by means of artificial neural
networks (ANNs). Initially ANNs was trained by using round blue-cell tumors. This cancer
diagnostic can fit into four different categories. Based on this training, ANNs appropriately
classified the entire sample gene and also identify the most relevant gene for classification. This
analysis had shown the possible applications of these ANNs method for cancer diagnosis.
3. Results and Discussion
The proposed weight based gene expression graph performs classification of diseases
by means of gene expression data along with carrying weight at the edge. In graph G(V,E),
where V represents the gene and the E represents the weighted edge. The weight present in
the edge determines the strength of relationship among the gene.
The graph is built from untreated microarray scanned image data to go through the
following stages such as preprocessing, data modeling, classification and validation. In
microarray technology, gene is identified by types of expression levels. The first level, cy3
represents the healthy condition of gene and the second expression level, cy5 represents the
diseased condition of gene. This gene expression is usually attained from the n samples are
arranged as matrix called gene expression matrix. In preprocessing stage, this is done by
means of normalization techniques.
Normalization is particularly useful to train the input values for classification algorithm,
this normalization techniques helps to speed up the learning phase in classification algorithm.
There are many methods for data normalization. For that, we consider the Standardized normal
distribution techniques along with LN ratios of each gene. It can be computed by using equation
(1), (2), (3) and (4).
 ISSN: 2502-4752
IJEECS Vol. 5, No. 2, February 2017 : 472 – 478
476
LNratio(x)=
⁄
(1)
Mg = (2)
Sg = = | | | | | | | | (3)
Z(LNratio(x), , )= (4)
Where, Mg represents the mean, represents the standard deviation and LNratio(x)
represents the input value x. For example, Table 1 Consider the expression levels training set to
compute standardized normal distribution.
Table 1. Expression levels training set
Gene
Tissue
a
Diseased Healthy
b
Diseased Healthy
C
Diseased Healthy
Sample 1 30 10000 100 12000 2000 10
Sample 2 50 30000 50 2000 1000 1099
Sample 3 30 5000 13000 80 4000 150
Sample 4 100 6000 80 15000 6000 25
Table 2. LNratio and Normalization of the training data set
Gene LNratio(x) Normalization(LNratio(x), , ) Mean Standard Deviation
a B A b
S1 -8.31 -6.85 -0.86 -0.64 -2.53 6.74
S2 -9.15 -5.28 -1.37 -0.13 -4.85 3.15
S3 -7.32 7.28 -1.5 0.97 1.55 5.92
S4 -5.86 -7.49 -0.62 -0.88 -1.83 6.45
Then relevancy between genes is computed by finding the relevancy of each gene by
calculating Relevance Count (RC). Based on TRC value, the gene relevancy is categorized into
3 types. They are more relevant gene (1), quieted gene (-1) and irrelevant gene (0). RC is
calculated by using threshold, cy5 and cy3 values. A threshold can be specified by using well
known methods for example, t test. Total Relevancy Count (TRC) is by doing summation of all
RC value in each gene. Table 3 shows the TC value for each gene.
RC ( , cy5, cy3) = {
( )
( )
( )
(5)
Table 3. TRC value for threshold
Gene A b c
Sample 1 -1 -1 1
Sample 2 -1 0 1
Sample 3 -1 1 1
Sample 4 -1 -1 1
TRC -4 -1 4
The weight ( ) between the genes are calculated by applying Modified Manhattan
distance formula. It is defined in equation (6).
IJEECS ISSN: 2502-4752 
Classification algorithm for Gene Expression Graph and Manhattan… (N. Sevugapandi)
477
WG ( ) = | | +| |+| | +….. +| | (6)
Table 4. Represents the weight for gene expression tree as matrix
Gene A B c
A 0 3 4
B 3 0 3
C 4 3 0
5. Conclusion
This paper proposes a Modified Manhattan distance based weighted GEG classifier to
classify a gene expression data. The experimental outcome confirms that the efficiency of the
proposed method gives better result than that of current methods. The proposed method can
also appropriately distinguish out-of-class samples. In addition to that, the proposed method
could properly classify samples in the relevant classes. The proposed method is going to
improve graph based data structure by adding Euclidean distance to determine the relationships
among genes. In this method, relevant genes are extracted by means of weight assigned to
each gene, where larger weights point out a healthier relationship between two genes. As a
result, the proposed method can decrease the cost acquired by classifying inappropriate genes.
One of the major works of the proposed method having ability to appropriately categorizes into
the relevant classes and also properly distinguishes out of class samples. In addition, it will
significantly reduce computation time taken for classifying the data.
References
[1] A Benso, S DiCarlo, G Politano, L Sterpone. A Graph Based Representation of Gene Expression
Profiles in DNA Microarrays. Proc. IEEE Symp. Computational Intelligence in Bioinformatics and
Computational Biology (CIBCB). 2008: 75-82.
[2] Benso, S Di Carlo, G Politano, L Sterpone. Differential Gene Expression Graphs: A Data Structure
for Classification in DNA Microarrays. Proc. Eighth IEEE Int‟l Conf. BioInformatics and BioEng.
(BIBE). 2008.
[3] J Breiman, L ad Friedman, CJ Stone, R Olshen. Classification and Regression Trees. New York,
NY, USA: Talyor and Francis. 1984.
[4] H Zhang, CY Yu, B Singer. Cell and tumor classification using gene expression data: Construction of
forests. Proc. Nat. Acad. Sci. USA. 2003; 100(7): 4168-4172.
[5] VN Vapnik. An Overview of Statistical Learning Theory. IEEE Trans. Neural Networks. 1999; 10(5):
988-999.
[6] J Khan, JS Wei, M Ringner, LH Saal, M Ladanyi, F Westermann, F Berthold, M Schwab, CR
Antonescu, C Peterson, PS Meltzer. Classification and diagnostic prediction of cancers using gene
expression profiling and artificial neural networks. Nature Med. 2001; 7(6): 673-679.
[7] Benso, S Di Carlo, G Politano. A cDNA microarray gene expression data classifier for clinical
diagnostics based on graph theory. IEEE/ACM Trans. Comput. Biol. Bioinformat. 2011; 8(3): 577-
591.
 ISSN: 2502-4752
IJEECS Vol. 5, No. 2, February 2017 : 472 – 478
478

More Related Content

What's hot

Presentation
PresentationPresentation
Presentation
sidra ali
 
Efficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
Efficiency of Using Sequence Discovery for Polymorphism in DNA SequenceEfficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
Efficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
IJSTA
 
Particle Swarm Optimization for Gene cluster Identification
Particle Swarm Optimization for Gene cluster IdentificationParticle Swarm Optimization for Gene cluster Identification
Particle Swarm Optimization for Gene cluster Identification
Editor IJCATR
 
Automatic Parallelization for Parallel Architectures Using Smith Waterman Alg...
Automatic Parallelization for Parallel Architectures Using Smith Waterman Alg...Automatic Parallelization for Parallel Architectures Using Smith Waterman Alg...
Automatic Parallelization for Parallel Architectures Using Smith Waterman Alg...
International Journal of Engineering Inventions www.ijeijournal.com
 
Power spectrum sequence analysis of rheumatic arthritis (ra disease using dsp...
Power spectrum sequence analysis of rheumatic arthritis (ra disease using dsp...Power spectrum sequence analysis of rheumatic arthritis (ra disease using dsp...
Power spectrum sequence analysis of rheumatic arthritis (ra disease using dsp...
eSAT Journals
 
Power spectrum sequence analysis of rheumatic
Power spectrum sequence analysis of rheumaticPower spectrum sequence analysis of rheumatic
Power spectrum sequence analysis of rheumatic
eSAT Publishing House
 
Improving DNA Barcode-based Fish Identification System on Imbalanced Data usi...
Improving DNA Barcode-based Fish Identification System on Imbalanced Data usi...Improving DNA Barcode-based Fish Identification System on Imbalanced Data usi...
Improving DNA Barcode-based Fish Identification System on Imbalanced Data usi...
TELKOMNIKA JOURNAL
 
H43014046
H43014046H43014046
H43014046
IJERA Editor
 
Deep learning for extracting protein-protein interactions from biomedical lit...
Deep learning for extracting protein-protein interactions from biomedical lit...Deep learning for extracting protein-protein interactions from biomedical lit...
Deep learning for extracting protein-protein interactions from biomedical lit...
Yifan Peng
 
A Study on DNA based Computation and Memory Devices
A Study on DNA based Computation and Memory DevicesA Study on DNA based Computation and Memory Devices
A Study on DNA based Computation and Memory Devices
Editor IJCATR
 
Delineation of techniques to implement on the enhanced proposed model using d...
Delineation of techniques to implement on the enhanced proposed model using d...Delineation of techniques to implement on the enhanced proposed model using d...
Delineation of techniques to implement on the enhanced proposed model using d...
ijdms
 
Basics of bioinformatics
Basics of bioinformaticsBasics of bioinformatics
Basics of bioinformaticsAbhishek Vatsa
 
Bioinformatics.Assignment
Bioinformatics.AssignmentBioinformatics.Assignment
Bioinformatics.AssignmentNaima Tahsin
 
A clonal based algorithm for the reconstruction of genetic network using s sy...
A clonal based algorithm for the reconstruction of genetic network using s sy...A clonal based algorithm for the reconstruction of genetic network using s sy...
A clonal based algorithm for the reconstruction of genetic network using s sy...
eSAT Journals
 
A clonal based algorithm for the reconstruction of
A clonal based algorithm for the reconstruction ofA clonal based algorithm for the reconstruction of
A clonal based algorithm for the reconstruction of
eSAT Publishing House
 
AI in Bioinformatics
AI in BioinformaticsAI in Bioinformatics
AI in Bioinformatics
Ali Kishk
 

What's hot (20)

Predicting Functional Regions in Genomic DNA Sequences Using Artificial Neur...
Predicting Functional Regions in Genomic DNA Sequences Using  Artificial Neur...Predicting Functional Regions in Genomic DNA Sequences Using  Artificial Neur...
Predicting Functional Regions in Genomic DNA Sequences Using Artificial Neur...
 
Presentation
PresentationPresentation
Presentation
 
Efficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
Efficiency of Using Sequence Discovery for Polymorphism in DNA SequenceEfficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
Efficiency of Using Sequence Discovery for Polymorphism in DNA Sequence
 
Particle Swarm Optimization for Gene cluster Identification
Particle Swarm Optimization for Gene cluster IdentificationParticle Swarm Optimization for Gene cluster Identification
Particle Swarm Optimization for Gene cluster Identification
 
Automatic Parallelization for Parallel Architectures Using Smith Waterman Alg...
Automatic Parallelization for Parallel Architectures Using Smith Waterman Alg...Automatic Parallelization for Parallel Architectures Using Smith Waterman Alg...
Automatic Parallelization for Parallel Architectures Using Smith Waterman Alg...
 
Power spectrum sequence analysis of rheumatic arthritis (ra disease using dsp...
Power spectrum sequence analysis of rheumatic arthritis (ra disease using dsp...Power spectrum sequence analysis of rheumatic arthritis (ra disease using dsp...
Power spectrum sequence analysis of rheumatic arthritis (ra disease using dsp...
 
Power spectrum sequence analysis of rheumatic
Power spectrum sequence analysis of rheumaticPower spectrum sequence analysis of rheumatic
Power spectrum sequence analysis of rheumatic
 
10.1.1.80.2149
10.1.1.80.214910.1.1.80.2149
10.1.1.80.2149
 
Improving DNA Barcode-based Fish Identification System on Imbalanced Data usi...
Improving DNA Barcode-based Fish Identification System on Imbalanced Data usi...Improving DNA Barcode-based Fish Identification System on Imbalanced Data usi...
Improving DNA Barcode-based Fish Identification System on Imbalanced Data usi...
 
H43014046
H43014046H43014046
H43014046
 
Deep learning for extracting protein-protein interactions from biomedical lit...
Deep learning for extracting protein-protein interactions from biomedical lit...Deep learning for extracting protein-protein interactions from biomedical lit...
Deep learning for extracting protein-protein interactions from biomedical lit...
 
A Study on DNA based Computation and Memory Devices
A Study on DNA based Computation and Memory DevicesA Study on DNA based Computation and Memory Devices
A Study on DNA based Computation and Memory Devices
 
Delineation of techniques to implement on the enhanced proposed model using d...
Delineation of techniques to implement on the enhanced proposed model using d...Delineation of techniques to implement on the enhanced proposed model using d...
Delineation of techniques to implement on the enhanced proposed model using d...
 
Myers CV_2015
Myers CV_2015Myers CV_2015
Myers CV_2015
 
Basics of bioinformatics
Basics of bioinformaticsBasics of bioinformatics
Basics of bioinformatics
 
Bioinformatics.Assignment
Bioinformatics.AssignmentBioinformatics.Assignment
Bioinformatics.Assignment
 
A clonal based algorithm for the reconstruction of genetic network using s sy...
A clonal based algorithm for the reconstruction of genetic network using s sy...A clonal based algorithm for the reconstruction of genetic network using s sy...
A clonal based algorithm for the reconstruction of genetic network using s sy...
 
A clonal based algorithm for the reconstruction of
A clonal based algorithm for the reconstruction ofA clonal based algorithm for the reconstruction of
A clonal based algorithm for the reconstruction of
 
MoM2010: Bioinformatics
MoM2010: BioinformaticsMoM2010: Bioinformatics
MoM2010: Bioinformatics
 
AI in Bioinformatics
AI in BioinformaticsAI in Bioinformatics
AI in Bioinformatics
 

Similar to 27 20 dec16 13794 28120-1-sm(edit)genap

Determining Evolutionary Relationships Using BLAST
Determining Evolutionary Relationships Using BLASTDetermining Evolutionary Relationships Using BLAST
Determining Evolutionary Relationships Using BLAST
Danielle Snowflack
 
Analysis of Genomic and Proteomic Sequence Using Fir Filter
Analysis of Genomic and Proteomic Sequence Using Fir FilterAnalysis of Genomic and Proteomic Sequence Using Fir Filter
Analysis of Genomic and Proteomic Sequence Using Fir Filter
IJMER
 
DATA ENCRYPTION USING BIO MOLECULAR INFORMATION
DATA ENCRYPTION USING BIO MOLECULAR INFORMATIONDATA ENCRYPTION USING BIO MOLECULAR INFORMATION
DATA ENCRYPTION USING BIO MOLECULAR INFORMATION
ijcisjournal
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and Development
IJERD Editor
 
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
rahulmonikasharma
 
Parallel analysis of gene expression
Parallel analysis of gene expressionParallel analysis of gene expression
Parallel analysis of gene expression
Hema Mallika
 
SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...
SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...
SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...
cscpconf
 
RT-PCR and DNA microarray measurement of mRNA cell proliferation
RT-PCR and DNA microarray measurement of mRNA cell proliferationRT-PCR and DNA microarray measurement of mRNA cell proliferation
RT-PCR and DNA microarray measurement of mRNA cell proliferation
IJAEMSJORNAL
 
Comparative analysis of dynamic programming
Comparative analysis of dynamic programmingComparative analysis of dynamic programming
Comparative analysis of dynamic programming
eSAT Publishing House
 
Comparative analysis of dynamic programming algorithms to find similarity in ...
Comparative analysis of dynamic programming algorithms to find similarity in ...Comparative analysis of dynamic programming algorithms to find similarity in ...
Comparative analysis of dynamic programming algorithms to find similarity in ...
eSAT Journals
 
Bioinformatics.pptx
Bioinformatics.pptxBioinformatics.pptx
Bioinformatics.pptx
Piyush Mishra
 
CLASSIFICATION OF CANCER BY GENE EXPRESSION USING NEURAL NETWORK
CLASSIFICATION OF CANCER BY GENE EXPRESSION USING NEURAL NETWORKCLASSIFICATION OF CANCER BY GENE EXPRESSION USING NEURAL NETWORK
CLASSIFICATION OF CANCER BY GENE EXPRESSION USING NEURAL NETWORK
International Research Journal of Modernization in Engineering Technology and Science
 
RNA Sequencing Research
RNA Sequencing ResearchRNA Sequencing Research
RNA Sequencing ResearchTanmay Ghai
 
31931 31941
31931 3194131931 31941
31931 31941
Amit Gupta
 
An Survey On DNA Based Cryptography
An Survey On DNA Based CryptographyAn Survey On DNA Based Cryptography
An Survey On DNA Based Cryptography
Yasmine Anino
 
LECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSLECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICS
MSCW Mysore
 
Encode Project
Encode ProjectEncode Project
Encode Project
Anarghya Hegde
 
DNA CHIPS AND MICROARRAY.pptx
DNA CHIPS AND MICROARRAY.pptxDNA CHIPS AND MICROARRAY.pptx
DNA CHIPS AND MICROARRAY.pptx
Shabnum
 
Stock markets and_human_genomics
Stock markets and_human_genomicsStock markets and_human_genomics
Stock markets and_human_genomicsShyam Sarkar
 

Similar to 27 20 dec16 13794 28120-1-sm(edit)genap (20)

Determining Evolutionary Relationships Using BLAST
Determining Evolutionary Relationships Using BLASTDetermining Evolutionary Relationships Using BLAST
Determining Evolutionary Relationships Using BLAST
 
Analysis of Genomic and Proteomic Sequence Using Fir Filter
Analysis of Genomic and Proteomic Sequence Using Fir FilterAnalysis of Genomic and Proteomic Sequence Using Fir Filter
Analysis of Genomic and Proteomic Sequence Using Fir Filter
 
DATA ENCRYPTION USING BIO MOLECULAR INFORMATION
DATA ENCRYPTION USING BIO MOLECULAR INFORMATIONDATA ENCRYPTION USING BIO MOLECULAR INFORMATION
DATA ENCRYPTION USING BIO MOLECULAR INFORMATION
 
International Journal of Engineering Research and Development
International Journal of Engineering Research and DevelopmentInternational Journal of Engineering Research and Development
International Journal of Engineering Research and Development
 
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
Clustering Approaches for Evaluation and Analysis on Formal Gene Expression C...
 
Parallel analysis of gene expression
Parallel analysis of gene expressionParallel analysis of gene expression
Parallel analysis of gene expression
 
SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...
SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...
SIMILARITY ANALYSIS OF DNA SEQUENCES BASED ON THE CHEMICAL PROPERTIES OF NUCL...
 
RT-PCR and DNA microarray measurement of mRNA cell proliferation
RT-PCR and DNA microarray measurement of mRNA cell proliferationRT-PCR and DNA microarray measurement of mRNA cell proliferation
RT-PCR and DNA microarray measurement of mRNA cell proliferation
 
Comparative analysis of dynamic programming
Comparative analysis of dynamic programmingComparative analysis of dynamic programming
Comparative analysis of dynamic programming
 
Comparative analysis of dynamic programming algorithms to find similarity in ...
Comparative analysis of dynamic programming algorithms to find similarity in ...Comparative analysis of dynamic programming algorithms to find similarity in ...
Comparative analysis of dynamic programming algorithms to find similarity in ...
 
Bioinformatics.pptx
Bioinformatics.pptxBioinformatics.pptx
Bioinformatics.pptx
 
CLASSIFICATION OF CANCER BY GENE EXPRESSION USING NEURAL NETWORK
CLASSIFICATION OF CANCER BY GENE EXPRESSION USING NEURAL NETWORKCLASSIFICATION OF CANCER BY GENE EXPRESSION USING NEURAL NETWORK
CLASSIFICATION OF CANCER BY GENE EXPRESSION USING NEURAL NETWORK
 
RNA Sequencing Research
RNA Sequencing ResearchRNA Sequencing Research
RNA Sequencing Research
 
project
projectproject
project
 
31931 31941
31931 3194131931 31941
31931 31941
 
An Survey On DNA Based Cryptography
An Survey On DNA Based CryptographyAn Survey On DNA Based Cryptography
An Survey On DNA Based Cryptography
 
LECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICSLECTURE NOTES ON BIOINFORMATICS
LECTURE NOTES ON BIOINFORMATICS
 
Encode Project
Encode ProjectEncode Project
Encode Project
 
DNA CHIPS AND MICROARRAY.pptx
DNA CHIPS AND MICROARRAY.pptxDNA CHIPS AND MICROARRAY.pptx
DNA CHIPS AND MICROARRAY.pptx
 
Stock markets and_human_genomics
Stock markets and_human_genomicsStock markets and_human_genomics
Stock markets and_human_genomics
 

More from nooriasukmaningtyas

Development of depth map from stereo images using sum of absolute differences...
Development of depth map from stereo images using sum of absolute differences...Development of depth map from stereo images using sum of absolute differences...
Development of depth map from stereo images using sum of absolute differences...
nooriasukmaningtyas
 
Model predictive controller for a retrofitted heat exchanger temperature cont...
Model predictive controller for a retrofitted heat exchanger temperature cont...Model predictive controller for a retrofitted heat exchanger temperature cont...
Model predictive controller for a retrofitted heat exchanger temperature cont...
nooriasukmaningtyas
 
Control of a servo-hydraulic system utilizing an extended wavelet functional ...
Control of a servo-hydraulic system utilizing an extended wavelet functional ...Control of a servo-hydraulic system utilizing an extended wavelet functional ...
Control of a servo-hydraulic system utilizing an extended wavelet functional ...
nooriasukmaningtyas
 
Decentralised optimal deployment of mobile underwater sensors for covering la...
Decentralised optimal deployment of mobile underwater sensors for covering la...Decentralised optimal deployment of mobile underwater sensors for covering la...
Decentralised optimal deployment of mobile underwater sensors for covering la...
nooriasukmaningtyas
 
Evaluation quality of service for internet of things based on fuzzy logic: a ...
Evaluation quality of service for internet of things based on fuzzy logic: a ...Evaluation quality of service for internet of things based on fuzzy logic: a ...
Evaluation quality of service for internet of things based on fuzzy logic: a ...
nooriasukmaningtyas
 
Low power architecture of logic gates using adiabatic techniques
Low power architecture of logic gates using adiabatic techniquesLow power architecture of logic gates using adiabatic techniques
Low power architecture of logic gates using adiabatic techniques
nooriasukmaningtyas
 
A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...
nooriasukmaningtyas
 
Smart monitoring system using NodeMCU for maintenance of production machines
Smart monitoring system using NodeMCU for maintenance of production machinesSmart monitoring system using NodeMCU for maintenance of production machines
Smart monitoring system using NodeMCU for maintenance of production machines
nooriasukmaningtyas
 
Design and simulation of a software defined networkingenabled smart switch, f...
Design and simulation of a software defined networkingenabled smart switch, f...Design and simulation of a software defined networkingenabled smart switch, f...
Design and simulation of a software defined networkingenabled smart switch, f...
nooriasukmaningtyas
 
Efficient wireless power transmission to remote the sensor in restenosis coro...
Efficient wireless power transmission to remote the sensor in restenosis coro...Efficient wireless power transmission to remote the sensor in restenosis coro...
Efficient wireless power transmission to remote the sensor in restenosis coro...
nooriasukmaningtyas
 
Grid reactive voltage regulation and cost optimization for electric vehicle p...
Grid reactive voltage regulation and cost optimization for electric vehicle p...Grid reactive voltage regulation and cost optimization for electric vehicle p...
Grid reactive voltage regulation and cost optimization for electric vehicle p...
nooriasukmaningtyas
 
Topology network effects for double synchronized switch harvesting circuit on...
Topology network effects for double synchronized switch harvesting circuit on...Topology network effects for double synchronized switch harvesting circuit on...
Topology network effects for double synchronized switch harvesting circuit on...
nooriasukmaningtyas
 
Improving the design of super-lift Luo converter using hybrid switching capac...
Improving the design of super-lift Luo converter using hybrid switching capac...Improving the design of super-lift Luo converter using hybrid switching capac...
Improving the design of super-lift Luo converter using hybrid switching capac...
nooriasukmaningtyas
 
Third harmonic current minimization using third harmonic blocking transformer
Third harmonic current minimization using third harmonic blocking transformerThird harmonic current minimization using third harmonic blocking transformer
Third harmonic current minimization using third harmonic blocking transformer
nooriasukmaningtyas
 
Power quality improvement of distribution systems asymmetry caused by power d...
Power quality improvement of distribution systems asymmetry caused by power d...Power quality improvement of distribution systems asymmetry caused by power d...
Power quality improvement of distribution systems asymmetry caused by power d...
nooriasukmaningtyas
 
Studies enhancement of transient stability by single machine infinite bus sys...
Studies enhancement of transient stability by single machine infinite bus sys...Studies enhancement of transient stability by single machine infinite bus sys...
Studies enhancement of transient stability by single machine infinite bus sys...
nooriasukmaningtyas
 
Renewable energy based dynamic tariff system for domestic load management
Renewable energy based dynamic tariff system for domestic load managementRenewable energy based dynamic tariff system for domestic load management
Renewable energy based dynamic tariff system for domestic load management
nooriasukmaningtyas
 
Energy harvesting maximization by integration of distributed generation based...
Energy harvesting maximization by integration of distributed generation based...Energy harvesting maximization by integration of distributed generation based...
Energy harvesting maximization by integration of distributed generation based...
nooriasukmaningtyas
 
Intelligent fault diagnosis for power distribution systemcomparative studies
Intelligent fault diagnosis for power distribution systemcomparative studiesIntelligent fault diagnosis for power distribution systemcomparative studies
Intelligent fault diagnosis for power distribution systemcomparative studies
nooriasukmaningtyas
 
A deep learning approach based on stochastic gradient descent and least absol...
A deep learning approach based on stochastic gradient descent and least absol...A deep learning approach based on stochastic gradient descent and least absol...
A deep learning approach based on stochastic gradient descent and least absol...
nooriasukmaningtyas
 

More from nooriasukmaningtyas (20)

Development of depth map from stereo images using sum of absolute differences...
Development of depth map from stereo images using sum of absolute differences...Development of depth map from stereo images using sum of absolute differences...
Development of depth map from stereo images using sum of absolute differences...
 
Model predictive controller for a retrofitted heat exchanger temperature cont...
Model predictive controller for a retrofitted heat exchanger temperature cont...Model predictive controller for a retrofitted heat exchanger temperature cont...
Model predictive controller for a retrofitted heat exchanger temperature cont...
 
Control of a servo-hydraulic system utilizing an extended wavelet functional ...
Control of a servo-hydraulic system utilizing an extended wavelet functional ...Control of a servo-hydraulic system utilizing an extended wavelet functional ...
Control of a servo-hydraulic system utilizing an extended wavelet functional ...
 
Decentralised optimal deployment of mobile underwater sensors for covering la...
Decentralised optimal deployment of mobile underwater sensors for covering la...Decentralised optimal deployment of mobile underwater sensors for covering la...
Decentralised optimal deployment of mobile underwater sensors for covering la...
 
Evaluation quality of service for internet of things based on fuzzy logic: a ...
Evaluation quality of service for internet of things based on fuzzy logic: a ...Evaluation quality of service for internet of things based on fuzzy logic: a ...
Evaluation quality of service for internet of things based on fuzzy logic: a ...
 
Low power architecture of logic gates using adiabatic techniques
Low power architecture of logic gates using adiabatic techniquesLow power architecture of logic gates using adiabatic techniques
Low power architecture of logic gates using adiabatic techniques
 
A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...A review on techniques and modelling methodologies used for checking electrom...
A review on techniques and modelling methodologies used for checking electrom...
 
Smart monitoring system using NodeMCU for maintenance of production machines
Smart monitoring system using NodeMCU for maintenance of production machinesSmart monitoring system using NodeMCU for maintenance of production machines
Smart monitoring system using NodeMCU for maintenance of production machines
 
Design and simulation of a software defined networkingenabled smart switch, f...
Design and simulation of a software defined networkingenabled smart switch, f...Design and simulation of a software defined networkingenabled smart switch, f...
Design and simulation of a software defined networkingenabled smart switch, f...
 
Efficient wireless power transmission to remote the sensor in restenosis coro...
Efficient wireless power transmission to remote the sensor in restenosis coro...Efficient wireless power transmission to remote the sensor in restenosis coro...
Efficient wireless power transmission to remote the sensor in restenosis coro...
 
Grid reactive voltage regulation and cost optimization for electric vehicle p...
Grid reactive voltage regulation and cost optimization for electric vehicle p...Grid reactive voltage regulation and cost optimization for electric vehicle p...
Grid reactive voltage regulation and cost optimization for electric vehicle p...
 
Topology network effects for double synchronized switch harvesting circuit on...
Topology network effects for double synchronized switch harvesting circuit on...Topology network effects for double synchronized switch harvesting circuit on...
Topology network effects for double synchronized switch harvesting circuit on...
 
Improving the design of super-lift Luo converter using hybrid switching capac...
Improving the design of super-lift Luo converter using hybrid switching capac...Improving the design of super-lift Luo converter using hybrid switching capac...
Improving the design of super-lift Luo converter using hybrid switching capac...
 
Third harmonic current minimization using third harmonic blocking transformer
Third harmonic current minimization using third harmonic blocking transformerThird harmonic current minimization using third harmonic blocking transformer
Third harmonic current minimization using third harmonic blocking transformer
 
Power quality improvement of distribution systems asymmetry caused by power d...
Power quality improvement of distribution systems asymmetry caused by power d...Power quality improvement of distribution systems asymmetry caused by power d...
Power quality improvement of distribution systems asymmetry caused by power d...
 
Studies enhancement of transient stability by single machine infinite bus sys...
Studies enhancement of transient stability by single machine infinite bus sys...Studies enhancement of transient stability by single machine infinite bus sys...
Studies enhancement of transient stability by single machine infinite bus sys...
 
Renewable energy based dynamic tariff system for domestic load management
Renewable energy based dynamic tariff system for domestic load managementRenewable energy based dynamic tariff system for domestic load management
Renewable energy based dynamic tariff system for domestic load management
 
Energy harvesting maximization by integration of distributed generation based...
Energy harvesting maximization by integration of distributed generation based...Energy harvesting maximization by integration of distributed generation based...
Energy harvesting maximization by integration of distributed generation based...
 
Intelligent fault diagnosis for power distribution systemcomparative studies
Intelligent fault diagnosis for power distribution systemcomparative studiesIntelligent fault diagnosis for power distribution systemcomparative studies
Intelligent fault diagnosis for power distribution systemcomparative studies
 
A deep learning approach based on stochastic gradient descent and least absol...
A deep learning approach based on stochastic gradient descent and least absol...A deep learning approach based on stochastic gradient descent and least absol...
A deep learning approach based on stochastic gradient descent and least absol...
 

Recently uploaded

Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
JoytuBarua2
 
MCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdfMCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdf
Osamah Alsalih
 
AP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specificAP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specific
BrazilAccount1
 
space technology lecture notes on satellite
space technology lecture notes on satellitespace technology lecture notes on satellite
space technology lecture notes on satellite
ongomchris
 
WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234
AafreenAbuthahir2
 
Architectural Portfolio Sean Lockwood
Architectural Portfolio Sean LockwoodArchitectural Portfolio Sean Lockwood
Architectural Portfolio Sean Lockwood
seandesed
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
Vijay Dialani, PhD
 
ASME IX(9) 2007 Full Version .pdf
ASME IX(9)  2007 Full Version       .pdfASME IX(9)  2007 Full Version       .pdf
ASME IX(9) 2007 Full Version .pdf
AhmedHussein950959
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
Amil Baba Dawood bangali
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
MdTanvirMahtab2
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
Neometrix_Engineering_Pvt_Ltd
 
Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
Massimo Talia
 
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
fxintegritypublishin
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
gerogepatton
 
CME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional ElectiveCME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional Elective
karthi keyan
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
Kerry Sado
 
DESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docxDESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docx
FluxPrime1
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
zwunae
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
Kamal Acharya
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
Kamal Acharya
 

Recently uploaded (20)

Planning Of Procurement o different goods and services
Planning Of Procurement o different goods and servicesPlanning Of Procurement o different goods and services
Planning Of Procurement o different goods and services
 
MCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdfMCQ Soil mechanics questions (Soil shear strength).pdf
MCQ Soil mechanics questions (Soil shear strength).pdf
 
AP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specificAP LAB PPT.pdf ap lab ppt no title specific
AP LAB PPT.pdf ap lab ppt no title specific
 
space technology lecture notes on satellite
space technology lecture notes on satellitespace technology lecture notes on satellite
space technology lecture notes on satellite
 
WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234WATER CRISIS and its solutions-pptx 1234
WATER CRISIS and its solutions-pptx 1234
 
Architectural Portfolio Sean Lockwood
Architectural Portfolio Sean LockwoodArchitectural Portfolio Sean Lockwood
Architectural Portfolio Sean Lockwood
 
ML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptxML for identifying fraud using open blockchain data.pptx
ML for identifying fraud using open blockchain data.pptx
 
ASME IX(9) 2007 Full Version .pdf
ASME IX(9)  2007 Full Version       .pdfASME IX(9)  2007 Full Version       .pdf
ASME IX(9) 2007 Full Version .pdf
 
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
NO1 Uk best vashikaran specialist in delhi vashikaran baba near me online vas...
 
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
Industrial Training at Shahjalal Fertilizer Company Limited (SFCL)
 
Standard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - NeometrixStandard Reomte Control Interface - Neometrix
Standard Reomte Control Interface - Neometrix
 
Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024Nuclear Power Economics and Structuring 2024
Nuclear Power Economics and Structuring 2024
 
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdfHybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
Hybrid optimization of pumped hydro system and solar- Engr. Abdul-Azeez.pdf
 
Immunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary AttacksImmunizing Image Classifiers Against Localized Adversary Attacks
Immunizing Image Classifiers Against Localized Adversary Attacks
 
CME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional ElectiveCME397 Surface Engineering- Professional Elective
CME397 Surface Engineering- Professional Elective
 
Hierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power SystemHierarchical Digital Twin of a Naval Power System
Hierarchical Digital Twin of a Naval Power System
 
DESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docxDESIGN A COTTON SEED SEPARATION MACHINE.docx
DESIGN A COTTON SEED SEPARATION MACHINE.docx
 
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
一比一原版(IIT毕业证)伊利诺伊理工大学毕业证成绩单专业办理
 
Student information management system project report ii.pdf
Student information management system project report ii.pdfStudent information management system project report ii.pdf
Student information management system project report ii.pdf
 
Final project report on grocery store management system..pdf
Final project report on grocery store management system..pdfFinal project report on grocery store management system..pdf
Final project report on grocery store management system..pdf
 

27 20 dec16 13794 28120-1-sm(edit)genap

  • 1. Indonesian Journal of Electrical Engineering and Computer Science Vol. 5, No. 2, February 2017, pp. 472 ~ 478 DOI: 10.11591/ijeecs.v5.i2.pp472-478  472 Received September 2, 2016; Revised December 25, 2016; Accepted January 11, 2017 Classification algorithm for Gene Expression Graph and Manhattan Distance N. Sevugapandi 1 , C. P. Chandran 2 1 Research and Development Center, Bharathiar University, Coimbatore, Tamilnadu 2 Computer Science, Ayya Nadar Janaki Ammal College, Sivakasi, Tamilnadu Corresponding author, e-mail: snsevugapandi@gmail.com 1 , drcpchandran@gmail.com 2 Abstract This proposed method focus on these issues by developing a novel classification algorithm by combining Gene Expression Graph (GEG) with Manhattan distance. This method will be used to express the gene expression data. Gene Expression Graph provides the optimal view about the relationship between normal and unhealthy genes. The method of using a graph-based gene expression to express gene information was first offered by the authors in [1] and [2], It will permits to construct a classifier based on an association between graphs represented for well-known classes and graphs represented for samples to evaluate. Additionally Euclidean distance is used to measure the strength of relationship which exists between the genes. Keywords: Data Mining, DNA Micro array, Gene Ontology, KEGG pathway Copyright © 2017 Institute of Advanced Engineering and Science. All rights reserved. 1. Introduction 1.1. Data Mining Data mining is defined as the non-trivial process of searching and analyzing data in order to find implicit but potentially useful information. Let D = {d1... dn} be the dataset to be analyzed. The data mining process is described as the process of finding a subset D‟ of D and hypotheses HU(D‟,C) about D‟ that a user U considers useful in an application context C. D‟ have fewer data elements than D, but it also have a lower dimensionality (m‟). In databases the data is partitioned into relations or object classes. D is considered as a union of relations R1... Rk each has its own dimensionality (m1... mk)[1]. 1.2. Bioinformatics Bioinformatics is the Science of integrating, managing, mining and interpreting information from biological datasets at genomic, metabalomic, proteomics, phylogenetic and cellular or whole organism levels. According to (National Institute of Health) NIH organization, the Bioinformatics and Computational Biology have been defined as “Bioinformatics is research and development or application of computational tools and approaches for expanding the use of biological, medical, health data including those to acquire store, organize, active, analyze or visualize such data” [2]. 1.3. Genomics DNA (Deoxyribonucleic Acid) is a molecule encoding the genetic instructions used in the development and functioning of all known living organisms many viruses. DNA is one of the three major macromolecules that are essential for all known forms of life. Genetic information is encoded as a sequence of nucleotides (Guanine, Adenine, Thymine, and Cytosine) recorded using the letters G, A, T, and C. Most DNA molecules are double-stranded helices, consisting of two long polymers of simple units called nucleotides with the nucleo bases (G, A, T, C) attached to the sugars. DNA is well-suited for biological information storage, since the DNA backbone is resistant to cleavage and the double-stranded structure provides the molecule with a built-in duplicate of the encoded information [3].
  • 2. IJEECS ISSN: 2502-4752  Classification algorithm for Gene Expression Graph and Manhattan… (N. Sevugapandi) 473 Figure 1. Structure of DNA 1.3.1. Nucleic Acids Nucleic acids are large biological molecules essential for all known forms of life. They include DNA. Together with proteins, nucleic acids are the most important biological macromolecules; each is found in abundance in all living things, where they function in encoding, transmitting and expressing genetic information. The nucleic acids Deoxyribonucleic acid DNA are polymers of nucleotides, arranged in a specific sequence. To form macromolecular polymers, nucleotides are joined between the 3‟ and 5‟ carbon atoms in their sugar moiety by a phosphodiester bond, giving rise to a nucleic acid with a sugar–phosphate „backbone‟ to which is attached a series of bases in a specific order. Hydrogen bonding between pairs of bases can occur, leading to the formation of double- stranded polymers if the sequences are complementary [4]. 1.3.2. Base pairs Basepairs are the building blocks of the DNA double helix, and contribute to the folded structure of both DNA. Dictated by specific hydrogen bonding patterns, Watson-Crick base pairs (Guanine-Cytosine and Adenine-Thymine) allow the DNA helix to maintain a regular helical structure that is independent of its nucleotide sequence. The complementary nature of this based-paired structure provides a backup copy of all genetic information encoded within double- stranded DNA. Figure 2 shows the pairs of ATGC. The regular structure and data redundancy provided by the DNA helix make DNA an optimal molecule for the storage of genetic information, while base-pairing between DNA and incoming nucleotides provide the mechanism through which DNA polymerase replicates DNA transcribes. Many DNA-binding proteins can recognize specific base pairing patterns that identify particular regulatory regions of genes.
  • 3.  ISSN: 2502-4752 IJEECS Vol. 5, No. 2, February 2017 : 472 – 478 474 Figure 2. Base pairs of ATGC 1.3.3. Gene Expression Data Gene expression refers to a complex series of processes in which the information encoded in a gene is used to produce a functional product such as a protein that dictates cell function. It involves several different steps through which DNA is converted to an RNA which in turn is converted into a protein or in some cases RNA, for example, genes encoding the necessary information for transfer RNAs and ribosomal RNAs (tRNAs and rRNAs). The information flow from DNA to RNA to protein can be controlled at several points helping the cell to adjust the quality and quantity of resulting proteins and thus self-regulate its functions. Thus, regulation of gene expression is a critical step in determining what kind of proteins and how much of each protein is expressed in a cell. 1.3.4. DNA Motifs Complex designs are often created using a relatively small set of common building blocks called motifs. DNA self-assembly can exploit this same design principle to hierarchically create more sophisticated aperiodic structures. There are many possible DNA motifs and the focus here is on only a few in the context of the target nanostructure. Motifs include junctions that enable three or more double stranded helices of DNA to interact and thus form specific structures (e.g., a triangle, a corner, and so on). Another important motif is a single strand of DNA protruding from a doublestranded helix called a sticky-end. Two motifs with complementary sequences on their sticky-ends will bind to form a composite motif. Composite motifs may also have embedded sticky-end motifs and thus can also bind with other composite motifs to form another, larger, composite motif. This results in a hierarchical structure for motifs. 1.3.5. Various Gene Expression Techniques Analytical methods may be used to examine mRNA expression levels or differential mRNA expression. Some examples of these techniques are listed below. Serial Analysis of Gene Expression (SAGE): SAGE is a technique used to create a library of short sequence tags which can each be used to detect a transcript. The expression level of the transcript can be determined by assessing how many times each tag is detected. This technology enables comprehensive expression analysis across the genome [5]. DNA microarray: Also known of as biochip or DNA chip, a DNA microarray is a solid surface to which a collection of microscopic DNA spots are attached. The microarrays are used to determine expression levels across a large number of genes or to perform genotyping across different regions of a genome [6]. RNA Seq: This refers to methods used to measure the sequence of RNA molecules. Examples include shotgun sequencing of cDNA molecules acquired from RNA through reverse transcription and technologies used to sequences. RNA molecules from a biological sample so that the primary sequence and abundance of each RNA molecule can be determined. Tiling arrays: A tiling array is a type of microarray chip, with labelled DNA or RNA targets hybridized to probes attached to a solid surface. However, the probes used differ to
  • 4. IJEECS ISSN: 2502-4752  Classification algorithm for Gene Expression Graph and Manhattan… (N. Sevugapandi) 475 those used with traditional microarrays. Rather than known sequences or predicted genes being probed, tiling arrays probe for sequences known to be present in a contigious region. 2. Previous Works J. Breiman, et al., [3] proposed model was used for formulating the prediction model from the car dataset. This prediction model was made by continuously portioning the data in data space. At the last, the portioned result is shown as a decision tree. Classification tree was based on Decision tree, which was typically intended for dependent variables to acquire a countable number of random values along with possible prediction error. The prediction error was considered in terms of penalty for wrong classification. Regression tree algorithms are likely as classification tree algorithms for dependent variables to acquire continuous or ordered distinct values. In regression tree method, prediction error will be calculated by measuring the squared variation among the actual and predicted values [7]. H. Zhang, et al., [4] proposed a deterministic forest methodology by means of continuous tree partition from microarray gene expression data. The key attribute for this methodology is to get better prediction accuracy rate and also to identify the appropriate genes for selecting tumor cell among datasets. Deterministic classification is more likely as random forest tree algorithm but deterministic provide accurate classification when compared with random forest. In deterministic classification, initial level classification of gene population is carried out by choosing predetermined number for splitting criterion. Then perform scrutinizing operation on repeatedly occurring pairs of genes to get more needed data from dataset. This scrutinizing process should disclose the value of the genes in classifying cancer tissues in the relevant data sets [8]. V. N. Vapnik, et al., [5] proposed a well-known classification algorithm called support vector machine (SVM) algorithm to collect necessary information from DNA microarray data. This algorithm is famous for its accuracy and robust nature. The SVM will provide better performance while using multi category of biological scrutiny in cancer diagnosis from microarray gene expression data. It is capable of analyzing both sample and gene expression data in order to explore the wrongly classified data result. SVM provide significant performance. The main drawback of this method, that SVM failed to detect out of class samples and also restricts to provide inadequate clinical diagnostics application [9]. J. Khan, et al., [6] purpose of this paper is to develop a model for classifying cancers into exact diagnostic groups based on gene expression data by means of artificial neural networks (ANNs). Initially ANNs was trained by using round blue-cell tumors. This cancer diagnostic can fit into four different categories. Based on this training, ANNs appropriately classified the entire sample gene and also identify the most relevant gene for classification. This analysis had shown the possible applications of these ANNs method for cancer diagnosis. 3. Results and Discussion The proposed weight based gene expression graph performs classification of diseases by means of gene expression data along with carrying weight at the edge. In graph G(V,E), where V represents the gene and the E represents the weighted edge. The weight present in the edge determines the strength of relationship among the gene. The graph is built from untreated microarray scanned image data to go through the following stages such as preprocessing, data modeling, classification and validation. In microarray technology, gene is identified by types of expression levels. The first level, cy3 represents the healthy condition of gene and the second expression level, cy5 represents the diseased condition of gene. This gene expression is usually attained from the n samples are arranged as matrix called gene expression matrix. In preprocessing stage, this is done by means of normalization techniques. Normalization is particularly useful to train the input values for classification algorithm, this normalization techniques helps to speed up the learning phase in classification algorithm. There are many methods for data normalization. For that, we consider the Standardized normal distribution techniques along with LN ratios of each gene. It can be computed by using equation (1), (2), (3) and (4).
  • 5.  ISSN: 2502-4752 IJEECS Vol. 5, No. 2, February 2017 : 472 – 478 476 LNratio(x)= ⁄ (1) Mg = (2) Sg = = | | | | | | | | (3) Z(LNratio(x), , )= (4) Where, Mg represents the mean, represents the standard deviation and LNratio(x) represents the input value x. For example, Table 1 Consider the expression levels training set to compute standardized normal distribution. Table 1. Expression levels training set Gene Tissue a Diseased Healthy b Diseased Healthy C Diseased Healthy Sample 1 30 10000 100 12000 2000 10 Sample 2 50 30000 50 2000 1000 1099 Sample 3 30 5000 13000 80 4000 150 Sample 4 100 6000 80 15000 6000 25 Table 2. LNratio and Normalization of the training data set Gene LNratio(x) Normalization(LNratio(x), , ) Mean Standard Deviation a B A b S1 -8.31 -6.85 -0.86 -0.64 -2.53 6.74 S2 -9.15 -5.28 -1.37 -0.13 -4.85 3.15 S3 -7.32 7.28 -1.5 0.97 1.55 5.92 S4 -5.86 -7.49 -0.62 -0.88 -1.83 6.45 Then relevancy between genes is computed by finding the relevancy of each gene by calculating Relevance Count (RC). Based on TRC value, the gene relevancy is categorized into 3 types. They are more relevant gene (1), quieted gene (-1) and irrelevant gene (0). RC is calculated by using threshold, cy5 and cy3 values. A threshold can be specified by using well known methods for example, t test. Total Relevancy Count (TRC) is by doing summation of all RC value in each gene. Table 3 shows the TC value for each gene. RC ( , cy5, cy3) = { ( ) ( ) ( ) (5) Table 3. TRC value for threshold Gene A b c Sample 1 -1 -1 1 Sample 2 -1 0 1 Sample 3 -1 1 1 Sample 4 -1 -1 1 TRC -4 -1 4 The weight ( ) between the genes are calculated by applying Modified Manhattan distance formula. It is defined in equation (6).
  • 6. IJEECS ISSN: 2502-4752  Classification algorithm for Gene Expression Graph and Manhattan… (N. Sevugapandi) 477 WG ( ) = | | +| |+| | +….. +| | (6) Table 4. Represents the weight for gene expression tree as matrix Gene A B c A 0 3 4 B 3 0 3 C 4 3 0 5. Conclusion This paper proposes a Modified Manhattan distance based weighted GEG classifier to classify a gene expression data. The experimental outcome confirms that the efficiency of the proposed method gives better result than that of current methods. The proposed method can also appropriately distinguish out-of-class samples. In addition to that, the proposed method could properly classify samples in the relevant classes. The proposed method is going to improve graph based data structure by adding Euclidean distance to determine the relationships among genes. In this method, relevant genes are extracted by means of weight assigned to each gene, where larger weights point out a healthier relationship between two genes. As a result, the proposed method can decrease the cost acquired by classifying inappropriate genes. One of the major works of the proposed method having ability to appropriately categorizes into the relevant classes and also properly distinguishes out of class samples. In addition, it will significantly reduce computation time taken for classifying the data. References [1] A Benso, S DiCarlo, G Politano, L Sterpone. A Graph Based Representation of Gene Expression Profiles in DNA Microarrays. Proc. IEEE Symp. Computational Intelligence in Bioinformatics and Computational Biology (CIBCB). 2008: 75-82. [2] Benso, S Di Carlo, G Politano, L Sterpone. Differential Gene Expression Graphs: A Data Structure for Classification in DNA Microarrays. Proc. Eighth IEEE Int‟l Conf. BioInformatics and BioEng. (BIBE). 2008. [3] J Breiman, L ad Friedman, CJ Stone, R Olshen. Classification and Regression Trees. New York, NY, USA: Talyor and Francis. 1984. [4] H Zhang, CY Yu, B Singer. Cell and tumor classification using gene expression data: Construction of forests. Proc. Nat. Acad. Sci. USA. 2003; 100(7): 4168-4172. [5] VN Vapnik. An Overview of Statistical Learning Theory. IEEE Trans. Neural Networks. 1999; 10(5): 988-999. [6] J Khan, JS Wei, M Ringner, LH Saal, M Ladanyi, F Westermann, F Berthold, M Schwab, CR Antonescu, C Peterson, PS Meltzer. Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks. Nature Med. 2001; 7(6): 673-679. [7] Benso, S Di Carlo, G Politano. A cDNA microarray gene expression data classifier for clinical diagnostics based on graph theory. IEEE/ACM Trans. Comput. Biol. Bioinformat. 2011; 8(3): 577- 591.
  • 7.  ISSN: 2502-4752 IJEECS Vol. 5, No. 2, February 2017 : 472 – 478 478