SlideShare a Scribd company logo
1 of 5
Download to read offline
International Association of Scientific Innovation and Research (IASIR)
(An Association Unifying the Sciences, Engineering, and Applied Research)
International Journal of Emerging Technologies in Computational
and Applied Sciences (IJETCAS)
www.iasir.net
IJETCAS 14-325; © 2014, IJETCAS All Rights Reserved Page 99
ISSN (Print): 2279-0047
ISSN (Online): 2279-0055
SPGUP – Sequence analysis, Phylogenetic trees, Genome Diagrams Using
Python - a Phylogenetic Tool
1
Prof.Dr. P. K. Srimani and 2
Mrs. Kumudavalli M.V
1
Director, R&D Division, B.U., DSI, Bangalore, Karnataka, INDIA
2
Dperatment of MCA (BU), DSI, Bangalore, Karnataka, INDIA and Research Scholar, SCSVMV University,
Kanchipuram, Tamilnadu, INDIA
_______________________________________________________________
Abstract: SPGUP is a software tool and a free GNU/Linux distribution developed by the authors that has the
unique feature of providing a set of Phyloinformatic tools for retrieving data from NCBI, to perform Sequence
analysis, Phylogenetic tree construction and to build the Genome diagrams. The operating system incorporates
most commonly used Phylogenetic software, which has been pre-compiled and pre-configured, allowing the
user for direct application of Phylogenetic methods in a stable Linux environment.
Keywords: Sequence, Phylogenetics, Phyloinformatic, operating system
__________________________________________________________________________________________
I. INTRODUCTION
Phylogenetic methods are having exponential growth nearly in all fields of biology. As the biological data
accumulation rate is increasing, the size of Phylogenetic analyses and the scope of inferences for which they are
utilized have both increased enormously. To this end a diverse array of methods that have been developed by the
researchers. But the availability of these methods, and the sheer number of software packages which are now
regularly employed in Phylogenetic research force academicians, biologists and system administrators to spend
huge amounts of time installing, configuring and maintaining them, rather than focusing on research.
Phyloinformatic research, in particular, depends largely on processing capacity and highly parallel analyses that
are distributed across many computers and processors. As Phylogenetic analysis has become more complex and
advanced, careful calibration of operating systems across computers also becomes more complex. While this
increasing need of computational power gives researchers the ability to be more creative in assaying complex
and time-consuming analyses, installation and configuration of various software tools for these machines
becomes highly repetitive, the process will be error-prone and time consuming. This situation or problem can be
easily handled with development of a tailor made operating systems for each specific purposes of Phylogenetic
analysis. Moreover, in recent days the Phylogenetic methods are experiencing rapid development; hence there is
a need for a distribution specifically focusing on this area. SPGUP is one such software tool and an operating
system by itself developed using Python ([1], [2]) which heads towards filling this need.
II. RELATED WORK
A thorough survey of the literature pertaining to the subject reveals that very sparse literature is available in this
direction. Some recent works include ([1], [2], [3], [4], [5], [6], [7] and [8]). Absolutely no work is available
with regard to the present work. Hence, the present innovative work is carried out.
III. NEED AND IMPORTANCE OF THE PROBLEM
As discussed earlier the Phylogenetic analysis has a major application in the field of Biology. Since there are
very few operating systems/software packages [3] available in this direction which are aimed at generalizing the
methods, and thus do not include many tools or packages essential for Phylogenetic analysis alone. SPGUP is a
tool which incorporates methods which are essential for performing basic studies pertaining to sequences and
Phylogenetic trees in addition it concentrates on Genome diagrams. A comparative analysis of SPGUP with
other available Phylogenetic softwares is essential to ensure and highlight the easy and speedy approach used to
obtain various results using SPGUP as compared to other tools. Since not much work in this area has been done,
the present investigation is carried out to throw light on the qualitative aspects of the tool.
IV. DATA SET DESCRIPTION
Basically, there are two ways of data selection viz., general and through SPGUP.
i. Data set selection (General Method)
A set of 23 sequences belonging to plants kingdom which contain the protein coded gene ‘chalcone synthase’
was being selected from NCBI’s Nucleotide Database using BLAST as in Table1. The set of sequences under
P. K. Srimani et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(1), March-May, 2014, pp.
99-103
IJETCAS 14-325; © 2014, IJETCAS All Rights Reserved Page 100
study was used in CLUSTALW ([4], [5]) to get the resulting tree. Later the tree file was used in Tree View to
get the Phylogram and Rectangular cladogram.
ii. Data set selection ( Using SPGUP )
The BLAST option in SPGUP tool is used to get set of (23) sequences based on the key-id of the primary
sequence having ‘chalcone synthase’. Later it was used in ALIGNMNET option to produce ‘.dnd’ file which
is later used to construct various forms of Phylogenetic trees.
Table I: Contains the sequence details
Sl.No LOCUS/ ACCESSION ORGANISM
1 JN830647.1 Pyrus pyrifolia
2 HQ853494.1 Malus toringoides
3 DQ286037.1 Sorbus aucuparia
4 AF400567.1 Rubus idaeus
5 HQ423171.1 chalcone synthase
6 AB201756.1 Fragaria x ananassa
7 JQ247184.1 Camellia sinensis
8 AM263200.1 Humulus lupulus
9 AJ413277.1 Rhododendron simsii
10 X94706.1 Juglans nigra
11 JN654702.1 Vaccinium corymbosum
12 JQ627646.1 Lonicera japonica
13 AB009350.1 Citrus sinensis
14 JF795272.1 Gossypium hirsutum
15 HQ127337.1 Phlox drummondii
16 EU430077.1 Senna tora
17 AY237728.1 Glycine max
18 L24517.1 Trifolium subterraneum
19 FJ705842.1 Capsicum annuum
20 AY170347.1 Arachis hypogaea
21 DQ208973.1 Cardamine maritime
22 AF112108.1 Barbarea vulgaris
23 AF144530.1 Rorippa amphibian
V. METHODOLOGY
The traditional methods which use the existing software tools require the following steps for Phylogenetic tree
construction:
Step1: The initial sequence having ‘chalcone synthase’ is being selected form NCBI’s data base, it is then used
in BLAST program of NCBI to get the set of similar sequences [4].
Step2: The set of sequences under study is pasted onto the CLUSTALW tool window for alignment. The tool
also gives the Tree output in NEWICK format, and as a Phylogram.
Step3: The tree file in NEWICK format is later used in TREE VIEW software tool, which is to be pre-
configured before use to get various types of pictorial representations of the tree.
This approach does not predict anything about the different tree formats viz., ASCII tree, Unrooted and Neto
format tree, but in case where SPGUP tool is used, exhaustive results pertaining to the study are obtained with
minimum effort and maximum efficiency. These are illustrated in the forthcoming sections.
However, in the present case only the following steps are needed for the results pertaining to Phylogenetic trees
construction.
Pictorial representation of the workflow of SPGUP
Key sequence id is entered into
the BLAST option screen of
SPGUP
A set of sequences of interest are
selected from BLAST output and
is used in ALIGN option to get
tree file in NEWICK
format('.dnd 'file)
Phylogenetic tree is constructed
using PHYLOGENETIC TREE
option
It produces 5 types of trees viz,
ASCII (plain text) tree, Rooted
tree, Rooted with colour,
Unrooted tree and Neto tree.
P. K. Srimani et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(1), March-May, 2014, pp.
99-103
IJETCAS 14-325; © 2014, IJETCAS All Rights Reserved Page 101
VI. EXPERIMENTS AND RESULTS
In this section, the results of the present investigation using both the traditional method (Figures 1, 2 and 3)
and the SPGUP tool (Figures 4, 5, 6, 7 and 8) are presented. The experiments are conducted as per the steps
mentioned in section V. Figure1 shows the Phylogenetic tree as a cladogram which is obtained by using the
CLUSTALW tool. A cladogram is a simple tree depicting only relationships between terminal nodes. A
cladogram can also show inferred character changes [6].
Fig. 1 Cladogram using CLUSTALW
The aligned sequences of the data set are further used in TREE VIEW tool which is exclusively used for tree
drawing. It takes the input file in NEWICK format. Figures 2 and 3 represent the Phylogenetic tree in a
Phylogram and Rectangular cladogram format.
Fig.2 Phylogram Using TREE VIEW
Fig. 3 Rectangular Cladogram using TREE VIEW
P. K. Srimani et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(1), March-May, 2014, pp.
99-103
IJETCAS 14-325; © 2014, IJETCAS All Rights Reserved Page 102
Following are the results obtained from SPGUP tool. The SPGUP tool allows the user to construct various
forms of Phylogenetic tree with minimum effort. Figure4 represents a basic Phylogram. A Phylogram or
additive tree which has additional information, in the sense that edge lengths are drawn proportional to some
attributes. The evolutionary tree for the group of 23 sequences under study is as below
Fig. 4 ASCII tree (Phylogram) using SPGUP Fig.5 Rooted tree (Phylogram) with color using SPGUP
A Phylogram can be rooted/unrooted , Figure 5 presents the Phylogram which is a rooted tree drawn with
colored edges representing the node to which they belong. Figures 6 and 7 depict the rooted and unrooted trees
respectively in the radial format [7] using SPGUP tool.
Fig. 6 Rooted tree using SPGUP Fig. 7 Unrooted tree using SPGUP
Figure 8 is a special format of Phylogenetic tree generally drawn for a very large data set.
Fig. 8 Neto format tree (for very large data set) using SPGUP
P. K. Srimani et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(1), March-May, 2014, pp.
99-103
IJETCAS 14-325; © 2014, IJETCAS All Rights Reserved Page 103
VII. CONCLUSION
The present investigation with the dataset of 23 sequences was carried out to throw some light on the existing
traditional methods/software tools which require the knowledge of many tools with their setup and usage as
compared to the tool SPGUP developed by the authors.
In the traditional methods, the user has to follow a lot of steps in order to construct the Phylogenetic trees i.e.
use of BLAST for finding the similar sequences, followed by CLUSTALW tool for aligning the sequences, in
order to get the Newick format tree. Later the tree file in ‘.dnd’ form is used in another tool called TREE VIEW
for getting different forms of Phylogenetic trees. These methods take a huge computational time with
dependencies on various tools, which is of concern to the user. Compared to traditional methods SPGUP is a
Phyloinformatic tool developed by the authors and is a pipeline of tools required for the tree construction, which
has a GUI based approach.
The special features of SPGUP tool are: (i) Different forms of trees can be constructed for the same data set.
(ii) Less number of steps is involved in the computational process. (iii) It can be used for both Nucleotide
(DNA, RNA) and Protein sequences. (iv) In order to increase the scope and applications an integrated approach
is employed in the design of SPGUP, comprising of Python, Networkx, Graphviz, etc. (v) The SPGUP tool is
first of its kind and has lot of scope for further research.
These special features have made it very user friendly for both the biologists and computer/IT professionals. The
versatility of this tool is that, it has better knowledge acquisition, assimilation and dissemination.
ACKNOWLEDGEMENTS
One of the authors Mrs. Kumudavalli M.V acknowledges Dayananda Sagar Institutions, Bangalore, Karnataka
and SCSVMV University, Kanchipuram, Tamilnadu, India for providing the facilities for carrying out the
research work.
REFERENCES
[1] http://www.python.org
[2] http://www.open-bio.org
[3] Stephen A. Smith1,* and Casey W. Dunn2, “Phyutility: a phyloinformatics tool for trees, alignments and molecular data”,
BIOINFORMATICS, APPLICATIONS NOTE Vol. 24 no. 5 2008, pages 715–716 doi:10.1093/bioinformatics/btm619
[4] S. B. Needleman and C. D. Wunsch. “A general method applicable to the search for similarities in the amino acid sequence of
two proteins”. Journal of Molecular Biology, 48:443-453, 1970.
[5] Elodine Nedelec,1
Thomas Moncion,2,3
Elisabeth Gassiat,1
Bruno Bossard,2,4
Guillemette Duchateau-Nguyen,5
Alain Denise,2,5
and Michel Termier5
. “A Pair-wise Alignment Algorithm Which Favors Clusters of Blocks”. Journal of Computational Biology,
12, Number 1, 33-47, 2005.
[6] Thompson J.D., Higgins, D.G., and Gibson, T.J. “Clustalw; improving the sensitivity of progressive multiple sequence alignment
through sequence weighting, position-specific gap penalties and weight matrix choice”. Nucl. Acids Res. 22, 4673- 4680, 1994.
[7] Srimani P. K, Kumudavalli M.V., “A Computational Analysis of the Phylogenetic Trees of Some Eukaryotes Sequences,”
International Journal of Current Research, Vol. 4,Issue, 05, pp. 206-210, May, 2012.
[8] Valdimir Makarenkov1
and Bruno Leclerc2
. “Comparison of Additive Tree using Circular Orders”. Journal of Computational
Biology. 7, 731-744, 2005.

More Related Content

What's hot

Evaluating the efficiency of rule techniques for file
Evaluating the efficiency of rule techniques for fileEvaluating the efficiency of rule techniques for file
Evaluating the efficiency of rule techniques for fileeSAT Publishing House
 
Using data mining methods knowledge discovery for text mining
Using data mining methods knowledge discovery for text miningUsing data mining methods knowledge discovery for text mining
Using data mining methods knowledge discovery for text miningeSAT Journals
 
Using data mining methods knowledge discovery for text mining
Using data mining methods knowledge discovery for text miningUsing data mining methods knowledge discovery for text mining
Using data mining methods knowledge discovery for text miningeSAT Publishing House
 
IRJET- Supervised Learning Approach for Flower Images using Color, Shape and ...
IRJET- Supervised Learning Approach for Flower Images using Color, Shape and ...IRJET- Supervised Learning Approach for Flower Images using Color, Shape and ...
IRJET- Supervised Learning Approach for Flower Images using Color, Shape and ...IRJET Journal
 
Decision Support System for Bat Identification using Random Forest and C5.0
Decision Support System for Bat Identification using Random Forest and C5.0Decision Support System for Bat Identification using Random Forest and C5.0
Decision Support System for Bat Identification using Random Forest and C5.0TELKOMNIKA JOURNAL
 
Filter Based Approach for Genomic Feature Set Selection (FBA-GFS)
Filter Based Approach for Genomic Feature Set Selection (FBA-GFS)Filter Based Approach for Genomic Feature Set Selection (FBA-GFS)
Filter Based Approach for Genomic Feature Set Selection (FBA-GFS)IJCSEA Journal
 
Image retrieval based on feature selection method
Image retrieval based on feature selection methodImage retrieval based on feature selection method
Image retrieval based on feature selection methodeSAT Publishing House
 
Ant System and Weighted Voting Method for Multiple Classifier Systems
Ant System and Weighted Voting Method for Multiple Classifier Systems Ant System and Weighted Voting Method for Multiple Classifier Systems
Ant System and Weighted Voting Method for Multiple Classifier Systems IJECEIAES
 
Identification of Plant Types by Leaf Textures Based on the Backpropagation N...
Identification of Plant Types by Leaf Textures Based on the Backpropagation N...Identification of Plant Types by Leaf Textures Based on the Backpropagation N...
Identification of Plant Types by Leaf Textures Based on the Backpropagation N...IJECEIAES
 
Flower Grain Image Classification Using Supervised Classification Algorithm
Flower Grain Image Classification Using Supervised Classification AlgorithmFlower Grain Image Classification Using Supervised Classification Algorithm
Flower Grain Image Classification Using Supervised Classification AlgorithmIJERD Editor
 
Artificial Intelligence based Pattern Recognition
Artificial Intelligence based Pattern RecognitionArtificial Intelligence based Pattern Recognition
Artificial Intelligence based Pattern RecognitionDr. Amarjeet Singh
 
Neighborhood search methods with moth optimization algorithm as a wrapper met...
Neighborhood search methods with moth optimization algorithm as a wrapper met...Neighborhood search methods with moth optimization algorithm as a wrapper met...
Neighborhood search methods with moth optimization algorithm as a wrapper met...IJECEIAES
 
Data reduction techniques for high dimensional biological data
Data reduction techniques for high dimensional biological dataData reduction techniques for high dimensional biological data
Data reduction techniques for high dimensional biological dataeSAT Journals
 
Sample size determination for classification of eeg signals using power analy...
Sample size determination for classification of eeg signals using power analy...Sample size determination for classification of eeg signals using power analy...
Sample size determination for classification of eeg signals using power analy...iaemedu
 
Sample size determination for classification of eeg signals using power analy...
Sample size determination for classification of eeg signals using power analy...Sample size determination for classification of eeg signals using power analy...
Sample size determination for classification of eeg signals using power analy...iaemedu
 
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASETSURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASETEditor IJMTER
 
Predicting students' performance using id3 and c4.5 classification algorithms
Predicting students' performance using id3 and c4.5 classification algorithmsPredicting students' performance using id3 and c4.5 classification algorithms
Predicting students' performance using id3 and c4.5 classification algorithmsIJDKP
 
Approaching Rules Induction CN2 Algorithm in Categorizing of Biodiversity
Approaching Rules Induction CN2 Algorithm in Categorizing of BiodiversityApproaching Rules Induction CN2 Algorithm in Categorizing of Biodiversity
Approaching Rules Induction CN2 Algorithm in Categorizing of Biodiversityijtsrd
 

What's hot (20)

Evaluating the efficiency of rule techniques for file
Evaluating the efficiency of rule techniques for fileEvaluating the efficiency of rule techniques for file
Evaluating the efficiency of rule techniques for file
 
Using data mining methods knowledge discovery for text mining
Using data mining methods knowledge discovery for text miningUsing data mining methods knowledge discovery for text mining
Using data mining methods knowledge discovery for text mining
 
Using data mining methods knowledge discovery for text mining
Using data mining methods knowledge discovery for text miningUsing data mining methods knowledge discovery for text mining
Using data mining methods knowledge discovery for text mining
 
IRJET- Supervised Learning Approach for Flower Images using Color, Shape and ...
IRJET- Supervised Learning Approach for Flower Images using Color, Shape and ...IRJET- Supervised Learning Approach for Flower Images using Color, Shape and ...
IRJET- Supervised Learning Approach for Flower Images using Color, Shape and ...
 
Decision Support System for Bat Identification using Random Forest and C5.0
Decision Support System for Bat Identification using Random Forest and C5.0Decision Support System for Bat Identification using Random Forest and C5.0
Decision Support System for Bat Identification using Random Forest and C5.0
 
Filter Based Approach for Genomic Feature Set Selection (FBA-GFS)
Filter Based Approach for Genomic Feature Set Selection (FBA-GFS)Filter Based Approach for Genomic Feature Set Selection (FBA-GFS)
Filter Based Approach for Genomic Feature Set Selection (FBA-GFS)
 
La2418611866
La2418611866La2418611866
La2418611866
 
Image retrieval based on feature selection method
Image retrieval based on feature selection methodImage retrieval based on feature selection method
Image retrieval based on feature selection method
 
Ant System and Weighted Voting Method for Multiple Classifier Systems
Ant System and Weighted Voting Method for Multiple Classifier Systems Ant System and Weighted Voting Method for Multiple Classifier Systems
Ant System and Weighted Voting Method for Multiple Classifier Systems
 
Identification of Plant Types by Leaf Textures Based on the Backpropagation N...
Identification of Plant Types by Leaf Textures Based on the Backpropagation N...Identification of Plant Types by Leaf Textures Based on the Backpropagation N...
Identification of Plant Types by Leaf Textures Based on the Backpropagation N...
 
Flower Grain Image Classification Using Supervised Classification Algorithm
Flower Grain Image Classification Using Supervised Classification AlgorithmFlower Grain Image Classification Using Supervised Classification Algorithm
Flower Grain Image Classification Using Supervised Classification Algorithm
 
Artificial Intelligence based Pattern Recognition
Artificial Intelligence based Pattern RecognitionArtificial Intelligence based Pattern Recognition
Artificial Intelligence based Pattern Recognition
 
Neighborhood search methods with moth optimization algorithm as a wrapper met...
Neighborhood search methods with moth optimization algorithm as a wrapper met...Neighborhood search methods with moth optimization algorithm as a wrapper met...
Neighborhood search methods with moth optimization algorithm as a wrapper met...
 
Data reduction techniques for high dimensional biological data
Data reduction techniques for high dimensional biological dataData reduction techniques for high dimensional biological data
Data reduction techniques for high dimensional biological data
 
M43016571
M43016571M43016571
M43016571
 
Sample size determination for classification of eeg signals using power analy...
Sample size determination for classification of eeg signals using power analy...Sample size determination for classification of eeg signals using power analy...
Sample size determination for classification of eeg signals using power analy...
 
Sample size determination for classification of eeg signals using power analy...
Sample size determination for classification of eeg signals using power analy...Sample size determination for classification of eeg signals using power analy...
Sample size determination for classification of eeg signals using power analy...
 
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASETSURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
SURVEY ON CLASSIFICATION ALGORITHMS USING BIG DATASET
 
Predicting students' performance using id3 and c4.5 classification algorithms
Predicting students' performance using id3 and c4.5 classification algorithmsPredicting students' performance using id3 and c4.5 classification algorithms
Predicting students' performance using id3 and c4.5 classification algorithms
 
Approaching Rules Induction CN2 Algorithm in Categorizing of Biodiversity
Approaching Rules Induction CN2 Algorithm in Categorizing of BiodiversityApproaching Rules Induction CN2 Algorithm in Categorizing of Biodiversity
Approaching Rules Induction CN2 Algorithm in Categorizing of Biodiversity
 

Viewers also liked (19)

Aijrfans14 214
Aijrfans14 214Aijrfans14 214
Aijrfans14 214
 
Ijetcas14 507
Ijetcas14 507Ijetcas14 507
Ijetcas14 507
 
Ijetcas14 308
Ijetcas14 308Ijetcas14 308
Ijetcas14 308
 
Ijetcas14 546
Ijetcas14 546Ijetcas14 546
Ijetcas14 546
 
Ijetcas14 306
Ijetcas14 306Ijetcas14 306
Ijetcas14 306
 
Red Apple Opportunities
Red Apple Opportunities Red Apple Opportunities
Red Apple Opportunities
 
Aijrfans14 215
Aijrfans14 215Aijrfans14 215
Aijrfans14 215
 
Ijetcas14 309
Ijetcas14 309Ijetcas14 309
Ijetcas14 309
 
Ijetcas14 350
Ijetcas14 350Ijetcas14 350
Ijetcas14 350
 
Ijetcas14 314
Ijetcas14 314Ijetcas14 314
Ijetcas14 314
 
Ijetcas14 317
Ijetcas14 317Ijetcas14 317
Ijetcas14 317
 
Ijetcas14 356
Ijetcas14 356Ijetcas14 356
Ijetcas14 356
 
Ijetcas14 584
Ijetcas14 584Ijetcas14 584
Ijetcas14 584
 
Viengsouvanh 2104
Viengsouvanh 2104Viengsouvanh 2104
Viengsouvanh 2104
 
Ijetcas14 344
Ijetcas14 344Ijetcas14 344
Ijetcas14 344
 
Aijrfans14 289
Aijrfans14 289Aijrfans14 289
Aijrfans14 289
 
Ijetcas14 336
Ijetcas14 336Ijetcas14 336
Ijetcas14 336
 
Ijetcas14 493
Ijetcas14 493Ijetcas14 493
Ijetcas14 493
 
Ijetcas14 351
Ijetcas14 351Ijetcas14 351
Ijetcas14 351
 

Similar to Ijetcas14 325

Genetic Algorithm for optimization on IRIS Dataset REPORT pdf
Genetic Algorithm for optimization on IRIS Dataset REPORT pdfGenetic Algorithm for optimization on IRIS Dataset REPORT pdf
Genetic Algorithm for optimization on IRIS Dataset REPORT pdfSunil Rajput
 
IDENTIFICATION AND CLASSIFICATION OF RARE MEDICINAL PLANTS USING MACHINE LEAR...
IDENTIFICATION AND CLASSIFICATION OF RARE MEDICINAL PLANTS USING MACHINE LEAR...IDENTIFICATION AND CLASSIFICATION OF RARE MEDICINAL PLANTS USING MACHINE LEAR...
IDENTIFICATION AND CLASSIFICATION OF RARE MEDICINAL PLANTS USING MACHINE LEAR...IRJET Journal
 
AI BASED CROP IDENTIFICATION WEBAPP
AI BASED CROP IDENTIFICATION WEBAPPAI BASED CROP IDENTIFICATION WEBAPP
AI BASED CROP IDENTIFICATION WEBAPPIRJET Journal
 
Potato Leaf Disease Detection Using Machine Learning
Potato Leaf Disease Detection Using Machine LearningPotato Leaf Disease Detection Using Machine Learning
Potato Leaf Disease Detection Using Machine LearningIRJET Journal
 
Parkinson Disease Detection Using XGBoost and SVM
Parkinson Disease Detection Using XGBoost and SVMParkinson Disease Detection Using XGBoost and SVM
Parkinson Disease Detection Using XGBoost and SVMIRJET Journal
 
Plant Disease Prediction Using Image Processing
Plant Disease Prediction Using Image ProcessingPlant Disease Prediction Using Image Processing
Plant Disease Prediction Using Image ProcessingIRJET Journal
 
LIMS FOR MAIZE MAPPING PROJECT
LIMS FOR MAIZE MAPPING PROJECTLIMS FOR MAIZE MAPPING PROJECT
LIMS FOR MAIZE MAPPING PROJECTG2 APPS SA DE CV
 
A Study on Surf & Hog Descriptors for Alzheimer’s Disease Detection
A Study on Surf & Hog Descriptors for Alzheimer’s Disease DetectionA Study on Surf & Hog Descriptors for Alzheimer’s Disease Detection
A Study on Surf & Hog Descriptors for Alzheimer’s Disease DetectionIRJET Journal
 
Data Analysis and Prediction System for Meteorological Data
Data Analysis and Prediction System for Meteorological DataData Analysis and Prediction System for Meteorological Data
Data Analysis and Prediction System for Meteorological DataIRJET Journal
 
De novo transcriptome assembly of solid sequencing data in cucumis melo
De novo transcriptome assembly of solid sequencing data in cucumis meloDe novo transcriptome assembly of solid sequencing data in cucumis melo
De novo transcriptome assembly of solid sequencing data in cucumis melobioejjournal
 
DE NOVO TRANSCRIPTOME ASSEMBLY OF SOLID SEQUENCING DATA IN CUCUMIS MELO
DE NOVO TRANSCRIPTOME ASSEMBLY OF SOLID SEQUENCING DATA IN CUCUMIS MELODE NOVO TRANSCRIPTOME ASSEMBLY OF SOLID SEQUENCING DATA IN CUCUMIS MELO
DE NOVO TRANSCRIPTOME ASSEMBLY OF SOLID SEQUENCING DATA IN CUCUMIS MELObioejjournal
 
Data modelling and data processing generated by human eye movements
Data modelling and data processing generated by human eye movements  Data modelling and data processing generated by human eye movements
Data modelling and data processing generated by human eye movements IJECEIAES
 
IRJET- Gene Mutation Data using Multiplicative Adaptive Algorithm and Gene On...
IRJET- Gene Mutation Data using Multiplicative Adaptive Algorithm and Gene On...IRJET- Gene Mutation Data using Multiplicative Adaptive Algorithm and Gene On...
IRJET- Gene Mutation Data using Multiplicative Adaptive Algorithm and Gene On...IRJET Journal
 
Hyperparameters analysis of long short-term memory architecture for crop cla...
Hyperparameters analysis of long short-term memory  architecture for crop cla...Hyperparameters analysis of long short-term memory  architecture for crop cla...
Hyperparameters analysis of long short-term memory architecture for crop cla...IJECEIAES
 
PARKINSON’S DISEASE DETECTION USING MACHINE LEARNING
PARKINSON’S DISEASE DETECTION USING MACHINE LEARNINGPARKINSON’S DISEASE DETECTION USING MACHINE LEARNING
PARKINSON’S DISEASE DETECTION USING MACHINE LEARNINGIRJET Journal
 
Accelerating GWAS epistatic interaction analysis methods
Accelerating GWAS epistatic interaction analysis methodsAccelerating GWAS epistatic interaction analysis methods
Accelerating GWAS epistatic interaction analysis methodsPriscill Orue Esquivel
 
The comparison of the text classification methods to be used for the analysis...
The comparison of the text classification methods to be used for the analysis...The comparison of the text classification methods to be used for the analysis...
The comparison of the text classification methods to be used for the analysis...ijcsit
 
Delineation of techniques to implement on the enhanced proposed model using d...
Delineation of techniques to implement on the enhanced proposed model using d...Delineation of techniques to implement on the enhanced proposed model using d...
Delineation of techniques to implement on the enhanced proposed model using d...ijdms
 

Similar to Ijetcas14 325 (20)

Genetic Algorithm for optimization on IRIS Dataset REPORT pdf
Genetic Algorithm for optimization on IRIS Dataset REPORT pdfGenetic Algorithm for optimization on IRIS Dataset REPORT pdf
Genetic Algorithm for optimization on IRIS Dataset REPORT pdf
 
IDENTIFICATION AND CLASSIFICATION OF RARE MEDICINAL PLANTS USING MACHINE LEAR...
IDENTIFICATION AND CLASSIFICATION OF RARE MEDICINAL PLANTS USING MACHINE LEAR...IDENTIFICATION AND CLASSIFICATION OF RARE MEDICINAL PLANTS USING MACHINE LEAR...
IDENTIFICATION AND CLASSIFICATION OF RARE MEDICINAL PLANTS USING MACHINE LEAR...
 
AI BASED CROP IDENTIFICATION WEBAPP
AI BASED CROP IDENTIFICATION WEBAPPAI BASED CROP IDENTIFICATION WEBAPP
AI BASED CROP IDENTIFICATION WEBAPP
 
Potato Leaf Disease Detection Using Machine Learning
Potato Leaf Disease Detection Using Machine LearningPotato Leaf Disease Detection Using Machine Learning
Potato Leaf Disease Detection Using Machine Learning
 
Parkinson Disease Detection Using XGBoost and SVM
Parkinson Disease Detection Using XGBoost and SVMParkinson Disease Detection Using XGBoost and SVM
Parkinson Disease Detection Using XGBoost and SVM
 
Plant Disease Prediction Using Image Processing
Plant Disease Prediction Using Image ProcessingPlant Disease Prediction Using Image Processing
Plant Disease Prediction Using Image Processing
 
LIMS for maize mapping project
LIMS for maize mapping projectLIMS for maize mapping project
LIMS for maize mapping project
 
LIMS FOR MAIZE MAPPING PROJECT
LIMS FOR MAIZE MAPPING PROJECTLIMS FOR MAIZE MAPPING PROJECT
LIMS FOR MAIZE MAPPING PROJECT
 
A Study on Surf & Hog Descriptors for Alzheimer’s Disease Detection
A Study on Surf & Hog Descriptors for Alzheimer’s Disease DetectionA Study on Surf & Hog Descriptors for Alzheimer’s Disease Detection
A Study on Surf & Hog Descriptors for Alzheimer’s Disease Detection
 
Data Analysis and Prediction System for Meteorological Data
Data Analysis and Prediction System for Meteorological DataData Analysis and Prediction System for Meteorological Data
Data Analysis and Prediction System for Meteorological Data
 
De novo transcriptome assembly of solid sequencing data in cucumis melo
De novo transcriptome assembly of solid sequencing data in cucumis meloDe novo transcriptome assembly of solid sequencing data in cucumis melo
De novo transcriptome assembly of solid sequencing data in cucumis melo
 
DE NOVO TRANSCRIPTOME ASSEMBLY OF SOLID SEQUENCING DATA IN CUCUMIS MELO
DE NOVO TRANSCRIPTOME ASSEMBLY OF SOLID SEQUENCING DATA IN CUCUMIS MELODE NOVO TRANSCRIPTOME ASSEMBLY OF SOLID SEQUENCING DATA IN CUCUMIS MELO
DE NOVO TRANSCRIPTOME ASSEMBLY OF SOLID SEQUENCING DATA IN CUCUMIS MELO
 
Data modelling and data processing generated by human eye movements
Data modelling and data processing generated by human eye movements  Data modelling and data processing generated by human eye movements
Data modelling and data processing generated by human eye movements
 
D1803012022
D1803012022D1803012022
D1803012022
 
IRJET- Gene Mutation Data using Multiplicative Adaptive Algorithm and Gene On...
IRJET- Gene Mutation Data using Multiplicative Adaptive Algorithm and Gene On...IRJET- Gene Mutation Data using Multiplicative Adaptive Algorithm and Gene On...
IRJET- Gene Mutation Data using Multiplicative Adaptive Algorithm and Gene On...
 
Hyperparameters analysis of long short-term memory architecture for crop cla...
Hyperparameters analysis of long short-term memory  architecture for crop cla...Hyperparameters analysis of long short-term memory  architecture for crop cla...
Hyperparameters analysis of long short-term memory architecture for crop cla...
 
PARKINSON’S DISEASE DETECTION USING MACHINE LEARNING
PARKINSON’S DISEASE DETECTION USING MACHINE LEARNINGPARKINSON’S DISEASE DETECTION USING MACHINE LEARNING
PARKINSON’S DISEASE DETECTION USING MACHINE LEARNING
 
Accelerating GWAS epistatic interaction analysis methods
Accelerating GWAS epistatic interaction analysis methodsAccelerating GWAS epistatic interaction analysis methods
Accelerating GWAS epistatic interaction analysis methods
 
The comparison of the text classification methods to be used for the analysis...
The comparison of the text classification methods to be used for the analysis...The comparison of the text classification methods to be used for the analysis...
The comparison of the text classification methods to be used for the analysis...
 
Delineation of techniques to implement on the enhanced proposed model using d...
Delineation of techniques to implement on the enhanced proposed model using d...Delineation of techniques to implement on the enhanced proposed model using d...
Delineation of techniques to implement on the enhanced proposed model using d...
 

More from Iasir Journals (20)

ijetcas14 650
ijetcas14 650ijetcas14 650
ijetcas14 650
 
Ijetcas14 648
Ijetcas14 648Ijetcas14 648
Ijetcas14 648
 
Ijetcas14 647
Ijetcas14 647Ijetcas14 647
Ijetcas14 647
 
Ijetcas14 643
Ijetcas14 643Ijetcas14 643
Ijetcas14 643
 
Ijetcas14 641
Ijetcas14 641Ijetcas14 641
Ijetcas14 641
 
Ijetcas14 639
Ijetcas14 639Ijetcas14 639
Ijetcas14 639
 
Ijetcas14 632
Ijetcas14 632Ijetcas14 632
Ijetcas14 632
 
Ijetcas14 624
Ijetcas14 624Ijetcas14 624
Ijetcas14 624
 
Ijetcas14 619
Ijetcas14 619Ijetcas14 619
Ijetcas14 619
 
Ijetcas14 615
Ijetcas14 615Ijetcas14 615
Ijetcas14 615
 
Ijetcas14 608
Ijetcas14 608Ijetcas14 608
Ijetcas14 608
 
Ijetcas14 605
Ijetcas14 605Ijetcas14 605
Ijetcas14 605
 
Ijetcas14 604
Ijetcas14 604Ijetcas14 604
Ijetcas14 604
 
Ijetcas14 598
Ijetcas14 598Ijetcas14 598
Ijetcas14 598
 
Ijetcas14 594
Ijetcas14 594Ijetcas14 594
Ijetcas14 594
 
Ijetcas14 593
Ijetcas14 593Ijetcas14 593
Ijetcas14 593
 
Ijetcas14 591
Ijetcas14 591Ijetcas14 591
Ijetcas14 591
 
Ijetcas14 589
Ijetcas14 589Ijetcas14 589
Ijetcas14 589
 
Ijetcas14 585
Ijetcas14 585Ijetcas14 585
Ijetcas14 585
 
Ijetcas14 583
Ijetcas14 583Ijetcas14 583
Ijetcas14 583
 

Recently uploaded

internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerunnathinaik
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting DataJhengPantaleon
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Celine George
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsKarinaGenton
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxRaymartEstabillo3
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️9953056974 Low Rate Call Girls In Saket, Delhi NCR
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxSayali Powar
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfakmcokerachita
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,Virag Sontakke
 
Science lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lessonScience lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lessonJericReyAuditor
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13Steve Thomason
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Educationpboyjonauth
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityGeoBlogs
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentInMediaRes1
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxiammrhaywood
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfSumit Tiwari
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxsocialsciencegdgrohi
 

Recently uploaded (20)

internship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developerinternship ppt on smartinternz platform as salesforce developer
internship ppt on smartinternz platform as salesforce developer
 
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data_Math 4-Q4 Week 5.pptx Steps in Collecting Data
_Math 4-Q4 Week 5.pptx Steps in Collecting Data
 
Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17Computed Fields and api Depends in the Odoo 17
Computed Fields and api Depends in the Odoo 17
 
Science 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its CharacteristicsScience 7 - LAND and SEA BREEZE and its Characteristics
Science 7 - LAND and SEA BREEZE and its Characteristics
 
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptxEPANDING THE CONTENT OF AN OUTLINE using notes.pptx
EPANDING THE CONTENT OF AN OUTLINE using notes.pptx
 
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
call girls in Kamla Market (DELHI) 🔝 >༒9953330565🔝 genuine Escort Service 🔝✔️✔️
 
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptxPOINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
POINT- BIOCHEMISTRY SEM 2 ENZYMES UNIT 5.pptx
 
Class 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdfClass 11 Legal Studies Ch-1 Concept of State .pdf
Class 11 Legal Studies Ch-1 Concept of State .pdf
 
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,भारत-रोम व्यापार.pptx, Indo-Roman Trade,
भारत-रोम व्यापार.pptx, Indo-Roman Trade,
 
Science lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lessonScience lesson Moon for 4th quarter lesson
Science lesson Moon for 4th quarter lesson
 
The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13The Most Excellent Way | 1 Corinthians 13
The Most Excellent Way | 1 Corinthians 13
 
Staff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSDStaff of Color (SOC) Retention Efforts DDSD
Staff of Color (SOC) Retention Efforts DDSD
 
Introduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher EducationIntroduction to ArtificiaI Intelligence in Higher Education
Introduction to ArtificiaI Intelligence in Higher Education
 
Paris 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activityParis 2024 Olympic Geographies - an activity
Paris 2024 Olympic Geographies - an activity
 
Alper Gobel In Media Res Media Component
Alper Gobel In Media Res Media ComponentAlper Gobel In Media Res Media Component
Alper Gobel In Media Res Media Component
 
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptxSOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
SOCIAL AND HISTORICAL CONTEXT - LFTVD.pptx
 
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdfEnzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
Enzyme, Pharmaceutical Aids, Miscellaneous Last Part of Chapter no 5th.pdf
 
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝Model Call Girl in Bikash Puri  Delhi reach out to us at 🔝9953056974🔝
Model Call Girl in Bikash Puri Delhi reach out to us at 🔝9953056974🔝
 
9953330565 Low Rate Call Girls In Rohini Delhi NCR
9953330565 Low Rate Call Girls In Rohini  Delhi NCR9953330565 Low Rate Call Girls In Rohini  Delhi NCR
9953330565 Low Rate Call Girls In Rohini Delhi NCR
 
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptxHistory Class XII Ch. 3 Kinship, Caste and Class (1).pptx
History Class XII Ch. 3 Kinship, Caste and Class (1).pptx
 

Ijetcas14 325

  • 1. International Association of Scientific Innovation and Research (IASIR) (An Association Unifying the Sciences, Engineering, and Applied Research) International Journal of Emerging Technologies in Computational and Applied Sciences (IJETCAS) www.iasir.net IJETCAS 14-325; © 2014, IJETCAS All Rights Reserved Page 99 ISSN (Print): 2279-0047 ISSN (Online): 2279-0055 SPGUP – Sequence analysis, Phylogenetic trees, Genome Diagrams Using Python - a Phylogenetic Tool 1 Prof.Dr. P. K. Srimani and 2 Mrs. Kumudavalli M.V 1 Director, R&D Division, B.U., DSI, Bangalore, Karnataka, INDIA 2 Dperatment of MCA (BU), DSI, Bangalore, Karnataka, INDIA and Research Scholar, SCSVMV University, Kanchipuram, Tamilnadu, INDIA _______________________________________________________________ Abstract: SPGUP is a software tool and a free GNU/Linux distribution developed by the authors that has the unique feature of providing a set of Phyloinformatic tools for retrieving data from NCBI, to perform Sequence analysis, Phylogenetic tree construction and to build the Genome diagrams. The operating system incorporates most commonly used Phylogenetic software, which has been pre-compiled and pre-configured, allowing the user for direct application of Phylogenetic methods in a stable Linux environment. Keywords: Sequence, Phylogenetics, Phyloinformatic, operating system __________________________________________________________________________________________ I. INTRODUCTION Phylogenetic methods are having exponential growth nearly in all fields of biology. As the biological data accumulation rate is increasing, the size of Phylogenetic analyses and the scope of inferences for which they are utilized have both increased enormously. To this end a diverse array of methods that have been developed by the researchers. But the availability of these methods, and the sheer number of software packages which are now regularly employed in Phylogenetic research force academicians, biologists and system administrators to spend huge amounts of time installing, configuring and maintaining them, rather than focusing on research. Phyloinformatic research, in particular, depends largely on processing capacity and highly parallel analyses that are distributed across many computers and processors. As Phylogenetic analysis has become more complex and advanced, careful calibration of operating systems across computers also becomes more complex. While this increasing need of computational power gives researchers the ability to be more creative in assaying complex and time-consuming analyses, installation and configuration of various software tools for these machines becomes highly repetitive, the process will be error-prone and time consuming. This situation or problem can be easily handled with development of a tailor made operating systems for each specific purposes of Phylogenetic analysis. Moreover, in recent days the Phylogenetic methods are experiencing rapid development; hence there is a need for a distribution specifically focusing on this area. SPGUP is one such software tool and an operating system by itself developed using Python ([1], [2]) which heads towards filling this need. II. RELATED WORK A thorough survey of the literature pertaining to the subject reveals that very sparse literature is available in this direction. Some recent works include ([1], [2], [3], [4], [5], [6], [7] and [8]). Absolutely no work is available with regard to the present work. Hence, the present innovative work is carried out. III. NEED AND IMPORTANCE OF THE PROBLEM As discussed earlier the Phylogenetic analysis has a major application in the field of Biology. Since there are very few operating systems/software packages [3] available in this direction which are aimed at generalizing the methods, and thus do not include many tools or packages essential for Phylogenetic analysis alone. SPGUP is a tool which incorporates methods which are essential for performing basic studies pertaining to sequences and Phylogenetic trees in addition it concentrates on Genome diagrams. A comparative analysis of SPGUP with other available Phylogenetic softwares is essential to ensure and highlight the easy and speedy approach used to obtain various results using SPGUP as compared to other tools. Since not much work in this area has been done, the present investigation is carried out to throw light on the qualitative aspects of the tool. IV. DATA SET DESCRIPTION Basically, there are two ways of data selection viz., general and through SPGUP. i. Data set selection (General Method) A set of 23 sequences belonging to plants kingdom which contain the protein coded gene ‘chalcone synthase’ was being selected from NCBI’s Nucleotide Database using BLAST as in Table1. The set of sequences under
  • 2. P. K. Srimani et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(1), March-May, 2014, pp. 99-103 IJETCAS 14-325; © 2014, IJETCAS All Rights Reserved Page 100 study was used in CLUSTALW ([4], [5]) to get the resulting tree. Later the tree file was used in Tree View to get the Phylogram and Rectangular cladogram. ii. Data set selection ( Using SPGUP ) The BLAST option in SPGUP tool is used to get set of (23) sequences based on the key-id of the primary sequence having ‘chalcone synthase’. Later it was used in ALIGNMNET option to produce ‘.dnd’ file which is later used to construct various forms of Phylogenetic trees. Table I: Contains the sequence details Sl.No LOCUS/ ACCESSION ORGANISM 1 JN830647.1 Pyrus pyrifolia 2 HQ853494.1 Malus toringoides 3 DQ286037.1 Sorbus aucuparia 4 AF400567.1 Rubus idaeus 5 HQ423171.1 chalcone synthase 6 AB201756.1 Fragaria x ananassa 7 JQ247184.1 Camellia sinensis 8 AM263200.1 Humulus lupulus 9 AJ413277.1 Rhododendron simsii 10 X94706.1 Juglans nigra 11 JN654702.1 Vaccinium corymbosum 12 JQ627646.1 Lonicera japonica 13 AB009350.1 Citrus sinensis 14 JF795272.1 Gossypium hirsutum 15 HQ127337.1 Phlox drummondii 16 EU430077.1 Senna tora 17 AY237728.1 Glycine max 18 L24517.1 Trifolium subterraneum 19 FJ705842.1 Capsicum annuum 20 AY170347.1 Arachis hypogaea 21 DQ208973.1 Cardamine maritime 22 AF112108.1 Barbarea vulgaris 23 AF144530.1 Rorippa amphibian V. METHODOLOGY The traditional methods which use the existing software tools require the following steps for Phylogenetic tree construction: Step1: The initial sequence having ‘chalcone synthase’ is being selected form NCBI’s data base, it is then used in BLAST program of NCBI to get the set of similar sequences [4]. Step2: The set of sequences under study is pasted onto the CLUSTALW tool window for alignment. The tool also gives the Tree output in NEWICK format, and as a Phylogram. Step3: The tree file in NEWICK format is later used in TREE VIEW software tool, which is to be pre- configured before use to get various types of pictorial representations of the tree. This approach does not predict anything about the different tree formats viz., ASCII tree, Unrooted and Neto format tree, but in case where SPGUP tool is used, exhaustive results pertaining to the study are obtained with minimum effort and maximum efficiency. These are illustrated in the forthcoming sections. However, in the present case only the following steps are needed for the results pertaining to Phylogenetic trees construction. Pictorial representation of the workflow of SPGUP Key sequence id is entered into the BLAST option screen of SPGUP A set of sequences of interest are selected from BLAST output and is used in ALIGN option to get tree file in NEWICK format('.dnd 'file) Phylogenetic tree is constructed using PHYLOGENETIC TREE option It produces 5 types of trees viz, ASCII (plain text) tree, Rooted tree, Rooted with colour, Unrooted tree and Neto tree.
  • 3. P. K. Srimani et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(1), March-May, 2014, pp. 99-103 IJETCAS 14-325; © 2014, IJETCAS All Rights Reserved Page 101 VI. EXPERIMENTS AND RESULTS In this section, the results of the present investigation using both the traditional method (Figures 1, 2 and 3) and the SPGUP tool (Figures 4, 5, 6, 7 and 8) are presented. The experiments are conducted as per the steps mentioned in section V. Figure1 shows the Phylogenetic tree as a cladogram which is obtained by using the CLUSTALW tool. A cladogram is a simple tree depicting only relationships between terminal nodes. A cladogram can also show inferred character changes [6]. Fig. 1 Cladogram using CLUSTALW The aligned sequences of the data set are further used in TREE VIEW tool which is exclusively used for tree drawing. It takes the input file in NEWICK format. Figures 2 and 3 represent the Phylogenetic tree in a Phylogram and Rectangular cladogram format. Fig.2 Phylogram Using TREE VIEW Fig. 3 Rectangular Cladogram using TREE VIEW
  • 4. P. K. Srimani et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(1), March-May, 2014, pp. 99-103 IJETCAS 14-325; © 2014, IJETCAS All Rights Reserved Page 102 Following are the results obtained from SPGUP tool. The SPGUP tool allows the user to construct various forms of Phylogenetic tree with minimum effort. Figure4 represents a basic Phylogram. A Phylogram or additive tree which has additional information, in the sense that edge lengths are drawn proportional to some attributes. The evolutionary tree for the group of 23 sequences under study is as below Fig. 4 ASCII tree (Phylogram) using SPGUP Fig.5 Rooted tree (Phylogram) with color using SPGUP A Phylogram can be rooted/unrooted , Figure 5 presents the Phylogram which is a rooted tree drawn with colored edges representing the node to which they belong. Figures 6 and 7 depict the rooted and unrooted trees respectively in the radial format [7] using SPGUP tool. Fig. 6 Rooted tree using SPGUP Fig. 7 Unrooted tree using SPGUP Figure 8 is a special format of Phylogenetic tree generally drawn for a very large data set. Fig. 8 Neto format tree (for very large data set) using SPGUP
  • 5. P. K. Srimani et al., International Journal of Emerging Technologies in Computational and Applied Sciences, 8(1), March-May, 2014, pp. 99-103 IJETCAS 14-325; © 2014, IJETCAS All Rights Reserved Page 103 VII. CONCLUSION The present investigation with the dataset of 23 sequences was carried out to throw some light on the existing traditional methods/software tools which require the knowledge of many tools with their setup and usage as compared to the tool SPGUP developed by the authors. In the traditional methods, the user has to follow a lot of steps in order to construct the Phylogenetic trees i.e. use of BLAST for finding the similar sequences, followed by CLUSTALW tool for aligning the sequences, in order to get the Newick format tree. Later the tree file in ‘.dnd’ form is used in another tool called TREE VIEW for getting different forms of Phylogenetic trees. These methods take a huge computational time with dependencies on various tools, which is of concern to the user. Compared to traditional methods SPGUP is a Phyloinformatic tool developed by the authors and is a pipeline of tools required for the tree construction, which has a GUI based approach. The special features of SPGUP tool are: (i) Different forms of trees can be constructed for the same data set. (ii) Less number of steps is involved in the computational process. (iii) It can be used for both Nucleotide (DNA, RNA) and Protein sequences. (iv) In order to increase the scope and applications an integrated approach is employed in the design of SPGUP, comprising of Python, Networkx, Graphviz, etc. (v) The SPGUP tool is first of its kind and has lot of scope for further research. These special features have made it very user friendly for both the biologists and computer/IT professionals. The versatility of this tool is that, it has better knowledge acquisition, assimilation and dissemination. ACKNOWLEDGEMENTS One of the authors Mrs. Kumudavalli M.V acknowledges Dayananda Sagar Institutions, Bangalore, Karnataka and SCSVMV University, Kanchipuram, Tamilnadu, India for providing the facilities for carrying out the research work. REFERENCES [1] http://www.python.org [2] http://www.open-bio.org [3] Stephen A. Smith1,* and Casey W. Dunn2, “Phyutility: a phyloinformatics tool for trees, alignments and molecular data”, BIOINFORMATICS, APPLICATIONS NOTE Vol. 24 no. 5 2008, pages 715–716 doi:10.1093/bioinformatics/btm619 [4] S. B. Needleman and C. D. Wunsch. “A general method applicable to the search for similarities in the amino acid sequence of two proteins”. Journal of Molecular Biology, 48:443-453, 1970. [5] Elodine Nedelec,1 Thomas Moncion,2,3 Elisabeth Gassiat,1 Bruno Bossard,2,4 Guillemette Duchateau-Nguyen,5 Alain Denise,2,5 and Michel Termier5 . “A Pair-wise Alignment Algorithm Which Favors Clusters of Blocks”. Journal of Computational Biology, 12, Number 1, 33-47, 2005. [6] Thompson J.D., Higgins, D.G., and Gibson, T.J. “Clustalw; improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice”. Nucl. Acids Res. 22, 4673- 4680, 1994. [7] Srimani P. K, Kumudavalli M.V., “A Computational Analysis of the Phylogenetic Trees of Some Eukaryotes Sequences,” International Journal of Current Research, Vol. 4,Issue, 05, pp. 206-210, May, 2012. [8] Valdimir Makarenkov1 and Bruno Leclerc2 . “Comparison of Additive Tree using Circular Orders”. Journal of Computational Biology. 7, 731-744, 2005.