GTfold is a scalable multicore program for RNA secondary structure prediction that can now fold real RNA sequences containing unidentified bases (N) and toggle between different energy models (Turner 1999 and Turner 2004). Experimental results show that GTfold is faster than other programs like UNAfold and more accurate in minimum free energy predictions. The addition of an internal loop speedup algorithm improves GTfold's performance significantly for larger sequences, though at the cost of longer running times.
Ekeeda Provides Online Civil Engineering Degree Subjects Courses, Video Lectures for All Engineering Universities. Video Tutorials Covers Subjects of Mechanical Engineering Degree. Visit us: https://ekeeda.com/
Power System Simulation Laboratory Manual Santhosh Kumar
Date:-(13-07-2016)
Hii friends
I Have Attached Our Power System Simulation Laboratory Manual Here for your Reference
Kindly download the Manual and Start Writing the Observation Note By Mr.G.Shivaraj-AP/EEE
Please follow it friends✌
With Happy,
Šαηтн๑zzζzz
Ekeeda Provides Online Civil Engineering Degree Subjects Courses, Video Lectures for All Engineering Universities. Video Tutorials Covers Subjects of Mechanical Engineering Degree. Visit us: https://ekeeda.com/
Power System Simulation Laboratory Manual Santhosh Kumar
Date:-(13-07-2016)
Hii friends
I Have Attached Our Power System Simulation Laboratory Manual Here for your Reference
Kindly download the Manual and Start Writing the Observation Note By Mr.G.Shivaraj-AP/EEE
Please follow it friends✌
With Happy,
Šαηтн๑zzζzz
The transmission overhead line is one of the vital elements in the power system for transmitting the electrical energy. In the transmission, the disturbances are often occurred. In the conventional algorithm, alpha and beta (mode) currents generated by Clarke’s transformation are utilized to convert the signal of Discrete Wavelet Transform (DWT) to obtain the Wavelet Transform Coefficient (WTC) and the Wavelet Coefficient Energy (WCE). This study introduces a new algorithm, called Modified Clarke for fault detection and classification using DWT and Back-Propagation Neural Network (BPNN) based on Clarke’s transformation on transmission overhead line by adding gamma current in the system. Daubechies4 (Db4) is used as a mother wavelet to decompose the high frequency components of the signal error. Simulation is performed using PSCAD / EMTDC transmission system modeling and carried out at different locations along the transmission line with different types of fault, fault resistances, fault locations and fault of the initial angle on a given power system model. The simulated fault types are in the study are the Single Line to Ground, the Line To Line, the Double Line to Ground and the Three Phases. There are four statistic methods utilized in the present study to determine the accuracy of detection and classification of faults. The result shows that the best and the worst structures of BPNN occurred on the configuration of 12-24-48-4 and 12-12-6-4, respectively. For instance, the error using Mean Square Error Method. The Error Of Clarke’s, Without Clarke’s and Modified Clarke’s are 0.05862, 0.05513 and 0.03721 which are the best, respectively, whereas, the worst are 0.06387, 0.0753 and 0.052, respectively. This indicates that the Modified Clarke’s result is in the lowest error. The method is successfully implement can be utilized in the detection and classification of fault in transmission line by utilities and power regulation in power system planning and operation.
INTRODUCTION BASIC TECHNIQUES TYPE OF BUSES
Y BUS MATRIX POWER SYSTEM COMPONENTS BUS ADMITTANCE MATRIX
Power (Load) flow study is the analysis of a power system in normal steady-state operation
This study will determine:
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
Fault detection in power transformers using random neural networksIJECEIAES
This paper discuss the application of artificial neural network-based algorithms to identify different types of faults in a power transformer, particularly using DGA (Dissolved Gas Analysis) test. The analysis of Random Neural Network (RNN) using Levenberg-Marquardt (LM) and Broyden-Fletcher-Goldfarb-Shanno (BFGS) algorithms has been done using the data of dissolved gases of power transformers collected from Punjab State Transmission Corporation Ltd.(PSTCL), Ludhiana, India. Sorting of the preprocessed data have been done using dimensionality reduction technique, i.e., principal component analysis. The sorted data is used as inputs to the Random Neural Networks (RNN) classifier. It has been seen from the results obtained that BFGS has better performance for the diagnosis of fault in transformer as compared to LM.
Optimum Network Reconfiguration using Grey Wolf OptimizerTELKOMNIKA JOURNAL
Distribution system Reconfiguration is the process of changing the topology of the distribution
network by opening and closing switches to satisfy a specific objective. It is a complex, combinatorial
optimization problem involving a nonlinear objective function and constraints. Grey Wolf Optimizer (GWO)
is a recently developed metaheuristic search algorithm inspired by the leadership hierarchy and hunting
strategy of grey wolves in nature. The objective of this paper is to determine an optimal network
reconfiguration that presents the minimum power losses, considering network constraints, and using GWO
algorithm. The proposed algorithm was tested using some standard networks (33 bus, 69 bus, 84 bus and
118 bus), and the obtained results reveal the efficiency and effectiveness of the proposed approach.
The transmission overhead line is one of the vital elements in the power system for transmitting the electrical energy. In the transmission, the disturbances are often occurred. In the conventional algorithm, alpha and beta (mode) currents generated by Clarke’s transformation are utilized to convert the signal of Discrete Wavelet Transform (DWT) to obtain the Wavelet Transform Coefficient (WTC) and the Wavelet Coefficient Energy (WCE). This study introduces a new algorithm, called Modified Clarke for fault detection and classification using DWT and Back-Propagation Neural Network (BPNN) based on Clarke’s transformation on transmission overhead line by adding gamma current in the system. Daubechies4 (Db4) is used as a mother wavelet to decompose the high frequency components of the signal error. Simulation is performed using PSCAD / EMTDC transmission system modeling and carried out at different locations along the transmission line with different types of fault, fault resistances, fault locations and fault of the initial angle on a given power system model. The simulated fault types are in the study are the Single Line to Ground, the Line To Line, the Double Line to Ground and the Three Phases. There are four statistic methods utilized in the present study to determine the accuracy of detection and classification of faults. The result shows that the best and the worst structures of BPNN occurred on the configuration of 12-24-48-4 and 12-12-6-4, respectively. For instance, the error using Mean Square Error Method. The Error Of Clarke’s, Without Clarke’s and Modified Clarke’s are 0.05862, 0.05513 and 0.03721 which are the best, respectively, whereas, the worst are 0.06387, 0.0753 and 0.052, respectively. This indicates that the Modified Clarke’s result is in the lowest error. The method is successfully implement can be utilized in the detection and classification of fault in transmission line by utilities and power regulation in power system planning and operation.
INTRODUCTION BASIC TECHNIQUES TYPE OF BUSES
Y BUS MATRIX POWER SYSTEM COMPONENTS BUS ADMITTANCE MATRIX
Power (Load) flow study is the analysis of a power system in normal steady-state operation
This study will determine:
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
Fault detection in power transformers using random neural networksIJECEIAES
This paper discuss the application of artificial neural network-based algorithms to identify different types of faults in a power transformer, particularly using DGA (Dissolved Gas Analysis) test. The analysis of Random Neural Network (RNN) using Levenberg-Marquardt (LM) and Broyden-Fletcher-Goldfarb-Shanno (BFGS) algorithms has been done using the data of dissolved gases of power transformers collected from Punjab State Transmission Corporation Ltd.(PSTCL), Ludhiana, India. Sorting of the preprocessed data have been done using dimensionality reduction technique, i.e., principal component analysis. The sorted data is used as inputs to the Random Neural Networks (RNN) classifier. It has been seen from the results obtained that BFGS has better performance for the diagnosis of fault in transformer as compared to LM.
Optimum Network Reconfiguration using Grey Wolf OptimizerTELKOMNIKA JOURNAL
Distribution system Reconfiguration is the process of changing the topology of the distribution
network by opening and closing switches to satisfy a specific objective. It is a complex, combinatorial
optimization problem involving a nonlinear objective function and constraints. Grey Wolf Optimizer (GWO)
is a recently developed metaheuristic search algorithm inspired by the leadership hierarchy and hunting
strategy of grey wolves in nature. The objective of this paper is to determine an optimal network
reconfiguration that presents the minimum power losses, considering network constraints, and using GWO
algorithm. The proposed algorithm was tested using some standard networks (33 bus, 69 bus, 84 bus and
118 bus), and the obtained results reveal the efficiency and effectiveness of the proposed approach.
Fractal representation of the power demand based on topological properties of...IJECEIAES
In a power system, the load demand considers two components such as the real power (P) because of resistive elements, and the reactive power (Q) because inductive or capacitive elements. This paper presents a graphical representation of the electric power demand based on the topological properties of the Julia Sets, with the purpose of observing the different graphic patterns and relationship with the hourly load consumptions. An algorithm that iterates complex numbers related to power is used to represent each fractal diagram of the load demand. The results show some representative patterns related to each value of the power consumption and similar behaviour in the fractal diagrams, which allows to understand consumption behaviours from the different hours of the day. This study allows to make a relation among the different consumptions of the day to create relationships that lead to the prediction of different behaviour patterns of the curves.
PaperLoad following in a deregulated power system with Thyristor Controlled S...rajeshja
Load following is considered to be an ancillary service in a deregulated power system. This paper investigates
the effect of a Thyristor Controlled Series Compensator (TCSC) for load following in a deregulated
two area interconnected thermal system with two GENCOs and two DISCOs in either areas. Optimal
gain settings of the integral controllers in the control areas are obtained using Genetic Algorithm by
minimizing a quadratic performance index. Simulation studies carried out in MATLAB validates that a
Thyristor Controlled Series Compensator in series with tie-line can effectively improve the load following
performance of the power system in a deregulated environment.
International Journal of Engineering Research and Applications (IJERA) is an open access online peer reviewed international journal that publishes research and review articles in the fields of Computer Science, Neural Networks, Electrical Engineering, Software Engineering, Information Technology, Mechanical Engineering, Chemical Engineering, Plastic Engineering, Food Technology, Textile Engineering, Nano Technology & science, Power Electronics, Electronics & Communication Engineering, Computational mathematics, Image processing, Civil Engineering, Structural Engineering, Environmental Engineering, VLSI Testing & Low Power VLSI Design etc.
COMPARING OF SWITCHING FREQUENCY ON VECTOR CONTROLLED ASYNCHRONOUS MOTORijscai
Nowadays, asynchronous motors have wide range use in many industrial applications. Field oriented
control (FOC) and direct torque control (DTC) are commonly used methods in high performance vector
control for asynchronous motors. Therefore, it is very important to identify clearly advantages and
disadvantages of both systems in the selection of appropriate control methods for many industrial
applications. This paper aims to present a new and different perspective regarding the comparison of the
switching behaviours on the FOC and the DTC drivers. For this purpose, the experimental studies have
been carried out to compare the inverter switching frequencies and torque responses of the asynchronous
motor in the FOC and the DTC systems under different working conditions. The dSPACE 1103 controller
board was programmed with Matlab/Simulink software. As expected, the experimental studies showed that
the FOC controlled motors has a lessened torque ripple. On the other hand, the FOC controlled motor
switching frequency
Finite Element Method for Designing and Analysis of the Transformer – A Retro...idescitation
Finite Element Analysis (FEA) using Finite Element Method (FEM) was
developed over 70 years to solve the complex elasticity and structural analysis problem in
civil and aeronautical engineering. Application of FEA is being expanded to simulation in
electrical engineering also to solve the complex design problems. The circuit theory models
for designing transformers are not much accurate in determining the transformer
parameters such as winding impedance, leakage inductance, hot spot temperature etc. The
physical realization of these parameters is needed on a prototype unit. The finite element
method can play a vital role in deriving these parameters without any physical verification.
An effort has been made in this paper to show the effectiveness of finite element method in
determining the above said parameters while designing the transformers - both oil cooled as
well as dry type - for power and distribution sectors as well as to analyze and detect the
internal faults in the transformer.
The traction inverter is a crucial power device in the electric vehicle’s powertrain, and its failure is intolerable as it would considerably compromise the system’s safety. For more reliable driving, installing a traction inverter that is sufficiently resistant to electrical failure is inherent. Due to its compact size and the small number of switches incorporated in three-phase four-switch inverter, this modular topology was used to compensate for the open switch’s failure. However, it is known to have manifold weaknesses mainly distinguished in the low-frequency region. This paper introduces a new fault-tolerant indirect control that handles the IGBT’s failure constituting the traction inverter. The fault compensator is designed first based on the Proportional Integral regulator combined with the notch filter to mitigate the current imbalance and restore the DC voltage equilibrium.Furthermore, to conceive a comprehensive fault-tolerant control, there must therefore contain an accurate fault detector. In this regard, an uncomplicated fault diagnosis method based on the current spectral analysis has been performed. The effectiveness of the submitted controller was validated by simulation using Matlab.
Modeling Under MATLAB by ANFIS of Three-Phase Tetrahedral Transformer Using i...TELKOMNIKA JOURNAL
This work deals with the modeling of a new three-phase tetrahedral transformer of HV power
supply, which feeds three magnetrons per phase. The design of this new power supply is composed of
three single-phase with magnetic shunt transformers coupling in star; each one is size to feed voltagedoubling
cells, thereby feeds a magnetron. In order to validate the functionality of this power supply, we
simulate it under Matlab-Simulink environment. Thus, we modeled nonlinear inductance using a new
approach of neuro-fuzzy (ANFIS); this method based on the interpolation of the curve B(H) of
ferromagnetic material, the results obtained gives forms of both voltages and currents, which shows that
they are in accordance with those of experimental tests, respecting the conditions recommended by the
magnetron manufacturer
This paper deals with subsynchronous resonance (SSR) phenomena in a capacitive series-compensated DFIG-based wind farm. Using both modal analysis and time-domain simulation, it is shown that the DFIG wind farm is potentially unstable due to the SSR mode. In order to damp the SSR, the rotor-side converter (RSC) and grid-side converter (GSC) controllers of the DFIG are utilized. The objective is to design a simple proportional SSR damping controller (SSRDC) by properly choosing an optimum input control signal (ICS) to the SSRDC block, so that the SSR mode becomes stable without decreasing or destabilizing the other system modes. Moreover, an optimum point within the RSC and GSC controllers to insert the SSRDC is identified. Three different signals are tested as potential ICSs including rotor speed, line real power, and voltage across the series capacitor, and an optimum ICS is identified using residue-based analysis and root-locus method. Moreover, two methods are discussed in order to estimate the optimum ICS, without measuring it directly. The studied power system is a 100 MW DFIG-based wind farm connected to a series-compensated line whose parameters are taken from the IEEE first benchmark model (FBM) for computer simulation of the SSR. MATLAB/Simulink is used as a tool for modeling and designing the SSRDC, and power system computer aided design/electromagnetic transients including dc (PSCAD/EMTDC) is used to perform time-domain simulation for design process validation.
Convergence analysis of the triangular-based power flow method for AC distribu...IJECEIAES
This paper addresses the convergence analysis of the triangular-based power flow (PF) method in alternating current radial distribution networks. The PF formulation is made via upper-triangular matrices, which enables finding a general iterative PF formula that does not require admittance matrix calculations. The convergence analysis of this iterative formula is carried out by applying the Banach fixed-point theorem (BFPT), which allows demonstrating that under an adequate voltage profile the triangular-based PF always converges. Numerical validations are made, on the well-known 33 and 69 distribution networks test systems. Gauss-seidel, newton-raphson, and backward/forward PF methods are considered for the sake of comparison. All the simulations are carried out in MATLAB software.
Enhancement of the direct power control applied to DFIG-WECS IJECEIAES
This work is dedicated to the study of an improved direct control of powers of the doubly fed induction generator (DFIG) incorporated in a wind energy conversion system 'WECS'. The control method adopts direct power control 'DPC' because of its various advantages like the ease of implementation which allows decoupled regulation for active and reactive powers, as well as a good performance at transient and steady state without PI regulators and rotating coordinate transformations. To do this, the modeling of the turbine and generator is performed. Therefore, the Maximum Power Point Tracking (MPPT) technology is implemented to extract optimal power at variable wind speed conditions. Subsequently, an explanation of the said command is spread out as well as the principle of adjusting the active and reactive power according to the desired speed. Then, the estimation method of these two control variables will be presented as well as the adopted switching table of the hysteresis controller model used based on the model of the multilevel inverters. Finally, the robustness of the developed system will be analyzed with validation in Matlab/Simulink environment to illustrate the performance of this command.
Bank of Extended Kalman Filters for Faults Diagnosis in Wind Turbine Doubly F...TELKOMNIKA JOURNAL
In order to increase the efficiency, to ensure availability and to prevent unexpected failures of the doubly fed induction generator (DFIG), widely used in speed variable wind turbine (SVWT), a model based approach is proposed for diagnosing stator and rotor winding and current sensors faults in the generator. In this study, the Extended Kalman Filter (EKF) is used as state and parameter estimation method for this model based diagnosis approach. The generator windings faults and current instruments defects are modelled, detected and isolated with the use of the faults indicators called residuals, which are obtained based on the EKF observer. The mathematical model of DFIG for both healthy and faulty operating conditions is implemented in Matlab/Simulink software. The obtained simulation results demonstrate the effectiveness of the proposed technique for diagnosis and quantification of the faults under study.
Fault Ride-Through capability of DSTATCOM for Distributed Wind Generation SystemIJPEDS-IAES
In this paper, fault ride through analysis of a low voltage distribution system
augmented with distributed wind generation using squirrel cage induction
generator and distribution static compensator (DSTATCOM) is carried out
through modeling and simulation study in MATLAB. The impact of
unbalanced (single line to ground) fault in a low voltage distribution system
in normal and severe conditions is studied and analyzed in details. Analysis
on system instability is also shown in case of sever fault condition. A
distribution Static Compensator (DSTATCOM) is used to improve fault ride
through (FRT) capability of wind generation system by compensating
positive sequence voltage. A comparison of dynamic response of the system
with and without DSTATCOM and effects of DSTATCOM on voltage and
generator speed are presented. The simulation results shows that
DSTATCOM is capable of reducing the voltage dips and improving the
voltage profiles by providing reactive power support to distributed wind
generation system under unbalanced fault condition and enhances the fault
ride through capability of the wind generator.
Fault Ride-Through capability of DSTATCOM for Distributed Wind Generation System
Tech-Report
1. GTfold : A Scalable Multicore Code for RNA
Secondary Structure Prediction
Neha Jatav
Department of Computer Science and Engineering
Indian Institute of Technology, Bombay, Powai, Mumbai, India 400076
Email: nehajatav@cse.iitb.ac.in
Project Guide:
Dr. David Bader
College of Computing
Georgia Institute of Technology, Atlanta, GA,USA
Email: bader@cc.gatech.edu
Abstract—Accurate prediction of RNA secondary structure
from the RNA base sequence is an unsolved computational
challenge. The accuracy of predictions made by free energy
minimization is limited by the quality of the energy parameters
in the underlying free energy model. The energy model that
GTfold and the de facto standard programs have been using
is Turner99, the set of nearest neighbor parameters for RNA
folding compiled by the Turner group in 1999. However, there is
a new set of thermodynamic values Turner 2004 compiled by the
Turner group in 2004. Also, using real sequences directly with
GTfold and other RNA folding programs posed a problem as
real sequences contain unspecified bases.
In this project, a user enhanced option of toggling the different
energy models has been added to GTfold. GTfold can now fold
real RNA sequences containing unidentified base N.
I. INTRODUCTION
GTfold is a fast, scalable multi-core code for predicting
RNA secondary (Article (Mathuriya, Bader, Heitsch, & Har-
vey, 2009)). RNA molecules perform a variety of different
biological functions including the role of small RNAs (with
tens or a few hundred of nucleotides) in gene splicing, editing,
and regulation. At the other end of the size spectrum, the
genomes of numerous viruses are lengthy single-stranded
RNA sequences with many thousands of nucleotides. These
single-stranded RNA sequences base pair to form molecular
structures, and the secondary structure of viruses like dengue
[3], ebola [16], and HIV [17] is known to have functional
significance. Thus, disrupting functionally significant base
pairings in RNA viral genomes is one potential method for
treating or preventing the many RNA-related diseases.
According to the thermodynamic hypothesis, the structure
having the minimum free energy (MFE) is predicted as the
secondary structure of the molecule. The free energy of a
secondary structure is the independent sum of the free energies
of distinct substructures called loops. The optimization is
performed using the dynamic programming algorithm given
by Zuker and Stiegler in 1981 [21] which is similar to the
algorithm for sequence alignment but far more complex. The
algorithm explores all the possibilities when computing the
MFE structure. There are heuristics and approximations which
have been applied to satisfy the computational requirements
in the existing folding programs. The free energies of differ-
ent loops are evaluated using thermodynamic model of free
energy.
II. RELATED WORK
The Vienna RNA, developed by the Theoretical Biochem-
istry Group and has an option of implementing their program
on a different thermodynamic model, called the Andronescu
model (Andronescu, Condon, Hoos, Mathews, & Murphy,
2007) which gives a constraint generation (CG), the first com-
putational approach to RNA free energy parameter estimation
that can be efficiently trained on large sets of structural as well
as thermodynamic data. The CG approach employs a novel
iterative scheme, whereby the energy values are first computed
as the solution to a constrained optimization problem. Then
the newly computed energy parameters are used to update
the constraints on the optimization function, so as to better
optimize the energy parameters in the next iteration. Using
this method on biologically sound data, revised parameters
can be obtained for the Turner99 energy model which provides
significant improvements in prediction accuracy over current
state of-the-art methods.
In Mfold web server developed by Michael Zuker (Article
(Zuker, 2003)), for a sequence entered into the sequence text
area box all characters except for AZ and az are removed.
Lower case characters are converted to upper case. For RNA
folding, T or t are converted to U.In addition, the letters W,
X, Y and Z also refer to A, C, G and U/T, respectively. These
nucleotides, if they pair, should do so only at the end of a helix.
Thus, the mfold web server does not support the IUPAC (In-
ternational Union of Pure and Applied Chemistry) ambiguous
DNA character convention (Cornish-Bowden, 1985).
2. III. RESEARCH CONTRIBUTIONS
A. Toggling of Thermodynamic values
RNA molecules are made up of A, C, G, and U, nucleotides
which can pair up according to the rules in (A,U), (U,A),
(G,C), (C,G), (G,U), (U,G). Nested base pairings result into
2D structures called secondary structures. There are 3D inter-
actions among the elements of the secondary structures which
result into 3D structures called tertiary structures. Pairings
among bases form various kinds of loops, which can be
classified based on the number of branches present in them.
Nearest neighbor thermodynamic model (NNTM) provides
a set of functions and sequence dependent parameters to
calculate the energy of various kinds of loops. The free energy
of a secondary structure is calculated by adding up the energy
of all loops and stacking present in the structure. There are
two existing thermodynamic models compiled by the Turner
group in 1999 and 2004, known as the Turner 99 model and
Turner 2004 model respectively. The energy parameters can be
toggled between any of these two models and the free energy
can be calculated according to that model.
B. Unidentified base N
The letter N should be used for an unspecified base. It
is not allowed to pair. It is very common in the real RNA
sequences. The real RNA sequences can be processed by
putting constraints on these bases. The base N is prohibited
from pairing and hence finally the RNA sequence is folded
such that none of these unidentified bases are paired.
C. Constraints folding
GTfold allows the optional incorporation of folding con-
straints. Each constraint consists of a single line in the con-
straint file that must conform to a rigid format. The various
types of constraints are itemized below. Multiple constraints
of any form are allowed in any order.
• Force a specific base pair or helix to form. The command
F i j k (1)
will force the formation of the helix (single base pair if
k=1) The triple (i, j, k) refers to k consecutive base pairs,
where rirj is the exterior closing base pair. If any of these
base pairs cannot exist, then an error will be generated
and the job will fail. The usual result is an output page
that declares Job aborted! No Structure!.
• Prohibit a specific base pair or helix from forming. The
command
P i j k (2)
will prohibit every single base pair of the form r[i+h]r[j-
h],(h varying from 0 to k), from occurring.
• Prohibit a string of consecutive bases from pairing. The
command
P i 0 k (3)
(the second to last character is zero) will prevent nu-
cleotides r[i], r[i+1], r[i+2],..., r[i+k-1] from pairing. This
TABLE I
MFES IN KCAL/MOL CALCULATED BY UNAFOLD, GTFOLD AND
RNAFOLD
Sequence Length UNAfold GTfold RNAfold
16S/X54252 698 -138 -143 -143
16S/X54253 702 -141 -149 -149
16S/X98467 1296 -460 -487 -489
16S/X65063 1433 -572 -584 -584
16S/Z17210 1436 -744 -762 -763
16S/X52949 1453 -795 -804 -805
16S/K00421 1475 -682 -687 -687
16S/Z17224 1551 -553 -568 -569
16S/X00794 1963 -723 -742 -747
TABLE II
MFE SCORES IN KCAL/MOL FOR SAME STRUCTURES ON GTFOLD AND
UNAFOLD
Sequence Length UNAfold GTfold
16S/K00421 1475 -680.5 -682.4
16S/X00794 1963 -723.1 -726.4
16S/X52949 1453 -794.6 -794.4
16S/X54252 698 -137.5 -138.7
16S/X54253 702 -141.3 -142.7
16S/X65063 1433 -571.6 -575.5
16S/X98467 1296 -460 -461.2
16S/Z17210 1436 -744 -748.3
16S/Z17224 1551 -552.6 -556.1
is a single base when k=1. Forcing too many bases to be
single stranded can generate a fatal error.
IV. EXPERIMENTAL RESULTS
A. Accuracy
The table I gives the Minimum Free Energies calculated by
the GTfold and other de facto standard programs for predicting
RNA secondary structures. In most of the cases, the Free
Energy calculated by GTfold is the minimum.
The table II shows the energy calculated by GTfold and
UNAfold for the same structure predicted by both. UNAfold
uses a different thermodynamic model as compared to that
used by GTfold. The differences lies in the calculation of the
multiloop energies and the external energies.
The table III shows the implementation of different free
thermodynamic parameters i.e. Turner 99 and Turner 04 using
GTfold.
TABLE III
MFE IN KCAL/MOL CALCULATED USING THE TURNER 99 AND TURNER
04 MODEL ON GTFOLD
Sequence Length Turner04 Turner99
16S/K00421 1475 -636.57 -687
16S/X00794 1963 -691.37 -747
16S/X52949 1453 -768.26 -805
16S/X54252 698 -121.46 -143
16S/X54253 702 -125.55 -149
16S/X65063 1433 -536.93 -584
16S/X98467 1296 -449.16 -489
16S/Z17210 1436 -724.21 -763
16S/Z17224 1551 -521.58 -569
3. TABLE IV
RUNNING TIMES IN SECONDS FOR UNAFOLD AND GTFOLD
Sequence Length UNAfold GTfold
16S/X54252 698 12 1
16S/X54253 702 10 1
16S/X98467 1296 23 4
16S/X65063 1433 25 4
16S/Z17210 1436 28 5
16S/X52949 1453 29 5
16S/K00421 1475 23 5
16S/Z17224 1551 34 6
16S/X00794 1963 72 9
TABLE V
RUNNING TIMES IN SECONDS FOR GTFOLD RUNNING WITH AND
WITHOUT ILSA
Sequence Length GTfold GTfoldwithILSA
16S/X54252 698 1 20
16S/X54253 702 1 21
16S/X98467 1296 4 127
16S/X65063 1433 4 164
16S/Z17210 1436 5 166
16S/X52949 1453 5 177
16S/K00421 1475 5 181
16S/Z17224 1551 6 213
16S/X00794 1963 9 440
B. Performance Timing
The table IV gives a comparison of the runtimes of GTfold
and UNAfold and it can be seen that GTfold is faster even for
larger RNA sequences.
The table V shows the running time comparison of GTfold
with and without using the Internal Loop Speed-up Algorithm.
V. CONCLUSION
GTfold can be used with both the free energy thermody-
namic models: Turner 99 as well as Turner 04 models. Users
have an option to work with either of the models. GTfold
allows the unidentified base ’N’ and hence can be used directly
with real sequences without any pre-processing or errors.
R´EF ´ERENCES
Andronescu, M., Condon, A., Hoos, H. H., Mathews, D. H.,
& Murphy, K. P. (2007). Efficient parameter estimation
for rna secondary structure prediction. Bioinformatics.
Cornish-Bowden. (1985). Nomenclature for incompletely
specified bases in nucleic acid sequences: recommen-
dations 1984. Nucleic Acids Research.
Mathuriya, A., Bader, D., Heitsch, C., & Harvey, S. (2009).
Gtfold: A scalable multicore code for rna secondary
structure prediction. 24th Annual ACM Symposium
on Applied Computing (SAC), Computational Sciences
Track, Honolulu, HI.
Zuker, M. (2003). Mfold web server for nucleic acid folding
and hybridization prediction. Nucleic Acids Research.