SlideShare a Scribd company logo
1 of 15
Download to read offline
WHITE PAPER
www.insybio.com
Efficient and accurate
analysis of non-coding
RNAs with InSyBio
ncRNASeq
By
InSyBio Ltd
Aigli Korfiati
Computer Engineer, MSc, PhD candidate
InSyBio Product Development Manager
September 2015
InSyBio ncRNASeq v1.0
WHITE PAPER
Analyze non-coding RNA molecules with InSyBio ncRNASeq
InSyBio – Intelligent Systems Biology – www.insybio.com
INTRODUCTION
This document is intended for researchers (molecular biologists, bioinformaticians,
biostatisticians, and so on...) working in academia or biopharma, biotechnology and health
industries who want to analyze non coding RNAs, predict miRNAs and analyze and predict
miRNA target sites.
NCRNASEQ IS A NON-CODING RNA ANALYSIS TOOL FOR THE PREDICTION
AND ANALYSIS OF
 non-coding RNAs, and
 miRNA target genes
Non-coding RNA genes are RNA sequences transcribed from DNA, but not translated to
proteins. Their identification as well as the identification of the genes they regulate is a
promising research area.
InSyBio ncRNASeq enables users to analyze non-coding RNAs. Users can search and
analyze the RNA sequence of their interest. They can also analyze a full sequences
dataset derived from online available databases, experimental sequencing techniques or
computational in silico techniques.
With InSyBio ncRNASeq you can predict and analyze RNA genes and miRNA target genes
combining a variety of sequential, structural and functional information, and using a high
performance machine learning technique. The RNA analysis is conducted by the
calculation of the 58 most informative features described in the literature, and the
miRNA-miRNA targets analysis is conducted by the calculation of the 124 most
informative ones. InSyBio ncRNASeq also provides results storage in its knowledge base,
equipped with information retrieval tools, to allow users produce and extract their own
datasets.
INSYBIO NCRNASEQ KEY PROPERTIES:
 Non coding RNA analysis with the calculation of 58 sequential, thermodynamical
and structural properties of the RNA sequence and miRNA-target site analysis
with the calculation of 124 sequential, thermodynamical, structural and motif
properties of the miRNA:mRNA pair.
 MiRNAs and miRNA-target sites predictions with high accuracy.
 Integrated information on stem-loop and mature miRNAs.
 Association of proteins to miRNAs participating to their regulatory mechanism.
 Batch computations are allowed.
 No need for personal super computers to perform difficult computing tasks: they
are now executed in our cloud infrastructure with minimum burden on the user’s
pc.
WITH INSYBIO NCRNASEQ YOU CAN:
a) Calculate 58 RNA genes-related features
b) Predict miRNAs
c) Calculate 124 miRNA target features
WHITE PAPER
Analyze non-coding RNA molecules with InSyBio ncRNASeq
InSyBio – Intelligent Systems Biology – www.insybio.com
d) Predict miRNA targets
e) Search stem-loop and mature miRNAs
f) View integrated information about the miRNA of your interest
WHITE PAPER
Analyze non-coding RNA molecules with InSyBio ncRNASeq
InSyBio – Intelligent Systems Biology – www.insybio.com
INSYBIO NCRNASEQ SCENARIO: MIRNA PREDICTION
In this scenario you have a set of sequences and want to predict which of them are pre-
miRNAs and discriminate them from other RNA stem-loops. Suppose you have a dataset
with 30 known pre-miRNAs, 15 snoRNAs and 30 pseudo hairpins which look like pre-
miRNAs but come from coding regions.
STEPS
1. You create a text file and write the sequences in fasta format in a file editor.
Note: Both lines of the fasta format (sequence description and sequence) are obligatory.
If you don’t know the sequence description, write something like >sequence_x.
2. After creating and saving your file you upload it to ncRNASeq through Data Store.
You visit the Data Store dashboard and select the category ncRNA sequences.
3. The file type is ncRNA sequences by default. So you just write a file title that
describes your file and select the file prepared in step 1. When the file is selected,
you upload it by clicking the respective button.
WHITE PAPER
Analyze non-coding RNA molecules with InSyBio ncRNASeq
InSyBio – Intelligent Systems Biology – www.insybio.com
4. The file is being verified automatically. When the uploading and verification
processes end you click the List all ncRNA sequences button.
5. In the action selection dropdown list you select miRNA Prediction at ncRNASeq.
WHITE PAPER
Analyze non-coding RNA molecules with InSyBio ncRNASeq
InSyBio – Intelligent Systems Biology – www.insybio.com
6. You are redirected to the miRNA prediction page and you click the Start
Calculation button.
7. The sequences are processed one by one and you can see in the resulting table or
download in a tab-delimited file: their fasta format, their prediction (miRNA
(green) / pseudo miRNA (red) / other (blue)), their prediction score and 58
computed features. These features include sequential, thermodynamical and
structural properties of the RNA sequence. The prediction score shows the
strength/confidence of the prediction. Prediction scores close to 0 indicate weak
predictions, while prediction scores far from 0 strong.
WHITE PAPER
Analyze non-coding RNA molecules with InSyBio ncRNASeq
InSyBio – Intelligent Systems Biology – www.insybio.com
DISCUSSION
Experimenting with a dataset with 30 known pre-miRNAs, 15 snoRNAs and 30 pseudo
hairpins which look like pre-miRNAs but come from coding regions led to the results
summarized in table 1.
total miRNAs other pseudo miRNAs
pre-miRNAs
30 28 2
snoRNAs
15 13 2
pseudo hairpins from coding regions
30 30
Table 1: Use case results
From the total of 30 pre-miRNAs, 28 were indeed predicted as miRNAs, while 2 were
wrongly predicted as pseudo miRNAs. However, the prediction scores of the two falsely
predicted pre-miRNAs were -0.0537 and -0.0383, values very close to the decision
boundary of 0, indicating the ambiguity of their prediction.
From the total of 15 snoRNAs, 13 were indeed predicted as other, while 2 were predicted
as pseudo miRNAs. Their prediction scores were -1.087 and -1.021.
The 30 pseudo hairpins from coding regions were all predicted as pseudo miRNAs.
WHITE PAPER
Analyze non-coding RNA molecules with InSyBio ncRNASeq
InSyBio – Intelligent Systems Biology – www.insybio.com
INSYBIO NCRNASEQ SCENARIO: MIRNA TARGET PREDICTION
In this scenario you have a set of miRNAs and mRNAs and want to predict the mRNA
target sites of the miRNAs. InSyBio ncRNASeq takes as input miRNAs and potential
mRNA target sites (not full mRNAs) and predicts which of them share a targeting
relation.
Suppose you want to predict the target sites of all miRNAs for the human mRNA genes:
CASC2, FECH, GKAP1 and TPR.
PREPROCESSING STEPS
1. You firstly have to run a target site prediction program, like miRanda [1] giving it
as input a list of all human miRNAs from miRBase [2] and the sequences of the
mRNA genes.
2. Or, you can directly download from miRanda (http://www.microrna.org/) the
human target site predictions for the mRNA genes: CASC2, FECH, GKAP1 and TPR.
NCRNASEQ STEPS
3. You create two text files and write in a file editor the miRNA sequences in the one
and the predicted mRNA target sites in the other, both in fasta format. Note that
the first miRNA will be tested against the first target site, the second miRNA
against the second target site and so on.
Note: Both lines of the fasta format (sequence description and sequence) are obligatory.
As a target site sequence description you are suggested to write the mRNA gene symbol
and/or the transcript id and as a miRNA sequence description its name and/or its
accession id.
WHITE PAPER
Analyze non-coding RNA molecules with InSyBio ncRNASeq
InSyBio – Intelligent Systems Biology – www.insybio.com
4. After creating and saving your files you upload them to ncRNASeq through Data
Store. You visit the Data Store dashboard and select the category miRNA
sequences for the miRNAs and mRNA sequences for the mRNA target sites.
5. You select the corresponding file type (miRNA/mRNA sequences), write a file title that
describes your file and select the files prepared in step 1. When a file is selected, you
upload it by clicking the respective button.
WHITE PAPER
Analyze non-coding RNA molecules with InSyBio ncRNASeq
InSyBio – Intelligent Systems Biology – www.insybio.com
6. The file is being verified automatically. When the uploading and verification processes
ends for both files you click the List all miRNA/mRNA sequences button.
7. In the action selection dropdown list you select miRNA Target Prediction at ncRNASeq.
WHITE PAPER
Analyze non-coding RNA molecules with InSyBio ncRNASeq
InSyBio – Intelligent Systems Biology – www.insybio.com
8. You are redirected to the miRNA target prediction page where you also have to select an
mRNA/miRNA file. Then you click the Start Calculation button.
9. The sequences are processed pair by pair and you can see in the resulting table or
download in a tab-delimited file: the examined miRNA fasta format, the examined mRNA
target site fasta format, their prediction (Target (green) / no Target (red)), their
prediction score and 124 computed features. These features include sequential,
thermodynamical, structural and motif properties of the miRNA-target site pairs. The fifth
column named Protein links the given miRNA to an Interact protein, indicating that the
specific miRNA regulates the expression of the respective protein. The link is supported if
the miRNA:mRNA pair shares a targeting relation and the mRNA is related to an InSyBio
Interact protein. Clicking on the protein, you are redirected to the Interact tool which
presents all the existing information in the Interact database for the specific protein. The
prediction score shows the strength/confidence of the prediction. Prediction scores close
to 0 indicate weak predictions, while prediction scores far from 0 strong.
WHITE PAPER
Analyze non-coding RNA molecules with InSyBio ncRNASeq
InSyBio – Intelligent Systems Biology – www.insybio.com
DISCUSSION
In this scenario we downloaded from miRanda the human target site predictions for the mRNA
genes: CASC2, FECH, GKAP1 and TPR. In miRanda downloads section the target site predictions
are separated in four categories based on mirSVR, a down-regulation score at the mRNA or
protein levels and the conservation of the miRNA:mRNA relationship across species:
 good mirSVR score, conserved miRNA,
 good mirSVR score, non-conserved miRNA,
 non-good mirSVR score, conserved miRNA,
 non-good mirSVR score, non-conserved miRNA.
This resulted in a dataset with 1805 entries as described in the following table:
good mirSVR
score,
conserved miRNA
good mirSVR score,
non-conserved
miRNA
non-good mirSVR
score,
conserved miRNA
non-good mirSVR
score,
non-conserved miRNA
sum
407 355 476 567 1805
We then created the respective files needed as input in ncRNASeq and performing the miRNA
target prediction task we came up with the following results:
good mirSVR
score,
conserved
miRNA
good mirSVR
score,
non-conserved
miRNA
non-good
mirSVR score,
conserved
miRNA
non-good mirSVR
score,
non-conserved
miRNA
total
samples 407 355 476 567 1805
predicted
targets
392 328 450 530 1700
predicted
non targets
15 27 26 37 105
accuracy 0.963145 0.923944 0.945378 0.934744 0.941828
mean
prediction
score
0.639238 0.623052 0.58483 0.502412 0.578726
score
standard
deviation
0.438878 0.482897 0.441147 0.391422 0.437944
WHITE PAPER
Analyze non-coding RNA molecules with InSyBio ncRNASeq
InSyBio – Intelligent Systems Biology – www.insybio.com
A total of 1700 miRNA-target site pairs were predicted as true targets, forming a percentage over
94%. The ncRNASeq prediction score ranges from 0.58 to 0.64 for the first 3 categories: 1) good
mirSVR score, conserved miRNA, 2) good mirSVR score, non-conserved miRNA and 3) non-good
mirSVR score, conserved miRNA and is 0.5 for the category non-good mirSVR score, non-
conserved miRNA.
We additionally performed random permutations of the miRNA-target site pairs for the same
mRNA gene and giving again the respective files as input in ncRNASeq and performing the miRNA
target prediction task we came up with the following results:
good mirSVR
score,
conserved
miRNA
good mirSVR
score,
non-conserved
miRNA
non-good
mirSVR score,
conserved
miRNA
non-good mirSVR
score,
non-conserved
miRNA
total
samples 407 355 476 567 1805
predicted
targets
17 18 42 30 107
predicted
non targets
390 337 434 537 1698
accuracy 0.958231 0.949296 0.911765 0.94709 0.940720
mean
prediction
score
-0.87632 -0.85332 -0.77982 -0.87429 -0.84571
score
standard
deviation
0.3254926 0.366411 0.434336 0.349322 0.374164
As expected, only 107 out of the 1805 samples were predicted as true targets indicating the
specificity of ncRNASeq. The prediction score ranges from -0.77 to -0.88, indicating the high
confidence of the prediction results.
To prove the performance of ncRNASeq we investigated two more datasets. The first one is a
dataset with 462 experimentally verified miRNA target sites downloaded from TarBase version 5c
[3] and miRecords release April 2013 [4] databases. The second one is a dataset with 1848 pseudo
miRNA target sites generated giving as input to miRanda artificial miRNAs with A, C, G, U
frequencies that are not consistent with real miRNAs. They are not miRNA target sites, but they
resemble much to miRNA-target site pairs.
WHITE PAPER
Analyze non-coding RNA molecules with InSyBio ncRNASeq
InSyBio – Intelligent Systems Biology – www.insybio.com
experimentally verified miRNA
target sites
pseudo miRNA target sites
samples 462 1848
predicted targets 403 0
predicted non targets 59 1848
accuracy 0.872294 1
mean prediction score 0.606079 -0.99257
score standard deviation 0.585316 0.042742
From the results, we observe that only 59 experimentally verified miRNA target sites were
predicted as non-targets and none pseudo miRNA target site was predicted as true target.
The input datasets of all experiments can be found in InSyBio evaluation version as pre-
uploaded files.
REFERENCES
[1] mirSVR predicted target site scoring method: Comprehensive modeling of microRNA targets predicts
functional non-conserved and non-canonical sites. Betel D, Koppal A, Agius P, Sander C, Leslie C., Genome
Biology 2010 11:R90
[2] miRBase: annotating high confidence microRNAs using deep sequencing data. Kozomara A, Griffiths-Jones S.
NAR 2014 42:D68-D73
[3] Papadopoulos, G. L., Reczko, M., Simossis, V. A., Sethupathy, P., & Hatzigeorgiou, A. G. (2009). The database
of experimentally supported targets: a functional update of TarBase. Nucleic acids research, 37(suppl 1), D155-
D158.
[4] Xiao, F., Zuo, Z., Cai, G., Kang, S., Gao, X., & Li, T. (2009). miRecords: an integrated resource for microRNA–
target interactions. Nucleic acids research, 37(suppl 1), D105-D110.
WHITE PAPER
Analyze non-coding RNA molecules with InSyBio ncRNASeq
InSyBio – Intelligent Systems Biology – www.insybio.com
ABOUT US
InSyBio Ltd is a bioinformatics pioneer company (www.insybio.com) in personalized healthcare,
that focuses on developing computational frameworks and tools for the analysis of complex life-
science and biological data in order to develop predictive integrated biomarkers (biomarkers of
various categories) with increased prognostic and diagnostic aspects for the personalized
Healthcare Industry.
InSyBio Suite consists of tools for providing integrated biological information from various
sources, while at the same time it is empowered with robust, user-friendly and installation-free
bioinformatics tools based on intelligent algorithms and methods.
HOW TO GET INSYBIO NCRNASEQ?
A demo version of InSyBio ncRNASeq is freely available at http://demo.insybio.com.
To request a free one month full (evaluation) version of InSyBio ncRNASeq please email us at
info@insybio.com.
To purchase InSyBio ncRNASeq commercial version 1.0 please contact us at sales@insybio.com.
COPYRIGHT NOTICE
External Publication of InSyBio Ltd - Any InSyBio information that is to be used in advertising,
press releases, or promotional materials requires prior written approval from the InSyBio Ltd. A
draft of the proposed document should accompany any such request. InSyBio Ltd reserves the
right to deny approval of external usage for any reason.
Copyright 2015 InSyBio Ltd. Reproduction without written permission is completely forbidden.

More Related Content

Viewers also liked

Camilo Escobar: Búsqueda de lo apetente
Camilo Escobar: Búsqueda de lo apetenteCamilo Escobar: Búsqueda de lo apetente
Camilo Escobar: Búsqueda de lo apetenteLekkere Feijoa
 
PROYECTO EN BAILÉN-maria luisa fernández garcía
PROYECTO EN BAILÉN-maria luisa fernández garcíaPROYECTO EN BAILÉN-maria luisa fernández garcía
PROYECTO EN BAILÉN-maria luisa fernández garcíaMarisa Fernández García
 
Proceso produccion emergencia
Proceso produccion emergenciaProceso produccion emergencia
Proceso produccion emergenciaLekkere Feijoa
 
Camilo Escobar: La Trama, la Unidad Discreta, y Experiencia en Matrices
Camilo Escobar: La Trama, la Unidad Discreta, y Experiencia en MatricesCamilo Escobar: La Trama, la Unidad Discreta, y Experiencia en Matrices
Camilo Escobar: La Trama, la Unidad Discreta, y Experiencia en MatricesLekkere Feijoa
 
Nul op de meter: een kwestie van energie opslag of energie efficiency
Nul op de meter: een kwestie van energie opslag of energie efficiency Nul op de meter: een kwestie van energie opslag of energie efficiency
Nul op de meter: een kwestie van energie opslag of energie efficiency duurzame verhalen
 
Comunicar em rotary 1º Seminário Interdistrital de Imagem Pública
Comunicar em rotary   1º Seminário Interdistrital de Imagem PúblicaComunicar em rotary   1º Seminário Interdistrital de Imagem Pública
Comunicar em rotary 1º Seminário Interdistrital de Imagem PúblicaVirgílio Paulo Corporatecoach
 
Sistem transpotasi manusia
Sistem transpotasi manusiaSistem transpotasi manusia
Sistem transpotasi manusiaDinia Risti
 
Paleografia e indice onomástico
Paleografia e indice onomásticoPaleografia e indice onomástico
Paleografia e indice onomásticoSol Reyes
 
Customer Service Via Social Media (Social Media Utilities)
Customer Service Via Social Media (Social Media Utilities)Customer Service Via Social Media (Social Media Utilities)
Customer Service Via Social Media (Social Media Utilities)Quyên Đoàn
 
Derechos y deberes de los venezolanos.
Derechos y deberes de los venezolanos.Derechos y deberes de los venezolanos.
Derechos y deberes de los venezolanos.Mariapatino00
 

Viewers also liked (16)

Camilo Escobar: Búsqueda de lo apetente
Camilo Escobar: Búsqueda de lo apetenteCamilo Escobar: Búsqueda de lo apetente
Camilo Escobar: Búsqueda de lo apetente
 
Zelf_Introductie
Zelf_IntroductieZelf_Introductie
Zelf_Introductie
 
PROYECTO EN BAILÉN-maria luisa fernández garcía
PROYECTO EN BAILÉN-maria luisa fernández garcíaPROYECTO EN BAILÉN-maria luisa fernández garcía
PROYECTO EN BAILÉN-maria luisa fernández garcía
 
GBI 1
GBI 1GBI 1
GBI 1
 
Proceso produccion emergencia
Proceso produccion emergenciaProceso produccion emergencia
Proceso produccion emergencia
 
Camilo Escobar: La Trama, la Unidad Discreta, y Experiencia en Matrices
Camilo Escobar: La Trama, la Unidad Discreta, y Experiencia en MatricesCamilo Escobar: La Trama, la Unidad Discreta, y Experiencia en Matrices
Camilo Escobar: La Trama, la Unidad Discreta, y Experiencia en Matrices
 
Proposta de trabalho nº1
Proposta de trabalho nº1Proposta de trabalho nº1
Proposta de trabalho nº1
 
Nul op de meter: een kwestie van energie opslag of energie efficiency
Nul op de meter: een kwestie van energie opslag of energie efficiency Nul op de meter: een kwestie van energie opslag of energie efficiency
Nul op de meter: een kwestie van energie opslag of energie efficiency
 
Comunicar em rotary 1º Seminário Interdistrital de Imagem Pública
Comunicar em rotary   1º Seminário Interdistrital de Imagem PúblicaComunicar em rotary   1º Seminário Interdistrital de Imagem Pública
Comunicar em rotary 1º Seminário Interdistrital de Imagem Pública
 
Porque del bocado
Porque del bocadoPorque del bocado
Porque del bocado
 
Deberes y Derechos de los venezolanos
Deberes y Derechos de los venezolanosDeberes y Derechos de los venezolanos
Deberes y Derechos de los venezolanos
 
Sistem transpotasi manusia
Sistem transpotasi manusiaSistem transpotasi manusia
Sistem transpotasi manusia
 
Paleografia e indice onomástico
Paleografia e indice onomásticoPaleografia e indice onomástico
Paleografia e indice onomástico
 
Customer Service Via Social Media (Social Media Utilities)
Customer Service Via Social Media (Social Media Utilities)Customer Service Via Social Media (Social Media Utilities)
Customer Service Via Social Media (Social Media Utilities)
 
Guided Media
Guided MediaGuided Media
Guided Media
 
Derechos y deberes de los venezolanos.
Derechos y deberes de los venezolanos.Derechos y deberes de los venezolanos.
Derechos y deberes de los venezolanos.
 

Similar to Efficient and accurate analysis of non-coding RNAs with InSyBio ncRNASeq

Bro gef mi_rna_0212_lr
Bro gef mi_rna_0212_lrBro gef mi_rna_0212_lr
Bro gef mi_rna_0212_lrElsa von Licy
 
Principle and method of RNAi-Creative Biogene
Principle and method of RNAi-Creative BiogenePrinciple and method of RNAi-Creative Biogene
Principle and method of RNAi-Creative BiogeneDonglin Bao
 
MicroRNA extraction
MicroRNA extractionMicroRNA extraction
MicroRNA extractionseerat sidhu
 
A Study on Genome Editing Tools
A Study on Genome Editing ToolsA Study on Genome Editing Tools
A Study on Genome Editing ToolsSuneet Meena
 
miScript Single Cell Poster
miScript Single Cell PostermiScript Single Cell Poster
miScript Single Cell PosterQIAGEN
 
Insilico & Genomics Approaches for the Characterization of Abiotic Stress ...
Insilico & Genomics  Approaches  for the Characterization of  Abiotic Stress ...Insilico & Genomics  Approaches  for the Characterization of  Abiotic Stress ...
Insilico & Genomics Approaches for the Characterization of Abiotic Stress ...rishiraj1992
 
Analytical Study of Hexapod miRNAs using Phylogenetic Methods
Analytical Study of Hexapod miRNAs using Phylogenetic MethodsAnalytical Study of Hexapod miRNAs using Phylogenetic Methods
Analytical Study of Hexapod miRNAs using Phylogenetic Methodscscpconf
 
Rna seq and chip seq
Rna seq and chip seqRna seq and chip seq
Rna seq and chip seqJyoti Singh
 
BiPday 2014 -- Tulipano Angelica
BiPday 2014 -- Tulipano AngelicaBiPday 2014 -- Tulipano Angelica
BiPday 2014 -- Tulipano Angelicaeventi-ITBbari
 
Meeting the challenges of miRNA research: miRNA and its Role in Human Disease...
Meeting the challenges of miRNA research: miRNA and its Role in Human Disease...Meeting the challenges of miRNA research: miRNA and its Role in Human Disease...
Meeting the challenges of miRNA research: miRNA and its Role in Human Disease...QIAGEN
 
The Trans-NIH RNAi Initiative : Informatics
The Trans-NIH RNAi Initiative: InformaticsThe Trans-NIH RNAi Initiative: Informatics
The Trans-NIH RNAi Initiative : InformaticsRajarshi Guha
 
CRISPR, a New Genome editor agent
CRISPR, a New Genome editor agentCRISPR, a New Genome editor agent
CRISPR, a New Genome editor agentAli Ahmadi
 

Similar to Efficient and accurate analysis of non-coding RNAs with InSyBio ncRNASeq (20)

Bro gef mi_rna_0212_lr
Bro gef mi_rna_0212_lrBro gef mi_rna_0212_lr
Bro gef mi_rna_0212_lr
 
Principle and method of RNAi-Creative Biogene
Principle and method of RNAi-Creative BiogenePrinciple and method of RNAi-Creative Biogene
Principle and method of RNAi-Creative Biogene
 
MicroRNA extraction
MicroRNA extractionMicroRNA extraction
MicroRNA extraction
 
Mi rna brochure_set
Mi rna brochure_setMi rna brochure_set
Mi rna brochure_set
 
Mi rna toss_set
Mi rna toss_setMi rna toss_set
Mi rna toss_set
 
A Study on Genome Editing Tools
A Study on Genome Editing ToolsA Study on Genome Editing Tools
A Study on Genome Editing Tools
 
miScript Single Cell Poster
miScript Single Cell PostermiScript Single Cell Poster
miScript Single Cell Poster
 
Bioinformatics.pptx
Bioinformatics.pptxBioinformatics.pptx
Bioinformatics.pptx
 
Insilico & Genomics Approaches for the Characterization of Abiotic Stress ...
Insilico & Genomics  Approaches  for the Characterization of  Abiotic Stress ...Insilico & Genomics  Approaches  for the Characterization of  Abiotic Stress ...
Insilico & Genomics Approaches for the Characterization of Abiotic Stress ...
 
Analytical Study of Hexapod miRNAs using Phylogenetic Methods
Analytical Study of Hexapod miRNAs using Phylogenetic MethodsAnalytical Study of Hexapod miRNAs using Phylogenetic Methods
Analytical Study of Hexapod miRNAs using Phylogenetic Methods
 
Mi rna array 2013
Mi rna array 2013Mi rna array 2013
Mi rna array 2013
 
Rna seq and chip seq
Rna seq and chip seqRna seq and chip seq
Rna seq and chip seq
 
BiPday 2014 -- Tulipano Angelica
BiPday 2014 -- Tulipano AngelicaBiPday 2014 -- Tulipano Angelica
BiPday 2014 -- Tulipano Angelica
 
Meeting the challenges of miRNA research: miRNA and its Role in Human Disease...
Meeting the challenges of miRNA research: miRNA and its Role in Human Disease...Meeting the challenges of miRNA research: miRNA and its Role in Human Disease...
Meeting the challenges of miRNA research: miRNA and its Role in Human Disease...
 
Mi rna assays 2012
Mi rna assays 2012Mi rna assays 2012
Mi rna assays 2012
 
The Trans-NIH RNAi Initiative : Informatics
The Trans-NIH RNAi Initiative: InformaticsThe Trans-NIH RNAi Initiative: Informatics
The Trans-NIH RNAi Initiative : Informatics
 
CRISPR, a New Genome editor agent
CRISPR, a New Genome editor agentCRISPR, a New Genome editor agent
CRISPR, a New Genome editor agent
 
Small rna
Small rnaSmall rna
Small rna
 
31931 31941
31931 3194131931 31941
31931 31941
 
Mirnapcrarray
MirnapcrarrayMirnapcrarray
Mirnapcrarray
 

Recently uploaded

Call Girls Hyderabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Hyderabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Hyderabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Hyderabad Just Call 9907093804 Top Class Call Girl Service AvailableDipal Arora
 
❤️Call girls in Jalandhar ☎️9876848877☎️ Call Girl service in Jalandhar☎️ Jal...
❤️Call girls in Jalandhar ☎️9876848877☎️ Call Girl service in Jalandhar☎️ Jal...❤️Call girls in Jalandhar ☎️9876848877☎️ Call Girl service in Jalandhar☎️ Jal...
❤️Call girls in Jalandhar ☎️9876848877☎️ Call Girl service in Jalandhar☎️ Jal...chandigarhentertainm
 
Call Girl Raipur 📲 9999965857 ヅ10k NiGhT Call Girls In Raipur
Call Girl Raipur 📲 9999965857 ヅ10k NiGhT Call Girls In RaipurCall Girl Raipur 📲 9999965857 ヅ10k NiGhT Call Girls In Raipur
Call Girl Raipur 📲 9999965857 ヅ10k NiGhT Call Girls In Raipurgragmanisha42
 
Chandigarh Call Girls 👙 7001035870 👙 Genuine WhatsApp Number for Real Meet
Chandigarh Call Girls 👙 7001035870 👙 Genuine WhatsApp Number for Real MeetChandigarh Call Girls 👙 7001035870 👙 Genuine WhatsApp Number for Real Meet
Chandigarh Call Girls 👙 7001035870 👙 Genuine WhatsApp Number for Real Meetpriyashah722354
 
pOOJA sexy Call Girls In Sector 49,9999965857 Young Female Escorts Service In...
pOOJA sexy Call Girls In Sector 49,9999965857 Young Female Escorts Service In...pOOJA sexy Call Girls In Sector 49,9999965857 Young Female Escorts Service In...
pOOJA sexy Call Girls In Sector 49,9999965857 Young Female Escorts Service In...Call Girls Noida
 
raisen Call Girls 👙 6297143586 👙 Genuine WhatsApp Number for Real Meet
raisen Call Girls 👙 6297143586 👙 Genuine WhatsApp Number for Real Meetraisen Call Girls 👙 6297143586 👙 Genuine WhatsApp Number for Real Meet
raisen Call Girls 👙 6297143586 👙 Genuine WhatsApp Number for Real MeetCall Girls Service
 
💚😋Mumbai Escort Service Call Girls, ₹5000 To 25K With AC💚😋
💚😋Mumbai Escort Service Call Girls, ₹5000 To 25K With AC💚😋💚😋Mumbai Escort Service Call Girls, ₹5000 To 25K With AC💚😋
💚😋Mumbai Escort Service Call Girls, ₹5000 To 25K With AC💚😋Sheetaleventcompany
 
Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8923113531 ...
Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8923113531 ...Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8923113531 ...
Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8923113531 ...gurkirankumar98700
 
Call Girl Price Amritsar ❤️🍑 9053900678 Call Girls in Amritsar Suman
Call Girl Price Amritsar ❤️🍑 9053900678 Call Girls in Amritsar SumanCall Girl Price Amritsar ❤️🍑 9053900678 Call Girls in Amritsar Suman
Call Girl Price Amritsar ❤️🍑 9053900678 Call Girls in Amritsar SumanCall Girls Service Chandigarh Ayushi
 
Hot Call Girl In Chandigarh 👅🥵 9053'900678 Call Girls Service In Chandigarh
Hot  Call Girl In Chandigarh 👅🥵 9053'900678 Call Girls Service In ChandigarhHot  Call Girl In Chandigarh 👅🥵 9053'900678 Call Girls Service In Chandigarh
Hot Call Girl In Chandigarh 👅🥵 9053'900678 Call Girls Service In ChandigarhVip call girls In Chandigarh
 
Krishnagiri call girls Tamil aunty 7877702510
Krishnagiri call girls Tamil aunty 7877702510Krishnagiri call girls Tamil aunty 7877702510
Krishnagiri call girls Tamil aunty 7877702510Vipesco
 
VIP Call Girl Sector 32 Noida Just Book Me 9711199171
VIP Call Girl Sector 32 Noida Just Book Me 9711199171VIP Call Girl Sector 32 Noida Just Book Me 9711199171
VIP Call Girl Sector 32 Noida Just Book Me 9711199171Call Girls Service Gurgaon
 
VIP Call Girl Sector 25 Gurgaon Just Call Me 9899900591
VIP Call Girl Sector 25 Gurgaon Just Call Me 9899900591VIP Call Girl Sector 25 Gurgaon Just Call Me 9899900591
VIP Call Girl Sector 25 Gurgaon Just Call Me 9899900591adityaroy0215
 
Call Girls Service Faridabad 📲 9999965857 ヅ10k NiGhT Call Girls In Faridabad
Call Girls Service Faridabad 📲 9999965857 ヅ10k NiGhT Call Girls In FaridabadCall Girls Service Faridabad 📲 9999965857 ヅ10k NiGhT Call Girls In Faridabad
Call Girls Service Faridabad 📲 9999965857 ヅ10k NiGhT Call Girls In Faridabadgragmanisha42
 
Udaipur Call Girls 📲 9999965857 Call Girl in Udaipur
Udaipur Call Girls 📲 9999965857 Call Girl in UdaipurUdaipur Call Girls 📲 9999965857 Call Girl in Udaipur
Udaipur Call Girls 📲 9999965857 Call Girl in Udaipurseemahedar019
 
Dehradun Call Girls Service 08854095900 Real Russian Girls Looking Models
Dehradun Call Girls Service 08854095900 Real Russian Girls Looking ModelsDehradun Call Girls Service 08854095900 Real Russian Girls Looking Models
Dehradun Call Girls Service 08854095900 Real Russian Girls Looking Modelsindiancallgirl4rent
 
(Sonam Bajaj) Call Girl in Jaipur- 09257276172 Escorts Service 50% Off with C...
(Sonam Bajaj) Call Girl in Jaipur- 09257276172 Escorts Service 50% Off with C...(Sonam Bajaj) Call Girl in Jaipur- 09257276172 Escorts Service 50% Off with C...
(Sonam Bajaj) Call Girl in Jaipur- 09257276172 Escorts Service 50% Off with C...indiancallgirl4rent
 
💚😋Chandigarh Escort Service Call Girls, ₹5000 To 25K With AC💚😋
💚😋Chandigarh Escort Service Call Girls, ₹5000 To 25K With AC💚😋💚😋Chandigarh Escort Service Call Girls, ₹5000 To 25K With AC💚😋
💚😋Chandigarh Escort Service Call Girls, ₹5000 To 25K With AC💚😋Sheetaleventcompany
 
Call Girl In Zirakpur ❤️♀️@ 9988299661 Zirakpur Call Girls Near Me ❤️♀️@ Sexy...
Call Girl In Zirakpur ❤️♀️@ 9988299661 Zirakpur Call Girls Near Me ❤️♀️@ Sexy...Call Girl In Zirakpur ❤️♀️@ 9988299661 Zirakpur Call Girls Near Me ❤️♀️@ Sexy...
Call Girl In Zirakpur ❤️♀️@ 9988299661 Zirakpur Call Girls Near Me ❤️♀️@ Sexy...Sheetaleventcompany
 
Nanded Call Girls 👙 6297143586 👙 Genuine WhatsApp Number for Real Meet
Nanded Call Girls 👙 6297143586 👙 Genuine WhatsApp Number for Real MeetNanded Call Girls 👙 6297143586 👙 Genuine WhatsApp Number for Real Meet
Nanded Call Girls 👙 6297143586 👙 Genuine WhatsApp Number for Real MeetCall Girls Service
 

Recently uploaded (20)

Call Girls Hyderabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Hyderabad Just Call 9907093804 Top Class Call Girl Service AvailableCall Girls Hyderabad Just Call 9907093804 Top Class Call Girl Service Available
Call Girls Hyderabad Just Call 9907093804 Top Class Call Girl Service Available
 
❤️Call girls in Jalandhar ☎️9876848877☎️ Call Girl service in Jalandhar☎️ Jal...
❤️Call girls in Jalandhar ☎️9876848877☎️ Call Girl service in Jalandhar☎️ Jal...❤️Call girls in Jalandhar ☎️9876848877☎️ Call Girl service in Jalandhar☎️ Jal...
❤️Call girls in Jalandhar ☎️9876848877☎️ Call Girl service in Jalandhar☎️ Jal...
 
Call Girl Raipur 📲 9999965857 ヅ10k NiGhT Call Girls In Raipur
Call Girl Raipur 📲 9999965857 ヅ10k NiGhT Call Girls In RaipurCall Girl Raipur 📲 9999965857 ヅ10k NiGhT Call Girls In Raipur
Call Girl Raipur 📲 9999965857 ヅ10k NiGhT Call Girls In Raipur
 
Chandigarh Call Girls 👙 7001035870 👙 Genuine WhatsApp Number for Real Meet
Chandigarh Call Girls 👙 7001035870 👙 Genuine WhatsApp Number for Real MeetChandigarh Call Girls 👙 7001035870 👙 Genuine WhatsApp Number for Real Meet
Chandigarh Call Girls 👙 7001035870 👙 Genuine WhatsApp Number for Real Meet
 
pOOJA sexy Call Girls In Sector 49,9999965857 Young Female Escorts Service In...
pOOJA sexy Call Girls In Sector 49,9999965857 Young Female Escorts Service In...pOOJA sexy Call Girls In Sector 49,9999965857 Young Female Escorts Service In...
pOOJA sexy Call Girls In Sector 49,9999965857 Young Female Escorts Service In...
 
raisen Call Girls 👙 6297143586 👙 Genuine WhatsApp Number for Real Meet
raisen Call Girls 👙 6297143586 👙 Genuine WhatsApp Number for Real Meetraisen Call Girls 👙 6297143586 👙 Genuine WhatsApp Number for Real Meet
raisen Call Girls 👙 6297143586 👙 Genuine WhatsApp Number for Real Meet
 
💚😋Mumbai Escort Service Call Girls, ₹5000 To 25K With AC💚😋
💚😋Mumbai Escort Service Call Girls, ₹5000 To 25K With AC💚😋💚😋Mumbai Escort Service Call Girls, ₹5000 To 25K With AC💚😋
💚😋Mumbai Escort Service Call Girls, ₹5000 To 25K With AC💚😋
 
Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8923113531 ...
Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8923113531 ...Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8923113531 ...
Russian Call Girls Lucknow ₹7.5k Pick Up & Drop With Cash Payment 8923113531 ...
 
Call Girl Price Amritsar ❤️🍑 9053900678 Call Girls in Amritsar Suman
Call Girl Price Amritsar ❤️🍑 9053900678 Call Girls in Amritsar SumanCall Girl Price Amritsar ❤️🍑 9053900678 Call Girls in Amritsar Suman
Call Girl Price Amritsar ❤️🍑 9053900678 Call Girls in Amritsar Suman
 
Hot Call Girl In Chandigarh 👅🥵 9053'900678 Call Girls Service In Chandigarh
Hot  Call Girl In Chandigarh 👅🥵 9053'900678 Call Girls Service In ChandigarhHot  Call Girl In Chandigarh 👅🥵 9053'900678 Call Girls Service In Chandigarh
Hot Call Girl In Chandigarh 👅🥵 9053'900678 Call Girls Service In Chandigarh
 
Krishnagiri call girls Tamil aunty 7877702510
Krishnagiri call girls Tamil aunty 7877702510Krishnagiri call girls Tamil aunty 7877702510
Krishnagiri call girls Tamil aunty 7877702510
 
VIP Call Girl Sector 32 Noida Just Book Me 9711199171
VIP Call Girl Sector 32 Noida Just Book Me 9711199171VIP Call Girl Sector 32 Noida Just Book Me 9711199171
VIP Call Girl Sector 32 Noida Just Book Me 9711199171
 
VIP Call Girl Sector 25 Gurgaon Just Call Me 9899900591
VIP Call Girl Sector 25 Gurgaon Just Call Me 9899900591VIP Call Girl Sector 25 Gurgaon Just Call Me 9899900591
VIP Call Girl Sector 25 Gurgaon Just Call Me 9899900591
 
Call Girls Service Faridabad 📲 9999965857 ヅ10k NiGhT Call Girls In Faridabad
Call Girls Service Faridabad 📲 9999965857 ヅ10k NiGhT Call Girls In FaridabadCall Girls Service Faridabad 📲 9999965857 ヅ10k NiGhT Call Girls In Faridabad
Call Girls Service Faridabad 📲 9999965857 ヅ10k NiGhT Call Girls In Faridabad
 
Udaipur Call Girls 📲 9999965857 Call Girl in Udaipur
Udaipur Call Girls 📲 9999965857 Call Girl in UdaipurUdaipur Call Girls 📲 9999965857 Call Girl in Udaipur
Udaipur Call Girls 📲 9999965857 Call Girl in Udaipur
 
Dehradun Call Girls Service 08854095900 Real Russian Girls Looking Models
Dehradun Call Girls Service 08854095900 Real Russian Girls Looking ModelsDehradun Call Girls Service 08854095900 Real Russian Girls Looking Models
Dehradun Call Girls Service 08854095900 Real Russian Girls Looking Models
 
(Sonam Bajaj) Call Girl in Jaipur- 09257276172 Escorts Service 50% Off with C...
(Sonam Bajaj) Call Girl in Jaipur- 09257276172 Escorts Service 50% Off with C...(Sonam Bajaj) Call Girl in Jaipur- 09257276172 Escorts Service 50% Off with C...
(Sonam Bajaj) Call Girl in Jaipur- 09257276172 Escorts Service 50% Off with C...
 
💚😋Chandigarh Escort Service Call Girls, ₹5000 To 25K With AC💚😋
💚😋Chandigarh Escort Service Call Girls, ₹5000 To 25K With AC💚😋💚😋Chandigarh Escort Service Call Girls, ₹5000 To 25K With AC💚😋
💚😋Chandigarh Escort Service Call Girls, ₹5000 To 25K With AC💚😋
 
Call Girl In Zirakpur ❤️♀️@ 9988299661 Zirakpur Call Girls Near Me ❤️♀️@ Sexy...
Call Girl In Zirakpur ❤️♀️@ 9988299661 Zirakpur Call Girls Near Me ❤️♀️@ Sexy...Call Girl In Zirakpur ❤️♀️@ 9988299661 Zirakpur Call Girls Near Me ❤️♀️@ Sexy...
Call Girl In Zirakpur ❤️♀️@ 9988299661 Zirakpur Call Girls Near Me ❤️♀️@ Sexy...
 
Nanded Call Girls 👙 6297143586 👙 Genuine WhatsApp Number for Real Meet
Nanded Call Girls 👙 6297143586 👙 Genuine WhatsApp Number for Real MeetNanded Call Girls 👙 6297143586 👙 Genuine WhatsApp Number for Real Meet
Nanded Call Girls 👙 6297143586 👙 Genuine WhatsApp Number for Real Meet
 

Efficient and accurate analysis of non-coding RNAs with InSyBio ncRNASeq

  • 1. WHITE PAPER www.insybio.com Efficient and accurate analysis of non-coding RNAs with InSyBio ncRNASeq By InSyBio Ltd Aigli Korfiati Computer Engineer, MSc, PhD candidate InSyBio Product Development Manager September 2015 InSyBio ncRNASeq v1.0
  • 2. WHITE PAPER Analyze non-coding RNA molecules with InSyBio ncRNASeq InSyBio – Intelligent Systems Biology – www.insybio.com INTRODUCTION This document is intended for researchers (molecular biologists, bioinformaticians, biostatisticians, and so on...) working in academia or biopharma, biotechnology and health industries who want to analyze non coding RNAs, predict miRNAs and analyze and predict miRNA target sites. NCRNASEQ IS A NON-CODING RNA ANALYSIS TOOL FOR THE PREDICTION AND ANALYSIS OF  non-coding RNAs, and  miRNA target genes Non-coding RNA genes are RNA sequences transcribed from DNA, but not translated to proteins. Their identification as well as the identification of the genes they regulate is a promising research area. InSyBio ncRNASeq enables users to analyze non-coding RNAs. Users can search and analyze the RNA sequence of their interest. They can also analyze a full sequences dataset derived from online available databases, experimental sequencing techniques or computational in silico techniques. With InSyBio ncRNASeq you can predict and analyze RNA genes and miRNA target genes combining a variety of sequential, structural and functional information, and using a high performance machine learning technique. The RNA analysis is conducted by the calculation of the 58 most informative features described in the literature, and the miRNA-miRNA targets analysis is conducted by the calculation of the 124 most informative ones. InSyBio ncRNASeq also provides results storage in its knowledge base, equipped with information retrieval tools, to allow users produce and extract their own datasets. INSYBIO NCRNASEQ KEY PROPERTIES:  Non coding RNA analysis with the calculation of 58 sequential, thermodynamical and structural properties of the RNA sequence and miRNA-target site analysis with the calculation of 124 sequential, thermodynamical, structural and motif properties of the miRNA:mRNA pair.  MiRNAs and miRNA-target sites predictions with high accuracy.  Integrated information on stem-loop and mature miRNAs.  Association of proteins to miRNAs participating to their regulatory mechanism.  Batch computations are allowed.  No need for personal super computers to perform difficult computing tasks: they are now executed in our cloud infrastructure with minimum burden on the user’s pc. WITH INSYBIO NCRNASEQ YOU CAN: a) Calculate 58 RNA genes-related features b) Predict miRNAs c) Calculate 124 miRNA target features
  • 3. WHITE PAPER Analyze non-coding RNA molecules with InSyBio ncRNASeq InSyBio – Intelligent Systems Biology – www.insybio.com d) Predict miRNA targets e) Search stem-loop and mature miRNAs f) View integrated information about the miRNA of your interest
  • 4. WHITE PAPER Analyze non-coding RNA molecules with InSyBio ncRNASeq InSyBio – Intelligent Systems Biology – www.insybio.com INSYBIO NCRNASEQ SCENARIO: MIRNA PREDICTION In this scenario you have a set of sequences and want to predict which of them are pre- miRNAs and discriminate them from other RNA stem-loops. Suppose you have a dataset with 30 known pre-miRNAs, 15 snoRNAs and 30 pseudo hairpins which look like pre- miRNAs but come from coding regions. STEPS 1. You create a text file and write the sequences in fasta format in a file editor. Note: Both lines of the fasta format (sequence description and sequence) are obligatory. If you don’t know the sequence description, write something like >sequence_x. 2. After creating and saving your file you upload it to ncRNASeq through Data Store. You visit the Data Store dashboard and select the category ncRNA sequences. 3. The file type is ncRNA sequences by default. So you just write a file title that describes your file and select the file prepared in step 1. When the file is selected, you upload it by clicking the respective button.
  • 5. WHITE PAPER Analyze non-coding RNA molecules with InSyBio ncRNASeq InSyBio – Intelligent Systems Biology – www.insybio.com 4. The file is being verified automatically. When the uploading and verification processes end you click the List all ncRNA sequences button. 5. In the action selection dropdown list you select miRNA Prediction at ncRNASeq.
  • 6. WHITE PAPER Analyze non-coding RNA molecules with InSyBio ncRNASeq InSyBio – Intelligent Systems Biology – www.insybio.com 6. You are redirected to the miRNA prediction page and you click the Start Calculation button. 7. The sequences are processed one by one and you can see in the resulting table or download in a tab-delimited file: their fasta format, their prediction (miRNA (green) / pseudo miRNA (red) / other (blue)), their prediction score and 58 computed features. These features include sequential, thermodynamical and structural properties of the RNA sequence. The prediction score shows the strength/confidence of the prediction. Prediction scores close to 0 indicate weak predictions, while prediction scores far from 0 strong.
  • 7. WHITE PAPER Analyze non-coding RNA molecules with InSyBio ncRNASeq InSyBio – Intelligent Systems Biology – www.insybio.com DISCUSSION Experimenting with a dataset with 30 known pre-miRNAs, 15 snoRNAs and 30 pseudo hairpins which look like pre-miRNAs but come from coding regions led to the results summarized in table 1. total miRNAs other pseudo miRNAs pre-miRNAs 30 28 2 snoRNAs 15 13 2 pseudo hairpins from coding regions 30 30 Table 1: Use case results From the total of 30 pre-miRNAs, 28 were indeed predicted as miRNAs, while 2 were wrongly predicted as pseudo miRNAs. However, the prediction scores of the two falsely predicted pre-miRNAs were -0.0537 and -0.0383, values very close to the decision boundary of 0, indicating the ambiguity of their prediction. From the total of 15 snoRNAs, 13 were indeed predicted as other, while 2 were predicted as pseudo miRNAs. Their prediction scores were -1.087 and -1.021. The 30 pseudo hairpins from coding regions were all predicted as pseudo miRNAs.
  • 8. WHITE PAPER Analyze non-coding RNA molecules with InSyBio ncRNASeq InSyBio – Intelligent Systems Biology – www.insybio.com INSYBIO NCRNASEQ SCENARIO: MIRNA TARGET PREDICTION In this scenario you have a set of miRNAs and mRNAs and want to predict the mRNA target sites of the miRNAs. InSyBio ncRNASeq takes as input miRNAs and potential mRNA target sites (not full mRNAs) and predicts which of them share a targeting relation. Suppose you want to predict the target sites of all miRNAs for the human mRNA genes: CASC2, FECH, GKAP1 and TPR. PREPROCESSING STEPS 1. You firstly have to run a target site prediction program, like miRanda [1] giving it as input a list of all human miRNAs from miRBase [2] and the sequences of the mRNA genes. 2. Or, you can directly download from miRanda (http://www.microrna.org/) the human target site predictions for the mRNA genes: CASC2, FECH, GKAP1 and TPR. NCRNASEQ STEPS 3. You create two text files and write in a file editor the miRNA sequences in the one and the predicted mRNA target sites in the other, both in fasta format. Note that the first miRNA will be tested against the first target site, the second miRNA against the second target site and so on. Note: Both lines of the fasta format (sequence description and sequence) are obligatory. As a target site sequence description you are suggested to write the mRNA gene symbol and/or the transcript id and as a miRNA sequence description its name and/or its accession id.
  • 9. WHITE PAPER Analyze non-coding RNA molecules with InSyBio ncRNASeq InSyBio – Intelligent Systems Biology – www.insybio.com 4. After creating and saving your files you upload them to ncRNASeq through Data Store. You visit the Data Store dashboard and select the category miRNA sequences for the miRNAs and mRNA sequences for the mRNA target sites. 5. You select the corresponding file type (miRNA/mRNA sequences), write a file title that describes your file and select the files prepared in step 1. When a file is selected, you upload it by clicking the respective button.
  • 10. WHITE PAPER Analyze non-coding RNA molecules with InSyBio ncRNASeq InSyBio – Intelligent Systems Biology – www.insybio.com 6. The file is being verified automatically. When the uploading and verification processes ends for both files you click the List all miRNA/mRNA sequences button. 7. In the action selection dropdown list you select miRNA Target Prediction at ncRNASeq.
  • 11. WHITE PAPER Analyze non-coding RNA molecules with InSyBio ncRNASeq InSyBio – Intelligent Systems Biology – www.insybio.com 8. You are redirected to the miRNA target prediction page where you also have to select an mRNA/miRNA file. Then you click the Start Calculation button. 9. The sequences are processed pair by pair and you can see in the resulting table or download in a tab-delimited file: the examined miRNA fasta format, the examined mRNA target site fasta format, their prediction (Target (green) / no Target (red)), their prediction score and 124 computed features. These features include sequential, thermodynamical, structural and motif properties of the miRNA-target site pairs. The fifth column named Protein links the given miRNA to an Interact protein, indicating that the specific miRNA regulates the expression of the respective protein. The link is supported if the miRNA:mRNA pair shares a targeting relation and the mRNA is related to an InSyBio Interact protein. Clicking on the protein, you are redirected to the Interact tool which presents all the existing information in the Interact database for the specific protein. The prediction score shows the strength/confidence of the prediction. Prediction scores close to 0 indicate weak predictions, while prediction scores far from 0 strong.
  • 12. WHITE PAPER Analyze non-coding RNA molecules with InSyBio ncRNASeq InSyBio – Intelligent Systems Biology – www.insybio.com DISCUSSION In this scenario we downloaded from miRanda the human target site predictions for the mRNA genes: CASC2, FECH, GKAP1 and TPR. In miRanda downloads section the target site predictions are separated in four categories based on mirSVR, a down-regulation score at the mRNA or protein levels and the conservation of the miRNA:mRNA relationship across species:  good mirSVR score, conserved miRNA,  good mirSVR score, non-conserved miRNA,  non-good mirSVR score, conserved miRNA,  non-good mirSVR score, non-conserved miRNA. This resulted in a dataset with 1805 entries as described in the following table: good mirSVR score, conserved miRNA good mirSVR score, non-conserved miRNA non-good mirSVR score, conserved miRNA non-good mirSVR score, non-conserved miRNA sum 407 355 476 567 1805 We then created the respective files needed as input in ncRNASeq and performing the miRNA target prediction task we came up with the following results: good mirSVR score, conserved miRNA good mirSVR score, non-conserved miRNA non-good mirSVR score, conserved miRNA non-good mirSVR score, non-conserved miRNA total samples 407 355 476 567 1805 predicted targets 392 328 450 530 1700 predicted non targets 15 27 26 37 105 accuracy 0.963145 0.923944 0.945378 0.934744 0.941828 mean prediction score 0.639238 0.623052 0.58483 0.502412 0.578726 score standard deviation 0.438878 0.482897 0.441147 0.391422 0.437944
  • 13. WHITE PAPER Analyze non-coding RNA molecules with InSyBio ncRNASeq InSyBio – Intelligent Systems Biology – www.insybio.com A total of 1700 miRNA-target site pairs were predicted as true targets, forming a percentage over 94%. The ncRNASeq prediction score ranges from 0.58 to 0.64 for the first 3 categories: 1) good mirSVR score, conserved miRNA, 2) good mirSVR score, non-conserved miRNA and 3) non-good mirSVR score, conserved miRNA and is 0.5 for the category non-good mirSVR score, non- conserved miRNA. We additionally performed random permutations of the miRNA-target site pairs for the same mRNA gene and giving again the respective files as input in ncRNASeq and performing the miRNA target prediction task we came up with the following results: good mirSVR score, conserved miRNA good mirSVR score, non-conserved miRNA non-good mirSVR score, conserved miRNA non-good mirSVR score, non-conserved miRNA total samples 407 355 476 567 1805 predicted targets 17 18 42 30 107 predicted non targets 390 337 434 537 1698 accuracy 0.958231 0.949296 0.911765 0.94709 0.940720 mean prediction score -0.87632 -0.85332 -0.77982 -0.87429 -0.84571 score standard deviation 0.3254926 0.366411 0.434336 0.349322 0.374164 As expected, only 107 out of the 1805 samples were predicted as true targets indicating the specificity of ncRNASeq. The prediction score ranges from -0.77 to -0.88, indicating the high confidence of the prediction results. To prove the performance of ncRNASeq we investigated two more datasets. The first one is a dataset with 462 experimentally verified miRNA target sites downloaded from TarBase version 5c [3] and miRecords release April 2013 [4] databases. The second one is a dataset with 1848 pseudo miRNA target sites generated giving as input to miRanda artificial miRNAs with A, C, G, U frequencies that are not consistent with real miRNAs. They are not miRNA target sites, but they resemble much to miRNA-target site pairs.
  • 14. WHITE PAPER Analyze non-coding RNA molecules with InSyBio ncRNASeq InSyBio – Intelligent Systems Biology – www.insybio.com experimentally verified miRNA target sites pseudo miRNA target sites samples 462 1848 predicted targets 403 0 predicted non targets 59 1848 accuracy 0.872294 1 mean prediction score 0.606079 -0.99257 score standard deviation 0.585316 0.042742 From the results, we observe that only 59 experimentally verified miRNA target sites were predicted as non-targets and none pseudo miRNA target site was predicted as true target. The input datasets of all experiments can be found in InSyBio evaluation version as pre- uploaded files. REFERENCES [1] mirSVR predicted target site scoring method: Comprehensive modeling of microRNA targets predicts functional non-conserved and non-canonical sites. Betel D, Koppal A, Agius P, Sander C, Leslie C., Genome Biology 2010 11:R90 [2] miRBase: annotating high confidence microRNAs using deep sequencing data. Kozomara A, Griffiths-Jones S. NAR 2014 42:D68-D73 [3] Papadopoulos, G. L., Reczko, M., Simossis, V. A., Sethupathy, P., & Hatzigeorgiou, A. G. (2009). The database of experimentally supported targets: a functional update of TarBase. Nucleic acids research, 37(suppl 1), D155- D158. [4] Xiao, F., Zuo, Z., Cai, G., Kang, S., Gao, X., & Li, T. (2009). miRecords: an integrated resource for microRNA– target interactions. Nucleic acids research, 37(suppl 1), D105-D110.
  • 15. WHITE PAPER Analyze non-coding RNA molecules with InSyBio ncRNASeq InSyBio – Intelligent Systems Biology – www.insybio.com ABOUT US InSyBio Ltd is a bioinformatics pioneer company (www.insybio.com) in personalized healthcare, that focuses on developing computational frameworks and tools for the analysis of complex life- science and biological data in order to develop predictive integrated biomarkers (biomarkers of various categories) with increased prognostic and diagnostic aspects for the personalized Healthcare Industry. InSyBio Suite consists of tools for providing integrated biological information from various sources, while at the same time it is empowered with robust, user-friendly and installation-free bioinformatics tools based on intelligent algorithms and methods. HOW TO GET INSYBIO NCRNASEQ? A demo version of InSyBio ncRNASeq is freely available at http://demo.insybio.com. To request a free one month full (evaluation) version of InSyBio ncRNASeq please email us at info@insybio.com. To purchase InSyBio ncRNASeq commercial version 1.0 please contact us at sales@insybio.com. COPYRIGHT NOTICE External Publication of InSyBio Ltd - Any InSyBio information that is to be used in advertising, press releases, or promotional materials requires prior written approval from the InSyBio Ltd. A draft of the proposed document should accompany any such request. InSyBio Ltd reserves the right to deny approval of external usage for any reason. Copyright 2015 InSyBio Ltd. Reproduction without written permission is completely forbidden.