SlideShare a Scribd company logo
Institute	of Biochemical Engineering	(IBVT)
Allmandring 31,	70569	Stuttgart,	Germany
www.ibvt.uni-stuttgart.de An	accessibility-
incorporated	method for
accurate prediction of
RNA-RNA	interactions
from sequence data
Richard	Schäfer
29/11/2016
Literature
Seminar
29-Nov-162,	IBVT,	Stuttgart
Integer Programming (IP) with incorporation of precomputed accessibility
Prediction of RNA-RNA interactions
29-Nov-163,	IBVT,	Stuttgart
The function of ncRNAs are often determined by their structure
Motivation - Versatility of RNA
• RNA is able to fold into diverse
structures to be involved in a
number of biological processes
• Experimental determination of
RNA structure is tedious and time-
consuming (e.g., Cryo-EM, SHAPE)
• Computational methods1 are used
to predict structure models with
the focus on secondary structure
• Many ncRNAs interact with other
RNAs (miRNA, siRNA) following
the same base-pairing rules as
with intramolecular interactions
Washietl S, Will S, Hendrix D, Goff, L, Rinn J, Berger B and
Kellis M. (2012) Computational analysis of noncoding RNAs.
WIREs RNA. doi: 10.1002/wrna.1134
1
Zuker M and Stiegler P (1981) Optimal computer folding of
large RNA sequences using thermodynamics and auxiliary
information. Nucleic Acids Res., 9, 133-148
29-Nov-164,	IBVT,	Stuttgart
Classification
Computational Approaches for Prediction of RNA-RNA Interactions
• Concatenation Methods: Prediction of inter- and intramolecular base-pairs by
concatenating the input sequences and applying classical RNA secondary structure
algorithms1 (e.g., Mfold, RNAfold). PairFold and RNAcofold implement this strategy
• Favorable Hybridization Methods: Finding the most stable configuration of the
interaction complex ranging from simple interactions (e.g., RNAhybrid, UNAfold,
RNAduplex) to complex joint structures (e.g., IRIS, SRIG)
• Accessibility Methods: Prediction of intermolecular base-pairs, with consideration
of the accessibility of interaction sites (e.g., IntaRNA, RNAup, RNAplex)
• Experimentally verified interactions sites are significantly more accessible
29-Nov-165,	IBVT,	Stuttgart
Differences	between	functional	and	non-functional	interaction	sites
Accessibility
Richter and Backofen (2012) Accessibility and conservation: General features of bacterial small RNA–mRNA
interactions. RNA Biology, 9:7
29-Nov-166,	IBVT,	Stuttgart
Maximize	the	objective	function	under	some	linear	constraints
Prediction	Model	using	RactIPAce
Kato Y, Mori T, Sato K, Maegawa S, Hosokawa H and Akutsu T (2016) An accessibility-incorporated method for accurate prediction of
RNA-RNA interactions from sequence data. Bioinformatics, 2016, 1-8
• Linear Programming is a method for maximizing/minimizing a function
subject to linear constraints
• Maximize y
subject to -x + y ≤ 1
3x + 2y ≤ 12
2x + 3y ≤ 12
• Integer Programming expects all
variables to be integers
• Maximize to the sum of intra-
and intermolecular base-
pairing probabilities
29-Nov-167,	IBVT,	Stuttgart
Optimization	of	a	linear	objective	function
(Integer)	Linear	Programming	(ILP)	
Creative Commons Integer Programming license under CC BY 2.0
Query RNA sequence a = a1,…,an
Let x = (xij)i<j (1 ≤ i < j ≤ n)
=
x11 ⋯ 0
⋮ ⋱ ⋮
xn1 ⋯ xnn
Query RNA sequence b = b1,…,bm
Let y = (yij)i<j (1 ≤ i < j ≤ n)
=
y11 ⋯ 0
⋮ ⋱ ⋮
ym1 ⋯ ymm
29-Nov-168,	IBVT,	Stuttgart
Matrices	for	secondary	structures
Abstraction	of	inter- and	intramolecular	base-pairs	
Hybridization between a and b
Let z = (zik)
=
z11 ⋯ z1𝑚
⋮ ⋱ ⋮
zn1 ⋯ zn𝑚
where xij = 1 means that bases ai and
aj form an intramolecular base-pair.
where yij = 1 means that bases bi and
bj form an intramolecular base-pair.
where zik = 1 means that bases ai and
bk form an intermolecular base-pair.
29-Nov-169,	IBVT,	Stuttgart
Definition	of the Probabilities for the IP	Formulation
Base-Pairing	and Hybridization Probabilities
• Base-pairing probabilities:
pij
a =	 2 P x	 	a)
6∈89:(<)
pij
b =	 2 P y	 	b)
>∈89:(?)
• Hybridization probabilities:
	pi𝑘
a, b =	 2 P z	 	a, b)
B∈C9D(<,?)
• Probabilities can be calculated with
RNAfold and RNAcofold,
respectively, in the ViennaRNA
package
• Factorization of the probability distribution:
P x, y, z	 	a, b	)	≈ P x	 	a)	P(y	 	b 	P z	 	a, b)
set of all secondary
structures of a with
ai and aj paired
set of all hybridized
structures of a and b
with ai and bk paired
29-Nov-1610,	IBVT,	Stuttgart
Profile	of	an	interaction	site
Accessibility	Probabilities
uij
(a) =
1
Z
2 eI
J 6
KL
6∈89:M
<
where	Z = 2 eI
J 6
KL
6∈8 <
free energy of x
gas temperature
set of all secondary
structures whose
subsequence
ai,…,aj is unpaired
probability that
the sequence
interval ai,…,aj is
unpaired
set of all secondary
structures of a
and	uii(a) = 1 − 2 pij
(a)
:ST
base-pairing
probabilities
between bases ai
and aj in sequence a
29-Nov-1611,	IBVT,	Stuttgart
IP	Formulation
• Maximize
2 pij
a 	
− 	θ xij	 + 2 pij
b − θ yij	 + 	𝛼	
TXY
2 pij
a, b 	
− 	η zik		
T,TXY
subject to 13 constraints
• Each base can be paired with at most one base – (1),(2)
• No intramolecular pseudoknots and crossing interactions – (3),(4),(5)
• Bases in intramolecular interactions are not accessible - (6),(7)
• Regions in intermolecular interactions are always accessible – (8),(9)
• Maximum number of accessible regions and no overlaps – (10)-(13)
29-Nov-1612,	IBVT,	Stuttgart
sRNA-mRNA	pairs	with	experimentally	verified	interaction	sites	
Prediction	Results	with	Positive	Data
SEN PPV MCC time (s)
RactIPAce 0.619 0.842 0.709 14.794
RactIP 0.544 0.388 0.446 11.139
RNAplex-a 0.675 0.687 0.670 6.349
RNAplex 0.687 0.609 0.634 0.707
RNAup 0.325 0.307 0.311 29.901
IntaRNA 0.696 0.686 0.672 19.494
𝑆𝐸𝑁 =	
`a
`abcd
𝑃𝑃𝑉 =
`a
`abca
𝑀𝐶𝐶	 ≈	 𝑆𝐸𝑁	𝑥	𝑃𝑃𝑉
29-Nov-1613,	IBVT,	Stuttgart
New	dataset	of	seven	RNA	pairs
Prediction	Results	with	Positive	and	Negative	Data
RNA-mRNA ID (%) In vitro RactIPAce RactIP RNAplex-a RNAplex RNAup IntaRNA
CP003938-cirA 88 - -1.004 -0.516 -0.004 1.400 1.012 0.247
FR877557-ompT 87 - -1.939 0.314 -0.678 0.155 -0.376 -1.475
CU928163-manX 89 + -1.253 2.105 -1.527 -0.341 -2.286 -2.337
FR775191-ptsG 70 + -1.998 -0.350 -3.065 -0.283 -3.235 -3.589
CP000946-STM3216 88 - 0.476 0.529 -1.496 -0.246 -2.500 -1.310
BC109583-nmpC1 76 - 0.349 0.080 1.030 -0.701 1.454 1.247
AJ277528-nmpC2 75 - -0.205 1.109 -1.783 1.163 0.717 -1.301
Kato Y, Mori T, Sato K, Maegawa S, Hosokawa H and Akutsu T (2016) An accessibility-incorporated method for accurate prediction of
RNA-RNA interactions from sequence data. Bioinformatics, 2016, 1-8
Discriminative power	of each method
Receiver	Operator	Characteristic
29-Nov-1614,	IBVT,	Stuttgart
AUC
RactIPAce 0.697
RactIP 0.614
RNAplex-a 0.773
RNAplex 0.527
RNAup 0.743
IntaRNA 0.882
all 0.891
Kato Y, Mori T, Sato K, Maegawa S, Hosokawa H and Akutsu T (2016) An
accessibility-incorporated method for accurate prediction of RNA-RNA
interactions from sequence data. Bioinformatics, 2016, 1-8
Outlook
23-Jun-1615,	IBVT,	Stuttgart
• RNA-RNA interaction prediction (RIP) on the basis of Integer
Programming (IP) – incorporation of accessibility significantly improves
the prediction strength
• Accurate prediction of interaction sites but problems with discrimination
• Methods still need a range of novel ideas to address the challenges in RIP
(SEN ≈ 0.7) – limits of generalized algorithms? Approaches for specific
classes of interactions
• Techniques of machine learning for prediction algorithms
(e.g., Neural Networks, Support Vector Machines for supervised learning)

More Related Content

Viewers also liked

Quality Control of RNA Samples — For Gene Expression Results You Can Rely On
Quality Control of RNA Samples — For Gene Expression Results You Can Rely OnQuality Control of RNA Samples — For Gene Expression Results You Can Rely On
Quality Control of RNA Samples — For Gene Expression Results You Can Rely On
QIAGEN
 
Materi Genetik (DNA & RNA)
Materi Genetik (DNA & RNA)Materi Genetik (DNA & RNA)
Materi Genetik (DNA & RNA)
Rizki Nurul Zulda
 
RNA-seq differential expression analysis
RNA-seq differential expression analysisRNA-seq differential expression analysis
RNA-seq differential expression analysis
mikaelhuss
 
NUCLEIC ACIDS: THE RNA
NUCLEIC ACIDS: THE RNANUCLEIC ACIDS: THE RNA
NUCLEIC ACIDS: THE RNA
lily rosemary masilang
 
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
VHIR Vall d’Hebron Institut de Recerca
 
RNA Interference technology
RNA Interference technologyRNA Interference technology
Biology - Chp 12 - DNA & RNA - PowerPoint
Biology - Chp 12 - DNA & RNA - PowerPointBiology - Chp 12 - DNA & RNA - PowerPoint
Biology - Chp 12 - DNA & RNA - PowerPoint
Mel Anthony Pepito
 
Biochemistry transcription (RNA biosynsthesis)
Biochemistry transcription (RNA biosynsthesis)Biochemistry transcription (RNA biosynsthesis)
Biochemistry transcription (RNA biosynsthesis)
Prabesh Raj Jamkatel
 
Antisense rna technology
Antisense rna technologyAntisense rna technology
Antisense rna technology
Saurav Das
 
5 DNA RNA Protein Synthesis
5 DNA RNA Protein Synthesis5 DNA RNA Protein Synthesis
5 DNA RNA Protein Synthesis
Louis Rosenfeld
 
Rna interfernce ppt
Rna interfernce pptRna interfernce ppt
Rna interfernce ppt
sworna kumari chithiraivelu
 
RNA interference
RNA interferenceRNA interference
RNA interference
Razavi Nader
 
Biology - Chp 12 - DNA & RNA - PowerPoint
Biology - Chp 12 - DNA & RNA - PowerPointBiology - Chp 12 - DNA & RNA - PowerPoint
Biology - Chp 12 - DNA & RNA - PowerPoint
Mr. Walajtys
 
structure types and function of RNA
structure types and function of RNAstructure types and function of RNA
structure types and function of RNA
adnandinmohammed
 
Structure of dna and rna
Structure of dna and rnaStructure of dna and rna
Structure of dna and rna
Himanshu Dev
 
RNA- Structure, Types and Functions
RNA- Structure, Types and FunctionsRNA- Structure, Types and Functions
RNA- Structure, Types and Functions
Namrata Chhabra
 

Viewers also liked (16)

Quality Control of RNA Samples — For Gene Expression Results You Can Rely On
Quality Control of RNA Samples — For Gene Expression Results You Can Rely OnQuality Control of RNA Samples — For Gene Expression Results You Can Rely On
Quality Control of RNA Samples — For Gene Expression Results You Can Rely On
 
Materi Genetik (DNA & RNA)
Materi Genetik (DNA & RNA)Materi Genetik (DNA & RNA)
Materi Genetik (DNA & RNA)
 
RNA-seq differential expression analysis
RNA-seq differential expression analysisRNA-seq differential expression analysis
RNA-seq differential expression analysis
 
NUCLEIC ACIDS: THE RNA
NUCLEIC ACIDS: THE RNANUCLEIC ACIDS: THE RNA
NUCLEIC ACIDS: THE RNA
 
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
Introduction to RNA-seq and RNA-seq Data Analysis (UEB-UAT Bioinformatics Cou...
 
RNA Interference technology
RNA Interference technologyRNA Interference technology
RNA Interference technology
 
Biology - Chp 12 - DNA & RNA - PowerPoint
Biology - Chp 12 - DNA & RNA - PowerPointBiology - Chp 12 - DNA & RNA - PowerPoint
Biology - Chp 12 - DNA & RNA - PowerPoint
 
Biochemistry transcription (RNA biosynsthesis)
Biochemistry transcription (RNA biosynsthesis)Biochemistry transcription (RNA biosynsthesis)
Biochemistry transcription (RNA biosynsthesis)
 
Antisense rna technology
Antisense rna technologyAntisense rna technology
Antisense rna technology
 
5 DNA RNA Protein Synthesis
5 DNA RNA Protein Synthesis5 DNA RNA Protein Synthesis
5 DNA RNA Protein Synthesis
 
Rna interfernce ppt
Rna interfernce pptRna interfernce ppt
Rna interfernce ppt
 
RNA interference
RNA interferenceRNA interference
RNA interference
 
Biology - Chp 12 - DNA & RNA - PowerPoint
Biology - Chp 12 - DNA & RNA - PowerPointBiology - Chp 12 - DNA & RNA - PowerPoint
Biology - Chp 12 - DNA & RNA - PowerPoint
 
structure types and function of RNA
structure types and function of RNAstructure types and function of RNA
structure types and function of RNA
 
Structure of dna and rna
Structure of dna and rnaStructure of dna and rna
Structure of dna and rna
 
RNA- Structure, Types and Functions
RNA- Structure, Types and FunctionsRNA- Structure, Types and Functions
RNA- Structure, Types and Functions
 

Similar to An accessibility-incorporated method for accurate prediction of RNA-RNA interactions from sequence data

FPGA Implementation of a GA
FPGA Implementation of a GAFPGA Implementation of a GA
FPGA Implementation of a GA
Hocine Merabti
 
Developmental Mega Sample: Exploring Inter-Individual Variation
Developmental Mega Sample: Exploring Inter-Individual VariationDevelopmental Mega Sample: Exploring Inter-Individual Variation
Developmental Mega Sample: Exploring Inter-Individual Variation
SaigeRutherford
 
Resume
ResumeResume
PCA-CompChem_seminar
PCA-CompChem_seminarPCA-CompChem_seminar
PCA-CompChem_seminar
Anne D'cruz
 
Q pcr poster-20070314
Q pcr poster-20070314Q pcr poster-20070314
Q pcr poster-20070314
Elsa von Licy
 
Prof. Ramez Daniel, Technion
Prof. Ramez Daniel, TechnionProf. Ramez Daniel, Technion
Prof. Ramez Daniel, Technion
chiportal
 
sasirekha_spectrum_sensing.ppt
sasirekha_spectrum_sensing.pptsasirekha_spectrum_sensing.ppt
sasirekha_spectrum_sensing.ppt
ChaeriahWael2
 
Designing Conservative Reversible N-Bit Binary Comparator for Emerging Quantu...
Designing Conservative Reversible N-Bit Binary Comparator for Emerging Quantu...Designing Conservative Reversible N-Bit Binary Comparator for Emerging Quantu...
Designing Conservative Reversible N-Bit Binary Comparator for Emerging Quantu...
VIT-AP University
 
RNA-Seq with R-Bioconductor
RNA-Seq with R-BioconductorRNA-Seq with R-Bioconductor
Artificial Neural Networks (ANNS) For Prediction of California Bearing Ratio ...
Artificial Neural Networks (ANNS) For Prediction of California Bearing Ratio ...Artificial Neural Networks (ANNS) For Prediction of California Bearing Ratio ...
Artificial Neural Networks (ANNS) For Prediction of California Bearing Ratio ...
IJMER
 
“MS-Extractor: An Innovative Approach to Extract Microsatellites on „Y‟ Chrom...
“MS-Extractor: An Innovative Approach to Extract Microsatellites on „Y‟ Chrom...“MS-Extractor: An Innovative Approach to Extract Microsatellites on „Y‟ Chrom...
“MS-Extractor: An Innovative Approach to Extract Microsatellites on „Y‟ Chrom...
IJERD Editor
 
Poster Presentation on "Artifact Characterization and Removal for In-Vivo Neu...
Poster Presentation on "Artifact Characterization and Removal for In-Vivo Neu...Poster Presentation on "Artifact Characterization and Removal for In-Vivo Neu...
Poster Presentation on "Artifact Characterization and Removal for In-Vivo Neu...
Md Kafiul Islam
 
2015. Pegadaraju Venkatramana. Array Tape Platform and its appliccation in ge...
2015. Pegadaraju Venkatramana. Array Tape Platform and its appliccation in ge...2015. Pegadaraju Venkatramana. Array Tape Platform and its appliccation in ge...
2015. Pegadaraju Venkatramana. Array Tape Platform and its appliccation in ge...
FOODCROPS
 
[2017-05-29] DNASmartTagger
[2017-05-29] DNASmartTagger [2017-05-29] DNASmartTagger
[2017-05-29] DNASmartTagger
Eli Kaminuma
 
Study on Adaptive PID Control Algorithm Based on RBF Neural Network
Study on Adaptive PID Control Algorithm Based on RBF Neural NetworkStudy on Adaptive PID Control Algorithm Based on RBF Neural Network
Study on Adaptive PID Control Algorithm Based on RBF Neural Network
Radita Apriana
 
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
IRJET Journal
 
Liquid Chromatography
Liquid ChromatographyLiquid Chromatography
Liquid Chromatography
Carmen Sanborn
 
High throughput sequencing with Thermostable Group II Intron Reverse Transcri...
High throughput sequencing with Thermostable Group II Intron Reverse Transcri...High throughput sequencing with Thermostable Group II Intron Reverse Transcri...
High throughput sequencing with Thermostable Group II Intron Reverse Transcri...
Douglas Wu
 
Signal Discrimination in Cells Through A Negative Feedback
Signal Discrimination in Cells Through A Negative FeedbackSignal Discrimination in Cells Through A Negative Feedback
Signal Discrimination in Cells Through A Negative Feedback
Vimalathithan Devaraj
 
Neural network based identification of multimachine power system
Neural network based identification of multimachine power systemNeural network based identification of multimachine power system
Neural network based identification of multimachine power system
csandit
 

Similar to An accessibility-incorporated method for accurate prediction of RNA-RNA interactions from sequence data (20)

FPGA Implementation of a GA
FPGA Implementation of a GAFPGA Implementation of a GA
FPGA Implementation of a GA
 
Developmental Mega Sample: Exploring Inter-Individual Variation
Developmental Mega Sample: Exploring Inter-Individual VariationDevelopmental Mega Sample: Exploring Inter-Individual Variation
Developmental Mega Sample: Exploring Inter-Individual Variation
 
Resume
ResumeResume
Resume
 
PCA-CompChem_seminar
PCA-CompChem_seminarPCA-CompChem_seminar
PCA-CompChem_seminar
 
Q pcr poster-20070314
Q pcr poster-20070314Q pcr poster-20070314
Q pcr poster-20070314
 
Prof. Ramez Daniel, Technion
Prof. Ramez Daniel, TechnionProf. Ramez Daniel, Technion
Prof. Ramez Daniel, Technion
 
sasirekha_spectrum_sensing.ppt
sasirekha_spectrum_sensing.pptsasirekha_spectrum_sensing.ppt
sasirekha_spectrum_sensing.ppt
 
Designing Conservative Reversible N-Bit Binary Comparator for Emerging Quantu...
Designing Conservative Reversible N-Bit Binary Comparator for Emerging Quantu...Designing Conservative Reversible N-Bit Binary Comparator for Emerging Quantu...
Designing Conservative Reversible N-Bit Binary Comparator for Emerging Quantu...
 
RNA-Seq with R-Bioconductor
RNA-Seq with R-BioconductorRNA-Seq with R-Bioconductor
RNA-Seq with R-Bioconductor
 
Artificial Neural Networks (ANNS) For Prediction of California Bearing Ratio ...
Artificial Neural Networks (ANNS) For Prediction of California Bearing Ratio ...Artificial Neural Networks (ANNS) For Prediction of California Bearing Ratio ...
Artificial Neural Networks (ANNS) For Prediction of California Bearing Ratio ...
 
“MS-Extractor: An Innovative Approach to Extract Microsatellites on „Y‟ Chrom...
“MS-Extractor: An Innovative Approach to Extract Microsatellites on „Y‟ Chrom...“MS-Extractor: An Innovative Approach to Extract Microsatellites on „Y‟ Chrom...
“MS-Extractor: An Innovative Approach to Extract Microsatellites on „Y‟ Chrom...
 
Poster Presentation on "Artifact Characterization and Removal for In-Vivo Neu...
Poster Presentation on "Artifact Characterization and Removal for In-Vivo Neu...Poster Presentation on "Artifact Characterization and Removal for In-Vivo Neu...
Poster Presentation on "Artifact Characterization and Removal for In-Vivo Neu...
 
2015. Pegadaraju Venkatramana. Array Tape Platform and its appliccation in ge...
2015. Pegadaraju Venkatramana. Array Tape Platform and its appliccation in ge...2015. Pegadaraju Venkatramana. Array Tape Platform and its appliccation in ge...
2015. Pegadaraju Venkatramana. Array Tape Platform and its appliccation in ge...
 
[2017-05-29] DNASmartTagger
[2017-05-29] DNASmartTagger [2017-05-29] DNASmartTagger
[2017-05-29] DNASmartTagger
 
Study on Adaptive PID Control Algorithm Based on RBF Neural Network
Study on Adaptive PID Control Algorithm Based on RBF Neural NetworkStudy on Adaptive PID Control Algorithm Based on RBF Neural Network
Study on Adaptive PID Control Algorithm Based on RBF Neural Network
 
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
An Optimized Parallel Algorithm for Longest Common Subsequence Using Openmp –...
 
Liquid Chromatography
Liquid ChromatographyLiquid Chromatography
Liquid Chromatography
 
High throughput sequencing with Thermostable Group II Intron Reverse Transcri...
High throughput sequencing with Thermostable Group II Intron Reverse Transcri...High throughput sequencing with Thermostable Group II Intron Reverse Transcri...
High throughput sequencing with Thermostable Group II Intron Reverse Transcri...
 
Signal Discrimination in Cells Through A Negative Feedback
Signal Discrimination in Cells Through A Negative FeedbackSignal Discrimination in Cells Through A Negative Feedback
Signal Discrimination in Cells Through A Negative Feedback
 
Neural network based identification of multimachine power system
Neural network based identification of multimachine power systemNeural network based identification of multimachine power system
Neural network based identification of multimachine power system
 

Recently uploaded

132/33KV substation case study Presentation
132/33KV substation case study Presentation132/33KV substation case study Presentation
132/33KV substation case study Presentation
kandramariana6
 
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student MemberIEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
VICTOR MAESTRE RAMIREZ
 
Manufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptxManufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptx
Madan Karki
 
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Sinan KOZAK
 
spirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptxspirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptx
Madan Karki
 
22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt
KrishnaveniKrishnara1
 
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURSCompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
RamonNovais6
 
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
insn4465
 
Applications of artificial Intelligence in Mechanical Engineering.pdf
Applications of artificial Intelligence in Mechanical Engineering.pdfApplications of artificial Intelligence in Mechanical Engineering.pdf
Applications of artificial Intelligence in Mechanical Engineering.pdf
Atif Razi
 
People as resource Grade IX.pdf minimala
People as resource Grade IX.pdf minimalaPeople as resource Grade IX.pdf minimala
People as resource Grade IX.pdf minimala
riddhimaagrawal986
 
Null Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAMNull Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAM
Divyanshu
 
Certificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi AhmedCertificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi Ahmed
Mahmoud Morsy
 
Data Driven Maintenance | UReason Webinar
Data Driven Maintenance | UReason WebinarData Driven Maintenance | UReason Webinar
Data Driven Maintenance | UReason Webinar
UReason
 
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
Gino153088
 
Seminar on Distillation study-mafia.pptx
Seminar on Distillation study-mafia.pptxSeminar on Distillation study-mafia.pptx
Seminar on Distillation study-mafia.pptx
Madan Karki
 
An improved modulation technique suitable for a three level flying capacitor ...
An improved modulation technique suitable for a three level flying capacitor ...An improved modulation technique suitable for a three level flying capacitor ...
An improved modulation technique suitable for a three level flying capacitor ...
IJECEIAES
 
Rainfall intensity duration frequency curve statistical analysis and modeling...
Rainfall intensity duration frequency curve statistical analysis and modeling...Rainfall intensity duration frequency curve statistical analysis and modeling...
Rainfall intensity duration frequency curve statistical analysis and modeling...
bijceesjournal
 
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
ecqow
 
Data Control Language.pptx Data Control Language.pptx
Data Control Language.pptx Data Control Language.pptxData Control Language.pptx Data Control Language.pptx
Data Control Language.pptx Data Control Language.pptx
ramrag33
 
Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...
bijceesjournal
 

Recently uploaded (20)

132/33KV substation case study Presentation
132/33KV substation case study Presentation132/33KV substation case study Presentation
132/33KV substation case study Presentation
 
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student MemberIEEE Aerospace and Electronic Systems Society as a Graduate Student Member
IEEE Aerospace and Electronic Systems Society as a Graduate Student Member
 
Manufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptxManufacturing Process of molasses based distillery ppt.pptx
Manufacturing Process of molasses based distillery ppt.pptx
 
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
Optimizing Gradle Builds - Gradle DPE Tour Berlin 2024
 
spirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptxspirit beverages ppt without graphics.pptx
spirit beverages ppt without graphics.pptx
 
22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt22CYT12-Unit-V-E Waste and its Management.ppt
22CYT12-Unit-V-E Waste and its Management.ppt
 
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURSCompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
CompEx~Manual~1210 (2).pdf COMPEX GAS AND VAPOURS
 
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
哪里办理(csu毕业证书)查尔斯特大学毕业证硕士学历原版一模一样
 
Applications of artificial Intelligence in Mechanical Engineering.pdf
Applications of artificial Intelligence in Mechanical Engineering.pdfApplications of artificial Intelligence in Mechanical Engineering.pdf
Applications of artificial Intelligence in Mechanical Engineering.pdf
 
People as resource Grade IX.pdf minimala
People as resource Grade IX.pdf minimalaPeople as resource Grade IX.pdf minimala
People as resource Grade IX.pdf minimala
 
Null Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAMNull Bangalore | Pentesters Approach to AWS IAM
Null Bangalore | Pentesters Approach to AWS IAM
 
Certificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi AhmedCertificates - Mahmoud Mohamed Moursi Ahmed
Certificates - Mahmoud Mohamed Moursi Ahmed
 
Data Driven Maintenance | UReason Webinar
Data Driven Maintenance | UReason WebinarData Driven Maintenance | UReason Webinar
Data Driven Maintenance | UReason Webinar
 
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
4. Mosca vol I -Fisica-Tipler-5ta-Edicion-Vol-1.pdf
 
Seminar on Distillation study-mafia.pptx
Seminar on Distillation study-mafia.pptxSeminar on Distillation study-mafia.pptx
Seminar on Distillation study-mafia.pptx
 
An improved modulation technique suitable for a three level flying capacitor ...
An improved modulation technique suitable for a three level flying capacitor ...An improved modulation technique suitable for a three level flying capacitor ...
An improved modulation technique suitable for a three level flying capacitor ...
 
Rainfall intensity duration frequency curve statistical analysis and modeling...
Rainfall intensity duration frequency curve statistical analysis and modeling...Rainfall intensity duration frequency curve statistical analysis and modeling...
Rainfall intensity duration frequency curve statistical analysis and modeling...
 
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
一比一原版(CalArts毕业证)加利福尼亚艺术学院毕业证如何办理
 
Data Control Language.pptx Data Control Language.pptx
Data Control Language.pptx Data Control Language.pptxData Control Language.pptx Data Control Language.pptx
Data Control Language.pptx Data Control Language.pptx
 
Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...Comparative analysis between traditional aquaponics and reconstructed aquapon...
Comparative analysis between traditional aquaponics and reconstructed aquapon...
 

An accessibility-incorporated method for accurate prediction of RNA-RNA interactions from sequence data

  • 1. Institute of Biochemical Engineering (IBVT) Allmandring 31, 70569 Stuttgart, Germany www.ibvt.uni-stuttgart.de An accessibility- incorporated method for accurate prediction of RNA-RNA interactions from sequence data Richard Schäfer 29/11/2016 Literature Seminar
  • 2. 29-Nov-162, IBVT, Stuttgart Integer Programming (IP) with incorporation of precomputed accessibility Prediction of RNA-RNA interactions
  • 3. 29-Nov-163, IBVT, Stuttgart The function of ncRNAs are often determined by their structure Motivation - Versatility of RNA • RNA is able to fold into diverse structures to be involved in a number of biological processes • Experimental determination of RNA structure is tedious and time- consuming (e.g., Cryo-EM, SHAPE) • Computational methods1 are used to predict structure models with the focus on secondary structure • Many ncRNAs interact with other RNAs (miRNA, siRNA) following the same base-pairing rules as with intramolecular interactions Washietl S, Will S, Hendrix D, Goff, L, Rinn J, Berger B and Kellis M. (2012) Computational analysis of noncoding RNAs. WIREs RNA. doi: 10.1002/wrna.1134 1 Zuker M and Stiegler P (1981) Optimal computer folding of large RNA sequences using thermodynamics and auxiliary information. Nucleic Acids Res., 9, 133-148
  • 4. 29-Nov-164, IBVT, Stuttgart Classification Computational Approaches for Prediction of RNA-RNA Interactions • Concatenation Methods: Prediction of inter- and intramolecular base-pairs by concatenating the input sequences and applying classical RNA secondary structure algorithms1 (e.g., Mfold, RNAfold). PairFold and RNAcofold implement this strategy • Favorable Hybridization Methods: Finding the most stable configuration of the interaction complex ranging from simple interactions (e.g., RNAhybrid, UNAfold, RNAduplex) to complex joint structures (e.g., IRIS, SRIG) • Accessibility Methods: Prediction of intermolecular base-pairs, with consideration of the accessibility of interaction sites (e.g., IntaRNA, RNAup, RNAplex)
  • 5. • Experimentally verified interactions sites are significantly more accessible 29-Nov-165, IBVT, Stuttgart Differences between functional and non-functional interaction sites Accessibility Richter and Backofen (2012) Accessibility and conservation: General features of bacterial small RNA–mRNA interactions. RNA Biology, 9:7
  • 6. 29-Nov-166, IBVT, Stuttgart Maximize the objective function under some linear constraints Prediction Model using RactIPAce Kato Y, Mori T, Sato K, Maegawa S, Hosokawa H and Akutsu T (2016) An accessibility-incorporated method for accurate prediction of RNA-RNA interactions from sequence data. Bioinformatics, 2016, 1-8
  • 7. • Linear Programming is a method for maximizing/minimizing a function subject to linear constraints • Maximize y subject to -x + y ≤ 1 3x + 2y ≤ 12 2x + 3y ≤ 12 • Integer Programming expects all variables to be integers • Maximize to the sum of intra- and intermolecular base- pairing probabilities 29-Nov-167, IBVT, Stuttgart Optimization of a linear objective function (Integer) Linear Programming (ILP) Creative Commons Integer Programming license under CC BY 2.0
  • 8. Query RNA sequence a = a1,…,an Let x = (xij)i<j (1 ≤ i < j ≤ n) = x11 ⋯ 0 ⋮ ⋱ ⋮ xn1 ⋯ xnn Query RNA sequence b = b1,…,bm Let y = (yij)i<j (1 ≤ i < j ≤ n) = y11 ⋯ 0 ⋮ ⋱ ⋮ ym1 ⋯ ymm 29-Nov-168, IBVT, Stuttgart Matrices for secondary structures Abstraction of inter- and intramolecular base-pairs Hybridization between a and b Let z = (zik) = z11 ⋯ z1𝑚 ⋮ ⋱ ⋮ zn1 ⋯ zn𝑚 where xij = 1 means that bases ai and aj form an intramolecular base-pair. where yij = 1 means that bases bi and bj form an intramolecular base-pair. where zik = 1 means that bases ai and bk form an intermolecular base-pair.
  • 9. 29-Nov-169, IBVT, Stuttgart Definition of the Probabilities for the IP Formulation Base-Pairing and Hybridization Probabilities • Base-pairing probabilities: pij a = 2 P x a) 6∈89:(<) pij b = 2 P y b) >∈89:(?) • Hybridization probabilities: pi𝑘 a, b = 2 P z a, b) B∈C9D(<,?) • Probabilities can be calculated with RNAfold and RNAcofold, respectively, in the ViennaRNA package • Factorization of the probability distribution: P x, y, z a, b ) ≈ P x a) P(y b P z a, b) set of all secondary structures of a with ai and aj paired set of all hybridized structures of a and b with ai and bk paired
  • 10. 29-Nov-1610, IBVT, Stuttgart Profile of an interaction site Accessibility Probabilities uij (a) = 1 Z 2 eI J 6 KL 6∈89:M < where Z = 2 eI J 6 KL 6∈8 < free energy of x gas temperature set of all secondary structures whose subsequence ai,…,aj is unpaired probability that the sequence interval ai,…,aj is unpaired set of all secondary structures of a and uii(a) = 1 − 2 pij (a) :ST base-pairing probabilities between bases ai and aj in sequence a
  • 11. 29-Nov-1611, IBVT, Stuttgart IP Formulation • Maximize 2 pij a − θ xij + 2 pij b − θ yij + 𝛼 TXY 2 pij a, b − η zik T,TXY subject to 13 constraints • Each base can be paired with at most one base – (1),(2) • No intramolecular pseudoknots and crossing interactions – (3),(4),(5) • Bases in intramolecular interactions are not accessible - (6),(7) • Regions in intermolecular interactions are always accessible – (8),(9) • Maximum number of accessible regions and no overlaps – (10)-(13)
  • 12. 29-Nov-1612, IBVT, Stuttgart sRNA-mRNA pairs with experimentally verified interaction sites Prediction Results with Positive Data SEN PPV MCC time (s) RactIPAce 0.619 0.842 0.709 14.794 RactIP 0.544 0.388 0.446 11.139 RNAplex-a 0.675 0.687 0.670 6.349 RNAplex 0.687 0.609 0.634 0.707 RNAup 0.325 0.307 0.311 29.901 IntaRNA 0.696 0.686 0.672 19.494 𝑆𝐸𝑁 = `a `abcd 𝑃𝑃𝑉 = `a `abca 𝑀𝐶𝐶 ≈ 𝑆𝐸𝑁 𝑥 𝑃𝑃𝑉
  • 13. 29-Nov-1613, IBVT, Stuttgart New dataset of seven RNA pairs Prediction Results with Positive and Negative Data RNA-mRNA ID (%) In vitro RactIPAce RactIP RNAplex-a RNAplex RNAup IntaRNA CP003938-cirA 88 - -1.004 -0.516 -0.004 1.400 1.012 0.247 FR877557-ompT 87 - -1.939 0.314 -0.678 0.155 -0.376 -1.475 CU928163-manX 89 + -1.253 2.105 -1.527 -0.341 -2.286 -2.337 FR775191-ptsG 70 + -1.998 -0.350 -3.065 -0.283 -3.235 -3.589 CP000946-STM3216 88 - 0.476 0.529 -1.496 -0.246 -2.500 -1.310 BC109583-nmpC1 76 - 0.349 0.080 1.030 -0.701 1.454 1.247 AJ277528-nmpC2 75 - -0.205 1.109 -1.783 1.163 0.717 -1.301 Kato Y, Mori T, Sato K, Maegawa S, Hosokawa H and Akutsu T (2016) An accessibility-incorporated method for accurate prediction of RNA-RNA interactions from sequence data. Bioinformatics, 2016, 1-8
  • 14. Discriminative power of each method Receiver Operator Characteristic 29-Nov-1614, IBVT, Stuttgart AUC RactIPAce 0.697 RactIP 0.614 RNAplex-a 0.773 RNAplex 0.527 RNAup 0.743 IntaRNA 0.882 all 0.891 Kato Y, Mori T, Sato K, Maegawa S, Hosokawa H and Akutsu T (2016) An accessibility-incorporated method for accurate prediction of RNA-RNA interactions from sequence data. Bioinformatics, 2016, 1-8
  • 15. Outlook 23-Jun-1615, IBVT, Stuttgart • RNA-RNA interaction prediction (RIP) on the basis of Integer Programming (IP) – incorporation of accessibility significantly improves the prediction strength • Accurate prediction of interaction sites but problems with discrimination • Methods still need a range of novel ideas to address the challenges in RIP (SEN ≈ 0.7) – limits of generalized algorithms? Approaches for specific classes of interactions • Techniques of machine learning for prediction algorithms (e.g., Neural Networks, Support Vector Machines for supervised learning)