Chikungunya virus (CHIKV), a mosquito-borne alphavirus of the Togaviridae family, has recently emerged in the Americas from two originating continents, Asia and Africa. Historically, CHIKV circulated as at least four lineages worldwide with both enzootic and epidemic transmission cycles. To understand the recent re-emergence patterns, and the current status of the CHIKV spread, an updated phylogeny of the virus is needed. Here, we performed phylogenetic and comparative genomics screens of CHIKV genomes, taking advantage of the public availability of many recently sequenced isolates. Based on these new data and analyses, we derive a revised phylogeny from full open reading frame nucleotide sequences. Within this phylogeny, we propose the presence of several distinct lineages in Africa which were once considered a single lineage. In parallel, thermodynamic modeling of CHIKV untranslated regions (UTRs) reveals a cascade of evolutionarily conserved RNA structures, in particular in the 3' UTR. We provide evidence for the duplication of non-coding RNAs (ncRNAs) in recently emerged Carribean CHIKV isolates and propose the existence of lineage-specific duplicated ncRNAs in different CHIKV lineages.
1. Evolutionary traits in Chikungunya
virus untranslated regions
Michael T. Wolfinger
4th VDS Retreat, Mondsee
26 June 2019
Research Group Bioinformatics and Computational Biology
& Department of Theoretical Chemistry
University of Vienna
3. Chikungunya Virus (CHIKV)
•Family Togaviridae / genus Alphavirus; mosquito-borne (Aedes spp.)
•Single-stranded (+) sense RNA virus
•“Chikungunya” (Makonde language): “Disease that bend up the joints”
•Chikungunya fever: febrile illness, arthralgia, rash, rarely causes hemorrhagic complications
•Enzootic in tropical and subtropical regions of Africa
•First outbreak described 1952 in Tanzania
•No vaccine available
WeaverandForrester,Antivir.Res.(2015)
!3
WeaverandForrester,Antivir.Res.(2015)
4. Alphavirus Genome Organization
• Non-segmented, single-stranded, (+)-sense RNA genomes of 11-12kB length
• Capped and polyadenylated
• AV genomes appear to host cells as mRNA for immediate translation upon entry into the cytoplasm
Viralzone/ExPASy
!4
https://viralzone.expasy.org/
6. CHIKV Epidemic Spread
Frickmannetal.,Viruses(2019)
!6
often associated with temporary outbreak events [9], leading to temporally increased infection risks in
contrast to arbovirus infections such as dengue fever, with continuously increasing case numbers [7,8].
Viruses 2019, 11, x FOR PEER REVIEW 2 of 8
in contrast to arbovirus infections such as dengue fever, with continuously increasing case numbers
[7,8].
Figure 1. Countries or regions with chikungunya virus (CHIKV) endemicity and/or outbreaks
mentioned in this review in the order of appearance (black dots and numerals 1–11). 1. Réunion; 2.
Figure 1. Countries or regions with chikungunya virus (CHIKV) endemicity and/or outbreaks mentioned
12. Conserved RNAs in Viral 3’UTR: ZIKV
!12
5' 3'
ΨDBSL1 SL2 DB 3'-SL
EXCURSION
13. Conserved RNAs in Viral 3’UTR: ZIKV
!13
5' 3'
ΨDBSL1 SL2 DB 3'-SL
• Accumulation of short flavivirus RNA (sfRNA) upon infection
• Stable decay intermediates produced by partial exoribonuclease degradation
• Xrn1 is efficiently stalled at conserved xrRNA structures
EXCURSION
14. Conserved RNAs in Viral 3’UTR
!14
• Accumulation of short flavivirus RNA (sfRNA) upon infection
• Stable decay intermediates produced by partial exoribonuclease degradation
• Xrn1 is efficiently stalled at conserved xrRNA structures
5' 3'
3'
5' 3'CDS
SL1 SL2 DB1 DB2 3'-SLSLA SLB
EXCURSION
15. Conserved RNAs in Viral 3’UTR
!15
• Accumulation of short flavivirus RNA (sfRNA) upon infection
• Stable decay intermediates produced by partial exoribonuclease degradation
• Xrn1 is efficiently stalled at conserved xrRNA structures
5' 3'
3'
5' 3'CDS
SL1 SL2 DB1 DB2 3'-SLSLA SLB
sfRNA mediates
pathogenicity !
EXCURSION
16. Conserved RNAs in Viral 3’UTR
!16
• Accumulation of short flavivirus RNA (sfRNA) upon infection
• Stable decay intermediates produced by partial exoribonuclease degradation
• Xrn1 is efficiently stalled at conserved xrRNA structures
5' 3'
3'
5' 3'CDS
SL1 SL2 DB1 DB2 3'-SLSLA SLB
sfRNA mediates
pathogenicity !
EXCURSION
How to find them?
17. Conserved RNAs in Viral 3’UTR
!17
• Accumulation of short flavivirus RNA (sfRNA) upon infection
• Stable decay intermediates produced by partial exoribonuclease degradation
• Xrn1 is efficiently stalled at conserved xrRNA structures
5' 3'
3'
5' 3'CDS
SL1 SL2 DB1 DB2 3'-SLSLA SLB
sfRNA mediates
pathogenicity !
How to find them?
viruses
Article
Functional RNA Structures in the 30UTR of
Tick-Borne, Insect-Specific and No-Known-Vector
Flaviviruses
Roman Ochsenreiter 1 , Ivo L. Hofacker 1,2 and Michael T. Wolfinger 1,2,*
1 Department of Theoretical Chemistry, University of Vienna, Währingerstraße 17, 1090 Vienna, Austria;
romanoch@tbi.univie.ac.at (R.O.); ivo@tbi.univie.ac.at (I.L.H.)
2 Research Group BCB, Faculty of Computer Science, University of Vienna, Währingerstraße 29,
1090 Vienna, Austria
* Correspondence: michael.wolfinger@univie.ac.at
Received: 1 March 2019; Accepted: 20 March 2019; Published: 24 March 2019
Abstract: Untranslated regions (UTRs) of flaviviruses contain a large number of RNA structural
EXCURSION
https://doi.org/10.3390/v11030298
18. • High mutation rate in RNA viruses due to error-prone RdRP
• For base pair (i,j): GC/CG/AU/UA/GU/UG
• Consistent mutation: different standard combinations
• Compensatory mutation: both positions are mutated
• Presence of both strongly supports predicted base pair (i,j)
!18
RNA Covariation as Evolutionary Trait
((((((((...((((((((..............))))))))....))......((((......))))))))))
USUV.10 CCACGGCUCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
USUV.11 UCACGGCCCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
.........10........20........30........40........50........60........70..
C
C
A
C
G
G
CC
C
AA
GCGAACAG
A
C
GGU
G
A
U
G
C
G A _
A
C U G U U C G U
G
G A A
G G
A
C
U A
G
A
G
G
U
U A
G
A
G
G
A
G
A
C
C
C
C
G
U
G
G
f
types of pairs:
1 2 3 4 5 6
allowed pairs:
incompatible
pairs in column
0
1
2
A-U
U-A
G-C
C-G
G-U
U-G
19. • High mutation rate in RNA viruses due to error-prone RdRP
• For base pair (i,j): GC/CG/AU/UA/GU/UG
• Consistent mutation: different standard combinations
• Compensatory mutation: both positions are mutated
• Presence of both strongly supports predicted base pair (i,j)
!19
RNA Covariation as Evolutionary Trait
((((((((...((((((((..............))))))))....))......((((......))))))))))
USUV.10 CCACGGCUCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
USUV.11 UCACGGCCCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
.........10........20........30........40........50........60........70..
C
C
A
C
G
G
CC
C
AA
GCGAACAG
A
C
GGU
G
A
U
G
C
G A _
A
C U G U U C G U
G
G A A
G G
A
C
U A
G
A
G
G
U
U A
G
A
G
G
A
G
A
C
C
C
C
G
U
G
G
LIV T.xrRNA1 T.xrRNA2 T.SL6 T.xrRNA2 Flavi_CRE
SGEV T.xrRNA1 T.xrRNA2 T.SL6 T.xrRNA2 Flavi_CRE
TBEV T.xrRNA1 T.SL6 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
OHFV T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
LGTV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
DTV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
POWV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
KSIV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like
~ 300 nt
b
G
R
Y
R
C
G
G
C
A
G
C
A C
R
C
Y
Y
R
G
Y
G
A
C G G G A
A R
U
G
G
UCG
C
UCCCG
A
C
G
Y
A
R
R
Y
Y
5´
R
G
G
R
Y
R
Y
G
G
C
A
R
C
R
C
R
C
A Y
G R
GG
R
A
R
G
R
Y
G
C
C
C
R
R
R
Y
Y
Y
Y
5´ C
Y
Y
C
R
A
A
G G
Y
A
R
C
R
R R G5´
G
G
C
G
GY
U
C
U U
U
C
U
C
C C
R
A
G
C
Y
C
Y
Y
A
G
A
C
A
C
A G A
U
A
G
U
C
U
R
A C
A
G
RR
R
G
G
R
G
Y
U
Y
G
A
A
A
5´
R
R
R
C
C
C
C
G
R
R
Y
Y
G
G
A G
G
R
G
G
G
A
R
R
R
R
R
RA
A
A
U
U G G
C
A
R
Y
Y
Y
Y
Y
Y
U
C R G
G
A
U
UU
Y
Y
Y
Y
5´
T.xrRNA2 SL6 Y-shaped (Y1) Flavi_CRET.xrRNA1
5' 3'
.((((..((((((((...........))))))))......))))....
SGEV|261-306 CCCCCC-GCACCAUGACAAGGCCGAACAUGGUGCACCAAAGGGGAG-G 46
TBEV|532-577 CCCCCU-GCAUCAUGAUAAGGCCGAACAUGGUGCAUGAAAGGGGAG-G 46
LIV|268-312 CCCCCC-GCCCCAUGACAAGGCCGAACAUGGAGCAUUAAAGG-GAG-G 45
LGTV|339-381 CCCCUU-GCGUCCAGAGAAGGCCGAACUGGGCGU---UAUAAGGAG-G 43
OHFV|179-222 CCCCCC-GCACCAUG-GAAGGCCAAACAUGGUGCAUG-AAGGGAAA-G 44
KFDV|164-204 CCAAAA-CCUCCCAGAGAAGGCCGAACUGGGAGGCC-----AUGAA-G 41
ALKV|165-205 CCAAAG-CCUCCCAGAGAAGGCCGAACUGGGAGGCC-----AUGAA-G 41
DTV|248-294 CCAAAAGGCCUCCUG-GAAGGCUCACCAGGAGUUAGGCCAUUCUAGAG 47
POWV|249-293 CCAAAAGGCCUCCUG-GAAGGCUCACCAGGAGUUAGGCCGUU-UAG-G 45
TYUV|341-386 CUUUC--CAUGAGAGGAGACGGUCAACUCUCAUGGAACAAGAAGACCG 46
NEGV|129-174 CCCCCCCUGGCCAGAAAAAGGGGGGGCAAACAGGCC--AGGGGUGAAG 46
KSIV|233-270 CUGAC---CAUCCCU-CAAGGCCGAGUGGGAUGC------GUAUGAAG 38
.........10........20........30........40.......
Seq. Conservation:
*
*
c
d f
types of pairs:
1 2 3 4 5 6
SL6
e (((((..((((((((((....)))))))).))...((((((((...........)))))))).........)).)))
POWV|291-361 AGGAG--CCCCCGAGCAUAA---CUCGGGAGGAGGGAGGAAGA-AAAUUGGCAAUCUUCCUCGGGAUUUUUCCGCCU 71
DTV|290-354 UAGAG--CCCCCGGGCAUAA---CUCGGGAGGAGGGGGGAAGA-CAAUUGGCAAUCUUCCCCGGGAUU------UUU 65
LGTV|377-449 GGAGG--CCCCCAGGGGGAAACCCCUGGGAGGAGGGAAGAGAG-AAAUUGGCAACUCUCUUCAGGAUAUUUCC-UCC 73
TBEV|573-645 GGAGG--CCCCCGGAAGCACGCUUCCGGGAGGAGGGAAGAGAG-AAAUUGGCAGCUCUCUUCAGGAUUUUUCC-UCC 73
Y1
allowed pairs:
incompatible
pairs in column
0
1
2
A-U
U-A
G-C
C-G
G-U
U-G
20. !20
RNA Covariation as Evolutionary Trait
((((((((...((((((((..............))))))))....))......((((......))))))))))
USUV.10 CCACGGCUCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
USUV.11 UCACGGCCCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
SLEV.12 CGGCGGCCCAAACCAUGGAG----UGCGUGA---CCAUGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGCUG 66
.........10........20........30........40........50........60........70..
C
C
A
C
G
G
CC
C
AA
GCGAACAG
A
C
GGU
G
A
U
G
C
G A _
A
C U G U U C G U
G
G A A
G G
A
C
U A
G
A
G
G
U
U A
G
A
G
G
A
G
A
C
C
C
C
G
U
G
G
U
GC
G
• High mutation rate in RNA viruses due to error-prone RdRP
• For base pair (i,j): GC/CG/AU/UA/GU/UG
• Consistent mutation: different standard combinations
• Compensatory mutation: both positions are mutated
• Presence of both strongly supports predicted base pair (i,j)
LIV T.xrRNA1 T.xrRNA2 T.SL6 T.xrRNA2 Flavi_CRE
SGEV T.xrRNA1 T.xrRNA2 T.SL6 T.xrRNA2 Flavi_CRE
TBEV T.xrRNA1 T.SL6 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
OHFV T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
LGTV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
DTV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
POWV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
KSIV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like
~ 300 nt
b
G
R
Y
R
C
G
G
C
A
G
C
A C
R
C
Y
Y
R
G
Y
G
A
C G G G A
A R
U
G
G
UCG
C
UCCCG
A
C
G
Y
A
R
R
Y
Y
5´
R
G
G
R
Y
R
Y
G
G
C
A
R
C
R
C
R
C
A Y
G R
GG
R
A
R
G
R
Y
G
C
C
C
R
R
R
Y
Y
Y
Y
5´ C
Y
Y
C
R
A
A
G G
Y
A
R
C
R
R R G5´
G
G
C
G
GY
U
C
U U
U
C
U
C
C C
R
A
G
C
Y
C
Y
Y
A
G
A
C
A
C
A G A
U
A
G
U
C
U
R
A C
A
G
RR
R
G
G
R
G
Y
U
Y
G
A
A
A
5´
R
R
R
C
C
C
C
G
R
R
Y
Y
G
G
A G
G
R
G
G
G
A
R
R
R
R
R
RA
A
A
U
U G G
C
A
R
Y
Y
Y
Y
Y
Y
U
C R G
G
A
U
UU
Y
Y
Y
Y
5´
T.xrRNA2 SL6 Y-shaped (Y1) Flavi_CRET.xrRNA1
5' 3'
.((((..((((((((...........))))))))......))))....
SGEV|261-306 CCCCCC-GCACCAUGACAAGGCCGAACAUGGUGCACCAAAGGGGAG-G 46
TBEV|532-577 CCCCCU-GCAUCAUGAUAAGGCCGAACAUGGUGCAUGAAAGGGGAG-G 46
LIV|268-312 CCCCCC-GCCCCAUGACAAGGCCGAACAUGGAGCAUUAAAGG-GAG-G 45
LGTV|339-381 CCCCUU-GCGUCCAGAGAAGGCCGAACUGGGCGU---UAUAAGGAG-G 43
OHFV|179-222 CCCCCC-GCACCAUG-GAAGGCCAAACAUGGUGCAUG-AAGGGAAA-G 44
KFDV|164-204 CCAAAA-CCUCCCAGAGAAGGCCGAACUGGGAGGCC-----AUGAA-G 41
ALKV|165-205 CCAAAG-CCUCCCAGAGAAGGCCGAACUGGGAGGCC-----AUGAA-G 41
DTV|248-294 CCAAAAGGCCUCCUG-GAAGGCUCACCAGGAGUUAGGCCAUUCUAGAG 47
POWV|249-293 CCAAAAGGCCUCCUG-GAAGGCUCACCAGGAGUUAGGCCGUU-UAG-G 45
TYUV|341-386 CUUUC--CAUGAGAGGAGACGGUCAACUCUCAUGGAACAAGAAGACCG 46
NEGV|129-174 CCCCCCCUGGCCAGAAAAAGGGGGGGCAAACAGGCC--AGGGGUGAAG 46
KSIV|233-270 CUGAC---CAUCCCU-CAAGGCCGAGUGGGAUGC------GUAUGAAG 38
.........10........20........30........40.......
Seq. Conservation:
*
*
c
d f
types of pairs:
1 2 3 4 5 6
SL6
e (((((..((((((((((....)))))))).))...((((((((...........)))))))).........)).)))
POWV|291-361 AGGAG--CCCCCGAGCAUAA---CUCGGGAGGAGGGAGGAAGA-AAAUUGGCAAUCUUCCUCGGGAUUUUUCCGCCU 71
DTV|290-354 UAGAG--CCCCCGGGCAUAA---CUCGGGAGGAGGGGGGAAGA-CAAUUGGCAAUCUUCCCCGGGAUU------UUU 65
LGTV|377-449 GGAGG--CCCCCAGGGGGAAACCCCUGGGAGGAGGGAAGAGAG-AAAUUGGCAACUCUCUUCAGGAUAUUUCC-UCC 73
TBEV|573-645 GGAGG--CCCCCGGAAGCACGCUUCCGGGAGGAGGGAAGAGAG-AAAUUGGCAGCUCUCUUCAGGAUUUUUCC-UCC 73
Y1
allowed pairs:
incompatible
pairs in column
0
1
2
A-U
U-A
G-C
C-G
G-U
U-G
21. !21
RNA Covariation as Evolutionary Trait
C
C
A
C
G
G
CC
C
AA
ACCAACAA
A
_
___
G
A
C
G
C
G A _
_
_ C C A U C G C
G
G A A
G G
A
C
U A
G
A
G
G
U
U A
G
A
G
G
A
G
A
C
C
C
C
G
C
G
G
C
A_
A
((((((((...((((((((..............))))))))....))......((((......))))))))))
USUV.10 CCACGGCUCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
USUV.11 UCACGGCCCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
SLEV.12 CGGCGGCCCAAACCAUGGAG----UGCGUGA---CCAUGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGCUG 66
SLEV.13 CAGCGGCCCAAACCAUGGAG----UGCGUGA---CCAUGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGCUG 66
.........10........20........30........40........50........60........70..
• High mutation rate in RNA viruses due to error-prone RdRP
• For base pair (i,j): GC/CG/AU/UA/GU/UG
• Consistent mutation: different standard combinations
• Compensatory mutation: both positions are mutated
• Presence of both strongly supports predicted base pair (i,j)
LIV T.xrRNA1 T.xrRNA2 T.SL6 T.xrRNA2 Flavi_CRE
SGEV T.xrRNA1 T.xrRNA2 T.SL6 T.xrRNA2 Flavi_CRE
TBEV T.xrRNA1 T.SL6 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
OHFV T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
LGTV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
DTV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
POWV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
KSIV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like
~ 300 nt
b
G
R
Y
R
C
G
G
C
A
G
C
A C
R
C
Y
Y
R
G
Y
G
A
C G G G A
A R
U
G
G
UCG
C
UCCCG
A
C
G
Y
A
R
R
Y
Y
5´
R
G
G
R
Y
R
Y
G
G
C
A
R
C
R
C
R
C
A Y
G R
GG
R
A
R
G
R
Y
G
C
C
C
R
R
R
Y
Y
Y
Y
5´ C
Y
Y
C
R
A
A
G G
Y
A
R
C
R
R R G5´
G
G
C
G
GY
U
C
U U
U
C
U
C
C C
R
A
G
C
Y
C
Y
Y
A
G
A
C
A
C
A G A
U
A
G
U
C
U
R
A C
A
G
RR
R
G
G
R
G
Y
U
Y
G
A
A
A
5´
R
R
R
C
C
C
C
G
R
R
Y
Y
G
G
A G
G
R
G
G
G
A
R
R
R
R
R
RA
A
A
U
U G G
C
A
R
Y
Y
Y
Y
Y
Y
U
C R G
G
A
U
UU
Y
Y
Y
Y
5´
T.xrRNA2 SL6 Y-shaped (Y1) Flavi_CRET.xrRNA1
5' 3'
.((((..((((((((...........))))))))......))))....
SGEV|261-306 CCCCCC-GCACCAUGACAAGGCCGAACAUGGUGCACCAAAGGGGAG-G 46
TBEV|532-577 CCCCCU-GCAUCAUGAUAAGGCCGAACAUGGUGCAUGAAAGGGGAG-G 46
LIV|268-312 CCCCCC-GCCCCAUGACAAGGCCGAACAUGGAGCAUUAAAGG-GAG-G 45
LGTV|339-381 CCCCUU-GCGUCCAGAGAAGGCCGAACUGGGCGU---UAUAAGGAG-G 43
OHFV|179-222 CCCCCC-GCACCAUG-GAAGGCCAAACAUGGUGCAUG-AAGGGAAA-G 44
KFDV|164-204 CCAAAA-CCUCCCAGAGAAGGCCGAACUGGGAGGCC-----AUGAA-G 41
ALKV|165-205 CCAAAG-CCUCCCAGAGAAGGCCGAACUGGGAGGCC-----AUGAA-G 41
DTV|248-294 CCAAAAGGCCUCCUG-GAAGGCUCACCAGGAGUUAGGCCAUUCUAGAG 47
POWV|249-293 CCAAAAGGCCUCCUG-GAAGGCUCACCAGGAGUUAGGCCGUU-UAG-G 45
TYUV|341-386 CUUUC--CAUGAGAGGAGACGGUCAACUCUCAUGGAACAAGAAGACCG 46
NEGV|129-174 CCCCCCCUGGCCAGAAAAAGGGGGGGCAAACAGGCC--AGGGGUGAAG 46
KSIV|233-270 CUGAC---CAUCCCU-CAAGGCCGAGUGGGAUGC------GUAUGAAG 38
.........10........20........30........40.......
Seq. Conservation:
*
*
c
d f
types of pairs:
1 2 3 4 5 6
SL6
e (((((..((((((((((....)))))))).))...((((((((...........)))))))).........)).)))
POWV|291-361 AGGAG--CCCCCGAGCAUAA---CUCGGGAGGAGGGAGGAAGA-AAAUUGGCAAUCUUCCUCGGGAUUUUUCCGCCU 71
DTV|290-354 UAGAG--CCCCCGGGCAUAA---CUCGGGAGGAGGGGGGAAGA-CAAUUGGCAAUCUUCCCCGGGAUU------UUU 65
LGTV|377-449 GGAGG--CCCCCAGGGGGAAACCCCUGGGAGGAGGGAAGAGAG-AAAUUGGCAACUCUCUUCAGGAUAUUUCC-UCC 73
TBEV|573-645 GGAGG--CCCCCGGAAGCACGCUUCCGGGAGGAGGGAAGAGAG-AAAUUGGCAGCUCUCUUCAGGAUUUUUCC-UCC 73
Y1
allowed pairs:
incompatible
pairs in column
0
1
2
A-U
U-A
G-C
C-G
G-U
U-G
22. !22
RNA Covariation as Evolutionary Trait
C
C
A
C
G
G
CC
C
AA
GCCAUGAG
A
_
G_U
G
A
U
G
U
G A _
A
C U C A U G G C
G
G A A
G G
A
C
U A
G
A
G
G
U
U A
G
A
G
G
A
G
A
C
C
C
C
G
U
G
G
C
GC
G
((((((((...((((((((..............))))))))....))......((((......))))))))))
ALFV.09 CCACGGCCCGGGCCAUGAGU-GAUGAUGUUA-ACUCAUGGC-GAAGGACUAGAGGUUAGAGGAGACCCCGUGG 70
USUV.10 CCACGGCUCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
USUV.11 UCACGGCCCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
SLEV.12 CGGCGGCCCAAACCAUGGAG----UGCGUGA---CCAUGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGCUG 66
SLEV.13 CAGCGGCCCAAACCAUGGAG----UGCGUGA---CCAUGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGCUG 66
.........10........20........30........40........50........60........70..
• High mutation rate in RNA viruses due to error-prone RdRP
• For base pair (i,j): GC/CG/AU/UA/GU/UG
• Consistent mutation: different standard combinations
• Compensatory mutation: both positions are mutated
• Presence of both strongly supports predicted base pair (i,j)
LIV T.xrRNA1 T.xrRNA2 T.SL6 T.xrRNA2 Flavi_CRE
SGEV T.xrRNA1 T.xrRNA2 T.SL6 T.xrRNA2 Flavi_CRE
TBEV T.xrRNA1 T.SL6 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
OHFV T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
LGTV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
DTV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
POWV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
KSIV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like
~ 300 nt
b
G
R
Y
R
C
G
G
C
A
G
C
A C
R
C
Y
Y
R
G
Y
G
A
C G G G A
A R
U
G
G
UCG
C
UCCCG
A
C
G
Y
A
R
R
Y
Y
5´
R
G
G
R
Y
R
Y
G
G
C
A
R
C
R
C
R
C
A Y
G R
GG
R
A
R
G
R
Y
G
C
C
C
R
R
R
Y
Y
Y
Y
5´ C
Y
Y
C
R
A
A
G G
Y
A
R
C
R
R R G5´
G
G
C
G
GY
U
C
U U
U
C
U
C
C C
R
A
G
C
Y
C
Y
Y
A
G
A
C
A
C
A G A
U
A
G
U
C
U
R
A C
A
G
RR
R
G
G
R
G
Y
U
Y
G
A
A
A
5´
R
R
R
C
C
C
C
G
R
R
Y
Y
G
G
A G
G
R
G
G
G
A
R
R
R
R
R
RA
A
A
U
U G G
C
A
R
Y
Y
Y
Y
Y
Y
U
C R G
G
A
U
UU
Y
Y
Y
Y
5´
T.xrRNA2 SL6 Y-shaped (Y1) Flavi_CRET.xrRNA1
5' 3'
.((((..((((((((...........))))))))......))))....
SGEV|261-306 CCCCCC-GCACCAUGACAAGGCCGAACAUGGUGCACCAAAGGGGAG-G 46
TBEV|532-577 CCCCCU-GCAUCAUGAUAAGGCCGAACAUGGUGCAUGAAAGGGGAG-G 46
LIV|268-312 CCCCCC-GCCCCAUGACAAGGCCGAACAUGGAGCAUUAAAGG-GAG-G 45
LGTV|339-381 CCCCUU-GCGUCCAGAGAAGGCCGAACUGGGCGU---UAUAAGGAG-G 43
OHFV|179-222 CCCCCC-GCACCAUG-GAAGGCCAAACAUGGUGCAUG-AAGGGAAA-G 44
KFDV|164-204 CCAAAA-CCUCCCAGAGAAGGCCGAACUGGGAGGCC-----AUGAA-G 41
ALKV|165-205 CCAAAG-CCUCCCAGAGAAGGCCGAACUGGGAGGCC-----AUGAA-G 41
DTV|248-294 CCAAAAGGCCUCCUG-GAAGGCUCACCAGGAGUUAGGCCAUUCUAGAG 47
POWV|249-293 CCAAAAGGCCUCCUG-GAAGGCUCACCAGGAGUUAGGCCGUU-UAG-G 45
TYUV|341-386 CUUUC--CAUGAGAGGAGACGGUCAACUCUCAUGGAACAAGAAGACCG 46
NEGV|129-174 CCCCCCCUGGCCAGAAAAAGGGGGGGCAAACAGGCC--AGGGGUGAAG 46
KSIV|233-270 CUGAC---CAUCCCU-CAAGGCCGAGUGGGAUGC------GUAUGAAG 38
.........10........20........30........40.......
Seq. Conservation:
*
*
c
d f
types of pairs:
1 2 3 4 5 6
SL6
e (((((..((((((((((....)))))))).))...((((((((...........)))))))).........)).)))
POWV|291-361 AGGAG--CCCCCGAGCAUAA---CUCGGGAGGAGGGAGGAAGA-AAAUUGGCAAUCUUCCUCGGGAUUUUUCCGCCU 71
DTV|290-354 UAGAG--CCCCCGGGCAUAA---CUCGGGAGGAGGGGGGAAGA-CAAUUGGCAAUCUUCCCCGGGAUU------UUU 65
LGTV|377-449 GGAGG--CCCCCAGGGGGAAACCCCUGGGAGGAGGGAAGAGAG-AAAUUGGCAACUCUCUUCAGGAUAUUUCC-UCC 73
TBEV|573-645 GGAGG--CCCCCGGAAGCACGCUUCCGGGAGGAGGGAAGAGAG-AAAUUGGCAGCUCUCUUCAGGAUUUUUCC-UCC 73
Y1
allowed pairs:
incompatible
pairs in column
0
1
2
A-U
U-A
G-C
C-G
G-U
U-G
23. !23
RNA Covariation as Evolutionary Trait
C
C
A
C
G
G
CC
C
AA
GCCAUGAG
G
_
GGU
G
A
U
G
C
G A _
A
C U C A U G G C
G
G A A
G G
A
C
U A
G
A
G
G
U
U A
G
A
G
G
A
G
A
C
C
C
C
G
C
G
G
C
GU
A
C
G
((((((((...((((((((..............))))))))....))......((((......))))))))))
MVEV.08 CCGCAGCCCGGGCCGGGAGGAGGUGAUGCGA-AC-CCCGGC-GAAGGACUAGAGGUUAGAGGAGACCCUGCGG 70
ALFV.09 CCACGGCCCGGGCCAUGAGU-GAUGAUGUUA-ACUCAUGGC-GAAGGACUAGAGGUUAGAGGAGACCCCGUGG 70
USUV.10 CCACGGCUCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
USUV.11 UCACGGCCCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
SLEV.12 CGGCGGCCCAAACCAUGGAG----UGCGUGA---CCAUGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGCUG 66
SLEV.13 CAGCGGCCCAAACCAUGGAG----UGCGUGA---CCAUGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGCUG 66
.........10........20........30........40........50........60........70..
• High mutation rate in RNA viruses due to error-prone RdRP
• For base pair (i,j): GC/CG/AU/UA/GU/UG
• Consistent mutation: different standard combinations
• Compensatory mutation: both positions are mutated
• Presence of both strongly supports predicted base pair (i,j)
LIV T.xrRNA1 T.xrRNA2 T.SL6 T.xrRNA2 Flavi_CRE
SGEV T.xrRNA1 T.xrRNA2 T.SL6 T.xrRNA2 Flavi_CRE
TBEV T.xrRNA1 T.SL6 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
OHFV T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
LGTV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
DTV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
POWV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
KSIV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like
~ 300 nt
b
G
R
Y
R
C
G
G
C
A
G
C
A C
R
C
Y
Y
R
G
Y
G
A
C G G G A
A R
U
G
G
UCG
C
UCCCG
A
C
G
Y
A
R
R
Y
Y
5´
R
G
G
R
Y
R
Y
G
G
C
A
R
C
R
C
R
C
A Y
G R
GG
R
A
R
G
R
Y
G
C
C
C
R
R
R
Y
Y
Y
Y
5´ C
Y
Y
C
R
A
A
G G
Y
A
R
C
R
R R G5´
G
G
C
G
GY
U
C
U U
U
C
U
C
C C
R
A
G
C
Y
C
Y
Y
A
G
A
C
A
C
A G A
U
A
G
U
C
U
R
A C
A
G
RR
R
G
G
R
G
Y
U
Y
G
A
A
A
5´
R
R
R
C
C
C
C
G
R
R
Y
Y
G
G
A G
G
R
G
G
G
A
R
R
R
R
R
RA
A
A
U
U G G
C
A
R
Y
Y
Y
Y
Y
Y
U
C R G
G
A
U
UU
Y
Y
Y
Y
5´
T.xrRNA2 SL6 Y-shaped (Y1) Flavi_CRET.xrRNA1
5' 3'
.((((..((((((((...........))))))))......))))....
SGEV|261-306 CCCCCC-GCACCAUGACAAGGCCGAACAUGGUGCACCAAAGGGGAG-G 46
TBEV|532-577 CCCCCU-GCAUCAUGAUAAGGCCGAACAUGGUGCAUGAAAGGGGAG-G 46
LIV|268-312 CCCCCC-GCCCCAUGACAAGGCCGAACAUGGAGCAUUAAAGG-GAG-G 45
LGTV|339-381 CCCCUU-GCGUCCAGAGAAGGCCGAACUGGGCGU---UAUAAGGAG-G 43
OHFV|179-222 CCCCCC-GCACCAUG-GAAGGCCAAACAUGGUGCAUG-AAGGGAAA-G 44
KFDV|164-204 CCAAAA-CCUCCCAGAGAAGGCCGAACUGGGAGGCC-----AUGAA-G 41
ALKV|165-205 CCAAAG-CCUCCCAGAGAAGGCCGAACUGGGAGGCC-----AUGAA-G 41
DTV|248-294 CCAAAAGGCCUCCUG-GAAGGCUCACCAGGAGUUAGGCCAUUCUAGAG 47
POWV|249-293 CCAAAAGGCCUCCUG-GAAGGCUCACCAGGAGUUAGGCCGUU-UAG-G 45
TYUV|341-386 CUUUC--CAUGAGAGGAGACGGUCAACUCUCAUGGAACAAGAAGACCG 46
NEGV|129-174 CCCCCCCUGGCCAGAAAAAGGGGGGGCAAACAGGCC--AGGGGUGAAG 46
KSIV|233-270 CUGAC---CAUCCCU-CAAGGCCGAGUGGGAUGC------GUAUGAAG 38
.........10........20........30........40.......
Seq. Conservation:
*
*
c
d f
types of pairs:
1 2 3 4 5 6
SL6
e (((((..((((((((((....)))))))).))...((((((((...........)))))))).........)).)))
POWV|291-361 AGGAG--CCCCCGAGCAUAA---CUCGGGAGGAGGGAGGAAGA-AAAUUGGCAAUCUUCCUCGGGAUUUUUCCGCCU 71
DTV|290-354 UAGAG--CCCCCGGGCAUAA---CUCGGGAGGAGGGGGGAAGA-CAAUUGGCAAUCUUCCCCGGGAUU------UUU 65
LGTV|377-449 GGAGG--CCCCCAGGGGGAAACCCCUGGGAGGAGGGAAGAGAG-AAAUUGGCAACUCUCUUCAGGAUAUUUCC-UCC 73
TBEV|573-645 GGAGG--CCCCCGGAAGCACGCUUCCGGGAGGAGGGAAGAGAG-AAAUUGGCAGCUCUCUUCAGGAUUUUUCC-UCC 73
Y1
allowed pairs:
incompatible
pairs in column
0
1
2
A-U
U-A
G-C
C-G
G-U
U-G
24. !24
RNA Covariation as Evolutionary Trait
C
C
G
C
G
G
CC
C
AA
GCCAUGAG
G
_
GGU
G
A
U
G
U
G A _
A
C C C A U G G C
G
G A A
G G
A
C
U A
G
A
G
G
U
U A
G
A
G
G
A
G
A
C
C
C
C
G
C
G
G
C
GC
A
C
G
((((((((...((((((((..............))))))))....))......((((......))))))))))
WNV.07 ACGCGGCCCAAAUCCUGGUGA-U-GGUGUUA-UGCCAGGGUGGAAGGACUAGAGGUUAGAGGAGACCCCGCGU 70
MVEV.08 CCGCAGCCCGGGCCGGGAGGAGGUGAUGCGA-AC-CCCGGC-GAAGGACUAGAGGUUAGAGGAGACCCUGCGG 70
ALFV.09 CCACGGCCCGGGCCAUGAGU-GAUGAUGUUA-ACUCAUGGC-GAAGGACUAGAGGUUAGAGGAGACCCCGUGG 70
USUV.10 CCACGGCUCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
USUV.11 UCACGGCCCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
SLEV.12 CGGCGGCCCAAACCAUGGAG----UGCGUGA---CCAUGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGCUG 66
SLEV.13 CAGCGGCCCAAACCAUGGAG----UGCGUGA---CCAUGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGCUG 66
.........10........20........30........40........50........60........70..
• High mutation rate in RNA viruses due to error-prone RdRP
• For base pair (i,j): GC/CG/AU/UA/GU/UG
• Consistent mutation: different standard combinations
• Compensatory mutation: both positions are mutated
• Presence of both strongly supports predicted base pair (i,j)
LIV T.xrRNA1 T.xrRNA2 T.SL6 T.xrRNA2 Flavi_CRE
SGEV T.xrRNA1 T.xrRNA2 T.SL6 T.xrRNA2 Flavi_CRE
TBEV T.xrRNA1 T.SL6 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
OHFV T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
LGTV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
DTV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
POWV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
KSIV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like
~ 300 nt
b
G
R
Y
R
C
G
G
C
A
G
C
A C
R
C
Y
Y
R
G
Y
G
A
C G G G A
A R
U
G
G
UCG
C
UCCCG
A
C
G
Y
A
R
R
Y
Y
5´
R
G
G
R
Y
R
Y
G
G
C
A
R
C
R
C
R
C
A Y
G R
GG
R
A
R
G
R
Y
G
C
C
C
R
R
R
Y
Y
Y
Y
5´ C
Y
Y
C
R
A
A
G G
Y
A
R
C
R
R R G5´
G
G
C
G
GY
U
C
U U
U
C
U
C
C C
R
A
G
C
Y
C
Y
Y
A
G
A
C
A
C
A G A
U
A
G
U
C
U
R
A C
A
G
RR
R
G
G
R
G
Y
U
Y
G
A
A
A
5´
R
R
R
C
C
C
C
G
R
R
Y
Y
G
G
A G
G
R
G
G
G
A
R
R
R
R
R
RA
A
A
U
U G G
C
A
R
Y
Y
Y
Y
Y
Y
U
C R G
G
A
U
UU
Y
Y
Y
Y
5´
T.xrRNA2 SL6 Y-shaped (Y1) Flavi_CRET.xrRNA1
5' 3'
.((((..((((((((...........))))))))......))))....
SGEV|261-306 CCCCCC-GCACCAUGACAAGGCCGAACAUGGUGCACCAAAGGGGAG-G 46
TBEV|532-577 CCCCCU-GCAUCAUGAUAAGGCCGAACAUGGUGCAUGAAAGGGGAG-G 46
LIV|268-312 CCCCCC-GCCCCAUGACAAGGCCGAACAUGGAGCAUUAAAGG-GAG-G 45
LGTV|339-381 CCCCUU-GCGUCCAGAGAAGGCCGAACUGGGCGU---UAUAAGGAG-G 43
OHFV|179-222 CCCCCC-GCACCAUG-GAAGGCCAAACAUGGUGCAUG-AAGGGAAA-G 44
KFDV|164-204 CCAAAA-CCUCCCAGAGAAGGCCGAACUGGGAGGCC-----AUGAA-G 41
ALKV|165-205 CCAAAG-CCUCCCAGAGAAGGCCGAACUGGGAGGCC-----AUGAA-G 41
DTV|248-294 CCAAAAGGCCUCCUG-GAAGGCUCACCAGGAGUUAGGCCAUUCUAGAG 47
POWV|249-293 CCAAAAGGCCUCCUG-GAAGGCUCACCAGGAGUUAGGCCGUU-UAG-G 45
TYUV|341-386 CUUUC--CAUGAGAGGAGACGGUCAACUCUCAUGGAACAAGAAGACCG 46
NEGV|129-174 CCCCCCCUGGCCAGAAAAAGGGGGGGCAAACAGGCC--AGGGGUGAAG 46
KSIV|233-270 CUGAC---CAUCCCU-CAAGGCCGAGUGGGAUGC------GUAUGAAG 38
.........10........20........30........40.......
Seq. Conservation:
*
*
c
d f
types of pairs:
1 2 3 4 5 6
SL6
e (((((..((((((((((....)))))))).))...((((((((...........)))))))).........)).)))
POWV|291-361 AGGAG--CCCCCGAGCAUAA---CUCGGGAGGAGGGAGGAAGA-AAAUUGGCAAUCUUCCUCGGGAUUUUUCCGCCU 71
DTV|290-354 UAGAG--CCCCCGGGCAUAA---CUCGGGAGGAGGGGGGAAGA-CAAUUGGCAAUCUUCCCCGGGAUU------UUU 65
LGTV|377-449 GGAGG--CCCCCAGGGGGAAACCCCUGGGAGGAGGGAAGAGAG-AAAUUGGCAACUCUCUUCAGGAUAUUUCC-UCC 73
TBEV|573-645 GGAGG--CCCCCGGAAGCACGCUUCCGGGAGGAGGGAAGAGAG-AAAUUGGCAGCUCUCUUCAGGAUUUUUCC-UCC 73
Y1
allowed pairs:
incompatible
pairs in column
0
1
2
A-U
U-A
G-C
C-G
G-U
U-G
25. !25
RNA Covariation as Evolutionary Trait
C
C
A
C
G
G
CC
C
AA
ACCAUGAG
G
_
_G_
G
A
U
G
C
G A _
A
C C C A U G G C
G
G A A
G G
A
C
U A
G
A
G
G
U
U A
G
A
G
G
A
G
A
C
C
C
C
G
C
G
G
C
AC
A
C
G
((((((((...((((((((..............))))))))....))......((((......))))))))))
JEV.06 CCACGGCCCAAACCUCAUCUA-G-GAUGCAAUAGAUGAGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGUGG 71
WNV.07 ACGCGGCCCAAAUCCUGGUGA-U-GGUGUUA-UGCCAGGGUGGAAGGACUAGAGGUUAGAGGAGACCCCGCGU 70
MVEV.08 CCGCAGCCCGGGCCGGGAGGAGGUGAUGCGA-AC-CCCGGC-GAAGGACUAGAGGUUAGAGGAGACCCUGCGG 70
ALFV.09 CCACGGCCCGGGCCAUGAGU-GAUGAUGUUA-ACUCAUGGC-GAAGGACUAGAGGUUAGAGGAGACCCCGUGG 70
USUV.10 CCACGGCUCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
USUV.11 UCACGGCCCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
SLEV.12 CGGCGGCCCAAACCAUGGAG----UGCGUGA---CCAUGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGCUG 66
SLEV.13 CAGCGGCCCAAACCAUGGAG----UGCGUGA---CCAUGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGCUG 66
.........10........20........30........40........50........60........70..
• High mutation rate in RNA viruses due to error-prone RdRP
• For base pair (i,j): GC/CG/AU/UA/GU/UG
• Consistent mutation: different standard combinations
• Compensatory mutation: both positions are mutated
• Presence of both strongly supports predicted base pair (i,j)
LIV T.xrRNA1 T.xrRNA2 T.SL6 T.xrRNA2 Flavi_CRE
SGEV T.xrRNA1 T.xrRNA2 T.SL6 T.xrRNA2 Flavi_CRE
TBEV T.xrRNA1 T.SL6 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
OHFV T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
LGTV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
DTV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
POWV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
KSIV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like
~ 300 nt
b
G
R
Y
R
C
G
G
C
A
G
C
A C
R
C
Y
Y
R
G
Y
G
A
C G G G A
A R
U
G
G
UCG
C
UCCCG
A
C
G
Y
A
R
R
Y
Y
5´
R
G
G
R
Y
R
Y
G
G
C
A
R
C
R
C
R
C
A Y
G R
GG
R
A
R
G
R
Y
G
C
C
C
R
R
R
Y
Y
Y
Y
5´ C
Y
Y
C
R
A
A
G G
Y
A
R
C
R
R R G5´
G
G
C
G
GY
U
C
U U
U
C
U
C
C C
R
A
G
C
Y
C
Y
Y
A
G
A
C
A
C
A G A
U
A
G
U
C
U
R
A C
A
G
RR
R
G
G
R
G
Y
U
Y
G
A
A
A
5´
R
R
R
C
C
C
C
G
R
R
Y
Y
G
G
A G
G
R
G
G
G
A
R
R
R
R
R
RA
A
A
U
U G G
C
A
R
Y
Y
Y
Y
Y
Y
U
C R G
G
A
U
UU
Y
Y
Y
Y
5´
T.xrRNA2 SL6 Y-shaped (Y1) Flavi_CRET.xrRNA1
5' 3'
.((((..((((((((...........))))))))......))))....
SGEV|261-306 CCCCCC-GCACCAUGACAAGGCCGAACAUGGUGCACCAAAGGGGAG-G 46
TBEV|532-577 CCCCCU-GCAUCAUGAUAAGGCCGAACAUGGUGCAUGAAAGGGGAG-G 46
LIV|268-312 CCCCCC-GCCCCAUGACAAGGCCGAACAUGGAGCAUUAAAGG-GAG-G 45
LGTV|339-381 CCCCUU-GCGUCCAGAGAAGGCCGAACUGGGCGU---UAUAAGGAG-G 43
OHFV|179-222 CCCCCC-GCACCAUG-GAAGGCCAAACAUGGUGCAUG-AAGGGAAA-G 44
KFDV|164-204 CCAAAA-CCUCCCAGAGAAGGCCGAACUGGGAGGCC-----AUGAA-G 41
ALKV|165-205 CCAAAG-CCUCCCAGAGAAGGCCGAACUGGGAGGCC-----AUGAA-G 41
DTV|248-294 CCAAAAGGCCUCCUG-GAAGGCUCACCAGGAGUUAGGCCAUUCUAGAG 47
POWV|249-293 CCAAAAGGCCUCCUG-GAAGGCUCACCAGGAGUUAGGCCGUU-UAG-G 45
TYUV|341-386 CUUUC--CAUGAGAGGAGACGGUCAACUCUCAUGGAACAAGAAGACCG 46
NEGV|129-174 CCCCCCCUGGCCAGAAAAAGGGGGGGCAAACAGGCC--AGGGGUGAAG 46
KSIV|233-270 CUGAC---CAUCCCU-CAAGGCCGAGUGGGAUGC------GUAUGAAG 38
.........10........20........30........40.......
Seq. Conservation:
*
*
c
d f
types of pairs:
1 2 3 4 5 6
SL6
e (((((..((((((((((....)))))))).))...((((((((...........)))))))).........)).)))
POWV|291-361 AGGAG--CCCCCGAGCAUAA---CUCGGGAGGAGGGAGGAAGA-AAAUUGGCAAUCUUCCUCGGGAUUUUUCCGCCU 71
DTV|290-354 UAGAG--CCCCCGGGCAUAA---CUCGGGAGGAGGGGGGAAGA-CAAUUGGCAAUCUUCCCCGGGAUU------UUU 65
LGTV|377-449 GGAGG--CCCCCAGGGGGAAACCCCUGGGAGGAGGGAAGAGAG-AAAUUGGCAACUCUCUUCAGGAUAUUUCC-UCC 73
TBEV|573-645 GGAGG--CCCCCGGAAGCACGCUUCCGGGAGGAGGGAAGAGAG-AAAUUGGCAGCUCUCUUCAGGAUUUUUCC-UCC 73
Y1
allowed pairs:
incompatible
pairs in column
0
1
2
A-U
U-A
G-C
C-G
G-U
U-G
26. !26
RNA Covariation as Evolutionary Trait
C
C
A
C
G
G
CC
C
AA
GCCAUGAG
G
A
_G_
G
A
U
G
C
G A _
A
C C C A U G G C
G
G A A
G G
A
C
U A
G
A
G
G
U
U A
G
A
G
G
A
G
A
C
C
C
C
G
U
G
G
C
GC
A
C
G
((((((((...((((((((..............))))))))....))......((((......))))))))))
JEV.05 CCACGGCCCAAGUCUCGUCCA-G-GAUGCAAUGGACGAGAUGUAAGGACUAGAGGUUAGAGGAGACCCCGUGG 71
JEV.06 CCACGGCCCAAACCUCAUCUA-G-GAUGCAAUAGAUGAGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGUGG 71
WNV.07 ACGCGGCCCAAAUCCUGGUGA-U-GGUGUUA-UGCCAGGGUGGAAGGACUAGAGGUUAGAGGAGACCCCGCGU 70
MVEV.08 CCGCAGCCCGGGCCGGGAGGAGGUGAUGCGA-AC-CCCGGC-GAAGGACUAGAGGUUAGAGGAGACCCUGCGG 70
ALFV.09 CCACGGCCCGGGCCAUGAGU-GAUGAUGUUA-ACUCAUGGC-GAAGGACUAGAGGUUAGAGGAGACCCCGUGG 70
USUV.10 CCACGGCUCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
USUV.11 UCACGGCCCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
SLEV.12 CGGCGGCCCAAACCAUGGAG----UGCGUGA---CCAUGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGCUG 66
SLEV.13 CAGCGGCCCAAACCAUGGAG----UGCGUGA---CCAUGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGCUG 66
.........10........20........30........40........50........60........70..
• High mutation rate in RNA viruses due to error-prone RdRP
• For base pair (i,j): GC/CG/AU/UA/GU/UG
• Consistent mutation: different standard combinations
• Compensatory mutation: both positions are mutated
• Presence of both strongly supports predicted base pair (i,j)
LIV T.xrRNA1 T.xrRNA2 T.SL6 T.xrRNA2 Flavi_CRE
SGEV T.xrRNA1 T.xrRNA2 T.SL6 T.xrRNA2 Flavi_CRE
TBEV T.xrRNA1 T.SL6 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
OHFV T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
LGTV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
DTV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
POWV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
KSIV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like
~ 300 nt
b
G
R
Y
R
C
G
G
C
A
G
C
A C
R
C
Y
Y
R
G
Y
G
A
C G G G A
A R
U
G
G
UCG
C
UCCCG
A
C
G
Y
A
R
R
Y
Y
5´
R
G
G
R
Y
R
Y
G
G
C
A
R
C
R
C
R
C
A Y
G R
GG
R
A
R
G
R
Y
G
C
C
C
R
R
R
Y
Y
Y
Y
5´ C
Y
Y
C
R
A
A
G G
Y
A
R
C
R
R R G5´
G
G
C
G
GY
U
C
U U
U
C
U
C
C C
R
A
G
C
Y
C
Y
Y
A
G
A
C
A
C
A G A
U
A
G
U
C
U
R
A C
A
G
RR
R
G
G
R
G
Y
U
Y
G
A
A
A
5´
R
R
R
C
C
C
C
G
R
R
Y
Y
G
G
A G
G
R
G
G
G
A
R
R
R
R
R
RA
A
A
U
U G G
C
A
R
Y
Y
Y
Y
Y
Y
U
C R G
G
A
U
UU
Y
Y
Y
Y
5´
T.xrRNA2 SL6 Y-shaped (Y1) Flavi_CRET.xrRNA1
5' 3'
.((((..((((((((...........))))))))......))))....
SGEV|261-306 CCCCCC-GCACCAUGACAAGGCCGAACAUGGUGCACCAAAGGGGAG-G 46
TBEV|532-577 CCCCCU-GCAUCAUGAUAAGGCCGAACAUGGUGCAUGAAAGGGGAG-G 46
LIV|268-312 CCCCCC-GCCCCAUGACAAGGCCGAACAUGGAGCAUUAAAGG-GAG-G 45
LGTV|339-381 CCCCUU-GCGUCCAGAGAAGGCCGAACUGGGCGU---UAUAAGGAG-G 43
OHFV|179-222 CCCCCC-GCACCAUG-GAAGGCCAAACAUGGUGCAUG-AAGGGAAA-G 44
KFDV|164-204 CCAAAA-CCUCCCAGAGAAGGCCGAACUGGGAGGCC-----AUGAA-G 41
ALKV|165-205 CCAAAG-CCUCCCAGAGAAGGCCGAACUGGGAGGCC-----AUGAA-G 41
DTV|248-294 CCAAAAGGCCUCCUG-GAAGGCUCACCAGGAGUUAGGCCAUUCUAGAG 47
POWV|249-293 CCAAAAGGCCUCCUG-GAAGGCUCACCAGGAGUUAGGCCGUU-UAG-G 45
TYUV|341-386 CUUUC--CAUGAGAGGAGACGGUCAACUCUCAUGGAACAAGAAGACCG 46
NEGV|129-174 CCCCCCCUGGCCAGAAAAAGGGGGGGCAAACAGGCC--AGGGGUGAAG 46
KSIV|233-270 CUGAC---CAUCCCU-CAAGGCCGAGUGGGAUGC------GUAUGAAG 38
.........10........20........30........40.......
Seq. Conservation:
*
*
c
d f
types of pairs:
1 2 3 4 5 6
SL6
e (((((..((((((((((....)))))))).))...((((((((...........)))))))).........)).)))
POWV|291-361 AGGAG--CCCCCGAGCAUAA---CUCGGGAGGAGGGAGGAAGA-AAAUUGGCAAUCUUCCUCGGGAUUUUUCCGCCU 71
DTV|290-354 UAGAG--CCCCCGGGCAUAA---CUCGGGAGGAGGGGGGAAGA-CAAUUGGCAAUCUUCCCCGGGAUU------UUU 65
LGTV|377-449 GGAGG--CCCCCAGGGGGAAACCCCUGGGAGGAGGGAAGAGAG-AAAUUGGCAACUCUCUUCAGGAUAUUUCC-UCC 73
TBEV|573-645 GGAGG--CCCCCGGAAGCACGCUUCCGGGAGGAGGGAAGAGAG-AAAUUGGCAGCUCUCUUCAGGAUUUUUCC-UCC 73
Y1
allowed pairs:
incompatible
pairs in column
0
1
2
A-U
U-A
G-C
C-G
G-U
U-G
27. !27
RNA Covariation as Evolutionary Trait
C
C
A
C
G
G
CC
C
AA
GCCAUGAG
G
A
_G_
G
A
U
G
C
G A _
A
C A C A U G G C
G
G A A
G G
A
C
U A
G
A
G
G
U
U A
G
A
G
G
A
G
A
C
C
C
C
G
U
G
G
C
GA
A
C
G
((((((((...((((((((..............))))))))....))......((((......))))))))))
JEV.04 CCACGGCCCAAGCUUCGUCUA-G-GAUGCAAUAGACGAGGUGUAAGGACUAGAGGUUAGAGGAGACCCCGUGG 71
JEV.05 CCACGGCCCAAGUCUCGUCCA-G-GAUGCAAUGGACGAGAUGUAAGGACUAGAGGUUAGAGGAGACCCCGUGG 71
JEV.06 CCACGGCCCAAACCUCAUCUA-G-GAUGCAAUAGAUGAGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGUGG 71
WNV.07 ACGCGGCCCAAAUCCUGGUGA-U-GGUGUUA-UGCCAGGGUGGAAGGACUAGAGGUUAGAGGAGACCCCGCGU 70
MVEV.08 CCGCAGCCCGGGCCGGGAGGAGGUGAUGCGA-AC-CCCGGC-GAAGGACUAGAGGUUAGAGGAGACCCUGCGG 70
ALFV.09 CCACGGCCCGGGCCAUGAGU-GAUGAUGUUA-ACUCAUGGC-GAAGGACUAGAGGUUAGAGGAGACCCCGUGG 70
USUV.10 CCACGGCUCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
USUV.11 UCACGGCCCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
SLEV.12 CGGCGGCCCAAACCAUGGAG----UGCGUGA---CCAUGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGCUG 66
SLEV.13 CAGCGGCCCAAACCAUGGAG----UGCGUGA---CCAUGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGCUG 66
.........10........20........30........40........50........60........70..
• High mutation rate in RNA viruses due to error-prone RdRP
• For base pair (i,j): GC/CG/AU/UA/GU/UG
• Consistent mutation: different standard combinations
• Compensatory mutation: both positions are mutated
• Presence of both strongly supports predicted base pair (i,j)
LIV T.xrRNA1 T.xrRNA2 T.SL6 T.xrRNA2 Flavi_CRE
SGEV T.xrRNA1 T.xrRNA2 T.SL6 T.xrRNA2 Flavi_CRE
TBEV T.xrRNA1 T.SL6 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
OHFV T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
LGTV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
DTV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
POWV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
KSIV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like
~ 300 nt
b
G
R
Y
R
C
G
G
C
A
G
C
A C
R
C
Y
Y
R
G
Y
G
A
C G G G A
A R
U
G
G
UCG
C
UCCCG
A
C
G
Y
A
R
R
Y
Y
5´
R
G
G
R
Y
R
Y
G
G
C
A
R
C
R
C
R
C
A Y
G R
GG
R
A
R
G
R
Y
G
C
C
C
R
R
R
Y
Y
Y
Y
5´ C
Y
Y
C
R
A
A
G G
Y
A
R
C
R
R R G5´
G
G
C
G
GY
U
C
U U
U
C
U
C
C C
R
A
G
C
Y
C
Y
Y
A
G
A
C
A
C
A G A
U
A
G
U
C
U
R
A C
A
G
RR
R
G
G
R
G
Y
U
Y
G
A
A
A
5´
R
R
R
C
C
C
C
G
R
R
Y
Y
G
G
A G
G
R
G
G
G
A
R
R
R
R
R
RA
A
A
U
U G G
C
A
R
Y
Y
Y
Y
Y
Y
U
C R G
G
A
U
UU
Y
Y
Y
Y
5´
T.xrRNA2 SL6 Y-shaped (Y1) Flavi_CRET.xrRNA1
5' 3'
.((((..((((((((...........))))))))......))))....
SGEV|261-306 CCCCCC-GCACCAUGACAAGGCCGAACAUGGUGCACCAAAGGGGAG-G 46
TBEV|532-577 CCCCCU-GCAUCAUGAUAAGGCCGAACAUGGUGCAUGAAAGGGGAG-G 46
LIV|268-312 CCCCCC-GCCCCAUGACAAGGCCGAACAUGGAGCAUUAAAGG-GAG-G 45
LGTV|339-381 CCCCUU-GCGUCCAGAGAAGGCCGAACUGGGCGU---UAUAAGGAG-G 43
OHFV|179-222 CCCCCC-GCACCAUG-GAAGGCCAAACAUGGUGCAUG-AAGGGAAA-G 44
KFDV|164-204 CCAAAA-CCUCCCAGAGAAGGCCGAACUGGGAGGCC-----AUGAA-G 41
ALKV|165-205 CCAAAG-CCUCCCAGAGAAGGCCGAACUGGGAGGCC-----AUGAA-G 41
DTV|248-294 CCAAAAGGCCUCCUG-GAAGGCUCACCAGGAGUUAGGCCAUUCUAGAG 47
POWV|249-293 CCAAAAGGCCUCCUG-GAAGGCUCACCAGGAGUUAGGCCGUU-UAG-G 45
TYUV|341-386 CUUUC--CAUGAGAGGAGACGGUCAACUCUCAUGGAACAAGAAGACCG 46
NEGV|129-174 CCCCCCCUGGCCAGAAAAAGGGGGGGCAAACAGGCC--AGGGGUGAAG 46
KSIV|233-270 CUGAC---CAUCCCU-CAAGGCCGAGUGGGAUGC------GUAUGAAG 38
.........10........20........30........40.......
Seq. Conservation:
*
*
c
d f
types of pairs:
1 2 3 4 5 6
SL6
e (((((..((((((((((....)))))))).))...((((((((...........)))))))).........)).)))
POWV|291-361 AGGAG--CCCCCGAGCAUAA---CUCGGGAGGAGGGAGGAAGA-AAAUUGGCAAUCUUCCUCGGGAUUUUUCCGCCU 71
DTV|290-354 UAGAG--CCCCCGGGCAUAA---CUCGGGAGGAGGGGGGAAGA-CAAUUGGCAAUCUUCCCCGGGAUU------UUU 65
LGTV|377-449 GGAGG--CCCCCAGGGGGAAACCCCUGGGAGGAGGGAAGAGAG-AAAUUGGCAACUCUCUUCAGGAUAUUUCC-UCC 73
TBEV|573-645 GGAGG--CCCCCGGAAGCACGCUUCCGGGAGGAGGGAAGAGAG-AAAUUGGCAGCUCUCUUCAGGAUUUUUCC-UCC 73
Y1
allowed pairs:
incompatible
pairs in column
0
1
2
A-U
U-A
G-C
C-G
G-U
U-G
28. !28
RNA Covariation as Evolutionary Trait
C
C
A
C
G
G
CC
C
AA
GCCAUGAG
G
A
_G_
G
A
U
G
C
G A _
A
C A C A U G G C
G
U A A
G G
A
C
U A
G
A
G
G
U
U A
G
A
G
G
A
G
A
C
C
C
C
G
U
G
G
C
GA
A
C
G
((((((((...((((((((..............))))))))....))......((((......))))))))))
APCV.03 CCGCGGCCCAACCAGUUCAGACU-GAUGCUA--UGAACUGGGUAAGGACUAGAGGUUAGAGGAGACCCCGCGG 70
JEV.04 CCACGGCCCAAGCUUCGUCUA-G-GAUGCAAUAGACGAGGUGUAAGGACUAGAGGUUAGAGGAGACCCCGUGG 71
JEV.05 CCACGGCCCAAGUCUCGUCCA-G-GAUGCAAUGGACGAGAUGUAAGGACUAGAGGUUAGAGGAGACCCCGUGG 71
JEV.06 CCACGGCCCAAACCUCAUCUA-G-GAUGCAAUAGAUGAGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGUGG 71
WNV.07 ACGCGGCCCAAAUCCUGGUGA-U-GGUGUUA-UGCCAGGGUGGAAGGACUAGAGGUUAGAGGAGACCCCGCGU 70
MVEV.08 CCGCAGCCCGGGCCGGGAGGAGGUGAUGCGA-AC-CCCGGC-GAAGGACUAGAGGUUAGAGGAGACCCUGCGG 70
ALFV.09 CCACGGCCCGGGCCAUGAGU-GAUGAUGUUA-ACUCAUGGC-GAAGGACUAGAGGUUAGAGGAGACCCCGUGG 70
USUV.10 CCACGGCUCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
USUV.11 UCACGGCCCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
SLEV.12 CGGCGGCCCAAACCAUGGAG----UGCGUGA---CCAUGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGCUG 66
SLEV.13 CAGCGGCCCAAACCAUGGAG----UGCGUGA---CCAUGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGCUG 66
.........10........20........30........40........50........60........70..
• High mutation rate in RNA viruses due to error-prone RdRP
• For base pair (i,j): GC/CG/AU/UA/GU/UG
• Consistent mutation: different standard combinations
• Compensatory mutation: both positions are mutated
• Presence of both strongly supports predicted base pair (i,j)
LIV T.xrRNA1 T.xrRNA2 T.SL6 T.xrRNA2 Flavi_CRE
SGEV T.xrRNA1 T.xrRNA2 T.SL6 T.xrRNA2 Flavi_CRE
TBEV T.xrRNA1 T.SL6 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
OHFV T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
LGTV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
DTV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
POWV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
KSIV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like
~ 300 nt
b
G
R
Y
R
C
G
G
C
A
G
C
A C
R
C
Y
Y
R
G
Y
G
A
C G G G A
A R
U
G
G
UCG
C
UCCCG
A
C
G
Y
A
R
R
Y
Y
5´
R
G
G
R
Y
R
Y
G
G
C
A
R
C
R
C
R
C
A Y
G R
GG
R
A
R
G
R
Y
G
C
C
C
R
R
R
Y
Y
Y
Y
5´ C
Y
Y
C
R
A
A
G G
Y
A
R
C
R
R R G5´
G
G
C
G
GY
U
C
U U
U
C
U
C
C C
R
A
G
C
Y
C
Y
Y
A
G
A
C
A
C
A G A
U
A
G
U
C
U
R
A C
A
G
RR
R
G
G
R
G
Y
U
Y
G
A
A
A
5´
R
R
R
C
C
C
C
G
R
R
Y
Y
G
G
A G
G
R
G
G
G
A
R
R
R
R
R
RA
A
A
U
U G G
C
A
R
Y
Y
Y
Y
Y
Y
U
C R G
G
A
U
UU
Y
Y
Y
Y
5´
T.xrRNA2 SL6 Y-shaped (Y1) Flavi_CRET.xrRNA1
5' 3'
.((((..((((((((...........))))))))......))))....
SGEV|261-306 CCCCCC-GCACCAUGACAAGGCCGAACAUGGUGCACCAAAGGGGAG-G 46
TBEV|532-577 CCCCCU-GCAUCAUGAUAAGGCCGAACAUGGUGCAUGAAAGGGGAG-G 46
LIV|268-312 CCCCCC-GCCCCAUGACAAGGCCGAACAUGGAGCAUUAAAGG-GAG-G 45
LGTV|339-381 CCCCUU-GCGUCCAGAGAAGGCCGAACUGGGCGU---UAUAAGGAG-G 43
OHFV|179-222 CCCCCC-GCACCAUG-GAAGGCCAAACAUGGUGCAUG-AAGGGAAA-G 44
KFDV|164-204 CCAAAA-CCUCCCAGAGAAGGCCGAACUGGGAGGCC-----AUGAA-G 41
ALKV|165-205 CCAAAG-CCUCCCAGAGAAGGCCGAACUGGGAGGCC-----AUGAA-G 41
DTV|248-294 CCAAAAGGCCUCCUG-GAAGGCUCACCAGGAGUUAGGCCAUUCUAGAG 47
POWV|249-293 CCAAAAGGCCUCCUG-GAAGGCUCACCAGGAGUUAGGCCGUU-UAG-G 45
TYUV|341-386 CUUUC--CAUGAGAGGAGACGGUCAACUCUCAUGGAACAAGAAGACCG 46
NEGV|129-174 CCCCCCCUGGCCAGAAAAAGGGGGGGCAAACAGGCC--AGGGGUGAAG 46
KSIV|233-270 CUGAC---CAUCCCU-CAAGGCCGAGUGGGAUGC------GUAUGAAG 38
.........10........20........30........40.......
Seq. Conservation:
*
*
c
d f
types of pairs:
1 2 3 4 5 6
SL6
e (((((..((((((((((....)))))))).))...((((((((...........)))))))).........)).)))
POWV|291-361 AGGAG--CCCCCGAGCAUAA---CUCGGGAGGAGGGAGGAAGA-AAAUUGGCAAUCUUCCUCGGGAUUUUUCCGCCU 71
DTV|290-354 UAGAG--CCCCCGGGCAUAA---CUCGGGAGGAGGGGGGAAGA-CAAUUGGCAAUCUUCCCCGGGAUU------UUU 65
LGTV|377-449 GGAGG--CCCCCAGGGGGAAACCCCUGGGAGGAGGGAAGAGAG-AAAUUGGCAACUCUCUUCAGGAUAUUUCC-UCC 73
TBEV|573-645 GGAGG--CCCCCGGAAGCACGCUUCCGGGAGGAGGGAAGAGAG-AAAUUGGCAGCUCUCUUCAGGAUUUUUCC-UCC 73
Y1
allowed pairs:
incompatible
pairs in column
0
1
2
A-U
U-A
G-C
C-G
G-U
U-G
29. !29
RNA Covariation as Evolutionary Trait
C
C
A
C
G
G
CC
C
AA
GCCAUGAC
G
A
_G_
G
A
U
G
C
G A _
A
C C C A U G G U
G
U A A
G G
A
C
U A
G
A
G
G
U
U A
G
A
G
G
A
G
A
C
C
C
C
G
C
G
G
U
GC
A
C
C
((((((((...((((((((..............))))))))....))......((((......))))))))))
KUNV.02 ACGCGGCCCUAGCUCUGGCAA--UGGUGUUA--ACCAGAGUGAAAGGACUAGAGGUUAGAGGAGACCCCGCGU 69
APCV.03 CCGCGGCCCAACCAGUUCAGACU-GAUGCUA--UGAACUGGGUAAGGACUAGAGGUUAGAGGAGACCCCGCGG 70
JEV.04 CCACGGCCCAAGCUUCGUCUA-G-GAUGCAAUAGACGAGGUGUAAGGACUAGAGGUUAGAGGAGACCCCGUGG 71
JEV.05 CCACGGCCCAAGUCUCGUCCA-G-GAUGCAAUGGACGAGAUGUAAGGACUAGAGGUUAGAGGAGACCCCGUGG 71
JEV.06 CCACGGCCCAAACCUCAUCUA-G-GAUGCAAUAGAUGAGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGUGG 71
WNV.07 ACGCGGCCCAAAUCCUGGUGA-U-GGUGUUA-UGCCAGGGUGGAAGGACUAGAGGUUAGAGGAGACCCCGCGU 70
MVEV.08 CCGCAGCCCGGGCCGGGAGGAGGUGAUGCGA-AC-CCCGGC-GAAGGACUAGAGGUUAGAGGAGACCCUGCGG 70
ALFV.09 CCACGGCCCGGGCCAUGAGU-GAUGAUGUUA-ACUCAUGGC-GAAGGACUAGAGGUUAGAGGAGACCCCGUGG 70
USUV.10 CCACGGCUCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
USUV.11 UCACGGCCCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
SLEV.12 CGGCGGCCCAAACCAUGGAG----UGCGUGA---CCAUGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGCUG 66
SLEV.13 CAGCGGCCCAAACCAUGGAG----UGCGUGA---CCAUGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGCUG 66
.........10........20........30........40........50........60........70..
• High mutation rate in RNA viruses due to error-prone RdRP
• For base pair (i,j): GC/CG/AU/UA/GU/UG
• Consistent mutation: different standard combinations
• Compensatory mutation: both positions are mutated
• Presence of both strongly supports predicted base pair (i,j)
LIV T.xrRNA1 T.xrRNA2 T.SL6 T.xrRNA2 Flavi_CRE
SGEV T.xrRNA1 T.xrRNA2 T.SL6 T.xrRNA2 Flavi_CRE
TBEV T.xrRNA1 T.SL6 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
OHFV T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
LGTV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
DTV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
POWV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
KSIV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like
~ 300 nt
b
G
R
Y
R
C
G
G
C
A
G
C
A C
R
C
Y
Y
R
G
Y
G
A
C G G G A
A R
U
G
G
UCG
C
UCCCG
A
C
G
Y
A
R
R
Y
Y
5´
R
G
G
R
Y
R
Y
G
G
C
A
R
C
R
C
R
C
A Y
G R
GG
R
A
R
G
R
Y
G
C
C
C
R
R
R
Y
Y
Y
Y
5´ C
Y
Y
C
R
A
A
G G
Y
A
R
C
R
R R G5´
G
G
C
G
GY
U
C
U U
U
C
U
C
C C
R
A
G
C
Y
C
Y
Y
A
G
A
C
A
C
A G A
U
A
G
U
C
U
R
A C
A
G
RR
R
G
G
R
G
Y
U
Y
G
A
A
A
5´
R
R
R
C
C
C
C
G
R
R
Y
Y
G
G
A G
G
R
G
G
G
A
R
R
R
R
R
RA
A
A
U
U G G
C
A
R
Y
Y
Y
Y
Y
Y
U
C R G
G
A
U
UU
Y
Y
Y
Y
5´
T.xrRNA2 SL6 Y-shaped (Y1) Flavi_CRET.xrRNA1
5' 3'
.((((..((((((((...........))))))))......))))....
SGEV|261-306 CCCCCC-GCACCAUGACAAGGCCGAACAUGGUGCACCAAAGGGGAG-G 46
TBEV|532-577 CCCCCU-GCAUCAUGAUAAGGCCGAACAUGGUGCAUGAAAGGGGAG-G 46
LIV|268-312 CCCCCC-GCCCCAUGACAAGGCCGAACAUGGAGCAUUAAAGG-GAG-G 45
LGTV|339-381 CCCCUU-GCGUCCAGAGAAGGCCGAACUGGGCGU---UAUAAGGAG-G 43
OHFV|179-222 CCCCCC-GCACCAUG-GAAGGCCAAACAUGGUGCAUG-AAGGGAAA-G 44
KFDV|164-204 CCAAAA-CCUCCCAGAGAAGGCCGAACUGGGAGGCC-----AUGAA-G 41
ALKV|165-205 CCAAAG-CCUCCCAGAGAAGGCCGAACUGGGAGGCC-----AUGAA-G 41
DTV|248-294 CCAAAAGGCCUCCUG-GAAGGCUCACCAGGAGUUAGGCCAUUCUAGAG 47
POWV|249-293 CCAAAAGGCCUCCUG-GAAGGCUCACCAGGAGUUAGGCCGUU-UAG-G 45
TYUV|341-386 CUUUC--CAUGAGAGGAGACGGUCAACUCUCAUGGAACAAGAAGACCG 46
NEGV|129-174 CCCCCCCUGGCCAGAAAAAGGGGGGGCAAACAGGCC--AGGGGUGAAG 46
KSIV|233-270 CUGAC---CAUCCCU-CAAGGCCGAGUGGGAUGC------GUAUGAAG 38
.........10........20........30........40.......
Seq. Conservation:
*
*
c
d f
types of pairs:
1 2 3 4 5 6
SL6
e (((((..((((((((((....)))))))).))...((((((((...........)))))))).........)).)))
POWV|291-361 AGGAG--CCCCCGAGCAUAA---CUCGGGAGGAGGGAGGAAGA-AAAUUGGCAAUCUUCCUCGGGAUUUUUCCGCCU 71
DTV|290-354 UAGAG--CCCCCGGGCAUAA---CUCGGGAGGAGGGGGGAAGA-CAAUUGGCAAUCUUCCCCGGGAUU------UUU 65
LGTV|377-449 GGAGG--CCCCCAGGGGGAAACCCCUGGGAGGAGGGAAGAGAG-AAAUUGGCAACUCUCUUCAGGAUAUUUCC-UCC 73
TBEV|573-645 GGAGG--CCCCCGGAAGCACGCUUCCGGGAGGAGGGAAGAGAG-AAAUUGGCAGCUCUCUUCAGGAUUUUUCC-UCC 73
Y1
allowed pairs:
incompatible
pairs in column
0
1
2
A-U
U-A
G-C
C-G
G-U
U-G
30. !30
RNA Covariation as Evolutionary Trait
((((((((...((((((((..............))))))))....))......((((......))))))))))
WNV.01 GCGCAGCCAUAACCUUGGUGA--AGGUGUUA--GCCAAGGAGAAGGGACUAGAGGUUAGCAGAGAUCCUGCGC 69
KUNV.02 ACGCGGCCCUAGCUCUGGCAA--UGGUGUUA--ACCAGAGUGAAAGGACUAGAGGUUAGAGGAGACCCCGCGU 69
APCV.03 CCGCGGCCCAACCAGUUCAGACU-GAUGCUA--UGAACUGGGUAAGGACUAGAGGUUAGAGGAGACCCCGCGG 70
JEV.04 CCACGGCCCAAGCUUCGUCUA-G-GAUGCAAUAGACGAGGUGUAAGGACUAGAGGUUAGAGGAGACCCCGUGG 71
JEV.05 CCACGGCCCAAGUCUCGUCCA-G-GAUGCAAUGGACGAGAUGUAAGGACUAGAGGUUAGAGGAGACCCCGUGG 71
JEV.06 CCACGGCCCAAACCUCAUCUA-G-GAUGCAAUAGAUGAGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGUGG 71
WNV.07 ACGCGGCCCAAAUCCUGGUGA-U-GGUGUUA-UGCCAGGGUGGAAGGACUAGAGGUUAGAGGAGACCCCGCGU 70
MVEV.08 CCGCAGCCCGGGCCGGGAGGAGGUGAUGCGA-AC-CCCGGC-GAAGGACUAGAGGUUAGAGGAGACCCUGCGG 70
ALFV.09 CCACGGCCCGGGCCAUGAGU-GAUGAUGUUA-ACUCAUGGC-GAAGGACUAGAGGUUAGAGGAGACCCCGUGG 70
USUV.10 CCACGGCUCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
USUV.11 UCACGGCCCAAGCGAACAGACGGUGAUGCGA-ACUGUUCGUGGAAGGACUAGAGGUUAGAGGAGACCCCGUGG 72
SLEV.12 CGGCGGCCCAAACCAUGGAG----UGCGUGA---CCAUGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGCUG 66
SLEV.13 CAGCGGCCCAAACCAUGGAG----UGCGUGA---CCAUGGCGUAAGGACUAGAGGUUAGAGGAGACCCCGCUG 66
.........10........20........30........40........50........60........70..
C
C
G
C
G
G
CC
C
AA
GCCAUGGC
G
A
_G_
G
A
U
G
C
G A _
A
G C C A U G G U
G
U A A
G G
A
C
U A
G
A
G
G
U
U A
G
A
G
G
A
G
A
C
C
C
C
G
C
G
G
U
GC
G
G
C
• High mutation rate in RNA viruses due to error-prone RdRP
• For base pair (i,j): GC/CG/AU/UA/GU/UG
• Consistent mutation: different standard combinations
• Compensatory mutation: both positions are mutated
• Presence of both strongly supports predicted base pair (i,j)
LIV T.xrRNA1 T.xrRNA2 T.SL6 T.xrRNA2 Flavi_CRE
SGEV T.xrRNA1 T.xrRNA2 T.SL6 T.xrRNA2 Flavi_CRE
TBEV T.xrRNA1 T.SL6 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
OHFV T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
LGTV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
DTV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
POWV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like Flavi_CRE
KSIV T.xrRNA1 T.xrRNA2 T.SL6 xrRNA-like
~ 300 nt
b
G
R
Y
R
C
G
G
C
A
G
C
A C
R
C
Y
Y
R
G
Y
G
A
C G G G A
A R
U
G
G
UCG
C
UCCCG
A
C
G
Y
A
R
R
Y
Y
5´
R
G
G
R
Y
R
Y
G
G
C
A
R
C
R
C
R
C
A Y
G R
GG
R
A
R
G
R
Y
G
C
C
C
R
R
R
Y
Y
Y
Y
5´ C
Y
Y
C
R
A
A
G G
Y
A
R
C
R
R R G5´
G
G
C
G
GY
U
C
U U
U
C
U
C
C C
R
A
G
C
Y
C
Y
Y
A
G
A
C
A
C
A G A
U
A
G
U
C
U
R
A C
A
G
RR
R
G
G
R
G
Y
U
Y
G
A
A
A
5´
R
R
R
C
C
C
C
G
R
R
Y
Y
G
G
A G
G
R
G
G
G
A
R
R
R
R
R
RA
A
A
U
U G G
C
A
R
Y
Y
Y
Y
Y
Y
U
C R G
G
A
U
UU
Y
Y
Y
Y
5´
T.xrRNA2 SL6 Y-shaped (Y1) Flavi_CRET.xrRNA1
5' 3'
.((((..((((((((...........))))))))......))))....
SGEV|261-306 CCCCCC-GCACCAUGACAAGGCCGAACAUGGUGCACCAAAGGGGAG-G 46
TBEV|532-577 CCCCCU-GCAUCAUGAUAAGGCCGAACAUGGUGCAUGAAAGGGGAG-G 46
LIV|268-312 CCCCCC-GCCCCAUGACAAGGCCGAACAUGGAGCAUUAAAGG-GAG-G 45
LGTV|339-381 CCCCUU-GCGUCCAGAGAAGGCCGAACUGGGCGU---UAUAAGGAG-G 43
OHFV|179-222 CCCCCC-GCACCAUG-GAAGGCCAAACAUGGUGCAUG-AAGGGAAA-G 44
KFDV|164-204 CCAAAA-CCUCCCAGAGAAGGCCGAACUGGGAGGCC-----AUGAA-G 41
ALKV|165-205 CCAAAG-CCUCCCAGAGAAGGCCGAACUGGGAGGCC-----AUGAA-G 41
DTV|248-294 CCAAAAGGCCUCCUG-GAAGGCUCACCAGGAGUUAGGCCAUUCUAGAG 47
POWV|249-293 CCAAAAGGCCUCCUG-GAAGGCUCACCAGGAGUUAGGCCGUU-UAG-G 45
TYUV|341-386 CUUUC--CAUGAGAGGAGACGGUCAACUCUCAUGGAACAAGAAGACCG 46
NEGV|129-174 CCCCCCCUGGCCAGAAAAAGGGGGGGCAAACAGGCC--AGGGGUGAAG 46
KSIV|233-270 CUGAC---CAUCCCU-CAAGGCCGAGUGGGAUGC------GUAUGAAG 38
.........10........20........30........40.......
Seq. Conservation:
*
*
c
d f
types of pairs:
1 2 3 4 5 6
SL6
e (((((..((((((((((....)))))))).))...((((((((...........)))))))).........)).)))
POWV|291-361 AGGAG--CCCCCGAGCAUAA---CUCGGGAGGAGGGAGGAAGA-AAAUUGGCAAUCUUCCUCGGGAUUUUUCCGCCU 71
DTV|290-354 UAGAG--CCCCCGGGCAUAA---CUCGGGAGGAGGGGGGAAGA-CAAUUGGCAAUCUUCCCCGGGAUU------UUU 65
LGTV|377-449 GGAGG--CCCCCAGGGGGAAACCCCUGGGAGGAGGGAAGAGAG-AAAUUGGCAACUCUCUUCAGGAUAUUUCC-UCC 73
TBEV|573-645 GGAGG--CCCCCGGAAGCACGCUUCCGGGAGGAGGGAAGAGAG-AAAUUGGCAGCUCUCUUCAGGAUUUUUCC-UCC 73
Y1
allowed pairs:
incompatible
pairs in column
0
1
2
A-U
U-A
G-C
C-G
G-U
U-G
31. CHIKV 3’UTR: Sequence Repeats
!31
IG. 2. Schematic representation of repeated sequence elements (RSE) and lengths of 39-NTRs of 35 alphaviruses (adapted from Strauss and
uss, 1994). Both the lengths of the repeats and the overall lengths of the 39-NTRs have been drawn to scale. The downward-pointing arrow
Pfeffer et al., Virology (1998)
Repeat Sequence Elements
32. CHIKV 3’UTR: Sequence Repeats
!32
IG. 2. Schematic representation of repeated sequence elements (RSE) and lengths of 39-NTRs of 35 alphaviruses (adapted from Strauss and
uss, 1994). Both the lengths of the repeats and the overall lengths of the 39-NTRs have been drawn to scale. The downward-pointing arrow
Chen et al., PLoS Pathog. (2013)
Lineage-specific Direct Repeats (DRs)
Pfeffer et al., Virology (1998)
Repeat Sequence Elements (RSEs)
33. CHIKV 3’UTR: Sequence Repeats
!33
Chen et al., PLoS Pathog.(2013)
Lineage-specific Direct Repeats (DRs)
Pfeffer et al., Virology (1998)
Repeat Sequence Elements (RSEs)
Do they form RNA structures ?
35. CHIKV 3’UTR: Conserved RNA Structures
!35
WA
ECSA-IOL
ECSA-MASA
AUL-Am
AUL-As
AUL-Cbn
CSE
CSE
CSE
CSE
CSE
CSE
G A G A U G U U A U U U U G U U U U U A A U A U U U C
G A G A U G U U A U U U U G U U U U U A A U A U U U C
GAGAUGUUAUUUUGUUUUUAAUAUUUC
GAGAUGUUAUUUUGUUUUUAAUAUUUC
G
A
G
A
U
G
U
U
A
U
U
U
U G U
U
U
U
U
A
A
U
A
U
U
U
C
UG
UA
AU
UG
(((((((((.........)))))))))
KF318729.1/688-714 GAGAUGUUAUUUUGUUUUUAAUAUUUC 27
KT327163.2/862-888 GAGAUGUUAUUUUGUUUUUAAUAUUUC 27
JF274082.1/472-498 GAGAUGUUAUUUUGUUUUUAAUAUUUC 27
KY038946.1/472-498 GAGAUGUUAUUUUGUUUUUAAUAUUUC 27
AY726732.1/540-566 GAGAUGUUAUUUUGUUUUUAAUAUUUC 27
MF001517.1/685-707 GAGAUGUUAUUUUGUUUUUAA----UC 23
.........10........20......
Conserved Sequence Element (CSE)
36. !36
WA
ECSA-IOL
ECSA-MASA
AUL-Am
AUL-As
AUL_Cbn
Y-a CSE
Y-a CSE
Y-a CSE
Y-a Y-b CSE
Y-a Y-b CSE
Y-a Y-b CSE
CHIKV 3’UTR: Conserved RNA Structures
C G G A A G A G A U G U A G G U A C U U A A G C U U C U U A A A A G C A G C C G A A C U C A C U U U G A G A U G U A G G C A U A G C A U A C C G A A C U C U U C C A
C G G A A G A G A U G U A G G U A C U U A A G C U U C U U A A A A G C A G C C G A A C U C A C U U U G A G A U G U A G G C A U A G C A U A C C G A A C U C U U C C A
CGGAAGAGAUGUAGGUACUUAAGCUUCUUAAAAGCAGCCGAACUCACUUUGAGAUGUAGGCAUAGCAUACCGAACUCUUCCA
CGGAAGAGAUGUAGGUACUUAAGCUUCUUAAAAGCAGCCGAACUCACUUUGAGAUGUAGGCAUAGCAUACCGAACUCUUCCA
C G
G
A
A
G
A
G
A
U
G
U
A G
G
U
A
C
U
U
A
A
G
C
U
U
C
U
U A
A
A
A
G
C A
G
C
C
G
A
A C
U
C
A
C U
U
U
G
A
G
A
U
G
UAG
G
C
A
U
A
G
C
AUA
C
C
G
A
A
C
U
C
U
U
C
C ACG
A
U
.(((((((.....(((......((((.....)))).(((...((((...)))).....))).......)))...))))))).
KY038946.1/363-444 CGGAAGAGAUGUAGGUACUUAAGCUUCCUAAAAGCAGCCGAACUCACUUUGAGAUGUAGGCAUAGCAUACCGAACUCUUCCA 82
KF318729.1/395-476 CGGAAGAGAUGUAGGUACUUAAGCUUCUUAAAAGCAGCCGAACUCGCUUUGAGAUGUAGGCGUAGCACACCGAACUCUUCCA 82
MF001517.1/392-473 CGGAAGAGAUGUAGGUACUUAAGCUUCUUAAAAGCAGCCGAACUCGCUUUGAGAUGUAGGCGUAGCACACCGAACUCUUCCA 82
KT327163.2/569-650 CGGAAGAGAUGUAGGUACUUAAGCUUCUUAAAAGCAGCCGAACUCGCUUUGAGAUGUAGGCGUAGCACACCGAACUCUUCCA 82
KF318729.1/580-660 UGGAAGAGACGUAGGUAC-UAAGCUUCUUAAAAGCAGCCGAACUCACUUUGAGAUGUAGGCAUAGCAUACCGAACUCUUCCA 81
MF001517.1/577-657 UGGAAGAGACGUAGGUAC-UAAGCUUCUUAAAAGCAGCCGAACUCACUUUGAGAUGUAGGCAUAGCAUACCGAACUCUUCCA 81
KT327163.2/754-834 UGGAAGAGACGUAGGUAC-UAAGCUUCUUAAAAGCAGCCGAACUCACUUUGAGAUGUAGGCAUAGCAUACCGAACUCUUCCA 81
JF274082.1/363-444 CGGAAGAGAUGUAGGCACUUAAGCUUCCUAAAAGCAGCCGAACUCACUUUGAGAAGUAGGCAUAGCAUACCGAACUCUUCCA 82
AY726732.1/431-512 GAGAAGAGACGUAGGUAUUUAAGCUUCCUAAAAGCAGCCGAACUCACUUUGAGACGUAGGCAUAGCAUACCGAACUCUUCCA 82
.........10........20........30........40........50........60........70........80.
Y-shaped Element (SL-Y)
37. !37
CHIKV 3’UTR: Conserved RNA Structures
Stem-loop a (SL-a)
WA
ECSA-IOL
ECSA-MASA
AUL-Am
AUL-As
AUL-Cbn
Y-a CSE
SL-a Y-a CSE
SL-a Y-a CSE
SL-a Y-a Y-b CSE
SL-a Y-a Y-b CSE
SL-a Y-a Y-b CSE
U G A C A A C U A G G U A U G A A G G C A U A C G C G U C C C U A A A G A G A C A C A C C G C A U A U A G C U A G G A A U C
U G A C A A C U A G G U A U G A A G G C A U A C G C G U C C C U A A A G A G A C A C A C C G C A U A U A G C U A G G A A U C
UGACAACUAGGUAUGAAGGCAUACGCGUCCCUAAAGAGACACACCGCAUAUAGCUAGGAAUC
UGACAACUAGGUAUGAAGGCAUACGCGUCCCUAAAGAGACACACCGCAUAUAGCUAGGAAUC
U G
A
C
A
A
C
U
A
G
G
U
A
U
G
A
A
G
G
C
A
U
A
C G
C G
U
C
C C
U A
A
A
G
AG
A
C
AC
A
C
C
G
C
A
U
A
U
A
G
C
U
A
G
G
A
A
U
C
GC
UA
CG
.((...((((.((((..((.....(.(((.(....).))).).))...)))).))))...))
KF318729.1/3-64 UGACAACUAGGUAUGAAGGCAUACGCGUCCCUAAAGAGACACACCGCAUAUAGCUAGGAAUC 62
MF001517.1/3-64 UGACAACUAGGUAUGAAGGCAUACGCGUCCCUAAAGAGACACACCGCAUAUAGCUAGGAAUC 62
KT327163.2/3-64 UGACAACUAGGUAUGAAGGCAUACGCGUCCCUAAAGAGACACACCGCAUAUAGCUAGGAAUC 62
JF274082.1/3-64 UGACAAUUAAGUAUGAAGGUAUAUGUGUCCCCUAAGAGACACACUGUACAUAGCAAAUAAUC 62
KY038946.1/3-64 CGACAACUAAGUAUGAAGGUAUAUGUGUCCCCUAAGAGACACACUGUACAUAGCUAAUAAUC 62
.........10........20........30........40........50........60.
38. !38
CHIKV 3’UTR: Conserved RNA Structures
Stem-loop b (SL-b)
A G G U A U A U G U G U C C C C U A A G A G A C A C A C C A U A U A U A G C U
A G G U A U A U G U G U C C C C U A A G A G A C A C A C C A U A U A U A G C U
AGGUAUAUGUGUCCCCUAAGAGACACACCAUAUAUAGCU
AGGUAUAUGUGUCCCCUAAGAGACACACCAUAUAUAGCU
A G G
U
A
U
A
U
G
U
G
U
CC
CC
U
A A
G A
G
A
C
A
C A
C
C
A
U
A
U
A
U A G C U
AU
..(((((((((((.(....).)))))...))))))....
KF318729.1/171-209 AGGUAUAUGUGUCCCCUAAGAGACACACCAUAUAUAGCU 39
MF001517.1/168-206 AGGUAUAUGUGUCCCCUAAGAGACACACCAUAUAUAGCU 39
KT327163.2/168-206 AGGUAUAUGUGUCCCCUAAGAGACACACCAUAUAUAGCU 39
KT327163.2/345-383 AGGUAUAUGUGUCCCCUAAGAGACACACCAUAUAUAGCU 39
JF274082.1/149-187 AGGUAUACGUGUCCCCUAAGAGACACAUUGUAUGUAGGU 39
KY038946.1/149-187 AGGUAUACGUGUCCCCUAAGAGACACAUUGUAUGUAGGU 39
AY726732.1/184-222 AGGUAUAAGUGUCCCCUAAGAGACACAUUGUAUGUAGGU 39
AY726732.1/54-92 AGGUAUAUGUGCCCCUUAAGAGGCACAAUAUAUAUAGCU 39
.........10........20........30........
WA
ECSA-IOL
ECSA-MASA
AUL-Am
AUL-As
AUL-Cbn
SL-b SL-b Y-a CSE
SL-a SL-b Y-a CSE
SL-a SL-b Y-a CSE
SL-a SL-b Y-a Y-b CSE
SL-a SL-b Y-a Y-b CSE
SL-a SL-b SL-b Y-a Y-b CSE
39. !39
CHIKV 3’UTR: Conserved RNA Elements
Unstructured repeat (UR)
5′
1
A
C
U
2
U
A
3
U
4
A
5
G
6
A
7
U
8
C
9
U
A
10
A
11
A
12
G
13
G
14
G
15
C
16
C
U
17
A
G
18
U
19
U
C
A
20
U
G
A
21
U
C
22
A
23
A
24
C
25
C
26
C
27
C
28
U29
G30
A31
A32U33
A34
A
G
35
U
36
A
37
A
38
C
39
A
40
A
41
A
42
A
43
C
U
44
A
45
C
U
46
G
A
47
A
48
A
49
A
50
A
51
C
U
52
C
53
A
54
C
A
55
C
U
56
A
57
A
58
A
59
A
60
A
61
C
U
62
U
C
63
A
64
C
U
65
A
66
A
67
A
68
A
69
A
U
70
C
71
G
A
72
A
G
73
A
74
A
75
A
76
A
3′
WA
ECSA-IOL
ECSA-MASA
AUL-Am
AUL-As
AUL-Cbn
SL-b UR-1 SL-b UR-2 UR-3 Y-a CSE
SL-a UR-1 SL-b UR-2 UR-3 Y-a CSE
SL-a UR-1 SL-b UR-2 UR-3 Y-a CSE
SL-a UR-1 SL-b UR-2 UR-3 Y-a UR-4 Y-b CSE
SL-a UR-1 SL-b UR-2 UR-3 Y-a UR-4 Y-b CSE
SL-a UR-1 SL-b UR-2 SL-b UR-3 UR-4 Y-a UR-5 Y-b CSE
41. CHIKV 3’UTR Overview
SL-b UR SL-b UR UR SL-Y CSE
SL-a UR SL-b UR UR SL-Y CSE
SL-a UR SL-b UR UR SL-Y CSE
SL-a UR SL-b UR SL-b UR UR SL-Y UR SL-Y CSE
SL-a UR SL-b UR UR SL-Y UR SL-Y CSE
SL-a UR SL-b UR UR SL-Y UR SL-Y CSE
WA
ECSA-IOL
ECSA-MASA
AUL-Cbn
AUL-Am
AUL-As
500 9007000 250
CHIKV 3'UTRs
AUL-Cbn specific duplication
!41
U G
A
C
A
A
C
U
A
G
G
U
A
U
G
A
A
G
G
C
A
U
A
C G
C G
U
C
C C
U A
A
A
G
AG
A
C
AC
A
C
C
G
C
A
U
A
U
A
G
C
U
A
G
G
A
A
U
C
GC
UA
CG
A G G
U
A
U
A
U
G
U
G
U
CC
CC
U
A A
G A
G
A
C
A
C A
C
C
A
U
A
U
A
U A G C U
AU
C G
G
A
A
G
A
G
A
U
G
U
A G
G
U
A
C
U
U
A
A
G
C
U
U
C
U
U A
A
A
A
G
C A
G
C
C
G
A
A C
U
C
A
C U
U
U
G
A
G
A
U
G
UAG
G
C
A
U
A
G
C
AUA
C
C
G
A
A
C
U
C
U
U
C
C ACG
A
U
C G
G
A
A
G
A
G
A
U
G
U
A G
G
U
A
C
U
U
A
A
G
C
U
U
C
U
U A
A
A
A
G
C A
G
C
C
G
A
A C
U
C
A
C U
U
U
G
A
G
A
U
G
UAG
G
C
A
U
A
G
C
AUA
C
C
G
A
A
C
U
C
U
U
C
C ACG
A
U
G
A
G
A
U
G
U
U
A
U
U
U
U G U
U
U
U
U
A
A
U
A
U
U
U
C
UG
UA
AU
UG
5′
1
A
C
U
2
U
A
3
U
4
A
5
G
6
A
7
U
8
C
9
U
A
10
A
11
A
12
G
13
G
14
G
15
C
16
C
U
17
A
G
18
U
19
U
C
A
20
U
G
A
21
U
C
22
A
23
A
24
C
25
C
26
C
27
C
28
U
29
G
30
A
31
A
32
U
33
A
34
A
G
35
U
36
A
37
A
38
C
39
A
40
A
41
A
42
A
43
C
U
44
A
45
C
U
46
G
A
47
A
48
A
49
A
50
A
51
C
U
52
C
53
A
54
C
A
55
C
U
56
A
57
A
58
A
59
A
60
A
61
C
U
62
U
C
63
A
64
C
U
65
A
66
A
67
A
68
A
69
A
U
70
C
71
G
A
72
A
G
73
A
74
A
75
A
76
A
3′ 5′
1
A
C
U
2
U
A
3
U
4
A
5
G
6
A
7
U
8
C
9
U
A
10
A
11
A
12
G
13
G
14
G
15
C
16
C
U
17
A
G
18
U
19
U
C
A
20
U
G
A
21
U
C
22
A
23
A
24
C
25
C
26
C
27
C
28
U
29
G
30
A
31
A
32
U
33
A
34
A
G
35
U
36
A
37
A
38
C
39
A
40
A
41
A
42
A
43
C
U
44
A
45
C
U
46
G
A
47
A
48
A
49
A
50
A
51
C
U
52
C
53
A
54
C
A
55
C
U
56
A
57
A
58
A
59
A
60
A
61
C
U
62
U
C
63
A
64
C
U
65
A
66
A
67
A
68
A
69
A
U
70
C
71
G
A
72
A
G
73
A
74
A
75
A
76
A
3′ 5′
1
A
C
U
2
U
A
3
U
4
A
5
G
6
A
7
U
8
C
9
U
A
10
A
11
A
12
G
13
G
14
G
15
C
16
C
U
17
A
G
18
U
19
U
C
A
20
U
G
A
21
U
C
22
A
23
A
24
C
25
C
26
C
27
C
28
U
29
G
30
A
31
A
32
U
33
A
34
A
G
35
U
36
A
37
A
38
C
39
A
40
A
41
A
42
A
43
C
U
44
A
45
C
U
46
G
A
47
A
48
A
49
A
50
A
51
C
U
52
C
53
A
54
C
A
55
C
U
56
A
57
A
58
A
59
A
60
A
61
C
U
62
U
C
63
A
64
C
U
65
A
66
A
67
A
68
A69
A
U70
C71
G
A72
A
G73
A74
A75
A
76
A
3′ 5′
1
A
C
U
2
U
A
3
U
4
A
5
G
6
A
7
U
8
C
9
U
A
10
A
11
A
12
G
13
G
14
G
15
C
16
C
U
17
A
G
18
U
19
U
C
A
20
U
G
A
21
U
C
22
A
23
A
24
C
25
C
26
C
27
C
28
U
29
G
30
A
31
A
32
U
33
A
34
A
G
35
U
36
A
37
A
38
C
39
A
40
A
41
A
42
A
43
C
U
44
A
45
C
U
46
G
A
47
A
48
A
49
A
50
A
51
C
U
52
C
53
A
54
C
A
55
C
U
56
A
57
A
58
A
59
A
60
A
61
C
U
62
U
C
63
A
64
C
U
65
A
66
A
67
A
68
A
69
A
U
70
C
71
G
A
72
A
G
73
A
74
A
75
A
76
A
3′
43. Acknowledgements
• Roman Ochsenreiter (TBI)
• Ivo L. Hofacker (TBI)
• Adriano de Bernardi Schneider (UCSD)
Updated phylogeny of Chikungunya virus
reveals lineage-specific non-coding RNAs
Adriano de Bernardi Schneider1,†
, Roman Ochsenreiter2,†
, Reilly Hostager1
,
Ivo L. Hofacker2,3
, Daniel Janies4
, and Michael T. Wolfinger2,3,
1
AntiViral Research Center, Department of Medicine, University of California San Diego, San Diego, 92103, United States of America
2
Department of Theoretical Chemistry, University of Vienna, Währingerstraße 17, 1090 Vienna, Austria
3
Research Group BCB, Faculty of Computer Science, University of Vienna, Währingerstraße 29, 1090 Vienna, Austria
4
Department of Bioinformatics and Genomics, University of North Carolina at Charlotte, Charlotte, 28223, United States of America
†
These authors have contributed equally to this work.
Chikungunya virus (CHIKV), a mosquito-borne alphavirus of paralysis, Guillain Barré Syndrome (GBS) and renal com-