SlideShare a Scribd company logo
1 of 10
Identity and geometry of a base triple in 16S
rRNA determined by comparative sequence
analysis and molecular modeling
PATRICIA BABIN,1
MICHAEL DOLAN,1
PAUL WOLLENZIEN,1
and ROBIN R. GUTELL2
1
Department of Biochemistry, North Carolina State University, Raleigh, North Carolina 27695-7622, USA
2
Institute for Cellular and Molecular Biology and School of Biological Sciences, University of Texas,
Austin, Texas 78712-1095, USA
ABSTRACT
Comparative sequence analysis complements experimental methods for the determination of RNA three-dimensional
structure. This approach is based on the concept that different sequences within the same gene family form similar
higher-order structures. The large number of rRNA sequences with sufficient variation, along with improved covari-
ation algorithms, are providing us with the opportunity to identify new base triples in 16S rRNA. The three-
dimensional conformations for one of our strongest candidates involving U121 (C124:G237) and/or U121 (U125:A236)
(Escherichia coli sequence and numbering) are analyzed here with different molecular modeling tools. Molecular
modeling shows that U121 interacts with C124 in the U121 (C124:G237) base triple. This arrangement maintains
isomorphic structures for the three most frequent sequence motifs (approximately 93% of known bacterial and
archaeal sequences), is consistent with chemical reactivity of U121 in E. coli ribosomes, and is geometrically favor-
able. Further, the restricted set of observed canonical (GU, AU, GC) base-pair types at positions 124:237 and 125:236
is consistent with the fact that the canonical base-pair sets (for both base pairs) that are not observed in nature
prevent the formation of the 121(124:237) base triple. The analysis described here serves as a general scheme for the
prediction of specific secondary and tertiary structure base pairing where there is a network of correlated base
changes.
Keywords: 16S rRNA; base triple; comparative sequence analysis; molecular modeling
INTRODUCTION
Current approaches to determine the three-dimensional
structure for large RNA molecules and ribonucleopro-
teins include electron microscopy (see Penczek et al+,
1994) and low-angle scattering experiments (Svergun
et al+, 1997) to directly investigate global structures+
Structure determination of large RNA molecules addi-
tionally is being guided by the approaches taken to
determine the high-resolution structures of smaller RNA
molecules+ This list includes most notably high-resolution
crystal structures of several tRNAs (Kim et al+, 1973;
see Basavappa & Sigler, 1991), crystal structures of
the P4–P6 domain (Cate et al+, 1996) and catalytic
domain (Golden et al+, 1998) of the Tetrahymena
Group I intron ribozyme, the crystal structure of the
hammerhead ribozyme (Pley et al+, 1994), the crystal
structure of the loop E region of the 5S rRNA (Correll
et al+, 1997), the crystal structure of the Hepatitis Delta
Virus ribozyme (Ferré-D’Amaré et al+, 1998), and NMR-
derived structures for a number of synthetic RNA ap-
tamers (see Uhlenbeck et al+, 1997)+
Crystals of ribosomes and ribosomal subunits with
good diffraction properties have been available for a
number of years (van Bohlen et al+, 1991), but it has
been difficult, because of the size and asymmetry of
the ribosome and the need to obtain isomorphic heavy
atom derivatives, to determine the diffraction phasing
necessary for electron-density maps and three-dimen-
sional structural determination+ The phasing problem
has been solved recently for the Haloarcula marismor-
tui 50S ribosomal subunit (Ban et al+, 1998) leading to
a 9-Å electron density map of the structure+ This should
provide critical information for the organization of the
whole subunit including the active site and its support-
ing structures when the fold of the rRNA in that struc-
ture is determined+ So far, it has been necessary to
infer details of rRNA structure from constraints deter-
mined by different experiments and to incorporate this
Reprint requests to: Paul Wollenzien, Department of Biochemistry,
Box 7622, North Carolina State University, Raleigh, North Carolina
27695-7622, USA; e-mail: wollenz@bchserver+bch+ncsu+edu+
RNA (1999), 5:1430–1439+ Cambridge University Press+ Printed in the USA+
Copyright © 1999 RNA Society+
1430
information into comprehensive models of the struc-
ture+ Chemical probing experiments to determine RNA
reactivity patterns, the determination of structures of
isolated rRNA regions by NMR and the determination
of three-dimensional arrangement by RNA cross-linking
are among the types of experiments that have been
employed in this approach (see Green & Noller, 1997)+
It is anticipated that RNA structure motifs recognized in
smaller RNAs can be applied to the larger rRNA struc-
ture based on chemical reactivity and patterns of se-
quence variance (Brion & Westhof, 1997; Leontis &
Westhof, 1998)+
Base triples, a prominent tertiary structure motif, are
important components in the tertiary structures of tRNAs
and the group I introns+ Several years ago, Gautheret
et al+ (1995) presented improvements in the prediction
of base triples with comparative sequence analysis+
This algorithm identifies the best covariations between
all unpaired nucleotides and known base pairs+ We
have used this method to predict several base triples in
16S and 23S rRNA (Gutell, 1996; R+R+ Gutell, unpubl+
data)+ Here we present one of the best base triple can-
didates in 16S rRNA involving nt 121 and bp 124:237
and/or 125:236 (Escherichia coli numbering)+ Chemi-
cal probing data and molecular modeling are used to
help ascertain which of the proposed base-triple inter-
actions occurs in the RNA, and the conformation of
these interactions+
RESULTS
Identification of base-triple candidates
by comparative sequence analysis
The 122–128/233–239 16S rRNA helix first was pro-
posed with comparative analysis based on the small
number of sequences available in 1980 (Woese et al+,
1980)+ (We will refer to this interaction as helix 122
here)+ With a significant increase in the number of 16S
rRNA sequences and the development of more pow-
erful covariation algorithms (Gutell et al+, 1992; Maidak
et al+, 1999; R+R+ Gutell, S+ Subrashchandran, M+
Schnare, Y+ Du, N+ Lin, L+ Madabusi, K+ Muller, N+ Pande,
N+ Yu, Z+ Shang, S+ Date, D+ Konings, V+ Schweiker, B+
Weiser, & J+ Cannone, in prep+), all of the positions
within the 122–128/233–239 helix have their strongest
statistically significant covariation with their previously
predicted base-pair partner, except for the 125:236 base
pair, where position 125 is nearly always a U in ap-
proximately 5,000 prokaryotic sequences, and position
236 is G in 75% and A in 25% of the sequences, (form-
ing UG and UA base pairs)+
With a smaller set of rRNA sequences, our earlier
analysis revealed several strong base-triple candi-
dates in 16S and 23S rRNA (Gutell, 1996), including
1072(1092:1099) in 23S rRNA, and 121(124:237)/
(125:236) in 16S rRNA+ Recently the putative 23S rRNA
base triple [1072(1092:1099)] has been substantiated
with experimental methods (Conn et al+, 1998, 1999)+
We have repeated this base-triple analysis utilizing the
same methods, but applied them to a larger pro-
karyotic 16S rRNA alignment (Maidak et al+, 1999)+ Sev-
eral strong candidates have been identified, including
595(596:644) and our previous 121(124:237)/(125:236)
base triple+ There is a mutual best covariation between
the unpaired position 595 and the base pair 596:644
with all three algorithms used to evaluate the signifi-
cance of sequence covariations—chi square, pseudo-
phylogenetic event counting (ec), and covary methods
(see Materials and Methods)+ The phylogenetic distri-
bution reveals that alternate versions of the triplets have
evolved independently several times, thus increasing
the likelihood that this comparatively inferred base tri-
ple is real+ Nuclear magnetic resonance analysis of this
region of the 16S rRNA (Kalurachchi & Nikonowicz,
1998) revealed a base triple at these positions+ The
experimental support for the 16S [595(596:644)] and
23S [1072(1092:1099)] rRNA interactions lends credi-
bility to the comparative methods employed here+
Our base-triple covariation analysis was performed
on the most recent (July 1998, version 7+0) release of
the Ribosomal Database Project (RDP) prokaryotic
alignment of 16S rRNA(6205 sequences, including com-
plete and partial sequences) (http://www+ cme+msu+edu/
RDP/download+html; Maidak et al+, 1999)+ The structure
in the 120–130/230–240 region of the 16S rRNA is
conserved in all prokaryotes and chloroplasts+ This
same region is slightly different in the Eucarya nuclear
and mitochondrial 16S-like rRNAs+ Because of this dif-
ference in structural homology, only the prokaryotic and
chloroplast 16S rRNAsequence sets are analyzed here+
The covariation signal between position 121 and the
base pairs 124:237 and 125:236 is very strong+ The
coordinated variation involves five positions+ The chi
square and ec methods identify mutual best covaria-
tions between position 121 and the base pair 125:236,
whereas the newer covary method identifies a mutual
best covariation between 121 and 124:237+ Approxi-
mately 5,000 sequences in the prokaryotic RDP align-
ment had nucleotide information at positions 121 and
124:237 and 125:236+ The dominant triplet sequences
at 121(124:237) are C(GC) [74%], U(AU) [10%], and
U(CG) [7%]+ At 121(125:236) the dominant sequences
are C(UG) [73%] and U(UA) [21%] (Table 1A,B)+
The nucleotide at position 121 determines the base
pairs and their arrangement at 124:237 and 125:236+
Note that position 121 is a pyrimidine in 98% of the
prokaryotic 16S rRNA sequences, with C occurring in
76% of these sequences, U in 22%, and G and A each
with 1% (Table 1A)+ When position 121 is a C, then
124:237 is a GC pair in 98% of the sequences, and
125:236 is a UG in 97% of the sequences (Table 1B)+
When position 121 is a U, then 124:237 is an AU (45%),
CG (30%), UA (15%), or GC (9%), and 125:236 is a UA
Geometry of a base triple in 16S rRNA 1431
(94%), or UG (5%)+ Taken all together, four sequence
motifs occur for these sequences (Table 2): when po-
sition 121 is a C, the 124:237 and 125:236 base pairs
are predominantly GC/UG (76%, motif A), and when
121 is a U, then the 124:237 and 125:236 base pairs
are predominantly AU/UA (10%, motif B), followed by
CG/UA (7%, motif C) or UA/UA (3%, motif D)+
Comparative analysis should reveal, in addition to
the nucleotide frequencies for the positions of interest,
an approximate number of events or times that the
nucleotides have changed coordinately during the evo-
lution of these rRNAs (Gutell et al+ 1986)+ These “phy-
logenetic events” are a gauge for the authenticity of
every base-pair and base-triple interaction predicted
with comparative methods+ Our confidence for each
proposed interaction is proportional to the number of
times a mutual change (covariation) occurs in the phy-
logenetic tree+
The four most frequent sequence combinations at
positions 121(124:237)/(125:236) were mapped onto
the RDP’s prokaryotic phylogenetic tree (Maidak et al+
1999), providing us with the number of occurrences for
each motif in each phylogenetic group (Table 3)+ For
this analysis, we mapped the sequence sets onto a
reduced phylogenetic tree that only contained the pri-
mary branches in the Archaea (i+e+, Crenarchaeota, Eu-
ryarchaeota) and the (eu)Bacteria (e+g+, Cyanobacteria,
Spirochetes, and Proteobacteria)+ The Purple Bacteria
(Proteobacteria) and Gram Positive branches were ex-
panded an additional level (i+e+,Alpha subdivision)+ There
are a few key observations from this analysis+ First, the
most frequent motif—(A), c/gc/ug (at 76%)—occurs in
all but two of the major prokaryotic phylogenetic groups+
Second, the second most abundant motif—(B), u/au/ua
(at 10%)—occurs in the two remaining major phylo-
TABLE 1+ Nucleotide frequencies at positions 121(124:237) (A)
and 121(125:236) (B)+
Aa
121 A C G U
—b
— — 10 AU
— — — 7 CG
— 74 — 2 GC
— — — 3 UA
124:237
Bc
121 A C G U
— 2 — 21 UA
— 73 — — UG
125:236
a
Entries are percentages of 5,056 sequences with nucleotide in-
formation at positions 121(124:237)+
b
All percentages less than 1+5 are shown with a dash+
c
Entries are percentages of 4,939 sequences with nucleotide in-
formation at positions 121(125:236)+
TABLE 2+ Distribution (in percentage of 5,000 prokaryotic
sequences analyzed) of nucleotide identity
at positions 121 (124:237)/(125:236)+
Sequence positions
121 (124:237)/(125:236)
Motif
Name
C GC/UG 76% A
U AU/UA 10% B
U CG/UA 7% C
U UA/UA 3% D
C AU/UA 1% E
U GC/UA 1% F
U GC/UG 1% G
C GC/UA 1% H
TABLE 3+ Distribution of sequence motifs A–D in bacteria
and Archaea+
Sequence motif
A B C D Phylogenetic group
Archaea
41 Crenarchaeota
136 Euryarchaeota
Bacteria
150 3 Cyanobacteria and chloroplasts
25 Fibrobacter phylum
213 Flexibacter-Cytophaga-Bacteroides phylum
3 Fls+sinusarabici assemblage
33 Fusobacteria and relatives
Gram-positive phylum
15 Anaerobic Halophiles
395 Bacillus-Lactobacillus-Streptococcus
subdivision
22 C+ Lituseburense group
32 C+ Purinolyticum group
160 Clostridium and relatives
75 Eubacterium and relatives
803 High GϩC subdivision
179 8 Mycoplasma and relatives
44 Sporomusa and relatives
37 Thermoanaerobacter and relatives
41 Green non-sulfur bacteria and relatives
2 4 Green sulfur bacteria
5 Nitrospina subdivision
7 6 Paraphyletic assemblage
17 10 6 Planctomyces and relatives
Purple bacteria
742 53 Alpha subdivision
247 Beta subdivision
102 Delta subdivision
2 Env+sar121 group
70 Epsilon subdivision
149 77 320 160 Gamma subdivision
11 Uncultured magnetotactic bacteria
151 8 Spirochetes and relatives
1 Thermophilic assemblage
5 Thermophilic oxygen reducers
7 Thermotogales
The distribution for the penta-sequence 121/124:237/125:236 is
subdivided by phylogenetic group+ The four most abundant penta-
sequences are shown where Aϭ c/gc/ug; B ϭ u/au/ua; C ϭ u/cg/ua;
D ϭ u/ua/ua+
1432 P. Babin et al.
genetic groups+ Third, motifs C and D do not occur
exclusively in any of these primary phylogenetic groups;
instead they occur in two groups (Bacteria-Plancto-
myces and Bacteria-Purple Bacteria-Gamma subdivi-
sion) that have other motifs (A and B and A, B, and C)+
Fourth, 23 of the phylogenetic groups have only one
motif, six phylogenetic groups have two motifs, one
group (Bacteria-Planctomyces and relatives) has three
motifs, and one group (Bacteria-Purple Bacteria-Gamma
subdivision) has all four motifs+ Last, the number of
phylogenetic events (times that each of these motifs
evolved) is estimated at approximately 10+ Here we
assume that the most abundant motif—(A), c/gc/ug—
is primordial, and thus any motif other than this one
has evolved [events have occurred in the Bacteria-
Cyanobacteria and Chloroplasts, Bacteria-Fibrobacter,
Bacteria-Flexibacter, Bacteria-Gram Positive-Phylum
Mycoplasma, Bacteria-Green Sulfur Bacteria, etc+ (see
Table 3)]+ Thus there are 13 such events+ However,
because some of the primary phylogenetic groups are
not necessarily monophyletic sister groups and the ac-
tual number of unrelated groups is subject to disagree-
ment, we estimate the number of phylogenetic changes
at these five positions to be no less than five and ap-
proximately 10+
The RDP Prokaryotic/Chloroplast 16S rRNA align-
ment was then analyzed to determine the most
frequently occurring nucleotide sequences in the (122–
129){(232–239) helix for motifs A, B, and C (Fig+ 1)+
The most frequently occurring sequence for each motif
was then used for the molecular modeling experi-
ments+ In addition, the E. coli sequence (which con-
tains motif C sequence at the base triple) was also
used for modeling experiments, as chemical reactivity
data was available for it+
Isomorphism and molecular modeling
in different sequence motifs
Molecular models for the helix 122 region were con-
structed using the constraint-satisfaction program MC-
SYM that constructs models using nucleotide units with
geometries extracted from known structures (Major
et al+, 1991, 1993; Gautheret et al+, 1993)+ This was
done to determine which base-triple interactions could
be incorporated into molecular models of the region+ In
the first exercise, the three most frequent sequence
motifs, A, B, and C, as well as the sequence of E. coli
(Fig+ 1) were used for modeling+
MC-SYM scripts to generate models for the se-
quences shown in Figure 1, A–D, were written with
three assumptions+ First, all of the base pairs in helix
122 contain normal Watson–Crick or wobble base pairs+
Second, the base triples for all the motifs should be
isomorphic+ Third, the uridine at position 121 in E. coli
has chemical reactivity (Moazed et al+, 1986; P+ Wol-
lenzien, unpubl+), indicating that the N3 position of U
121 is not used in the hydrogen bonding in the E. coli
sequence+
The ISOPAIR program (Gautheret & Gutell, 1997)
was used first to determine which conformations would
produce isomorphic base triples for the sequences in
motifs A, B, C, and E. coli (Table 4)+ All four of the
nucleotides in the base pairs 124:237 and 125:236 were
considered as possible hydrogen bonding partners for
121 in the ISOPAIR analysis+
The ISOPAIR analysis for a triple occurring between
nt 121 and the bp 124:237 revealed that all predicted
triples for these nucleotides would occur in the major
groove by means of a hydrogen bond between nt 121
and 124+ Triples with a hydrogen bond between nt 121
TABLE 4+ Summary of structures predicted by ISOPAIR-MCSYM analysis+
Motif
Hydrogen
bonding
patterna
MC-SYM
transformationsb
Hydrogen
bonds
formed
Improper
bond lengths
after minimization?
Most isomorphic
structure?
A 121–124 CG_31 C121(N4)–G124(N7) Yes
121–124 CG_32 C121(N4)–G124(N7)
121–236 CG_32 C121(N4)–G236(N7) N4–N7 H bond distance
B 121–124 AU_50 U121(O4)–A124(N6)
121–124 AU_52 U121(O4)–A124(N6) Yes
121–236 AU_50 U121(O4)–A236(N6) O4–N6 H bond distance
C 121–124 CU_101 U121(O4)–C124(N4)
121–124 CU_104 U121(O4)–C124(N4) Yes
121–236 AU_50 U121(O4)–A236(N6) O4–N6 H bond distance
E. coli 121–124 CU_101 U121(O4)–C124(N4)
121–124 CU_104 U121(O4)–C124(N4) Yes
121–236 AU_50 U121(O4)–A236(N6) O4–N6 H bond distance
a
Hydrogen-bonding patterns are listed for the base-triple interactions which were consistent with molecular models+
b
The “Transformation” designations were obtained in MC-SYM version 1+3 (see WWW+IRO+UMONTREAL+ CA/;MAJOR/
HTML/USERGUIDE+HTML) for the base pairs containing one-hydrogen-bond interactions between the indicated bases+
Geometry of a base triple in 16S rRNA 1433
and 237 were eliminated because they would have in-
volved use of the N3 position of nt 121+ For motif A, the
two isomorphic hydrogen-bonding patterns that were
predicted were C-121-N4 to G-124-N7 and C-121-N4
to G-124-O6+ The first hydrogen-bonding pattern has
the highest degree of isomorphism to the other motifs+
For motifs B and C, the hydrogen-bonding patterns,
which are isomorphic to the C-121-N4 to G-124-N7
pattern, are, respectively U-121-O4 to A-124-N6 and
U-121-O4 to C-124-N4+
The ISOPAIR analysis for a triple occurring between
nt 121 and the bp 125:236 revealed that all predicted tri-
ples for these nucleotides would occur in the major
groove via a hydrogen bond between 121 and 236+
Some isomorphic triples were eliminated because they
would have involved use of the N3 position of nt 121 in
E. coli, including one predicted triple between 121 and
125+ For motifA, one hydrogen-bonding pattern was pre-
dicted for an interaction between 121 and 236: C-121-N4
to G-236-N7+ For motifs B and C, the hydrogen-bond-
ing pattern, which is isomorphic to the C-121-N4 to
G-236-N7 pattern, is U-121-O4 to A-236-N6+
Even though ISOPAIR only predicted triples occur-
ring between positions 121 and 124 or 121 and 236,
additional MC-SYM scripts were written to attempt to
create triples utilizing a hydrogen bond between 121–
125 or 121–237+ Models were generated for a triple
formed by a hydrogen bond between 121 and 125 for
motif A, but isomorphic triples for the other motifs could
not be identified+ It was not possible to generate mod-
els for a triple formed by a hydrogen bond between 121
and 237+ Additionally, even though no isomorphic base
triples were predicted to occur in the minor groove,
MC-SYM scripts were written to attempt to generate a
hydrogen bond between 121 and each of the 4 nt in
their minor groove+ Even large amounts of conforma-
tional freedom were not sufficient to allow models to be
generated in which the hydrogen bond forming the base
triple occurred in the minor groove+
Therefore, the only models to be considered were
those that contained a base triple formed by a hydro-
gen bond between 121 and 124 (C-121-N4 to G-124-N7
for motif A) or 121 and 236 (C-121-N4 to G-236-N7 for
motif A)+ The models from MC-SYM that were most
consistent with A-form RNA geometry were then sub-
jected to energy minimization+ Only the model that uti-
lized a hydrogen bond between 121 and 124 (Fig+ 2A)
could be minimized to acceptable O39-P bond lengths
FIGURE 1. Sequences for helix 122 region used for modeling+ A–D: The most common sequence patterns for the three
major motifs and E. coli+ A: Motif A (121 ϭ C, 124:237 ϭ G:C, and 125:236 ϭ U:G); B: Motif B (121 ϭ U, 124:237 ϭ A:U,
and 125:236 ϭ U:A); C: Motif C (121 ϭ U, 124:237 ϭ C:G, and 125:236 ϭ U:A); D: The sequence for E. coli (Brosius et al+,
1981), the second most common pattern for motif C+ E–H contain “hybrid” sequences designed to examine the cause of the
strong neighbor effect between the adjacent base pairs 124:237 and 125:236+ E: Sequence motif A with the 125:236 ϭ U:G
base pair substituted with G:C+ F: Sequence motif A with substitution of 125:236 ϭ A:U+ G: Sequence motif A with the
124:235 ϭ G:C base pair substituted with C:G+ H: Sequence motif A with the 124:235 base pair substituted with U:A+
1434 P. Babin et al.
and hydrogen-bond lengths (Saenger, 1984)+ The model
containing a base triple formed by a hydrogen bond
between 121 and 236 could not be minimized to an
acceptable hydrogen bond length between 121 and
236 and maintain an acceptable O39-P bond length
between 121 and 122 (Fig+ 2B)+ This analysis was re-
peated with the same results for all motifs+ The models
for all motifs for helix 122 using this base triple were
superimposed using the backbone and ribose atoms
for alignment (Fig+ 3)+ The root mean square (RMS)
deviations between models are listed in Table 5+
ISOPAIR was used to analyze the possibility of isogeo-
metric triples in helix 122 for all motifs, A–H, shown in
Table 2+ Isomorphic triples for all of the motifs except
motif D could be found+ The conformations shown in
Figure 2 were predicted for either comparison of the
ABC or ABCEFGH motifs+ Motif E was modeled as an
example of one of the less frequent motifs+ An isomor-
phic structure for the region utilizing a hydrogen bond
between 121 and 124 was obtained and the structure
containing it could be refined by energy minimization
(data not shown)+ Comparison of the structure for motif
E versus motif A indicates a close similarity in the struc-
ture of the base triple, but a higher degree of differ-
ences in the overall structure (Table 5)+
Sequence bias at the base pair adjacent
to the base triple interaction
The strong correlation between the sequences at
124:237 and 125:236 was investigated to determine if
FIGURE 2. Models for the 121(124:237) triple and the 121(125:236)
triple in motif A+ Both of the molecular models are shown after sub-
jecting them to energy minimization+ A: Model for the 121(124:237)
triple containing a hydrogen bond between the N4 of nt 121 and the
N7 of nt 124+ B: Model for the 121(125:236) triple containing a hy-
drogen bond between the N4 of nt 121 and the N7 of nt 236+ Note
that a suitable hydrogen-bond length cannot be attained in the model
containing the 121(125:236) triple+ The measurements indicated in
both panels are between the hydrogen of the hydrogen-bond donor
and the heavy atom hydrogen-bond acceptor+ Distances between
heavy atoms in both panels also indicate an acceptable hydrogen-
bond distance (Saenger, 1984) in the structure of A but not in B+
FIGURE 3. Superposition of the models for the proposed helix 122 with the base triple 121(125:236)+ A: Superposition of
models of helix 122 for motifs A (red), B (green), C (blue), and E. coli (yellow)+ Alignment was done using the backbone and
ribose atoms+ The atoms are shown only for motif A; the backbones are shown for all four models+ B: Superposition of the
base triple for motifs A, B, C, and E. coli+ The geometry for the triple of the C (blue) and E. coli motifs are nearly identical
and share the same molecular structure in this figure+ See Table 5 for RMS deviation of the models from one another+
TABLE 5+ RMS deviations of models containing base triples+
Motif
Deviation from
ideal A-form
RNA structure of
same sequencea
Deviation of
entire model from
motif A modelb
Deviation of
base triple
from motif A
base tripleb
A 0+74 Å — —
B 0+40 Å 1+32 Å 1+29 Å
C 0+61 Å 1+77 Å 1+93 Å
E. coli 0+61 Å 1+80 Å 1+94 Å
Dc
— — —
E 2+72 Å 1+79 Å 2+02 Å
a
RMS deviation obtained for all atoms+
b
RMS deviation obtained for backbone and ribose atoms+
c
It was not possible to predict structures for motif D that would
contain acceptable covalent and hydrogen bond lengths+
Geometry of a base triple in 16S rRNA 1435
there was a geometrical connection between the bias
and the occurrence of the base triple at 121(124:237)+
When 121 is a C and 124:237 are GC, there are no
occurrences of GC or AU in prokaryotes at the base
pair 125:236+ Hybrid sequences 1 and 2 were created
to investigate this bias+ In these hybrids, the only change
made to the most common sequence for motif A was
the substitution of GC for UG at base pair 125:236
(Fig+ 1E) and the substitution of AU for UG at base pair
125:236 (Fig+ 1F)+ For hybrid 1 and 2 sequences, mod-
els could not be generated for base triples incorporat-
ing a hydrogen bond between nt 121 and 124 (results
not shown)+
Certain base-pair types are also absent at the 124:237
positions+ When 121 is a C and 125:236 are UG, there
are no CG or UA base pairs at 124:237+ To investigate
this bias, additional hybrid models, hybrid 3 and hybrid
4 were created+ Hybrid 3 (Fig+ 1G) is the most common
sequence for motif A with base pair 124:237 changed
from GC to CG+ Hybrid 4 (Fig+ 1H) contains a UA base
pair at base 124:237+ When these hybrid sequences
were modeled, it was possible to generate models for
a base triple incorporating a hydrogen bond between nt
121 and 124 (Fig+ 4)+ However, these models could not
be minimized to acceptable O39-P and hydrogen-bond
lengths (Saenger, 1984)+ Specifically, the O39-P bond
length between nt 121 and 122 was incompatible with
an acceptable bond length for the hydrogen bond be-
tween 121 and 124 to form the base triple+ Thus, the
strong-neighbor effect occurring between base pairs
124:237 and 125:236 can be explained by the require-
ment for a geometric arrangement needed to allow the
formation of the triple at 121(124:237)+
DISCUSSION
Base triples are inherently more difficult to predict
than base pairs (see Gautheret et al+, 1995)+ Al-
though the majority of the secondary structure base
pairs are conserved in all members of a given RNA
type (e+g+, tRNA), this is not the situation for base
triples+ Several base triples in tRNA and group I in-
trons (Michel et al+, 1990; Michel & Westhof, 1990)
are present in only a subset of their structures (e+g+,
in the type-2 tRNAs, or in the C1-2 subgroup of
group I introns)+ Second, although similar (base pair-
ing) conformations are only maintained with posi-
tional covariations, similar conformations in base triples
can be maintained with single, unmatched positional
variation (Klug et al+, 1974)+
The 121(124:237)/(125:236) putative base triple was
identified by three comparative sequence analysis ap-
proaches+ The identification of the base-triple inter-
action within this sequence was investigated further
with ISOPAIR, MC-SYM, and energy minimization to
determine the potential for having isomorphic struc-
tures and to determine the possibility of forming mo-
lecular models+ In the present case, it was possible to
take advantage of chemical reactivity data that was
relevant to the conformations of U121, as U121 was
reactive with CMCT (Moazed et al+, 1986)+ The MC-
SYM exercises resulted in models in which U121 was
hydrogen bonded with either C124 or A236+ However,
MC-SYM models typically need to be constructed ini-
tially with long O39-P bond lengths, and when energy
minimization was performed to determine which mod-
els could be refined to acceptable bond lengths, the
model with a U121-to-A124 hydrogen bond was the
only one in which that could be done+
The interaction that is predicted here involves an ex-
tra hydrogen bond between 16S rRNA nt 121 and 124
in the major groove of the helical region+ This is a dif-
ferent type of interaction than the recent examples of
same-strand near-neighbor interactions in the group I
ribozyme (Cate et al+, 1996) and in hepatitis delta virus
ribozyme (Ferré-D’Amaré et al+, 1998) that occur in the
minor groove of the helix+ However, the base triple in-
teraction in the 16S rRNA at 595(596:644) has been
shown to occur in the major groove (Kalurachchi &
Nikonowicz, 1998) with the hydrogen bond occurring
between nt 595 and 596+ There are also several ex-
amples in tRNA structures of base triple interactions
that involve major groove interactions+ Furthermore, we
are confident that the predictive modeling program MC-
SYM has the capability of constructing minor groove
interactions, because another base triple in 16S rRNA
between nt 494(440:497) is predicted to contain its
extra interaction in the minor groove+ Thus, in spite of
the notoriety of the narrowness of the RNAmajor groove
in regular helices, there is no clear rule about which
groove is used in these types of interactions+
FIGURE 4. Models demonstrating the cause of the strong neighbor
effect+ Both examples shown here have been refined from MC-SYM
structures by energy minimization and contain proper backbone bond
lengths, anti-base conformations and 39-endo ribose conformations+
A: Model for the base triple utilizing a hydrogen bond between C121
and C124 in hybrid 3+ The distance between the hydrogen of N4
(C121) and the O2 (C124) exceeds the maximum acceptable length
of 2+17 Å+ B: Model for the base triple utilizing a hydrogen bond
between C121 and U124 in hybrid 4+ The distance between the O4
(U124) and the hydrogen of N4 (C121) exceeds the maximum ac-
ceptable length of 2+17 Å+ Hydrogen-bond distances measured be-
tween heavy atoms also indicate unacceptably long distances in
both cases (Saenger, 1984)+
1436 P. Babin et al.
By modeling “hybrid” sequences that utilized these
nonoccurring base pairings, we demonstrated that a
base triple in which nt 121 is hydrogen bonded to 124
is consistent with the coordinated set of base pairings
at 124:237 and 125:236+ We conclude that specific base
pairings at 124:237 (e+g+, CG and UA when 121 is U)
and 125:236 (e+g+, CG and AU when 121 is U or C)
were not allowed due to structural constraints at
121(124:237)+ The neighbor effect has been widely ob-
served in tRNAs (Gautheret et al+ 1995), in Group I
introns (Michel & Westhof, 1990), and in 16S and 23S
rRNA (R+ Gutell, unpubl+ data); the work presented here
is a demonstration that it is based at least partly on
structural constraints+
Finally, by modeling the (122–129){(232–239) helix
with MC-SYM, we demonstrated that the conforma-
tions chosen for the triple 121(124:237) in molecules
containing the A, B, and C sequence motifs are part of
models for the entire helical region that are isomorphic
in spite of divergent sequences at many positions+ There
are two types of exception to this conclusion that occur
with the region containing the minor sequence motifs+
The first is that we have not been able to determine
isomorphic structures for motif D (U(UA/UA)) that oc-
curs in 3% of the sequences+ Furthermore the se-
quence motifs E–H that occur in approximately 1% of
our prokaryotic data set can be modeled into base-
triple geometries isomorphic with the major motifs A, B,
and C+ However, the formation of these base triples
requires some distortion in the positions of the riboses
and bases compared to the normal type-A geometry+
The exceptions in motifs D–H occur in a small fraction
(approximately 7% total) of the total number of se-
quences+ Thus the idea of strict isomorphism at this
region extends to about 93% of the prokaryotic 16S
rRNA data set+
In the absence of a base triple in this region, nt 121
would most likely be stacked on nt 122 because of the
favorable stacking interactions+ However the presence
of the base triple involving 121 causes a repositioning
and redirection of the phosphate backbone in the re-
gion of 121+ This may affect the interaction that helix
122 has with adjacent helix (240–242)/(284–286) as
well as the trajectory of the single-stranded region116–
121 in the ribosomal subunit+
MATERIALS AND METHODS
Comparative base-triple analysis
Two comparative methods (Gautheret et al+, 1995) that iden-
tify base triples were used to search for covariations between
the known secondary-structure base pairs and the unpaired
positions+ Method one (chi) calculates the expected and
observed frequencies of base triples and their chi-square
values for all possible base pair and unpaired nucleotide com-
binations+ The base triple candidates with the highest chi-
squared values are considered possible+ Method two (ec and
sec) calculates values for the number of pseudophylogenetic
events+ Because the sequences in our alignments are ar-
ranged in phylogenetic order, the sequences that are most
closely related are adjacent to one another+ The number of
coordinated base changes that have occurred throughout the
evolution of the RNA under study can be approximated by
counting the number of mutual events (or simultaneous
changes) in the two (or three) columns (positions) in the align-
ment+ This number is divided by the total number of changes
that have occurred at the positions under study+ Thus the
maximum score of 1 denotes that all of the positional changes
are involved in mutual events, while a score of 0+5 signifies
that only half of the changes are associated with a mutual
event+
More recently a third base-triple covariation algorithm called
“covary” has been established (R+ Gutell, J+ Cannone, V+
Schweitzer, unpubl+ program; Gutell et al+, in prep+)+ This new-
est method sums the frequencies of those triples that covary
from the most frequent triplet (counting the most frequent
triplet)+ We only consider those triplets where the percentage
of pure covariation (covariation without exceptions) from
the most frequent triplet is greater than 35% of those triplets
that vary from the most frequent triplet; we filter out those
triplets with less than 35% covariation+ For example, for the
(124:237)121 base triple, (GC)C ϭ 74%, (AU)U ϭ 10%,
(CG)U ϭ 7%, (UA)U ϭ 3%, (GC)U ϭ 2%, other ϭ 4%+ Here
the total covariation value ϭ +84, as the only pure triplet co-
variations are (GC)C and (AU)U, and the filter value is 38%
(10/26, where 10 ϭ %(AU)U, 26 ϭ %((AU)U ϩ (CG)U ϩ
(UA)U ϩ (GC)U) ϩ others)+ The base pair that covaries best
with the unpaired position 121 is 124:237 with the covary
score +84, and the unpaired position that covaries best with
the 124:237 base pair is 121, with the same +84 covary score+
This method will be formally presented elsewhere ( R+R+ Gutell,
S+ Subrashchandran, M+ Schnare, Y+ Du, N+ Lin, L+ Madabusi,
K+ Muller, N+ Pande, N+ Yu, Z+ Shang, S+ Date, D+ Konings, V+
Schweiker, B+ Weiser, & J+ Cannone, in prep)+
For all of these methods, putative base triples are consid-
ered possible when the covariation between the base pair
and the unpaired nucleotide is mutual (e+g+, base pair X co-
varies best with unpaired nucleotide Y, and Y covaries best
with X )+ These base-triple “mutual best” covariations are then
evaluated for other considerations, including weaker covari-
ations among the base pairs in the vicinity of the base-triple
base pair [neighbor effect, see Gautheret et al+, (1995)], the
number of times the covariation occurred in the evolution of
the rRNAs, the statistical significance of the triple covaria-
tions, and the exceptions (single and double variations)+ A
putative base triple with a mutual best covariation is consid-
ered more probable when there are neighbor effects flanking
the base pair (of the base triple), and when the base triple
candidate is identified as mutual best with all methods—ec
(phylogenetic events), chi (statistical methods), and covary
(logical method)+ The putative base triples U121(C124-G237)
and/or U121(U125-A236) are the highest-scoring triplet co-
variations with these three methods+
ISOPAIR
ISOPAIR (Gautheret & Gutell, 1997) was used to determine
the hydrogen-bonding patterns that would produce isomor-
Geometry of a base triple in 16S rRNA 1437
phic base triples across the major motifs and the E. coli se-
quence+ Additionally, in preparation for molecular modeling
of helix 122 for all the motifs and the E. coli sequence, the
RDP Prokaryotic/Chloroplast 16S rRNA alignment was then
searched for the most frequently occurring pattern of nucle-
otides in helix 122 for motifs A, B, and C+
Molecular modeling
MC-SYM 1+3 scripts (Major et al+, 1991) were written to de-
termine which of the base-paired nucleotides (124–237 or
125–236) were capable of forming a hydrogen bond to the
single-stranded nt 121+ In all MC-SYM scripts, standard glo-
bal constraints contained in the sample scripts for the pro-
gram were used to minimize van der Waals’ overlaps in the
generated models and all scripts were written to generate
models that were as close to A-form RNA as possible+ Be-
cause the formation of the base triple requires an exceptional
geometry for nt 121, the O39-P bonding distances between
adjacent nucleotides needed to be initially set at 6+5 Å to
allow for searching of conformational space that would result
in the formation of models+ This is reasonably consistent with
the default value of 6 Å used in the ADJACENCY section of
the MC-SYM program (http://www+iro+umontreal+ ca/;major/
HTML/mcsym+ug+html)+ This is well beyond the normal max-
imum of 1+62 Å for this bond; however, all the O39-P bond
lengths in the models are easily adjusted to the optimal length
of 1+58–1+62 Å by energy minimization+ For helix 122, very
few models (and thus very few conformations) were gener-
ated by MC-SYM for each script and therefore clustering of
models into families was not required+
Energy minimization
All structures considered isomorphic by the ISOPAIR analy-
sis at the base triple were minimized using Insight II (Molec-
ular Simulations, Inc+)+ Global constraints were used in the
MC-SYM scripts, so van der Waals’ overlaps were minimal
and the main deviation of the structures generated by MC-
SYM from acceptable nucleotide geometries was the O39-P
distances between adjacent nucleotides+ Ninety rounds of
steepest-descent minimization followed by ten steps of con-
jugate gradient minimization using the AMBER force field
were used+ The geometry of the helical regions of the mini-
mized structures (bp 122:239–127:234) were determined by
measuring the RMS deviation of the structures from an ideal
type-A RNA helix of the same sequence and the geometry of
nt 121 was determined by measuring its dihedral angles and
bond lengths+
ACKNOWLEDGMENTS
This work was supported by National Institutes of Health
(NIH) grants GM48207 to R+R+G+ and GM43237 to P+W+ Rob-
ert Cedergren initially suggested the use of MC-SYM in mod-
eling rRNA regions and we gratefully acknowledge his insight
into this problem+ Francois Major is thanked for his com-
ments and suggestions in the use of MC-SYM, Daniel Gauth-
eret for making the program ISOPAIR available, Vi Schweitzer,
Jamie Cannone, Sankarasubramanian Subashchandran for
developmental work on the covariation algorithms+
Received March 24, 1999; returned for revision May 5,
1999; revised manuscript received August 7, 1999
REFERENCES
Ban N, Freeborn B, Nissen P, Penczek P, Grassucci R, Sweet R,
Frank J, Moore P, Steitz T+ 1998+ A 9 Å resolution X-ray crystal-
lographic map of the large ribosomal subunit+ Cell 93:1105–1115+
Basavappa R, Sigler PB+ 1991+ The 3 Å crystal structure of yeast
initiator tRNA: Functional implications in initiator/elongator dis-
crimination+ EMBO J 10:3105–3111+
Brion P, Westhof E+ 1997+ Hierarchy and dynamics of RNA folding+
Annu Rev Biophys Biomol Struct 26:113–137+
Brosius J, Dull TJ, Sleeter DD, Noller HF+ 1981+ Gene organization
and primary structure of a ribosomal RNA operon from Esche-
richia coli+ J Mol Biol 148:107–127+
Cate JH, Gooding AR, Podell E, Zhou K, Golden BL, Kundrot CE,
Cech TR, Doudna JA+ 1996+ Crystal structure of a group I ribo-
zyme domain: Principles of RNApacking+ Science 273:1678–1685+
Conn GL, Draper DE, Lattman EE, Gittis AG+ 1999+ Crystal structure
of a conserved ribosomal protein–RNA complex+ Science 284:
1171–1174+
Conn GL, Gutell RR, Draper DE+ 1998+ A functional ribosomal RNA
tertiary structure involves a base triple interaction+ Biochemistry
37:11980–11988+
Correll CC, Freeborn B, Moore PB, Steitz TA+ 1997+ Metals, motifs,
and recognition in the crystal structure of a 5S rRNA domain+ Cell
91:705–712+
Ferré-D’Amaré AR, Zhou K, Doudna JA+ 1998+ Crystal structure of a
hepatitis delta virus ribozyme+ Nature 395:567–574+
Gautheret D, Damberger SH, Gutell RR+ 1995+ Identification of base-
triples in RNA using comparative sequence analysis+ J Mol Biol
248:27–43+
Gautheret D, Gutell RR+ 1997+ Inferring the conformation of RNA
base pairs and triples from patterns of sequence variation+ Nu-
cleic Acids Res 25:1559–1564+
Gautheret D, Major F, Cedergren R+ 1993+ Modeling the three-
dimensional structure of RNA using discrete nucleotide confor-
mational sets+ J Mol Biol 229:1049–1064+
Golden BL, Gooding AR, Podell ER, Cech TR+ 1998+ A preorganized
active site in the crystal structure of the Tetrahymena ribozyme+
Science 282:259–264+
Green R, Noller HF+ 1997+ Ribosomes and translation+ Annu Rev
Biochem 66:679–716+
Gutell RR+ 1996+ Comparative sequence analysis and the structure
of 16S and 23S rRNA+ In: Zimmermann RA, Dahlberg AE, eds+
Ribosomal RNA: Structure, evolution, processing, and function in
protein synthesis+ Boca Raton, Florida: CRC Press+ pp 111–128+
Gutell RR, Noller HF, Woese CR+ 1986+ Higher order structure in
ribosomal RNA+ EMBO J 5:1111–1113+
Gutell RR, Power A, Hertz G, Putt E, Stormo G+ 1992+ Identifying
constraints on the higher-order structure of RNA: Continued de-
velopment and application of comparative sequence analysis meth-
ods+ Nucleic Acids Res 20:5785–5795+
Kalurachchi K, Nikonowicz EP+ 1998+ NMR structure determination of
the binding site for ribosomal protein S8 from Escherichia coli 168
rRNA+ J Mol Biol 280:639–654+
Kim S-H, Quigley FL, Suddath FL, McPherson A, Sneden D, Rim JJ,
Weinzierl J, Rich A+ 1973+ Three-dimensional structure of yeast
phenylalanine transfer RNA: Folding of the polynucleotide chain+
Science 179:285–288+
Klug A, Ladner J, Robertus JD+ 1974+ The structural geometry of
co-ordinated base changes in transfer RNA+ J Mol Biol 89:511–516+
Leontis NB, Westhof E+ 1998+ The 5S rRNA loop E: Chemical probing
and phylogenetic data versus crystal structure+ RNA 4:1134–1153+
Maidak BL, Cole JR, Parker CT Jr, Garrity GM, Larsen N, Li B, Lilburn
TG, McCaughey MJ, Olsen GJ, Overbeek R, Pramanik S, Schmidt
TM, Tiedje JM, Woese CR+ 1999+ A new version of the RDP+
Nucleic Acids Res 27:171–173+
Major F, Gautheret D, Cedergren R+ 1993+ Reproducing the three-
dimensional structure of a tRNA molecule from structural con-
straints+ Proc Natl Acad Sci USA 90:9408–9412+
1438 P. Babin et al.
Major F, Turcotte M, Gautheret D, Lapalme G, Fillion E, Cedergren R+
1991+ The combination of symbolic and numerical computa-
tion for three-dimensional modeling of RNA+ Science 253:1255–
1260+
Michel F, Ellington AD, Couture S, Szostak JW+ 1990+ Phylogenetic
and genetic evidence for base-triples in the catalytic domain of
group I introns+ Nature 347:578–580+
Michel F, Westhof E+ 1990+ Modelling of the three-dimensional archi-
tecture of Group I catalytic introns base on comparative se-
quence analysis+ J Mol Biol 216:585–610+
Moazed D, Stern S, Noller HF+ 1986+ Rapid chemical probing of
conformation in 16S ribosomal RNA and 308 ribosomal subunits
using primer extension. J Mol Biol 187:399–416+
Penczek P, Brassucci RA, Frank J+ 1994+ The ribosome at improved
resolution: New techniques for merging and orientation refine-
ment in 3D cryo-electron microscopy of biological particles+ Ultra-
microscopy 53:251–270+
Pley HW, Flaherty KM, McKay DB+ 1994+ Three-dimensional struc-
ture of a hammerhead ribozyme+ Nature 372:68–74+
Saenger W+ 1984+ Principles of nucleic acid structure+ New York:
Springer-Verlag+
Svergun DI, Burkhardt N, Pedersen JS, Koch MHJ, Volkov VV, Kozin
MB, Meerwinck W, Stuhrmann HB, Diederich G, Nierhaus KH+
1997+ Structural analysis of the 70S E. coli ribosome and its RNA
by solution scattering: I+ Invariants and validation of electron mi-
croscopic models+ J Mol Biol 271:588–601+
Uhlenbeck OC, Pardi A, Feigon J+ 1997+ RNA structure comes of age+
Cell 90:833–840+
van Bohlen K, Makowski I, Hansen HAS, Bartesl H, Berkovitch-Yelin
Z, Zaytzev A, Meyer S, Paullre C, Franscheschi F, Yonath A+
1991+ Characterization and preliminary attempts for derivatiza-
tion of crystals of large ribosomal subunits from Haloarcula maris-
mortui diffracting to 3 Å resolution. J Mol Biol 222:11–15+
Woese CR, Magrum LJ, Gupta R, Siegel RB, Stahl DA, Kop J, Craw-
ford N, Brosius J, Gutell RR, Hogan JJ, Noller HF+ 1980+ Second-
ary structure model for bacterial 16S ribosomal RNA: Phylogenetic,
enzymatic and chemical evidence+ Nucleic Acids Res 8:2275–
2293+
Geometry of a base triple in 16S rRNA 1439

More Related Content

Viewers also liked

Gutell 069.mpe.2000.15.0083
Gutell 069.mpe.2000.15.0083Gutell 069.mpe.2000.15.0083
Gutell 069.mpe.2000.15.0083Robin Gutell
 
Document32 1-130422122353-phpapp01
Document32 1-130422122353-phpapp01Document32 1-130422122353-phpapp01
Document32 1-130422122353-phpapp01Carloz Rodriguez
 
Conclucion de la vida de san marcelino champagnat
Conclucion de la vida de san marcelino champagnatConclucion de la vida de san marcelino champagnat
Conclucion de la vida de san marcelino champagnatCesar Lopez Lopez
 
La enseñanza de estrategias de comprensión lectora
La enseñanza de estrategias de comprensión lectoraLa enseñanza de estrategias de comprensión lectora
La enseñanza de estrategias de comprensión lectoradeysi05diaz
 
Biog rafia shakira mebarak ripoll
Biog rafia   shakira mebarak ripollBiog rafia   shakira mebarak ripoll
Biog rafia shakira mebarak ripollAReli LuNa Lucas
 
Gutell 067.nass.1999.41.0048
Gutell 067.nass.1999.41.0048Gutell 067.nass.1999.41.0048
Gutell 067.nass.1999.41.0048Robin Gutell
 
First aid kit slideshare
First aid kit slideshareFirst aid kit slideshare
First aid kit slidesharerobinashley1
 
Gutell 066.biochemistry.1998.37.11980
Gutell 066.biochemistry.1998.37.11980Gutell 066.biochemistry.1998.37.11980
Gutell 066.biochemistry.1998.37.11980Robin Gutell
 
Emprendimiento de un plan de negocios
Emprendimiento de un plan de negociosEmprendimiento de un plan de negocios
Emprendimiento de un plan de negociosLiliana Beral
 
Expo taller de escritores
Expo taller de escritoresExpo taller de escritores
Expo taller de escritoresdeysi05diaz
 
Diarrea aguda y cronicas.
Diarrea aguda y cronicas.Diarrea aguda y cronicas.
Diarrea aguda y cronicas.marlemin
 
Xx mistrzostwa pierwszej pomocy pck1
Xx mistrzostwa pierwszej pomocy pck1Xx mistrzostwa pierwszej pomocy pck1
Xx mistrzostwa pierwszej pomocy pck1callieee
 
Expo taller de escritores
Expo taller de escritoresExpo taller de escritores
Expo taller de escritoreshilde121893
 
Gutell 070.dao.diseases.aquat.organ.2000.41.0031
Gutell 070.dao.diseases.aquat.organ.2000.41.0031Gutell 070.dao.diseases.aquat.organ.2000.41.0031
Gutell 070.dao.diseases.aquat.organ.2000.41.0031Robin Gutell
 
Curso de induccion 2013 33
Curso de induccion 2013 33Curso de induccion 2013 33
Curso de induccion 2013 33jesusUIN
 
Fedorenko_Denis iv hematology forum_2016_st.p
Fedorenko_Denis iv hematology forum_2016_st.pFedorenko_Denis iv hematology forum_2016_st.p
Fedorenko_Denis iv hematology forum_2016_st.pEAFO2014
 

Viewers also liked (20)

Gutell 069.mpe.2000.15.0083
Gutell 069.mpe.2000.15.0083Gutell 069.mpe.2000.15.0083
Gutell 069.mpe.2000.15.0083
 
Colunistas galo final
Colunistas galo finalColunistas galo final
Colunistas galo final
 
Document32 1-130422122353-phpapp01
Document32 1-130422122353-phpapp01Document32 1-130422122353-phpapp01
Document32 1-130422122353-phpapp01
 
Conclucion de la vida de san marcelino champagnat
Conclucion de la vida de san marcelino champagnatConclucion de la vida de san marcelino champagnat
Conclucion de la vida de san marcelino champagnat
 
La enseñanza de estrategias de comprensión lectora
La enseñanza de estrategias de comprensión lectoraLa enseñanza de estrategias de comprensión lectora
La enseñanza de estrategias de comprensión lectora
 
Biog rafia shakira mebarak ripoll
Biog rafia   shakira mebarak ripollBiog rafia   shakira mebarak ripoll
Biog rafia shakira mebarak ripoll
 
Gutell 067.nass.1999.41.0048
Gutell 067.nass.1999.41.0048Gutell 067.nass.1999.41.0048
Gutell 067.nass.1999.41.0048
 
First aid kit slideshare
First aid kit slideshareFirst aid kit slideshare
First aid kit slideshare
 
No aeroporto,
No aeroporto,No aeroporto,
No aeroporto,
 
Gutell 066.biochemistry.1998.37.11980
Gutell 066.biochemistry.1998.37.11980Gutell 066.biochemistry.1998.37.11980
Gutell 066.biochemistry.1998.37.11980
 
David histo
David histoDavid histo
David histo
 
Emprendimiento de un plan de negocios
Emprendimiento de un plan de negociosEmprendimiento de un plan de negocios
Emprendimiento de un plan de negocios
 
Expo taller de escritores
Expo taller de escritoresExpo taller de escritores
Expo taller de escritores
 
Curriculum berenice aguilar
Curriculum berenice aguilarCurriculum berenice aguilar
Curriculum berenice aguilar
 
Diarrea aguda y cronicas.
Diarrea aguda y cronicas.Diarrea aguda y cronicas.
Diarrea aguda y cronicas.
 
Xx mistrzostwa pierwszej pomocy pck1
Xx mistrzostwa pierwszej pomocy pck1Xx mistrzostwa pierwszej pomocy pck1
Xx mistrzostwa pierwszej pomocy pck1
 
Expo taller de escritores
Expo taller de escritoresExpo taller de escritores
Expo taller de escritores
 
Gutell 070.dao.diseases.aquat.organ.2000.41.0031
Gutell 070.dao.diseases.aquat.organ.2000.41.0031Gutell 070.dao.diseases.aquat.organ.2000.41.0031
Gutell 070.dao.diseases.aquat.organ.2000.41.0031
 
Curso de induccion 2013 33
Curso de induccion 2013 33Curso de induccion 2013 33
Curso de induccion 2013 33
 
Fedorenko_Denis iv hematology forum_2016_st.p
Fedorenko_Denis iv hematology forum_2016_st.pFedorenko_Denis iv hematology forum_2016_st.p
Fedorenko_Denis iv hematology forum_2016_st.p
 

Similar to Gutell 068.rna.1999.05.1430

Gutell 059.fold.design.01.0419
Gutell 059.fold.design.01.0419Gutell 059.fold.design.01.0419
Gutell 059.fold.design.01.0419Robin Gutell
 
Gutell 080.bmc.bioinformatics.2002.3.2
Gutell 080.bmc.bioinformatics.2002.3.2Gutell 080.bmc.bioinformatics.2002.3.2
Gutell 080.bmc.bioinformatics.2002.3.2Robin Gutell
 
Gutell 075.jmb.2001.310.0735
Gutell 075.jmb.2001.310.0735Gutell 075.jmb.2001.310.0735
Gutell 075.jmb.2001.310.0735Robin Gutell
 
Gutell 081.cosb.2002.12.0301
Gutell 081.cosb.2002.12.0301Gutell 081.cosb.2002.12.0301
Gutell 081.cosb.2002.12.0301Robin Gutell
 
Gutell 090.bmc.bioinformatics.2004.5.105
Gutell 090.bmc.bioinformatics.2004.5.105Gutell 090.bmc.bioinformatics.2004.5.105
Gutell 090.bmc.bioinformatics.2004.5.105Robin Gutell
 
Gutell 028.cosb.1993.03.0313
Gutell 028.cosb.1993.03.0313Gutell 028.cosb.1993.03.0313
Gutell 028.cosb.1993.03.0313Robin Gutell
 
Gutell 002.nar.1981.09.06167
Gutell 002.nar.1981.09.06167Gutell 002.nar.1981.09.06167
Gutell 002.nar.1981.09.06167Robin Gutell
 
Gutell 108.jmb.2009.391.769
Gutell 108.jmb.2009.391.769Gutell 108.jmb.2009.391.769
Gutell 108.jmb.2009.391.769Robin Gutell
 
Gutell 087.mpe.2003.29.0216
Gutell 087.mpe.2003.29.0216Gutell 087.mpe.2003.29.0216
Gutell 087.mpe.2003.29.0216Robin Gutell
 
Gutell 114.jmb.2011.413.0473
Gutell 114.jmb.2011.413.0473Gutell 114.jmb.2011.413.0473
Gutell 114.jmb.2011.413.0473Robin Gutell
 
Gutell 101.physica.a.2007.386.0564.good
Gutell 101.physica.a.2007.386.0564.goodGutell 101.physica.a.2007.386.0564.good
Gutell 101.physica.a.2007.386.0564.goodRobin Gutell
 
Gutell 054.jmb.1996.256.0701
Gutell 054.jmb.1996.256.0701Gutell 054.jmb.1996.256.0701
Gutell 054.jmb.1996.256.0701Robin Gutell
 
Gutell 010.jbc.1984.259.05173
Gutell 010.jbc.1984.259.05173Gutell 010.jbc.1984.259.05173
Gutell 010.jbc.1984.259.05173Robin Gutell
 
Gutell 112.j.phys.chem.b.2010.114.13497
Gutell 112.j.phys.chem.b.2010.114.13497Gutell 112.j.phys.chem.b.2010.114.13497
Gutell 112.j.phys.chem.b.2010.114.13497Robin Gutell
 
Gutell 122.chapter comparative analy_russell_2013
Gutell 122.chapter comparative analy_russell_2013Gutell 122.chapter comparative analy_russell_2013
Gutell 122.chapter comparative analy_russell_2013Robin Gutell
 
Gutell 098.jmb.2006.360.0978
Gutell 098.jmb.2006.360.0978Gutell 098.jmb.2006.360.0978
Gutell 098.jmb.2006.360.0978Robin Gutell
 
Gutell 043.curg.1994.26.0321
Gutell 043.curg.1994.26.0321Gutell 043.curg.1994.26.0321
Gutell 043.curg.1994.26.0321Robin Gutell
 
Gutell 001.nar.1980.08.02275
Gutell 001.nar.1980.08.02275Gutell 001.nar.1980.08.02275
Gutell 001.nar.1980.08.02275Robin Gutell
 
Gutell 056.mpe.1996.05.0391
Gutell 056.mpe.1996.05.0391Gutell 056.mpe.1996.05.0391
Gutell 056.mpe.1996.05.0391Robin Gutell
 
Gutell 062.jmb.1997.267.1104
Gutell 062.jmb.1997.267.1104Gutell 062.jmb.1997.267.1104
Gutell 062.jmb.1997.267.1104Robin Gutell
 

Similar to Gutell 068.rna.1999.05.1430 (20)

Gutell 059.fold.design.01.0419
Gutell 059.fold.design.01.0419Gutell 059.fold.design.01.0419
Gutell 059.fold.design.01.0419
 
Gutell 080.bmc.bioinformatics.2002.3.2
Gutell 080.bmc.bioinformatics.2002.3.2Gutell 080.bmc.bioinformatics.2002.3.2
Gutell 080.bmc.bioinformatics.2002.3.2
 
Gutell 075.jmb.2001.310.0735
Gutell 075.jmb.2001.310.0735Gutell 075.jmb.2001.310.0735
Gutell 075.jmb.2001.310.0735
 
Gutell 081.cosb.2002.12.0301
Gutell 081.cosb.2002.12.0301Gutell 081.cosb.2002.12.0301
Gutell 081.cosb.2002.12.0301
 
Gutell 090.bmc.bioinformatics.2004.5.105
Gutell 090.bmc.bioinformatics.2004.5.105Gutell 090.bmc.bioinformatics.2004.5.105
Gutell 090.bmc.bioinformatics.2004.5.105
 
Gutell 028.cosb.1993.03.0313
Gutell 028.cosb.1993.03.0313Gutell 028.cosb.1993.03.0313
Gutell 028.cosb.1993.03.0313
 
Gutell 002.nar.1981.09.06167
Gutell 002.nar.1981.09.06167Gutell 002.nar.1981.09.06167
Gutell 002.nar.1981.09.06167
 
Gutell 108.jmb.2009.391.769
Gutell 108.jmb.2009.391.769Gutell 108.jmb.2009.391.769
Gutell 108.jmb.2009.391.769
 
Gutell 087.mpe.2003.29.0216
Gutell 087.mpe.2003.29.0216Gutell 087.mpe.2003.29.0216
Gutell 087.mpe.2003.29.0216
 
Gutell 114.jmb.2011.413.0473
Gutell 114.jmb.2011.413.0473Gutell 114.jmb.2011.413.0473
Gutell 114.jmb.2011.413.0473
 
Gutell 101.physica.a.2007.386.0564.good
Gutell 101.physica.a.2007.386.0564.goodGutell 101.physica.a.2007.386.0564.good
Gutell 101.physica.a.2007.386.0564.good
 
Gutell 054.jmb.1996.256.0701
Gutell 054.jmb.1996.256.0701Gutell 054.jmb.1996.256.0701
Gutell 054.jmb.1996.256.0701
 
Gutell 010.jbc.1984.259.05173
Gutell 010.jbc.1984.259.05173Gutell 010.jbc.1984.259.05173
Gutell 010.jbc.1984.259.05173
 
Gutell 112.j.phys.chem.b.2010.114.13497
Gutell 112.j.phys.chem.b.2010.114.13497Gutell 112.j.phys.chem.b.2010.114.13497
Gutell 112.j.phys.chem.b.2010.114.13497
 
Gutell 122.chapter comparative analy_russell_2013
Gutell 122.chapter comparative analy_russell_2013Gutell 122.chapter comparative analy_russell_2013
Gutell 122.chapter comparative analy_russell_2013
 
Gutell 098.jmb.2006.360.0978
Gutell 098.jmb.2006.360.0978Gutell 098.jmb.2006.360.0978
Gutell 098.jmb.2006.360.0978
 
Gutell 043.curg.1994.26.0321
Gutell 043.curg.1994.26.0321Gutell 043.curg.1994.26.0321
Gutell 043.curg.1994.26.0321
 
Gutell 001.nar.1980.08.02275
Gutell 001.nar.1980.08.02275Gutell 001.nar.1980.08.02275
Gutell 001.nar.1980.08.02275
 
Gutell 056.mpe.1996.05.0391
Gutell 056.mpe.1996.05.0391Gutell 056.mpe.1996.05.0391
Gutell 056.mpe.1996.05.0391
 
Gutell 062.jmb.1997.267.1104
Gutell 062.jmb.1997.267.1104Gutell 062.jmb.1997.267.1104
Gutell 062.jmb.1997.267.1104
 

More from Robin Gutell

Gutell 124.rna 2013-woese-19-vii-xi
Gutell 124.rna 2013-woese-19-vii-xiGutell 124.rna 2013-woese-19-vii-xi
Gutell 124.rna 2013-woese-19-vii-xiRobin Gutell
 
Gutell 123.app environ micro_2013_79_1803
Gutell 123.app environ micro_2013_79_1803Gutell 123.app environ micro_2013_79_1803
Gutell 123.app environ micro_2013_79_1803Robin Gutell
 
Gutell 118.plos_one_2012.7_e38203.supplementalfig
Gutell 118.plos_one_2012.7_e38203.supplementalfigGutell 118.plos_one_2012.7_e38203.supplementalfig
Gutell 118.plos_one_2012.7_e38203.supplementalfigRobin Gutell
 
Gutell 117.rcad_e_science_stockholm_pp15-22
Gutell 117.rcad_e_science_stockholm_pp15-22Gutell 117.rcad_e_science_stockholm_pp15-22
Gutell 117.rcad_e_science_stockholm_pp15-22Robin Gutell
 
Gutell 116.rpass.bibm11.pp618-622.2011
Gutell 116.rpass.bibm11.pp618-622.2011Gutell 116.rpass.bibm11.pp618-622.2011
Gutell 116.rpass.bibm11.pp618-622.2011Robin Gutell
 
Gutell 115.rna2dmap.bibm11.pp613-617.2011
Gutell 115.rna2dmap.bibm11.pp613-617.2011Gutell 115.rna2dmap.bibm11.pp613-617.2011
Gutell 115.rna2dmap.bibm11.pp613-617.2011Robin Gutell
 
Gutell 113.ploso.2011.06.e18768
Gutell 113.ploso.2011.06.e18768Gutell 113.ploso.2011.06.e18768
Gutell 113.ploso.2011.06.e18768Robin Gutell
 
Gutell 111.bmc.genomics.2010.11.485
Gutell 111.bmc.genomics.2010.11.485Gutell 111.bmc.genomics.2010.11.485
Gutell 111.bmc.genomics.2010.11.485Robin Gutell
 
Gutell 110.ant.v.leeuwenhoek.2010.98.195
Gutell 110.ant.v.leeuwenhoek.2010.98.195Gutell 110.ant.v.leeuwenhoek.2010.98.195
Gutell 110.ant.v.leeuwenhoek.2010.98.195Robin Gutell
 
Gutell 109.ejp.2009.44.277
Gutell 109.ejp.2009.44.277Gutell 109.ejp.2009.44.277
Gutell 109.ejp.2009.44.277Robin Gutell
 
Gutell 107.ssdbm.2009.200
Gutell 107.ssdbm.2009.200Gutell 107.ssdbm.2009.200
Gutell 107.ssdbm.2009.200Robin Gutell
 
Gutell 106.j.euk.microbio.2009.56.0142.2
Gutell 106.j.euk.microbio.2009.56.0142.2Gutell 106.j.euk.microbio.2009.56.0142.2
Gutell 106.j.euk.microbio.2009.56.0142.2Robin Gutell
 
Gutell 105.zoologica.scripta.2009.38.0043
Gutell 105.zoologica.scripta.2009.38.0043Gutell 105.zoologica.scripta.2009.38.0043
Gutell 105.zoologica.scripta.2009.38.0043Robin Gutell
 
Gutell 104.biology.direct.2008.03.016
Gutell 104.biology.direct.2008.03.016Gutell 104.biology.direct.2008.03.016
Gutell 104.biology.direct.2008.03.016Robin Gutell
 
Gutell 103.structure.2008.16.0535
Gutell 103.structure.2008.16.0535Gutell 103.structure.2008.16.0535
Gutell 103.structure.2008.16.0535Robin Gutell
 
Gutell 102.bioinformatics.2007.23.3289
Gutell 102.bioinformatics.2007.23.3289Gutell 102.bioinformatics.2007.23.3289
Gutell 102.bioinformatics.2007.23.3289Robin Gutell
 
Gutell 100.imb.2006.15.533
Gutell 100.imb.2006.15.533Gutell 100.imb.2006.15.533
Gutell 100.imb.2006.15.533Robin Gutell
 
Gutell 099.nature.2006.443.0931
Gutell 099.nature.2006.443.0931Gutell 099.nature.2006.443.0931
Gutell 099.nature.2006.443.0931Robin Gutell
 
Gutell 097.jphy.2006.42.0655
Gutell 097.jphy.2006.42.0655Gutell 097.jphy.2006.42.0655
Gutell 097.jphy.2006.42.0655Robin Gutell
 
Gutell 096.jmb.2006.358.0193
Gutell 096.jmb.2006.358.0193Gutell 096.jmb.2006.358.0193
Gutell 096.jmb.2006.358.0193Robin Gutell
 

More from Robin Gutell (20)

Gutell 124.rna 2013-woese-19-vii-xi
Gutell 124.rna 2013-woese-19-vii-xiGutell 124.rna 2013-woese-19-vii-xi
Gutell 124.rna 2013-woese-19-vii-xi
 
Gutell 123.app environ micro_2013_79_1803
Gutell 123.app environ micro_2013_79_1803Gutell 123.app environ micro_2013_79_1803
Gutell 123.app environ micro_2013_79_1803
 
Gutell 118.plos_one_2012.7_e38203.supplementalfig
Gutell 118.plos_one_2012.7_e38203.supplementalfigGutell 118.plos_one_2012.7_e38203.supplementalfig
Gutell 118.plos_one_2012.7_e38203.supplementalfig
 
Gutell 117.rcad_e_science_stockholm_pp15-22
Gutell 117.rcad_e_science_stockholm_pp15-22Gutell 117.rcad_e_science_stockholm_pp15-22
Gutell 117.rcad_e_science_stockholm_pp15-22
 
Gutell 116.rpass.bibm11.pp618-622.2011
Gutell 116.rpass.bibm11.pp618-622.2011Gutell 116.rpass.bibm11.pp618-622.2011
Gutell 116.rpass.bibm11.pp618-622.2011
 
Gutell 115.rna2dmap.bibm11.pp613-617.2011
Gutell 115.rna2dmap.bibm11.pp613-617.2011Gutell 115.rna2dmap.bibm11.pp613-617.2011
Gutell 115.rna2dmap.bibm11.pp613-617.2011
 
Gutell 113.ploso.2011.06.e18768
Gutell 113.ploso.2011.06.e18768Gutell 113.ploso.2011.06.e18768
Gutell 113.ploso.2011.06.e18768
 
Gutell 111.bmc.genomics.2010.11.485
Gutell 111.bmc.genomics.2010.11.485Gutell 111.bmc.genomics.2010.11.485
Gutell 111.bmc.genomics.2010.11.485
 
Gutell 110.ant.v.leeuwenhoek.2010.98.195
Gutell 110.ant.v.leeuwenhoek.2010.98.195Gutell 110.ant.v.leeuwenhoek.2010.98.195
Gutell 110.ant.v.leeuwenhoek.2010.98.195
 
Gutell 109.ejp.2009.44.277
Gutell 109.ejp.2009.44.277Gutell 109.ejp.2009.44.277
Gutell 109.ejp.2009.44.277
 
Gutell 107.ssdbm.2009.200
Gutell 107.ssdbm.2009.200Gutell 107.ssdbm.2009.200
Gutell 107.ssdbm.2009.200
 
Gutell 106.j.euk.microbio.2009.56.0142.2
Gutell 106.j.euk.microbio.2009.56.0142.2Gutell 106.j.euk.microbio.2009.56.0142.2
Gutell 106.j.euk.microbio.2009.56.0142.2
 
Gutell 105.zoologica.scripta.2009.38.0043
Gutell 105.zoologica.scripta.2009.38.0043Gutell 105.zoologica.scripta.2009.38.0043
Gutell 105.zoologica.scripta.2009.38.0043
 
Gutell 104.biology.direct.2008.03.016
Gutell 104.biology.direct.2008.03.016Gutell 104.biology.direct.2008.03.016
Gutell 104.biology.direct.2008.03.016
 
Gutell 103.structure.2008.16.0535
Gutell 103.structure.2008.16.0535Gutell 103.structure.2008.16.0535
Gutell 103.structure.2008.16.0535
 
Gutell 102.bioinformatics.2007.23.3289
Gutell 102.bioinformatics.2007.23.3289Gutell 102.bioinformatics.2007.23.3289
Gutell 102.bioinformatics.2007.23.3289
 
Gutell 100.imb.2006.15.533
Gutell 100.imb.2006.15.533Gutell 100.imb.2006.15.533
Gutell 100.imb.2006.15.533
 
Gutell 099.nature.2006.443.0931
Gutell 099.nature.2006.443.0931Gutell 099.nature.2006.443.0931
Gutell 099.nature.2006.443.0931
 
Gutell 097.jphy.2006.42.0655
Gutell 097.jphy.2006.42.0655Gutell 097.jphy.2006.42.0655
Gutell 097.jphy.2006.42.0655
 
Gutell 096.jmb.2006.358.0193
Gutell 096.jmb.2006.358.0193Gutell 096.jmb.2006.358.0193
Gutell 096.jmb.2006.358.0193
 

Recently uploaded

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...apidays
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘RTylerCroy
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?Igalia
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityPrincipled Technologies
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024SynarionITSolutions
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...DianaGray10
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherRemote DBA Services
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingEdi Saputra
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024The Digital Insurer
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century educationjfdjdjcjdnsjd
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodJuan lago vázquez
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businesspanagenda
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerThousandEyes
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesBoston Institute of Analytics
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CVKhem
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...Neo4j
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024The Digital Insurer
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc
 

Recently uploaded (20)

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
🐬 The future of MySQL is Postgres 🐘
🐬  The future of MySQL is Postgres   🐘🐬  The future of MySQL is Postgres   🐘
🐬 The future of MySQL is Postgres 🐘
 
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
 
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivityBoost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
 
Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024Top 10 Most Downloaded Games on Play Store in 2024
Top 10 Most Downloaded Games on Play Store in 2024
 
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
Connector Corner: Accelerate revenue generation using UiPath API-centric busi...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost SavingRepurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
 
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
 
presentation ICT roal in 21st century education
presentation ICT roal in 21st century educationpresentation ICT roal in 21st century education
presentation ICT roal in 21st century education
 
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin WoodPolkadot JAM Slides - Token2049 - By Dr. Gavin Wood
Polkadot JAM Slides - Token2049 - By Dr. Gavin Wood
 
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire businessWhy Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
 
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
+971581248768>> SAFE AND ORIGINAL ABORTION PILLS FOR SALE IN DUBAI AND ABUDHA...
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
HTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation StrategiesHTML Injection Attacks: Impact and Mitigation Strategies
HTML Injection Attacks: Impact and Mitigation Strategies
 
Real Time Object Detection Using Open CV
Real Time Object Detection Using Open CVReal Time Object Detection Using Open CV
Real Time Object Detection Using Open CV
 
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...Workshop - Best of Both Worlds_ Combine  KG and Vector search for  enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
 
Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024Manulife - Insurer Innovation Award 2024
Manulife - Insurer Innovation Award 2024
 
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law DevelopmentsTrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
 

Gutell 068.rna.1999.05.1430

  • 1. Identity and geometry of a base triple in 16S rRNA determined by comparative sequence analysis and molecular modeling PATRICIA BABIN,1 MICHAEL DOLAN,1 PAUL WOLLENZIEN,1 and ROBIN R. GUTELL2 1 Department of Biochemistry, North Carolina State University, Raleigh, North Carolina 27695-7622, USA 2 Institute for Cellular and Molecular Biology and School of Biological Sciences, University of Texas, Austin, Texas 78712-1095, USA ABSTRACT Comparative sequence analysis complements experimental methods for the determination of RNA three-dimensional structure. This approach is based on the concept that different sequences within the same gene family form similar higher-order structures. The large number of rRNA sequences with sufficient variation, along with improved covari- ation algorithms, are providing us with the opportunity to identify new base triples in 16S rRNA. The three- dimensional conformations for one of our strongest candidates involving U121 (C124:G237) and/or U121 (U125:A236) (Escherichia coli sequence and numbering) are analyzed here with different molecular modeling tools. Molecular modeling shows that U121 interacts with C124 in the U121 (C124:G237) base triple. This arrangement maintains isomorphic structures for the three most frequent sequence motifs (approximately 93% of known bacterial and archaeal sequences), is consistent with chemical reactivity of U121 in E. coli ribosomes, and is geometrically favor- able. Further, the restricted set of observed canonical (GU, AU, GC) base-pair types at positions 124:237 and 125:236 is consistent with the fact that the canonical base-pair sets (for both base pairs) that are not observed in nature prevent the formation of the 121(124:237) base triple. The analysis described here serves as a general scheme for the prediction of specific secondary and tertiary structure base pairing where there is a network of correlated base changes. Keywords: 16S rRNA; base triple; comparative sequence analysis; molecular modeling INTRODUCTION Current approaches to determine the three-dimensional structure for large RNA molecules and ribonucleopro- teins include electron microscopy (see Penczek et al+, 1994) and low-angle scattering experiments (Svergun et al+, 1997) to directly investigate global structures+ Structure determination of large RNA molecules addi- tionally is being guided by the approaches taken to determine the high-resolution structures of smaller RNA molecules+ This list includes most notably high-resolution crystal structures of several tRNAs (Kim et al+, 1973; see Basavappa & Sigler, 1991), crystal structures of the P4–P6 domain (Cate et al+, 1996) and catalytic domain (Golden et al+, 1998) of the Tetrahymena Group I intron ribozyme, the crystal structure of the hammerhead ribozyme (Pley et al+, 1994), the crystal structure of the loop E region of the 5S rRNA (Correll et al+, 1997), the crystal structure of the Hepatitis Delta Virus ribozyme (Ferré-D’Amaré et al+, 1998), and NMR- derived structures for a number of synthetic RNA ap- tamers (see Uhlenbeck et al+, 1997)+ Crystals of ribosomes and ribosomal subunits with good diffraction properties have been available for a number of years (van Bohlen et al+, 1991), but it has been difficult, because of the size and asymmetry of the ribosome and the need to obtain isomorphic heavy atom derivatives, to determine the diffraction phasing necessary for electron-density maps and three-dimen- sional structural determination+ The phasing problem has been solved recently for the Haloarcula marismor- tui 50S ribosomal subunit (Ban et al+, 1998) leading to a 9-Å electron density map of the structure+ This should provide critical information for the organization of the whole subunit including the active site and its support- ing structures when the fold of the rRNA in that struc- ture is determined+ So far, it has been necessary to infer details of rRNA structure from constraints deter- mined by different experiments and to incorporate this Reprint requests to: Paul Wollenzien, Department of Biochemistry, Box 7622, North Carolina State University, Raleigh, North Carolina 27695-7622, USA; e-mail: wollenz@bchserver+bch+ncsu+edu+ RNA (1999), 5:1430–1439+ Cambridge University Press+ Printed in the USA+ Copyright © 1999 RNA Society+ 1430
  • 2. information into comprehensive models of the struc- ture+ Chemical probing experiments to determine RNA reactivity patterns, the determination of structures of isolated rRNA regions by NMR and the determination of three-dimensional arrangement by RNA cross-linking are among the types of experiments that have been employed in this approach (see Green & Noller, 1997)+ It is anticipated that RNA structure motifs recognized in smaller RNAs can be applied to the larger rRNA struc- ture based on chemical reactivity and patterns of se- quence variance (Brion & Westhof, 1997; Leontis & Westhof, 1998)+ Base triples, a prominent tertiary structure motif, are important components in the tertiary structures of tRNAs and the group I introns+ Several years ago, Gautheret et al+ (1995) presented improvements in the prediction of base triples with comparative sequence analysis+ This algorithm identifies the best covariations between all unpaired nucleotides and known base pairs+ We have used this method to predict several base triples in 16S and 23S rRNA (Gutell, 1996; R+R+ Gutell, unpubl+ data)+ Here we present one of the best base triple can- didates in 16S rRNA involving nt 121 and bp 124:237 and/or 125:236 (Escherichia coli numbering)+ Chemi- cal probing data and molecular modeling are used to help ascertain which of the proposed base-triple inter- actions occurs in the RNA, and the conformation of these interactions+ RESULTS Identification of base-triple candidates by comparative sequence analysis The 122–128/233–239 16S rRNA helix first was pro- posed with comparative analysis based on the small number of sequences available in 1980 (Woese et al+, 1980)+ (We will refer to this interaction as helix 122 here)+ With a significant increase in the number of 16S rRNA sequences and the development of more pow- erful covariation algorithms (Gutell et al+, 1992; Maidak et al+, 1999; R+R+ Gutell, S+ Subrashchandran, M+ Schnare, Y+ Du, N+ Lin, L+ Madabusi, K+ Muller, N+ Pande, N+ Yu, Z+ Shang, S+ Date, D+ Konings, V+ Schweiker, B+ Weiser, & J+ Cannone, in prep+), all of the positions within the 122–128/233–239 helix have their strongest statistically significant covariation with their previously predicted base-pair partner, except for the 125:236 base pair, where position 125 is nearly always a U in ap- proximately 5,000 prokaryotic sequences, and position 236 is G in 75% and A in 25% of the sequences, (form- ing UG and UA base pairs)+ With a smaller set of rRNA sequences, our earlier analysis revealed several strong base-triple candi- dates in 16S and 23S rRNA (Gutell, 1996), including 1072(1092:1099) in 23S rRNA, and 121(124:237)/ (125:236) in 16S rRNA+ Recently the putative 23S rRNA base triple [1072(1092:1099)] has been substantiated with experimental methods (Conn et al+, 1998, 1999)+ We have repeated this base-triple analysis utilizing the same methods, but applied them to a larger pro- karyotic 16S rRNA alignment (Maidak et al+, 1999)+ Sev- eral strong candidates have been identified, including 595(596:644) and our previous 121(124:237)/(125:236) base triple+ There is a mutual best covariation between the unpaired position 595 and the base pair 596:644 with all three algorithms used to evaluate the signifi- cance of sequence covariations—chi square, pseudo- phylogenetic event counting (ec), and covary methods (see Materials and Methods)+ The phylogenetic distri- bution reveals that alternate versions of the triplets have evolved independently several times, thus increasing the likelihood that this comparatively inferred base tri- ple is real+ Nuclear magnetic resonance analysis of this region of the 16S rRNA (Kalurachchi & Nikonowicz, 1998) revealed a base triple at these positions+ The experimental support for the 16S [595(596:644)] and 23S [1072(1092:1099)] rRNA interactions lends credi- bility to the comparative methods employed here+ Our base-triple covariation analysis was performed on the most recent (July 1998, version 7+0) release of the Ribosomal Database Project (RDP) prokaryotic alignment of 16S rRNA(6205 sequences, including com- plete and partial sequences) (http://www+ cme+msu+edu/ RDP/download+html; Maidak et al+, 1999)+ The structure in the 120–130/230–240 region of the 16S rRNA is conserved in all prokaryotes and chloroplasts+ This same region is slightly different in the Eucarya nuclear and mitochondrial 16S-like rRNAs+ Because of this dif- ference in structural homology, only the prokaryotic and chloroplast 16S rRNAsequence sets are analyzed here+ The covariation signal between position 121 and the base pairs 124:237 and 125:236 is very strong+ The coordinated variation involves five positions+ The chi square and ec methods identify mutual best covaria- tions between position 121 and the base pair 125:236, whereas the newer covary method identifies a mutual best covariation between 121 and 124:237+ Approxi- mately 5,000 sequences in the prokaryotic RDP align- ment had nucleotide information at positions 121 and 124:237 and 125:236+ The dominant triplet sequences at 121(124:237) are C(GC) [74%], U(AU) [10%], and U(CG) [7%]+ At 121(125:236) the dominant sequences are C(UG) [73%] and U(UA) [21%] (Table 1A,B)+ The nucleotide at position 121 determines the base pairs and their arrangement at 124:237 and 125:236+ Note that position 121 is a pyrimidine in 98% of the prokaryotic 16S rRNA sequences, with C occurring in 76% of these sequences, U in 22%, and G and A each with 1% (Table 1A)+ When position 121 is a C, then 124:237 is a GC pair in 98% of the sequences, and 125:236 is a UG in 97% of the sequences (Table 1B)+ When position 121 is a U, then 124:237 is an AU (45%), CG (30%), UA (15%), or GC (9%), and 125:236 is a UA Geometry of a base triple in 16S rRNA 1431
  • 3. (94%), or UG (5%)+ Taken all together, four sequence motifs occur for these sequences (Table 2): when po- sition 121 is a C, the 124:237 and 125:236 base pairs are predominantly GC/UG (76%, motif A), and when 121 is a U, then the 124:237 and 125:236 base pairs are predominantly AU/UA (10%, motif B), followed by CG/UA (7%, motif C) or UA/UA (3%, motif D)+ Comparative analysis should reveal, in addition to the nucleotide frequencies for the positions of interest, an approximate number of events or times that the nucleotides have changed coordinately during the evo- lution of these rRNAs (Gutell et al+ 1986)+ These “phy- logenetic events” are a gauge for the authenticity of every base-pair and base-triple interaction predicted with comparative methods+ Our confidence for each proposed interaction is proportional to the number of times a mutual change (covariation) occurs in the phy- logenetic tree+ The four most frequent sequence combinations at positions 121(124:237)/(125:236) were mapped onto the RDP’s prokaryotic phylogenetic tree (Maidak et al+ 1999), providing us with the number of occurrences for each motif in each phylogenetic group (Table 3)+ For this analysis, we mapped the sequence sets onto a reduced phylogenetic tree that only contained the pri- mary branches in the Archaea (i+e+, Crenarchaeota, Eu- ryarchaeota) and the (eu)Bacteria (e+g+, Cyanobacteria, Spirochetes, and Proteobacteria)+ The Purple Bacteria (Proteobacteria) and Gram Positive branches were ex- panded an additional level (i+e+,Alpha subdivision)+ There are a few key observations from this analysis+ First, the most frequent motif—(A), c/gc/ug (at 76%)—occurs in all but two of the major prokaryotic phylogenetic groups+ Second, the second most abundant motif—(B), u/au/ua (at 10%)—occurs in the two remaining major phylo- TABLE 1+ Nucleotide frequencies at positions 121(124:237) (A) and 121(125:236) (B)+ Aa 121 A C G U —b — — 10 AU — — — 7 CG — 74 — 2 GC — — — 3 UA 124:237 Bc 121 A C G U — 2 — 21 UA — 73 — — UG 125:236 a Entries are percentages of 5,056 sequences with nucleotide in- formation at positions 121(124:237)+ b All percentages less than 1+5 are shown with a dash+ c Entries are percentages of 4,939 sequences with nucleotide in- formation at positions 121(125:236)+ TABLE 2+ Distribution (in percentage of 5,000 prokaryotic sequences analyzed) of nucleotide identity at positions 121 (124:237)/(125:236)+ Sequence positions 121 (124:237)/(125:236) Motif Name C GC/UG 76% A U AU/UA 10% B U CG/UA 7% C U UA/UA 3% D C AU/UA 1% E U GC/UA 1% F U GC/UG 1% G C GC/UA 1% H TABLE 3+ Distribution of sequence motifs A–D in bacteria and Archaea+ Sequence motif A B C D Phylogenetic group Archaea 41 Crenarchaeota 136 Euryarchaeota Bacteria 150 3 Cyanobacteria and chloroplasts 25 Fibrobacter phylum 213 Flexibacter-Cytophaga-Bacteroides phylum 3 Fls+sinusarabici assemblage 33 Fusobacteria and relatives Gram-positive phylum 15 Anaerobic Halophiles 395 Bacillus-Lactobacillus-Streptococcus subdivision 22 C+ Lituseburense group 32 C+ Purinolyticum group 160 Clostridium and relatives 75 Eubacterium and relatives 803 High GϩC subdivision 179 8 Mycoplasma and relatives 44 Sporomusa and relatives 37 Thermoanaerobacter and relatives 41 Green non-sulfur bacteria and relatives 2 4 Green sulfur bacteria 5 Nitrospina subdivision 7 6 Paraphyletic assemblage 17 10 6 Planctomyces and relatives Purple bacteria 742 53 Alpha subdivision 247 Beta subdivision 102 Delta subdivision 2 Env+sar121 group 70 Epsilon subdivision 149 77 320 160 Gamma subdivision 11 Uncultured magnetotactic bacteria 151 8 Spirochetes and relatives 1 Thermophilic assemblage 5 Thermophilic oxygen reducers 7 Thermotogales The distribution for the penta-sequence 121/124:237/125:236 is subdivided by phylogenetic group+ The four most abundant penta- sequences are shown where Aϭ c/gc/ug; B ϭ u/au/ua; C ϭ u/cg/ua; D ϭ u/ua/ua+ 1432 P. Babin et al.
  • 4. genetic groups+ Third, motifs C and D do not occur exclusively in any of these primary phylogenetic groups; instead they occur in two groups (Bacteria-Plancto- myces and Bacteria-Purple Bacteria-Gamma subdivi- sion) that have other motifs (A and B and A, B, and C)+ Fourth, 23 of the phylogenetic groups have only one motif, six phylogenetic groups have two motifs, one group (Bacteria-Planctomyces and relatives) has three motifs, and one group (Bacteria-Purple Bacteria-Gamma subdivision) has all four motifs+ Last, the number of phylogenetic events (times that each of these motifs evolved) is estimated at approximately 10+ Here we assume that the most abundant motif—(A), c/gc/ug— is primordial, and thus any motif other than this one has evolved [events have occurred in the Bacteria- Cyanobacteria and Chloroplasts, Bacteria-Fibrobacter, Bacteria-Flexibacter, Bacteria-Gram Positive-Phylum Mycoplasma, Bacteria-Green Sulfur Bacteria, etc+ (see Table 3)]+ Thus there are 13 such events+ However, because some of the primary phylogenetic groups are not necessarily monophyletic sister groups and the ac- tual number of unrelated groups is subject to disagree- ment, we estimate the number of phylogenetic changes at these five positions to be no less than five and ap- proximately 10+ The RDP Prokaryotic/Chloroplast 16S rRNA align- ment was then analyzed to determine the most frequently occurring nucleotide sequences in the (122– 129){(232–239) helix for motifs A, B, and C (Fig+ 1)+ The most frequently occurring sequence for each motif was then used for the molecular modeling experi- ments+ In addition, the E. coli sequence (which con- tains motif C sequence at the base triple) was also used for modeling experiments, as chemical reactivity data was available for it+ Isomorphism and molecular modeling in different sequence motifs Molecular models for the helix 122 region were con- structed using the constraint-satisfaction program MC- SYM that constructs models using nucleotide units with geometries extracted from known structures (Major et al+, 1991, 1993; Gautheret et al+, 1993)+ This was done to determine which base-triple interactions could be incorporated into molecular models of the region+ In the first exercise, the three most frequent sequence motifs, A, B, and C, as well as the sequence of E. coli (Fig+ 1) were used for modeling+ MC-SYM scripts to generate models for the se- quences shown in Figure 1, A–D, were written with three assumptions+ First, all of the base pairs in helix 122 contain normal Watson–Crick or wobble base pairs+ Second, the base triples for all the motifs should be isomorphic+ Third, the uridine at position 121 in E. coli has chemical reactivity (Moazed et al+, 1986; P+ Wol- lenzien, unpubl+), indicating that the N3 position of U 121 is not used in the hydrogen bonding in the E. coli sequence+ The ISOPAIR program (Gautheret & Gutell, 1997) was used first to determine which conformations would produce isomorphic base triples for the sequences in motifs A, B, C, and E. coli (Table 4)+ All four of the nucleotides in the base pairs 124:237 and 125:236 were considered as possible hydrogen bonding partners for 121 in the ISOPAIR analysis+ The ISOPAIR analysis for a triple occurring between nt 121 and the bp 124:237 revealed that all predicted triples for these nucleotides would occur in the major groove by means of a hydrogen bond between nt 121 and 124+ Triples with a hydrogen bond between nt 121 TABLE 4+ Summary of structures predicted by ISOPAIR-MCSYM analysis+ Motif Hydrogen bonding patterna MC-SYM transformationsb Hydrogen bonds formed Improper bond lengths after minimization? Most isomorphic structure? A 121–124 CG_31 C121(N4)–G124(N7) Yes 121–124 CG_32 C121(N4)–G124(N7) 121–236 CG_32 C121(N4)–G236(N7) N4–N7 H bond distance B 121–124 AU_50 U121(O4)–A124(N6) 121–124 AU_52 U121(O4)–A124(N6) Yes 121–236 AU_50 U121(O4)–A236(N6) O4–N6 H bond distance C 121–124 CU_101 U121(O4)–C124(N4) 121–124 CU_104 U121(O4)–C124(N4) Yes 121–236 AU_50 U121(O4)–A236(N6) O4–N6 H bond distance E. coli 121–124 CU_101 U121(O4)–C124(N4) 121–124 CU_104 U121(O4)–C124(N4) Yes 121–236 AU_50 U121(O4)–A236(N6) O4–N6 H bond distance a Hydrogen-bonding patterns are listed for the base-triple interactions which were consistent with molecular models+ b The “Transformation” designations were obtained in MC-SYM version 1+3 (see WWW+IRO+UMONTREAL+ CA/;MAJOR/ HTML/USERGUIDE+HTML) for the base pairs containing one-hydrogen-bond interactions between the indicated bases+ Geometry of a base triple in 16S rRNA 1433
  • 5. and 237 were eliminated because they would have in- volved use of the N3 position of nt 121+ For motif A, the two isomorphic hydrogen-bonding patterns that were predicted were C-121-N4 to G-124-N7 and C-121-N4 to G-124-O6+ The first hydrogen-bonding pattern has the highest degree of isomorphism to the other motifs+ For motifs B and C, the hydrogen-bonding patterns, which are isomorphic to the C-121-N4 to G-124-N7 pattern, are, respectively U-121-O4 to A-124-N6 and U-121-O4 to C-124-N4+ The ISOPAIR analysis for a triple occurring between nt 121 and the bp 125:236 revealed that all predicted tri- ples for these nucleotides would occur in the major groove via a hydrogen bond between 121 and 236+ Some isomorphic triples were eliminated because they would have involved use of the N3 position of nt 121 in E. coli, including one predicted triple between 121 and 125+ For motifA, one hydrogen-bonding pattern was pre- dicted for an interaction between 121 and 236: C-121-N4 to G-236-N7+ For motifs B and C, the hydrogen-bond- ing pattern, which is isomorphic to the C-121-N4 to G-236-N7 pattern, is U-121-O4 to A-236-N6+ Even though ISOPAIR only predicted triples occur- ring between positions 121 and 124 or 121 and 236, additional MC-SYM scripts were written to attempt to create triples utilizing a hydrogen bond between 121– 125 or 121–237+ Models were generated for a triple formed by a hydrogen bond between 121 and 125 for motif A, but isomorphic triples for the other motifs could not be identified+ It was not possible to generate mod- els for a triple formed by a hydrogen bond between 121 and 237+ Additionally, even though no isomorphic base triples were predicted to occur in the minor groove, MC-SYM scripts were written to attempt to generate a hydrogen bond between 121 and each of the 4 nt in their minor groove+ Even large amounts of conforma- tional freedom were not sufficient to allow models to be generated in which the hydrogen bond forming the base triple occurred in the minor groove+ Therefore, the only models to be considered were those that contained a base triple formed by a hydro- gen bond between 121 and 124 (C-121-N4 to G-124-N7 for motif A) or 121 and 236 (C-121-N4 to G-236-N7 for motif A)+ The models from MC-SYM that were most consistent with A-form RNA geometry were then sub- jected to energy minimization+ Only the model that uti- lized a hydrogen bond between 121 and 124 (Fig+ 2A) could be minimized to acceptable O39-P bond lengths FIGURE 1. Sequences for helix 122 region used for modeling+ A–D: The most common sequence patterns for the three major motifs and E. coli+ A: Motif A (121 ϭ C, 124:237 ϭ G:C, and 125:236 ϭ U:G); B: Motif B (121 ϭ U, 124:237 ϭ A:U, and 125:236 ϭ U:A); C: Motif C (121 ϭ U, 124:237 ϭ C:G, and 125:236 ϭ U:A); D: The sequence for E. coli (Brosius et al+, 1981), the second most common pattern for motif C+ E–H contain “hybrid” sequences designed to examine the cause of the strong neighbor effect between the adjacent base pairs 124:237 and 125:236+ E: Sequence motif A with the 125:236 ϭ U:G base pair substituted with G:C+ F: Sequence motif A with substitution of 125:236 ϭ A:U+ G: Sequence motif A with the 124:235 ϭ G:C base pair substituted with C:G+ H: Sequence motif A with the 124:235 base pair substituted with U:A+ 1434 P. Babin et al.
  • 6. and hydrogen-bond lengths (Saenger, 1984)+ The model containing a base triple formed by a hydrogen bond between 121 and 236 could not be minimized to an acceptable hydrogen bond length between 121 and 236 and maintain an acceptable O39-P bond length between 121 and 122 (Fig+ 2B)+ This analysis was re- peated with the same results for all motifs+ The models for all motifs for helix 122 using this base triple were superimposed using the backbone and ribose atoms for alignment (Fig+ 3)+ The root mean square (RMS) deviations between models are listed in Table 5+ ISOPAIR was used to analyze the possibility of isogeo- metric triples in helix 122 for all motifs, A–H, shown in Table 2+ Isomorphic triples for all of the motifs except motif D could be found+ The conformations shown in Figure 2 were predicted for either comparison of the ABC or ABCEFGH motifs+ Motif E was modeled as an example of one of the less frequent motifs+ An isomor- phic structure for the region utilizing a hydrogen bond between 121 and 124 was obtained and the structure containing it could be refined by energy minimization (data not shown)+ Comparison of the structure for motif E versus motif A indicates a close similarity in the struc- ture of the base triple, but a higher degree of differ- ences in the overall structure (Table 5)+ Sequence bias at the base pair adjacent to the base triple interaction The strong correlation between the sequences at 124:237 and 125:236 was investigated to determine if FIGURE 2. Models for the 121(124:237) triple and the 121(125:236) triple in motif A+ Both of the molecular models are shown after sub- jecting them to energy minimization+ A: Model for the 121(124:237) triple containing a hydrogen bond between the N4 of nt 121 and the N7 of nt 124+ B: Model for the 121(125:236) triple containing a hy- drogen bond between the N4 of nt 121 and the N7 of nt 236+ Note that a suitable hydrogen-bond length cannot be attained in the model containing the 121(125:236) triple+ The measurements indicated in both panels are between the hydrogen of the hydrogen-bond donor and the heavy atom hydrogen-bond acceptor+ Distances between heavy atoms in both panels also indicate an acceptable hydrogen- bond distance (Saenger, 1984) in the structure of A but not in B+ FIGURE 3. Superposition of the models for the proposed helix 122 with the base triple 121(125:236)+ A: Superposition of models of helix 122 for motifs A (red), B (green), C (blue), and E. coli (yellow)+ Alignment was done using the backbone and ribose atoms+ The atoms are shown only for motif A; the backbones are shown for all four models+ B: Superposition of the base triple for motifs A, B, C, and E. coli+ The geometry for the triple of the C (blue) and E. coli motifs are nearly identical and share the same molecular structure in this figure+ See Table 5 for RMS deviation of the models from one another+ TABLE 5+ RMS deviations of models containing base triples+ Motif Deviation from ideal A-form RNA structure of same sequencea Deviation of entire model from motif A modelb Deviation of base triple from motif A base tripleb A 0+74 Å — — B 0+40 Å 1+32 Å 1+29 Å C 0+61 Å 1+77 Å 1+93 Å E. coli 0+61 Å 1+80 Å 1+94 Å Dc — — — E 2+72 Å 1+79 Å 2+02 Å a RMS deviation obtained for all atoms+ b RMS deviation obtained for backbone and ribose atoms+ c It was not possible to predict structures for motif D that would contain acceptable covalent and hydrogen bond lengths+ Geometry of a base triple in 16S rRNA 1435
  • 7. there was a geometrical connection between the bias and the occurrence of the base triple at 121(124:237)+ When 121 is a C and 124:237 are GC, there are no occurrences of GC or AU in prokaryotes at the base pair 125:236+ Hybrid sequences 1 and 2 were created to investigate this bias+ In these hybrids, the only change made to the most common sequence for motif A was the substitution of GC for UG at base pair 125:236 (Fig+ 1E) and the substitution of AU for UG at base pair 125:236 (Fig+ 1F)+ For hybrid 1 and 2 sequences, mod- els could not be generated for base triples incorporat- ing a hydrogen bond between nt 121 and 124 (results not shown)+ Certain base-pair types are also absent at the 124:237 positions+ When 121 is a C and 125:236 are UG, there are no CG or UA base pairs at 124:237+ To investigate this bias, additional hybrid models, hybrid 3 and hybrid 4 were created+ Hybrid 3 (Fig+ 1G) is the most common sequence for motif A with base pair 124:237 changed from GC to CG+ Hybrid 4 (Fig+ 1H) contains a UA base pair at base 124:237+ When these hybrid sequences were modeled, it was possible to generate models for a base triple incorporating a hydrogen bond between nt 121 and 124 (Fig+ 4)+ However, these models could not be minimized to acceptable O39-P and hydrogen-bond lengths (Saenger, 1984)+ Specifically, the O39-P bond length between nt 121 and 122 was incompatible with an acceptable bond length for the hydrogen bond be- tween 121 and 124 to form the base triple+ Thus, the strong-neighbor effect occurring between base pairs 124:237 and 125:236 can be explained by the require- ment for a geometric arrangement needed to allow the formation of the triple at 121(124:237)+ DISCUSSION Base triples are inherently more difficult to predict than base pairs (see Gautheret et al+, 1995)+ Al- though the majority of the secondary structure base pairs are conserved in all members of a given RNA type (e+g+, tRNA), this is not the situation for base triples+ Several base triples in tRNA and group I in- trons (Michel et al+, 1990; Michel & Westhof, 1990) are present in only a subset of their structures (e+g+, in the type-2 tRNAs, or in the C1-2 subgroup of group I introns)+ Second, although similar (base pair- ing) conformations are only maintained with posi- tional covariations, similar conformations in base triples can be maintained with single, unmatched positional variation (Klug et al+, 1974)+ The 121(124:237)/(125:236) putative base triple was identified by three comparative sequence analysis ap- proaches+ The identification of the base-triple inter- action within this sequence was investigated further with ISOPAIR, MC-SYM, and energy minimization to determine the potential for having isomorphic struc- tures and to determine the possibility of forming mo- lecular models+ In the present case, it was possible to take advantage of chemical reactivity data that was relevant to the conformations of U121, as U121 was reactive with CMCT (Moazed et al+, 1986)+ The MC- SYM exercises resulted in models in which U121 was hydrogen bonded with either C124 or A236+ However, MC-SYM models typically need to be constructed ini- tially with long O39-P bond lengths, and when energy minimization was performed to determine which mod- els could be refined to acceptable bond lengths, the model with a U121-to-A124 hydrogen bond was the only one in which that could be done+ The interaction that is predicted here involves an ex- tra hydrogen bond between 16S rRNA nt 121 and 124 in the major groove of the helical region+ This is a dif- ferent type of interaction than the recent examples of same-strand near-neighbor interactions in the group I ribozyme (Cate et al+, 1996) and in hepatitis delta virus ribozyme (Ferré-D’Amaré et al+, 1998) that occur in the minor groove of the helix+ However, the base triple in- teraction in the 16S rRNA at 595(596:644) has been shown to occur in the major groove (Kalurachchi & Nikonowicz, 1998) with the hydrogen bond occurring between nt 595 and 596+ There are also several ex- amples in tRNA structures of base triple interactions that involve major groove interactions+ Furthermore, we are confident that the predictive modeling program MC- SYM has the capability of constructing minor groove interactions, because another base triple in 16S rRNA between nt 494(440:497) is predicted to contain its extra interaction in the minor groove+ Thus, in spite of the notoriety of the narrowness of the RNAmajor groove in regular helices, there is no clear rule about which groove is used in these types of interactions+ FIGURE 4. Models demonstrating the cause of the strong neighbor effect+ Both examples shown here have been refined from MC-SYM structures by energy minimization and contain proper backbone bond lengths, anti-base conformations and 39-endo ribose conformations+ A: Model for the base triple utilizing a hydrogen bond between C121 and C124 in hybrid 3+ The distance between the hydrogen of N4 (C121) and the O2 (C124) exceeds the maximum acceptable length of 2+17 Å+ B: Model for the base triple utilizing a hydrogen bond between C121 and U124 in hybrid 4+ The distance between the O4 (U124) and the hydrogen of N4 (C121) exceeds the maximum ac- ceptable length of 2+17 Å+ Hydrogen-bond distances measured be- tween heavy atoms also indicate unacceptably long distances in both cases (Saenger, 1984)+ 1436 P. Babin et al.
  • 8. By modeling “hybrid” sequences that utilized these nonoccurring base pairings, we demonstrated that a base triple in which nt 121 is hydrogen bonded to 124 is consistent with the coordinated set of base pairings at 124:237 and 125:236+ We conclude that specific base pairings at 124:237 (e+g+, CG and UA when 121 is U) and 125:236 (e+g+, CG and AU when 121 is U or C) were not allowed due to structural constraints at 121(124:237)+ The neighbor effect has been widely ob- served in tRNAs (Gautheret et al+ 1995), in Group I introns (Michel & Westhof, 1990), and in 16S and 23S rRNA (R+ Gutell, unpubl+ data); the work presented here is a demonstration that it is based at least partly on structural constraints+ Finally, by modeling the (122–129){(232–239) helix with MC-SYM, we demonstrated that the conforma- tions chosen for the triple 121(124:237) in molecules containing the A, B, and C sequence motifs are part of models for the entire helical region that are isomorphic in spite of divergent sequences at many positions+ There are two types of exception to this conclusion that occur with the region containing the minor sequence motifs+ The first is that we have not been able to determine isomorphic structures for motif D (U(UA/UA)) that oc- curs in 3% of the sequences+ Furthermore the se- quence motifs E–H that occur in approximately 1% of our prokaryotic data set can be modeled into base- triple geometries isomorphic with the major motifs A, B, and C+ However, the formation of these base triples requires some distortion in the positions of the riboses and bases compared to the normal type-A geometry+ The exceptions in motifs D–H occur in a small fraction (approximately 7% total) of the total number of se- quences+ Thus the idea of strict isomorphism at this region extends to about 93% of the prokaryotic 16S rRNA data set+ In the absence of a base triple in this region, nt 121 would most likely be stacked on nt 122 because of the favorable stacking interactions+ However the presence of the base triple involving 121 causes a repositioning and redirection of the phosphate backbone in the re- gion of 121+ This may affect the interaction that helix 122 has with adjacent helix (240–242)/(284–286) as well as the trajectory of the single-stranded region116– 121 in the ribosomal subunit+ MATERIALS AND METHODS Comparative base-triple analysis Two comparative methods (Gautheret et al+, 1995) that iden- tify base triples were used to search for covariations between the known secondary-structure base pairs and the unpaired positions+ Method one (chi) calculates the expected and observed frequencies of base triples and their chi-square values for all possible base pair and unpaired nucleotide com- binations+ The base triple candidates with the highest chi- squared values are considered possible+ Method two (ec and sec) calculates values for the number of pseudophylogenetic events+ Because the sequences in our alignments are ar- ranged in phylogenetic order, the sequences that are most closely related are adjacent to one another+ The number of coordinated base changes that have occurred throughout the evolution of the RNA under study can be approximated by counting the number of mutual events (or simultaneous changes) in the two (or three) columns (positions) in the align- ment+ This number is divided by the total number of changes that have occurred at the positions under study+ Thus the maximum score of 1 denotes that all of the positional changes are involved in mutual events, while a score of 0+5 signifies that only half of the changes are associated with a mutual event+ More recently a third base-triple covariation algorithm called “covary” has been established (R+ Gutell, J+ Cannone, V+ Schweitzer, unpubl+ program; Gutell et al+, in prep+)+ This new- est method sums the frequencies of those triples that covary from the most frequent triplet (counting the most frequent triplet)+ We only consider those triplets where the percentage of pure covariation (covariation without exceptions) from the most frequent triplet is greater than 35% of those triplets that vary from the most frequent triplet; we filter out those triplets with less than 35% covariation+ For example, for the (124:237)121 base triple, (GC)C ϭ 74%, (AU)U ϭ 10%, (CG)U ϭ 7%, (UA)U ϭ 3%, (GC)U ϭ 2%, other ϭ 4%+ Here the total covariation value ϭ +84, as the only pure triplet co- variations are (GC)C and (AU)U, and the filter value is 38% (10/26, where 10 ϭ %(AU)U, 26 ϭ %((AU)U ϩ (CG)U ϩ (UA)U ϩ (GC)U) ϩ others)+ The base pair that covaries best with the unpaired position 121 is 124:237 with the covary score +84, and the unpaired position that covaries best with the 124:237 base pair is 121, with the same +84 covary score+ This method will be formally presented elsewhere ( R+R+ Gutell, S+ Subrashchandran, M+ Schnare, Y+ Du, N+ Lin, L+ Madabusi, K+ Muller, N+ Pande, N+ Yu, Z+ Shang, S+ Date, D+ Konings, V+ Schweiker, B+ Weiser, & J+ Cannone, in prep)+ For all of these methods, putative base triples are consid- ered possible when the covariation between the base pair and the unpaired nucleotide is mutual (e+g+, base pair X co- varies best with unpaired nucleotide Y, and Y covaries best with X )+ These base-triple “mutual best” covariations are then evaluated for other considerations, including weaker covari- ations among the base pairs in the vicinity of the base-triple base pair [neighbor effect, see Gautheret et al+, (1995)], the number of times the covariation occurred in the evolution of the rRNAs, the statistical significance of the triple covaria- tions, and the exceptions (single and double variations)+ A putative base triple with a mutual best covariation is consid- ered more probable when there are neighbor effects flanking the base pair (of the base triple), and when the base triple candidate is identified as mutual best with all methods—ec (phylogenetic events), chi (statistical methods), and covary (logical method)+ The putative base triples U121(C124-G237) and/or U121(U125-A236) are the highest-scoring triplet co- variations with these three methods+ ISOPAIR ISOPAIR (Gautheret & Gutell, 1997) was used to determine the hydrogen-bonding patterns that would produce isomor- Geometry of a base triple in 16S rRNA 1437
  • 9. phic base triples across the major motifs and the E. coli se- quence+ Additionally, in preparation for molecular modeling of helix 122 for all the motifs and the E. coli sequence, the RDP Prokaryotic/Chloroplast 16S rRNA alignment was then searched for the most frequently occurring pattern of nucle- otides in helix 122 for motifs A, B, and C+ Molecular modeling MC-SYM 1+3 scripts (Major et al+, 1991) were written to de- termine which of the base-paired nucleotides (124–237 or 125–236) were capable of forming a hydrogen bond to the single-stranded nt 121+ In all MC-SYM scripts, standard glo- bal constraints contained in the sample scripts for the pro- gram were used to minimize van der Waals’ overlaps in the generated models and all scripts were written to generate models that were as close to A-form RNA as possible+ Be- cause the formation of the base triple requires an exceptional geometry for nt 121, the O39-P bonding distances between adjacent nucleotides needed to be initially set at 6+5 Å to allow for searching of conformational space that would result in the formation of models+ This is reasonably consistent with the default value of 6 Å used in the ADJACENCY section of the MC-SYM program (http://www+iro+umontreal+ ca/;major/ HTML/mcsym+ug+html)+ This is well beyond the normal max- imum of 1+62 Å for this bond; however, all the O39-P bond lengths in the models are easily adjusted to the optimal length of 1+58–1+62 Å by energy minimization+ For helix 122, very few models (and thus very few conformations) were gener- ated by MC-SYM for each script and therefore clustering of models into families was not required+ Energy minimization All structures considered isomorphic by the ISOPAIR analy- sis at the base triple were minimized using Insight II (Molec- ular Simulations, Inc+)+ Global constraints were used in the MC-SYM scripts, so van der Waals’ overlaps were minimal and the main deviation of the structures generated by MC- SYM from acceptable nucleotide geometries was the O39-P distances between adjacent nucleotides+ Ninety rounds of steepest-descent minimization followed by ten steps of con- jugate gradient minimization using the AMBER force field were used+ The geometry of the helical regions of the mini- mized structures (bp 122:239–127:234) were determined by measuring the RMS deviation of the structures from an ideal type-A RNA helix of the same sequence and the geometry of nt 121 was determined by measuring its dihedral angles and bond lengths+ ACKNOWLEDGMENTS This work was supported by National Institutes of Health (NIH) grants GM48207 to R+R+G+ and GM43237 to P+W+ Rob- ert Cedergren initially suggested the use of MC-SYM in mod- eling rRNA regions and we gratefully acknowledge his insight into this problem+ Francois Major is thanked for his com- ments and suggestions in the use of MC-SYM, Daniel Gauth- eret for making the program ISOPAIR available, Vi Schweitzer, Jamie Cannone, Sankarasubramanian Subashchandran for developmental work on the covariation algorithms+ Received March 24, 1999; returned for revision May 5, 1999; revised manuscript received August 7, 1999 REFERENCES Ban N, Freeborn B, Nissen P, Penczek P, Grassucci R, Sweet R, Frank J, Moore P, Steitz T+ 1998+ A 9 Å resolution X-ray crystal- lographic map of the large ribosomal subunit+ Cell 93:1105–1115+ Basavappa R, Sigler PB+ 1991+ The 3 Å crystal structure of yeast initiator tRNA: Functional implications in initiator/elongator dis- crimination+ EMBO J 10:3105–3111+ Brion P, Westhof E+ 1997+ Hierarchy and dynamics of RNA folding+ Annu Rev Biophys Biomol Struct 26:113–137+ Brosius J, Dull TJ, Sleeter DD, Noller HF+ 1981+ Gene organization and primary structure of a ribosomal RNA operon from Esche- richia coli+ J Mol Biol 148:107–127+ Cate JH, Gooding AR, Podell E, Zhou K, Golden BL, Kundrot CE, Cech TR, Doudna JA+ 1996+ Crystal structure of a group I ribo- zyme domain: Principles of RNApacking+ Science 273:1678–1685+ Conn GL, Draper DE, Lattman EE, Gittis AG+ 1999+ Crystal structure of a conserved ribosomal protein–RNA complex+ Science 284: 1171–1174+ Conn GL, Gutell RR, Draper DE+ 1998+ A functional ribosomal RNA tertiary structure involves a base triple interaction+ Biochemistry 37:11980–11988+ Correll CC, Freeborn B, Moore PB, Steitz TA+ 1997+ Metals, motifs, and recognition in the crystal structure of a 5S rRNA domain+ Cell 91:705–712+ Ferré-D’Amaré AR, Zhou K, Doudna JA+ 1998+ Crystal structure of a hepatitis delta virus ribozyme+ Nature 395:567–574+ Gautheret D, Damberger SH, Gutell RR+ 1995+ Identification of base- triples in RNA using comparative sequence analysis+ J Mol Biol 248:27–43+ Gautheret D, Gutell RR+ 1997+ Inferring the conformation of RNA base pairs and triples from patterns of sequence variation+ Nu- cleic Acids Res 25:1559–1564+ Gautheret D, Major F, Cedergren R+ 1993+ Modeling the three- dimensional structure of RNA using discrete nucleotide confor- mational sets+ J Mol Biol 229:1049–1064+ Golden BL, Gooding AR, Podell ER, Cech TR+ 1998+ A preorganized active site in the crystal structure of the Tetrahymena ribozyme+ Science 282:259–264+ Green R, Noller HF+ 1997+ Ribosomes and translation+ Annu Rev Biochem 66:679–716+ Gutell RR+ 1996+ Comparative sequence analysis and the structure of 16S and 23S rRNA+ In: Zimmermann RA, Dahlberg AE, eds+ Ribosomal RNA: Structure, evolution, processing, and function in protein synthesis+ Boca Raton, Florida: CRC Press+ pp 111–128+ Gutell RR, Noller HF, Woese CR+ 1986+ Higher order structure in ribosomal RNA+ EMBO J 5:1111–1113+ Gutell RR, Power A, Hertz G, Putt E, Stormo G+ 1992+ Identifying constraints on the higher-order structure of RNA: Continued de- velopment and application of comparative sequence analysis meth- ods+ Nucleic Acids Res 20:5785–5795+ Kalurachchi K, Nikonowicz EP+ 1998+ NMR structure determination of the binding site for ribosomal protein S8 from Escherichia coli 168 rRNA+ J Mol Biol 280:639–654+ Kim S-H, Quigley FL, Suddath FL, McPherson A, Sneden D, Rim JJ, Weinzierl J, Rich A+ 1973+ Three-dimensional structure of yeast phenylalanine transfer RNA: Folding of the polynucleotide chain+ Science 179:285–288+ Klug A, Ladner J, Robertus JD+ 1974+ The structural geometry of co-ordinated base changes in transfer RNA+ J Mol Biol 89:511–516+ Leontis NB, Westhof E+ 1998+ The 5S rRNA loop E: Chemical probing and phylogenetic data versus crystal structure+ RNA 4:1134–1153+ Maidak BL, Cole JR, Parker CT Jr, Garrity GM, Larsen N, Li B, Lilburn TG, McCaughey MJ, Olsen GJ, Overbeek R, Pramanik S, Schmidt TM, Tiedje JM, Woese CR+ 1999+ A new version of the RDP+ Nucleic Acids Res 27:171–173+ Major F, Gautheret D, Cedergren R+ 1993+ Reproducing the three- dimensional structure of a tRNA molecule from structural con- straints+ Proc Natl Acad Sci USA 90:9408–9412+ 1438 P. Babin et al.
  • 10. Major F, Turcotte M, Gautheret D, Lapalme G, Fillion E, Cedergren R+ 1991+ The combination of symbolic and numerical computa- tion for three-dimensional modeling of RNA+ Science 253:1255– 1260+ Michel F, Ellington AD, Couture S, Szostak JW+ 1990+ Phylogenetic and genetic evidence for base-triples in the catalytic domain of group I introns+ Nature 347:578–580+ Michel F, Westhof E+ 1990+ Modelling of the three-dimensional archi- tecture of Group I catalytic introns base on comparative se- quence analysis+ J Mol Biol 216:585–610+ Moazed D, Stern S, Noller HF+ 1986+ Rapid chemical probing of conformation in 16S ribosomal RNA and 308 ribosomal subunits using primer extension. J Mol Biol 187:399–416+ Penczek P, Brassucci RA, Frank J+ 1994+ The ribosome at improved resolution: New techniques for merging and orientation refine- ment in 3D cryo-electron microscopy of biological particles+ Ultra- microscopy 53:251–270+ Pley HW, Flaherty KM, McKay DB+ 1994+ Three-dimensional struc- ture of a hammerhead ribozyme+ Nature 372:68–74+ Saenger W+ 1984+ Principles of nucleic acid structure+ New York: Springer-Verlag+ Svergun DI, Burkhardt N, Pedersen JS, Koch MHJ, Volkov VV, Kozin MB, Meerwinck W, Stuhrmann HB, Diederich G, Nierhaus KH+ 1997+ Structural analysis of the 70S E. coli ribosome and its RNA by solution scattering: I+ Invariants and validation of electron mi- croscopic models+ J Mol Biol 271:588–601+ Uhlenbeck OC, Pardi A, Feigon J+ 1997+ RNA structure comes of age+ Cell 90:833–840+ van Bohlen K, Makowski I, Hansen HAS, Bartesl H, Berkovitch-Yelin Z, Zaytzev A, Meyer S, Paullre C, Franscheschi F, Yonath A+ 1991+ Characterization and preliminary attempts for derivatiza- tion of crystals of large ribosomal subunits from Haloarcula maris- mortui diffracting to 3 Å resolution. J Mol Biol 222:11–15+ Woese CR, Magrum LJ, Gupta R, Siegel RB, Stahl DA, Kop J, Craw- ford N, Brosius J, Gutell RR, Hogan JJ, Noller HF+ 1980+ Second- ary structure model for bacterial 16S ribosomal RNA: Phylogenetic, enzymatic and chemical evidence+ Nucleic Acids Res 8:2275– 2293+ Geometry of a base triple in 16S rRNA 1439