Physica A 386 (2007) 564–572Networks of interactions in the secondary and tertiary structureof ribosomal RNAChang-Yong Lee...
Complex networks are often classified, according to research fields, as social [5,6], technological [7–10],and biological ne...
2. Conceptualization of rRNA molecules as networksA biologically active RNA structure is composed of a specific sequence of...
the normalized shortest path length for the constructed rRNA networks have the same standard Gaussiandistribution, despite...
As shown in Fig. 2, the plotted mass functions for all rRNA networks reveal that the tertiary interactionsreduce the maxim...
interactions while maintaining the same number of tertiary interactions. In both of the T16S-3D and H23S-3Dnetworks, Lreal...
implying the absence of any characteristic scale of the helix betweenness. Similar power-law behavior arefound for the T16...
important helices that maintain the rRNA structure networks have minimal variation in bacterial rRNAs.A similar result is ...
References[1] For review network theory, see, for example, M. Newman, SIAM Rev. 45 (2003) 167;R. Albert, A.-L. Baraba´ si,...
Upcoming SlideShare
Loading in …5
×

Gutell 101.physica.a.2007.386.0564.good

191 views
150 views

Published on

Lee C.-Y., Lee J.C., and Gutell R.R. (2007).
Networks of interactions in the secondary and tertiary structure of ribosomal RNA.
Physica A, 386(1):564-572.

Published in: Technology
0 Comments
0 Likes
Statistics
Notes
  • Be the first to comment

  • Be the first to like this

No Downloads
Views
Total views
191
On SlideShare
0
From Embeds
0
Number of Embeds
1
Actions
Shares
0
Downloads
0
Comments
0
Likes
0
Embeds 0
No embeds

No notes for slide

Gutell 101.physica.a.2007.386.0564.good

  1. 1. Physica A 386 (2007) 564–572Networks of interactions in the secondary and tertiary structureof ribosomal RNAChang-Yong Leea,Ã, Jung C. Leeb, Robin R. GutellbaThe Department of Industrial Information, Kongju National University, Chungnam 340-702, South KoreabThe Institute for Cellular and Molecular Biology, The University of Texas at Austin, 1 University Station A4800, Austin, TX 78712, USAReceived 13 March 2007; received in revised form 12 July 2007Available online 28 August 2007AbstractWe construct four different structural networks for both the secondary and tertiary structures of the 16S and 23Sribosomal RNAs (rRNAs) in the high-resolution crystal structures of the Thermus thermophilus 30S and Haloarculamarismortui 50S ribosomal subunits, and investigate topological characteristics of the rRNA structures by determiningrelevant measures, such as the characteristic path length, the clustering coefficient, and the helix betweenness. This studyreveals that the 23S rRNA network is more compact than the 16S rRNA networks, reflecting the more globular overallstructure of the 23S rRNA relative to the 16S rRNA. In particular, the large number of tertiary interactions in the 23SrRNA tends to cluster, accounting for its small-world network properties. In addition, although the rRNA networks arenot the scale-free network, their helix betweenness has a power-law distribution and is correlated with the phylogeneticconservation of helices. The higher the helix betweenness, the more conserved the helix. These results suggest a potentialrole of the rRNA network as a new quantitative approach in rRNA research.r 2007 Elsevier B.V. All rights reserved.PACS: 87.14.Gg; 87.15.Àv; 89.75.HcKeywords: Ribosomal RNA; Complex networks; rRNA structure; Nucleotide conservation; Small-world1. IntroductionThe network (or graph) theory [1], originated from the Ko¨ nigsberg’s seven bridges problem formulated byEuler, was systematically studied in terms of the random network theory developed by Erdo¨ s and Re´ nyi [2].Significant advance in the network theory was recently made by the discovery of some distinctive features thatmany complex networks have in common, including the small-world [3] and the scale-free [4] properties. Theseuncovered characteristics distinguish complex networks from the random and the regular networks.Subsequent researches on the complex networks of various systems have made considerable progress in theunderstanding of these systems, and studies on the complex networks have become more active across manydisciplines.ARTICLE IN PRESSwww.elsevier.com/locate/physa0378-4371/$ - see front matter r 2007 Elsevier B.V. All rights reserved.doi:10.1016/j.physa.2007.08.030ÃCorresponding author. Tel.: +82 041 330 1423; fax: +82 041 330 1429.E-mail address: clee@kongju.ac.kr (C.-Y. Lee).
  2. 2. Complex networks are often classified, according to research fields, as social [5,6], technological [7–10],and biological networks [3,10–14], to name just a few. Recently, the research community has begun tostudy various types of biological networks: neural networks [3], food networks [10], metabolic networks [11],genetic regulatory networks [12], and protein interaction networks [13,14]. Most biological networksare composed of molecules (or substrates) and their interactions that are represented as vertices andedges, respectively. This research on biological networks is mainly focused on the investigation of theirconnectivity by determining such statistical measures as the degree distribution, the characteristic path length,and the clustering coefficient; many biological networks are found to be a scale-free and/or small-worldnetwork.Besides these biological networks made of independent molecules interacting with one another, a largebiological molecule itself can be represented as a network. Proteins and RNA molecules, which are composedof a long chain of amino acids or nucleotides with multiple interactions between them, can be constructed intoappropriate networks which represent structural features. Protein structures, which have been traditionallyviewed as molecules that catalyze essential functions in the cell, have been studied as the network of aminoacids [15–17]. Furthermore, it was shown that protein structures can be characterized as the small-worldnetwork with which key residues for their folding process can be identified. In addition to the conventionalanalysis methods, the quantitative network-based methods can reveal hidden functional as well as structuralcharacteristics of proteins.The structure of the rRNA is important because it is believed that the structure dictates itsbiological function. Since the rRNA, a tightly packed asymmetric macromolecule, has been consideredtoo large for a high-resolution structural analysis, quantitative studies on the structure proved difficultuntil recent progress in the high resolution X-ray crystallography has been made. The 2:4 ˚A resolutionof the 50S subunit from the H. marismortui [18] and the 3:05 ˚A resolution of the 30S subunit fromthe T. thermophilus [19] provided the first detailed views of the structure at the atomic level. Theseenable us to study not only the sequence and structure, but the function of the molecules in a greatdetail.Two-thirds of the mass of the ribosome [20], the site of protein synthesis of a living cell, is RNA and theremainder is protein. While an older and conventional paradigm dictated that the functional sites in theribosome is composed of protein, a long series of experiments [21] that culminated recently with the high-resolution crystal structures of the 30S and 50S ribosomal subunits [18,19] revealed and verified that RNA(16S rRNA in the 30S subunit; 23S and 5S rRNAs in the 50S subunit) is the active participant in proteinsynthesis. Based on the simple concept that different RNA sequences with evolutionary related and similarbiological functions fold into very similar secondary and tertiary structures, the secondary structures of therRNAs were determined with comparative sequence analysis [22–24]. In particular, approximately 97–98% ofthe basepairs predicted in these structure models are present in the high-resolution crystal structures of the T.thermophilus 30S and H. marismortui 50S subunits [25].Early efforts to quantitatively analyze the RNA secondary structure (including rRNA) were based onsimplified representations, including the fine- and coarse-grained tree representations. In the find-grainedtree representation [26], both basepairs and unpaired nucleotides are considered as vertices, while adjacentbase-pairs and/or unpaired nucleotides are considered as edges. This approach has been used for com-parison of two structures, including structure alignment, motif-based searches, and quantitative measurementof the ‘‘tree distance’’ between two structures [27]. In the coarse-grained tree representation that involvesa RNA chain of up to about 100 nucleotides [28,29], double stranded helices are represented as edges,while single stranded loops (hairpin, internal, and multi-stem loops) are represented as vertices. Thisrepresentation is used for the study of the algebraic connectivity based on the spectral decomposition[30]. More recently, the interaction networks of RNAs is studied for the relationships between helicaldomains [31]. This study uncovers not only structural similarity but the conserved pattern and distancesbetween motifs.In this paper, we study the characteristics of the rRNA structure from the biological network perspective. Inparticular, we construct four rRNA networks employing both the secondary and tertiary structures of the T.thermophilus 16S and H. marismortui 23S rRNAs by identifying each nucleotide as a vertex and chemicalbonds (either the hydrogen or the covalent bond) between nucleotides as an edge [25,32].ARTICLE IN PRESSC.-Y. Lee et al. / Physica A 386 (2007) 564–572 565
  3. 3. 2. Conceptualization of rRNA molecules as networksA biologically active RNA structure is composed of a specific sequence of four nucleotides [adenine (A),cytosine (C), guanine (G), and uracil (U)] that is folded into its secondary structure and then into its tertiarystructure. Procaryotic ribosomes are molecules of about 250 ˚A in diameter and contain the 30S (small) and 50S(large) subunits. The 30S subunit contains about 20 proteins and the 16S rRNA which plays a crucial role inthe decoding process of mRNA; the 50S subunit contains about 30 proteins and the 23S and 5S rRNAs whichcatalyze the chemical reaction of peptide bond formation [33]. Whereas the 5S rRNA contains relatively smallnumber of about 120 nucleotides, the 16S and 23S rRNAs are large polymers of approximately 1500 and 3000nucleotides, respectively. Their secondary structures comprised of double-stranded helices and single-strandedloops are divided into secondary structural domains (four domains in the 16S rRNA and six in the 23S rRNA)[34]. These domains range in size from approximately 150–550 and 270–840 nucleotides in the 16S and 23SrRNAs, respectively.In unfolded state, the rRNA is a single-stranded linear polymer of nucleotides, in which the backbone ofadjacent nucleotides is connected via the covalent bond. Thus, the order of nucleotides linked by the covalentbond determines the sequence of the rRNA. In order for the rRNA to function biologically, the linear polymerfolds onto itself to form helices of various sizes which are main components in the 3D structure. A helix isnothing but a pair of consecutive sequence segments that form base pairs via the hydrogen bond. Due to theformation of helices, various types of single-stranded region (or unpaired region) occur between helices.Similar to the terminology devised for describing protein structures, the architecture of the rRNA istraditionally described hierarchically by the secondary and tertiary structures. The RNA secondary structureis a 2D diagram consisting of many secondary structure elements including double-stranded helices andunpaired regions. The RNA tertiary structure, in contrast, is a 3D structure in which secondary structureelements are strategically and topologically arranged with each other to make a large number of tertiarycontacts between secondary structure elements. Thus, tertiary interactions contain secondary interactions,pseudoknot interactions, and all other hydrogen-bond-mediated contacts comprising base–base, base–backbone, and backbone–backbone interactions.A detailed mapping of the secondary and tertiary structure interactions in the high-resolution 16S and23S rRNAs crystal structures [19,18] revealed that, while the secondary structure interactions of the 16S and23S rRNAs occur within domains, their tertiary structure interactions occur both within and between domains[25]. In particular, the 23S rRNA contains many more tertiary interactions than the 16S rRNA.Approximately 15% and 45% of the 180 and 460 tertiary interactions in the 16S and 23S rRNAs,respectively, occur between domains (data not shown) suggesting that the 23S rRNA is more globular in shapecompared to the 16S rRNA.Here we introduce and conceptualize rRNA structural networks by representing nucleotides as vertices andtheir covalent sugar-phosphate backbone and hydrogen-bonds interactions as edges. We construct four rRNAnetworks for the secondary structure only and secondary and tertiary structure in the high-resolutionT. thermophilus 16S and H. marismortui 23S rRNA crystal structures [19,18,25]. Moreover, multiple bondsbetween a pair of nucleotides are represented as a single edge while multiple edges between a pair of verticesare, in general, not allowed in the network theory. The four networks are denoted as T16S-2D, T16S-3D,H23S-2D, and H23S-3D, where 2D and 3D represent secondary structure only and secondary and tertiarystructure, respectively. The four constructed rRNA networks are then analyzed by such commonly adoptedmeasures as the shortest path length, the clustering coefficient, and the vertex betweenness to understandtopological characteristics of the rRNA structures from a network perspective.3. Results3.1. Compactness and small-world propertyOne measure to quantify the network topology is the shortest path length lij connecting a pair of verticesi and j. This measure is simply the minimum number of edges along the shortest path between the pair. Fig. 1shows frequencies of the normalized shortest path length ~l for the four different networks. The frequencies ofARTICLE IN PRESSC.-Y. Lee et al. / Physica A 386 (2007) 564–572566
  4. 4. the normalized shortest path length for the constructed rRNA networks have the same standard Gaussiandistribution, despite the differences in their detailed structural characteristics.Although the four networks have the same distribution in their mean shortest paths, their means andstandard deviations differ from one another, as shown in Table 1. For a linear polymer of N nucleotides, themean of the shortest path length ml % N=3$OðNÞ for Nb1. Thus, if both rRNAs were to have similarstructural characteristics, ml of the 23S rRNA network should be about twice that of the 16S rRNA networksince the 23S rRNA has about twice as many nucleotides as the 16S rRNA. Table 1, however, shows that boththe 16S and 23S rRNA networks have about the same mean shortest path length considering the 2D and 3Dnetworks separately. In addition, the folding of the secondary structure to its tertiary structure leads to anapproximately 2- and 3-fold reduction in the 16S and 23S rRNA networks, respectively, in both the meanshortest path and its standard deviation. This suggests quantitatively that the 23S rRNA is structurally morecompact than the 16S rRNA.To elaborate on these findings, we utilize the mass function MðdÞ using the cumulative density function ofthe frequency distribution. That is,MðdÞ ¼ NXlpdPðlÞ, (1)where N is the number of nucleotides, and PðlÞ is the frequency of l. The mass function is a measure of theaverage number of nucleotides within a distance less than or equal to d, and it is equivalent to the average‘‘mass’’ of the network. Incidentally, it is similar to the hop plot of the Internet diameter [35].ARTICLE IN PRESS-2 0 2 40.00.10.20.30.40.50.6frequencynormalized shortest path lengthFig. 1. The frequency distribution of the normalized shortest path length ~l ¼ ðl À ^mlÞ=^sl for the four rRNA networks, where ^ml and ^sl arethe estimated mean and standard deviation of the shortest path length: T16S-2D ð&Þ, T16S-3D ðÞ, H23S-2D ðnÞ, and H23S-3D ð,Þ.Frequencies are normalized such that, for each network, the sum of frequencies over all shortest path lengths is unity. By the scaling, alldistributions collapse to the standard Gaussian distribution, Nð0; 1Þ (solid line).Table 1The estimated mean (^ml) and its standard deviation (^sl) of the shortest path length for four rRNA networksT16S-2D H23S-2D T16S-3D H23S-3D^ml 66.2 72.0 30.5 24.4^sl 82.6 65.4 36.5 21.0C.-Y. Lee et al. / Physica A 386 (2007) 564–572 567
  5. 5. As shown in Fig. 2, the plotted mass functions for all rRNA networks reveal that the tertiary interactionsreduce the maximum shortest path length lmax in both the 16S and 23S rRNAs, where lmax satisfiesMðlmaxÞ % N. More importantly, the 23S rRNA has a slope more than twice the 16S rRNA in both thesecondary and tertiary structures, indicating that nucleotides in the 23S rRNA are more densely packed thanthose in the 16S rRNA. Since the 23S rRNA contains about twice as many nucleotides as the 16S rRNA,the 23S rRNA network would contain about twice as many nucleotides as the 16S rRNA within the samedistance if the packing density is comparable for both rRNAs. A similar finding was established from the massfractal dimension analysis of rRNA molecules. While the mass fractal dimension of the 16S rRNA molecule isless than three, that of the 23S rRNA is close to three, implying that the 23S rRNA is a more compact 3Dobject [36].The characteristic compactness of the H23-3D network is due to an increase in the number of domain-domain tertiary interactions in 23S rRNA, relative to 16S rRNA. These extra tertiary interactions betweendifferent domains reduce the overall simple sequence distance between the nucleotides in different domains.While the more compact 23S rRNA itself is responsible for the peptide bond formation [37,18], the lesscompact 16S rRNA might be related to the higher degree of structural flexibility of the 30S subunit duringtranslocation of mRNA and tRNAs, including the rotational rigid-body motion between the head and the restof the 30S subunit [38]. In particular, it has been reported that the 30S subunit might undergo the ratchet-likemovement relative to the large 50S subunit [39]. In contrast to the 30S subunit, the 50S subunit might not beassociated with any significant movements in its core region during protein synthesis because of its muchcompact structure except peripheral regions.We further address the shortest path length from the perspective of the small-world network [3]. Nucleotidesin rRNA secondary structures contain only two or three interactions, including basepairing interactions withtheir basepairing partners and/or covalent interactions with their neighboring nucleotides in sequence, so thatthey are not clustered and hardly show an appreciable cliquishness. Thus, the rRNA networks based only onthe secondary structure interactions (2D) do not form a small-world network. In contrast, the networks basedon the secondary and tertiary structure interactions (3D) contain many tertiary interactions that result in aperceptible cliquishness characteristic of the small-world network.As shown in Table 2, we calculate the characteristic path length L and the clustering coefficient C [3] in bothof the T16S-3D and H23S-3D networks, and compare these values with networks of randomly rewired tertiaryARTICLE IN PRESS0 50 100 150 200050010001500200025003000b=100.7b=32.6b=33.0b=14.9massfunction,M(d)distance, dFig. 2. Plots of the mass function MðdÞ versus the distance d for four rRNA networks: T16S-2D ðÞ, T16S-3D ðÞ, H23S-2D ðÞ, andH23S-3D ð’Þ. Dotted lines are estimated slopes b’s which are 14.9, 33.0, 32.6, and 100.7 for T16S-2D, T16S-3D, H23-2D, and H23S-3D,respectively.C.-Y. Lee et al. / Physica A 386 (2007) 564–572568
  6. 6. interactions while maintaining the same number of tertiary interactions. In both of the T16S-3D and H23S-3Dnetworks, Lreal$Lregular and CrealbCrandom$NÀ1, suggesting that both networks are far from the randomnetwork and resemble the regular grid network. Interestingly, Lreal$Lregular and CrealbCrewired for the 23SrRNA tertiary network, featuring the small-world property. The H23S-3D network is highly clustered like aregular lattice, yet has a small characteristic path length like a random network. The same is not true for theT16S-3D network where Creal is just an order of magnitude larger than Crewired. This finding can be accountedby the fact that the 23S rRNA has many more tertiary structure interactions than the 16S rRNA. Experimentshave shown that a RNA sequence form its secondary structure first, and then folds into its tertiary structure[40,41]. This hierarchical folding of RNA structure suggests that the formation of tertiary structureinteractions, especially in the 23S rRNA, results in the clustering of nucleotides.3.2. Betweenness and nucleotide conservationComparative sequence analysis of rRNA sequences for organisms that span across the three primarydivisions of life (Archaea, Bacteria, and Eukaryotes) revealed that many of the highly conserved nucleotidesare found clustered in a few specific regions of the rRNA structure [34,42]. The relative conservation ofnucleotides in rRNA structure is related to their prominence or importance within a network environment.This prominence of a network is called the centrality in the network, since it measures which vertex(nucleotide) is best connected to other vertices or the most influential in the formation of the network.Although the relative importance of a vertex is quantified by various measures such as the vertex betweenness,the degree, and eigenvector centrality [43], here we employ the vertex betweenness as a measure and investigateits correlation with the nucleotide conservation.The vertex betweenness of a vertex vi, bðviÞ, is defined as the number of shortest paths between all pairs ofvertices passing through the vertex vi [43,44] and is usually normalized by the maximum possible value, so thatit takes values between 0 and 1. Since some vertices can be visited more frequently than others in a network,the betweenness of a vertex can be also used as a measure for the amount of ‘‘traffic’’ that runs through thevertex [45]. In this regard, the vertex betweenness can be used as an indicator for which vertex (nucleotide inrRNA) is the most influential to hold a network.Similarly, the helix betweenness can be defined by treating rRNA secondary structure helices as ‘‘vertices’’to measure for which helix in the rRNAs is the most influential helix to hold the rRNA structure networks.The helix betweenness of a helix j, Bj, is defined as the average vertex betweenness for all vertices in a helix.That is,Bj ¼1njXvi2AjbðviÞ, (2)where nj is the number of vertices (or nucleotides) in a helix j and Aj is the set of vertices in the helix j.Fig. 3 shows distributions of the helix betweenness of the H23-2D and H23-3D networks, both of whichapproximately have power-law distributions of the formPðBÞ / BÀa, (3)ARTICLE IN PRESSTable 2The characteristic path length ðLrealÞ and the clustering coefficient ðCrealÞ for T16S-3D and H23S-3D, which are compared to those withthe randomly rewired tertiary interactions (Lrewired and Crewired )Network N Lreal Lrewired Lregular %ffiffiffiffiffiNpCreal Crewired Crandom % NÀ1T16S-3D 1521 33.1 12.0 39.0 0.010 0.0031 0.00066H23S-3D 2922 27.7 11.4 54.1 0.014 0.0006 0.00034Lregular and Crandom represent the characteristic path length of a regular grid network and the clustering coefficient of a random network,respectively.ffiffiffiffiffiNpand NÀ1represent the typical characteristic path length of a regular grid network and clustering coefficient of a randomnetwork, respectively, where N is the number of nucleotides (vertices) in rRNA sequence comprising the 16S and 23S rRNA networks.C.-Y. Lee et al. / Physica A 386 (2007) 564–572 569
  7. 7. implying the absence of any characteristic scale of the helix betweenness. Similar power-law behavior arefound for the T16S-2D and T16S-3D networks (data not shown). These results indicate that the power-lawbehavior of the betweenness that was first observed in the scale-free networks [45] also occurs in non-scale-freenetworks. Some other non-scale-free networks also follow a power-law behavior, as reported in thecollaboration network and the neural network of Caenorhabditis elegans [46].The power-law distribution tells that high values of helix betweenness is not just statistically forbidden orrare, but are as equally important in the organization of a network as those with low values. The power-lawalso indicates that while most of helices in the rRNA structure networks have low helix betweenness values, afew of the helices have high helix betweenness values that are statistically significant. This implies that only afew helices of high betweenness dictate the global structure, while most of helices are participated in clusteringlocal structures. Interestingly, tertiary interactions slightly alter the power-law distribution as shown in Fig. 3.Since the tertiary structure necessarily has more edges linked by the tertiary interactions, the tertiary structuremay yield a new shortest path that is not possible in the secondary structure. Moreover, the number of verticesalong the shortest path in the tertiary structure is always less than that in the secondary structure. Therefore,tertiary interactions reduce the extent of the betweenness, and thus the number of helices with higher helixbetweenness values are reduced.We also introduce the term helix conservation as a measure of the average of the nucleotide conservationsfor all nucleotides in a helix. The nucleotide conservation values can be computed for each nucleotide positionin a sequence alignment using the modified Shannon equation [42],CONS ¼XiPi log2 ð4PiÞ þ PD log2 ðPDÞ, (4)where Pi is the frequency of nucleotide i at a given position and PD is the frequency of deletions at thatposition. The computed nucleotide conservation values range between À1 and 2 ðÀ1pCONSp2Þ.The plots of the helix betweenness versus the helix conservation computed for the bacterial alignment forthe T16S-2D and H23S-2D networks are shown in Fig. 4. When a nucleotide with CONS41:3 is arbitraryconsidered highly conserved, helices with high helix betweenness values are referred to as highly conservedhelices in both the T16S-2D and H23S-2D networks. The converse, however, is not true. In particular, vastmajority of helices with their betweenness greater than 0.1 is highly conserved, suggesting that the mostARTICLE IN PRESS1E-3 0.01 0.11E-41E-30.010.1probabilityhelix betweennessFig. 3. Log–log plots of the helix betweenness versus its probability for H23S-2D ð’Þ and H23S-3D ðÞ. The solid and the dottedlines represent estimated slopes, which are À1:13 Æ 0:07 and À1:11 Æ 0:08 for H23S-2D and H23S-3D, respectively. The distribution ofH23S-3D is shifted vertically for the display purpose.C.-Y. Lee et al. / Physica A 386 (2007) 564–572570
  8. 8. important helices that maintain the rRNA structure networks have minimal variation in bacterial rRNAs.A similar result is also obtained for the networks of tertiary structures.4. Summary and conclusionIn this paper, the four 16S and 23S rRNA structural networks were constructed using the secondary andtertiary interactions mapped in the high-resolution crystal structures of the T. thermophilus 30S andH. marismortui 50S subunits, by treating nucleotides and their interactions as vertices and edges, respectively.Subsequently, their topological characteristics were investigated quantitatively with various measures and thenrelated to the biological and structural implications of the rRNAs.The mass functions for the rRNA networks revealed that the 23S rRNA is more compact than the 16SrRNA in both the secondary-only (2D) and secondary and tertiary (3D) networks. The tertiary inter-actions, especially in the H23-3D network, cluster nucleotides in a way that increase the cliquishness of thenetwork. In addition, the helix betweenness follows a power-law distribution. Only a few helices havehigh centrality in the formation of the global structure of rRNA, while the rest are associated with theclustering of local structures. Furthermore, helices with higher betweenness are usually highly conserved inBacteria.These results could uncover the characteristics of the rRNA that are not discernible with other qualitativemethod, and suggest a potential role of rRNA networks as a new quantitative approach for RNA research. Inparticular, network analysis of the 16S and 23S rRNA structures may give some insights into RNA structureand function by providing some useful quantitative measures to describe the topological characteristics ofRNA structure and folding.This work was supported by the Korea Research Foundation Grant funded by the Korean Government(MOEHRD) (KRF-2005-041-H00052 to CYL), the Welch Foundation (F-1427 to RRG), and the NationalInstitutes of Health (GM067317 to RRG). The main calculations were performed by using thesupercomputing resource of the Korea Institute of Science and Technology Information(KISTI).ARTICLE IN PRESS0.00 0.05 0.10 0.15 0.20 0.25 0.30-1.0-0.50.00.51.01.52.0helixconservation0.00 0.05 0.10 0.15 0.20-0.50.00.51.01.52.0helixconservationhelix betweennessFig. 4. Plots of the helix betweenness versus the helix conservation in the bacterial domain for T16S-2D (A) and H23S-2D (B). Thequantities are dimensionless.C.-Y. Lee et al. / Physica A 386 (2007) 564–572 571
  9. 9. References[1] For review network theory, see, for example, M. Newman, SIAM Rev. 45 (2003) 167;R. Albert, A.-L. Baraba´ si, Rev. Mod. Phys. 74 (2002) 47.[2] P. Erdo¨ s, A. Re´ nyi, Pulb. Math. Inst. Hung. Acad. Sci. 5 (1960) 17;B. Bolloba´ s, Random Graphs, Academic Press, London, 1985.[3] D. Watts, S. Strogatz, Nature (London) 393 (1998) 440.[4] A.-L. Baraba´ si, R. Albert, Science 286 (1999) 509.[5] L. Amaral, A. Scala, M. Barthelemy, H. Stanley, Proc. Natl. Acad. Sci. USA 97 (2000) 12149.[6] M. Newman, Proc. Natl. Acad. Sci. USA 98 (2001) 404.[7] M. Faloutsos, P. Faloutsos, C. Faloutsos, Comput. Commun. Rev. 29 (1999) 251.[8] R. Albert, H. Jeong, A.-L. Baraba´ si, Nature (London) 401 (1999) 130.[9] A. Broder, R. Kumar, F. Maghoul, P. Raghaven, S. Rajagopalan, R. Stata, A. Tomkins, J. Wiener, Comput Networks 33 (2000) 309.[10] R. Williams, N. Martinez, Nature (London) 404 (2000) 180.[11] D. Fell, A. Wagner, Nat. Biotechnol. 18 (2000) 1121.[12] N. Guelzim, S. Bottani, P. Bourgine, F. Kepes, Nature Genetics 31 (2002) 60.[13] H. Jeong, B. Tombor, R. Albert, Z. Litvai, A.-L. Baraba´ si, Nature (London) 407 (2000) 651.[14] For a recent review in the biological network, see for example, a special section in Science 301 (2003) 1863, and references therein.[15] M. Vendruscolo, N.V. Dokholyan, E. Paci, M. Karplus, Phys. Rev. E 65 (2002) 061910.[16] L.H. Greene, V.A. Higman, J. Mol. Biol. 334 (2003) 781.[17] A.R. Atilgan, P. Akan, C. Baysal, Biophys. J. 86 (2004) 85.[18] N. Ban, P. Nissen, J. Hansen, P. Moore, T. Steitz, Science 289 (2000) 905.[19] B. Wimberly, et al., Nature (London) 407 (2000) 327.[20] For a general reference of the ribosome and its function, see, for example, K.H. Nierhaus, D.N. Wilson (Eds.), Protein Synthesis andRibosome Structure: Translating the Genome, Wiley-VCH, Weinheim, 2004; R. Garrett, S. Douthwaite, A. Liljas, A. Matheson, P.Moore, H. Noller (Eds.), The Ribosome: Structure, Function, Antibiotics, and Cellular Interactions, ASM Press, Washington DC,2000.[21] H.F. Noller, Annu. Rev. Biochem. 53 (1984) 119.[22] G.W. Fox, C.R. Woese, Nature (London) 256 (1975) 505.[23] H.F. Noller, et al., Nucl. Acids Res. 9 (1981) 6167.[24] C.R. Woese, R. Gutell, R. Gupta, H.F. Noller, Microbiol. Rev. 47 (1983) 621.[25] R. Gutell, J. Lee, J. Cannone, Curr. Opin. Struct. Biol. 12 (2002) 301 and references therein.[26] W. Fontana, D.A.M. Konings, P.F. Stadler, P. Schuster, Biopolymers 33 (1993) 1389.[27] I.L. Hofacker, W. Fontana, P.F. Stadler, L.S. Bonhoeffer, M. Tacker, P. Schuster, Monatsh. Chem. 125 (1994) 167.[28] B.A. Shapiro, Comput. Appl. Biosci. 4 (1988) 387.[29] S.Y. Le, R. Nussinov, J.V. Maizel, Comput. Biomed. Res. 22 (1989) 461.[30] D. Barash, J. Comp. Biol. 11 (2004) 1169;D. Barash, Bioinformatics 22 (2004) 1861.[31] A. Lescoute, E. Westhof, Nucl. Acids Res. 34 (2006) 6587.[32] The structural information of both rRNAs can be found in hhttp://www.rcsb.org/pdb/i. The access numbers are 1JJF and 1JJ2 for theThermus thermophilus 30S and Haloarcula marismortui 50S, respectively.[33] V. Ramakrishnan, Cell 108 (2002) 557.[34] J.J. Cannone, et al., BMC Bioinformatics 3 (2002) 2.[35] R. Pastor-Satorras, A. Vespignani, Evolution and Structure of the Internet, Cambridge University Press, Cambridge, UK, 2004(Chapter 4).[36] C.-Y. Lee, Phys. Rev. E 73 (2006) 042901.[37] H. Noller, Science 309 (2005) 1508;P. Nissen, J. Hansen, N. Ban, P.B. Moore, T.A. Steitz, Science 289 (2000) 920.[38] J. Frank, R. Agrawal, Nature (London) 406 (2000) 318.[39] B. Schuwirth, et al., Science 310 (2005) 827.[40] I. Tinoco, C. Bustamante, J. Mol. Biol. 293 (1999) 271.[41] P. Higgs, Q. Rev. Biophys. 33 (2000) 199.[42] R. Gutell, B. Weiser, C. Woese, H. Noller, Progr. Nucl. Acid Res. Mol. Biol. 32 (1985) 155.[43] L. Freeman, Sociometry 40 (1977) 35;L. Freeman, Soc. Network. 1 (1979) 215.[44] M.E.J. Newman, Phys. Rev. E 64 (2001) 016132.[45] K.-I. Goh, B. Kahng, D. Kim, Phys. Rev. Lett. 87 (2001) 278701.[46] C.-Y. Lee, (arXiv:physics/0605220).ARTICLE IN PRESSC.-Y. Lee et al. / Physica A 386 (2007) 564–572572

×