Empirical models for substitution in ribosomal RNA.

Mol Biol Evol

Department of Medical Biophysics, University of Toronto, and Ontario Cancer Institute, University Health Network, Toronto, Ontario, Canada.

Published: March 2004

Empirical models of substitution are often used in protein sequence analysis because the large alphabet of amino acids requires that many parameters be estimated in all but the simplest parametric models. When information about structure is used in the analysis of substitutions in structured RNA, a similar situation occurs. The number of parameters necessary to adequately describe the substitution process increases in order to model the substitution of paired bases. We have developed a method to obtain substitution rate matrices empirically from RNA alignments that include structural information in the form of base pairs. Our data consisted of alignments from the European Ribosomal RNA Database of Bacterial and Eukaryotic Small Subunit and Large Subunit Ribosomal RNA ( Wuyts et al. 2001. Nucleic Acids Res. 29:175-177; Wuyts et al. 2002. Nucleic Acids Res. 30:183-185). Using secondary structural information, we converted each sequence in the alignments into a sequence over a 20-symbol code: one symbol for each of the four individual bases, and one symbol for each of the 16 ordered pairs. Substitutions in the coded sequences are defined in the natural way, as observed changes between two sequences at any particular site. For given ranges (windows) of sequence divergence, we obtained substitution frequency matrices for the coded sequences. Using a technique originally developed for modeling amino acid substitutions ( Veerassamy, Smith, and Tillier. 2003. J. Comput. Biol. 10:997-1010), we were able to estimate the actual evolutionary distance for each window. The actual evolutionary distances were used to derive instantaneous rate matrices, and from these we selected a universal rate matrix. The universal rate matrices were incorporated into the Phylip Software package ( Felsenstein 2002. http://evolution.genetics.washington.edu/phylip.html), and we analyzed the ribosomal RNA alignments using both distance and maximum likelihood methods. The empirical substitution models performed well on simulated data, and produced reasonable evolutionary trees for 16S ribosomal RNA sequences from sequenced Bacterial genomes. Empirical models have the advantage of being easily implemented, and the fact that the code consists of 20 symbols makes the models easily incorporated into existing programs for protein sequence analysis. In addition, the models are useful for simulating the evolution of RNA sequence and structure simultaneously.

Download full-text PDF

Source
http://dx.doi.org/10.1093/molbev/msh029DOI Listing

Publication Analysis

Top Keywords

ribosomal rna
20
empirical models
12
rate matrices
12
models substitution
8
rna
8
protein sequence
8
sequence analysis
8
rna alignments
8
nucleic acids
8
acids res
8

Similar Publications

An obligately anaerobic, spore-forming sulphate-reducing bacterium, strain SB140, was isolated from a long-term continuous enrichment culture that was inoculated with peat soil from an acidic fen. Cells were immotile, slightly curved rods that stained Gram-negative. The optimum temperature for growth was 28 °C.

View Article and Find Full Text PDF

sp. nov., isolated from tree bark ( Chev.) and its antioxidant activity.

Int J Syst Evol Microbiol

January 2025

Department of Biochemistry and Microbiology, Faculty of Pharmaceutical Sciences, Chulalongkorn University, Bangkok 10330, Thailand.

A Gram-stain-positive, facultatively anaerobic, rod-shaped strain, designated SPB1-3, was isolated from tree bark. This strain exhibited heterofermentative production of dl-lactic acid from glucose. Optimal growth was observed at 25-40 °C, pH 4.

View Article and Find Full Text PDF

Precise imaging of noncoding RNAs (ncRNAs) in specific organelles allows decoding of their functions at subcellular level but lacks advanced tools. Here we present a DNA-based nanobiotechnology for spatially selective imaging of ncRNA (e.g.

View Article and Find Full Text PDF

A Gram-stain-negative, aerobic and rod-shaped bacterium, designated as HZG-20, was isolated from a tidal flat in Zhoushan, Zhejiang Province, China. The 16S rRNA sequence similarities between strain HZG-20 and RR4-56, NNCM2, P31 and X9-2-2 were 98.9, 91.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!