Internal correspondence analysis of codon and amino-acid usage in thermophilic bacteria.

J Appl Genet

Laboratoire de Biométrie, Biologie Evolutive, Centre National de la Recherche Scientifique -UMR-5558, Université Claude Bernard, LYON I, 43 Bd 11/11/1918, F-69622 Villeurbanne Cedex, France.

Published: July 2003

AI Article Synopsis

  • The study analyzed codon usage in mesophilic and thermophilic bacteria using two datasets, one with complete genomes and another with partial genomes.
  • Results showed that the amino-acid composition in thermophilic proteins is influenced by nucleic acid-level pressures rather than selection at the protein level.
  • Additionally, complexities at the mRNA level and the presence of outliers in the data caution against directly attributing temperature as the sole factor affecting codon trends in thermophiles.

Article Abstract

Starting from two datasets of codon usage in coding sequences from mesophilic and thermophilic bacteria, we used internal correspondence analysis to study the variability of codon usage within and between species, and within and between amino acids. The first dataset included 18,958,458 codons from 58,482 coding sequences from completely sequenced genomes of 25 species, along with 6,793,581 dinucleotides from 21,876 intergenic spaces. The second dataset, with partially sequenced genomes, included 97,095,873 codons from 293 bacterial species. Results were consistent between the two datasets. The trend for the amino-acid composition of thermophilic proteins was found to be under the control of a pressure at the nucleic acid level, not a selection at the protein level. This effect was not present in intergenic spaces, ruling out a pressure at the DNA level. The pattern at the mRNA level was more complex than a simple purine enrichment of the sense strand of coding sequences. Outliers in the partial genome dataset introduced a note of caution about the interpretation of temperature as the direct determinant of the trend observed in thermophiles. The surprising lack of selection on the amino-acid content of thermophilic proteins suggests that the amino-acid repertoire was set up in a hot environment.

Download full-text PDF

Source

Publication Analysis

Top Keywords

coding sequences
12
internal correspondence
8
correspondence analysis
8
thermophilic bacteria
8
codon usage
8
sequenced genomes
8
intergenic spaces
8
thermophilic proteins
8
analysis codon
4
amino-acid
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!