The Purine Bias of Coding Sequences is Determined by Physicochemical Constraints on Proteins.

Bioinform Biol Insights

Fundação Oswaldo Cruz (FIOCRUZ), Instituto Oswaldo Cruz (IOC), Laboratório de Genômica Funcional e Bioinformática, Rio de Janeiro, RJ, Brazil.

Published: June 2014

For this report, we analyzed protein secondary structures in relation to the statistics of three nucleotide codon positions. The purpose of this investigation was to find which properties of the ribosome, tRNA or protein level, could explain the purine bias (Rrr) as it is observed in coding DNA. We found that the Rrr pattern is the consequence of a regularity (the codon structure) resulting from physicochemical constraints on proteins and thermodynamic constraints on ribosomal machinery. The physicochemical constraints on proteins mainly come from the hydropathy and molecular weight (MW) of secondary structures as well as the energy cost of amino acid synthesis. These constraints appear through a network of statistical correlations, such as (i) the cost of amino acid synthesis, which is in favor of a higher level of guanine in the first codon position, (ii) the constructive contribution of hydropathy alternation in proteins, (iii) the spatial organization of secondary structure in proteins according to solvent accessibility, (iv) the spatial organization of secondary structure according to amino acid hydropathy, (v) the statistical correlation of MW with protein secondary structures and their overall hydropathy, (vi) the statistical correlation of thymine in the second codon position with hydropathy and the energy cost of amino acid synthesis, and (vii) the statistical correlation of adenine in the second codon position with amino acid complexity and the MW of secondary protein structures. Amino acid physicochemical properties and functional constraints on proteins constitute a code that is translated into a purine bias within the coding DNA via tRNAs. In that sense, the Rrr pattern within coding DNA is the effect of information transfer on nucleotide composition from protein to DNA by selection according to the codon positions. Thus, coding DNA structure and ribosomal machinery co-evolved to minimize the energy cost of protein coding given the functional constraints on proteins.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4039185PMC
http://dx.doi.org/10.4137/BBI.S13161DOI Listing

Publication Analysis

Top Keywords

amino acid
24
constraints proteins
20
coding dna
16
purine bias
12
physicochemical constraints
12
secondary structures
12
energy cost
12
cost amino
12
acid synthesis
12
codon position
12

Similar Publications

Two new strains of Streptomyces with metabolic potential for biological control of pear black spot disease.

BMC Microbiol

December 2024

State Key Laboratory Incubation Base for Conservation and Utilization of Bio-Resource in Tarim Basin, Alar, Xinjiang Uygur Autonomous Region, 843300, China.

Background: Pear black spot is caused by Alternaria tenuissima. It is one of the diseases of concern limiting pear production worldwide. Existing cultivation methods and fungicides are not sufficient to control early blight.

View Article and Find Full Text PDF

Rapamycin is an important natural macrolide antibiotic with antifungal, immunosuppressive and antitumor activities produced by Streptomyces rapamycinicus. However, their prospective applications are limited by low fermentation units. In this study, we found that the exogenous aromatic amino acids phenylalanine and tyrosine could effectively increase the yield of rapamycin in industrial microbial fermentation.

View Article and Find Full Text PDF

Inferring transcriptomic dynamics implicated in odor fatty acid accumulation in adipose tissue of Hulun Buir sheep from birth to market.

BMC Genomics

December 2024

CAS Key Laboratory of Agroecological Processes in Subtropical Region, Institute of Subtropical Agriculture, Chinese Academy of Sciences, Changsha, Hunan, 410125, China.

This study aimed to investigate the temporal accumulation of odor fatty acids (OFAs) in the dorsal subcutaneous adipose tissue, and uncover their dynamic regulatory metabolic pathways from the transcriptomic perspective in lambs from birth to market. Thirty-two Hulun Buir lambs were selected and randomly assigned to four different sampling stages following their growth trajectories: neonatal (day 1), weaning (day 75), mid-fattening (day 150), and late-fattening (day 180) stages. Results indicated that the contents of three OFAs increased progressively as lambs matured, with the most drastic change occurred at mid-fattening vs.

View Article and Find Full Text PDF

Evolution of SARS-CoV-2 spike trimers towards optimized heparan sulfate cross-linking and inter-chain mobility.

Sci Rep

December 2024

Institute of Physiological Chemistry and Pathobiochemistry, University of Münster, Waldeyerstrasse 15, D-48149, Münster, Germany.

The heparan sulfate (HS)-rich extracellular matrix (ECM) serves as an initial interaction site for the homotrimeric spike (S) protein of SARS-CoV-2 to facilitate subsequent docking to angiotensin-converting enzyme 2 (ACE2) receptors and cellular infection. More recent variants, notably Omicron, have evolved by swapping several amino acids to positively charged residues to enhance the interaction of the S-protein trimer with the negatively charged HS. However, these enhanced interactions may reduce Omicron's ability to move through the HS-rich ECM to effectively find ACE2 receptors and infect cells, raising the question of how to mechanistically explain HS-associated viral movement.

View Article and Find Full Text PDF

This manuscript details the application of Isothermal Titration Calorimetry (ITC) to characterize the kinetics of 3CL, the main protease from the Severe Acute Respiratory Syndrome CoronaVirus-2 (SARS-CoV-2), and its inhibition by Ensitrelvir, a known non-covalent inhibitor. 3CL is essential for producing the proteins necessary for viral infection, which led to the COVID-19 pandemic. The ITC-based assay provided rapid and reliable measurements of 3CL activity, allowing for the direct derivation of the kinetic enzymatic constants K and k by monitoring the thermal power required to maintain a constant temperature as the substrate is consumed.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!