Understanding the relative contributions of various evolutionary processes-purifying selection, neutral drift, and adaptation-is fundamental to evolutionary biology. A common metric to distinguish these processes is the ratio of nonsynonymous to synonymous substitutions (i.e., dN/dS) interpreted from the neutral theory as a null model. However, from biophysical considerations, mutations have non-negligible effects on the biophysical properties of proteins such as folding stability. In this work, we investigated how stability affects the rate of protein evolution in phylogenetic trees by using simulations that combine explicit protein sequences with associated stability changes. We first simulated myoglobin evolution in phylogenetic trees with a biophysically realistic approach that accounts for 3D structural information and estimates of changes in stability upon mutation. We then compared evolutionary rates inferred directly from simulation to those estimated using maximum-likelihood (ML) methods. We found that the dN/dS estimated by ML methods (ωML) is highly predictive of the per gene dN/dS inferred from the simulated phylogenetic trees. This agreement is strong in the regime of high stability where protein evolution is neutral. At low folding stabilities and under mutation-selection balance, we observe deviations from neutrality (per gene dN/dS > 1 and dN/dS < 1). We showed that although per gene dN/dS is robust to these deviations, ML tests for positive selection detect statistically significant per site dN/dS > 1. Altogether, we show how protein biophysics affects the dN/dS estimations and its subsequent interpretation. These results are important for improving the current approaches for detecting positive selection.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4224349 | PMC |
http://dx.doi.org/10.1093/gbe/evu223 | DOI Listing |
Taxon Rep Int Lepid Surv
October 2023
Department of Biophysics, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd., Dallas, TX 75390-9050, USA.
Genomic sequencing of worldwide butterfly fauna followed by phylogenetic analysis of protein-coding genes informs butterfly classification throughout the taxonomic hierarchy, from families to species. As a rule, we attribute the same taxonomic rank to more prominent clades of comparable divergence (i.e.
View Article and Find Full Text PDFMycoKeys
January 2025
The Key Laboratory for Silviculture and Conservation of the Ministry of Education, Beijing Forestry University, Beijing 100083, China Beijing Forestry University Beijing China.
The species complex (FFSC) encompasses a diverse array of more than 80 phylogenetic species with both phytopathological and clinical importance. A stable taxonomy is crucial for species in the FFSC due to their economical relevance. Fungal strains used in this study were obtained from and , collected from Beijing and Shaanxi Province.
View Article and Find Full Text PDFEvolutionary sparse learning (ESL) uses a supervised machine learning approach, Least Absolute Shrinkage and Selection Operator (LASSO), to build models explaining the relationship between a hypothesis and the variation across genomic features (e.g., sites) in sequence alignments.
View Article and Find Full Text PDFTaxon Rep Int Lepid Surv
November 2024
Department of Biophysics, University of Texas Southwestern Medical Center, 5323 Harry Hines Blvd., Dallas, TX 75390-9050, USA.
Continuing with the genomic analysis of butterflies, we present a taxonomic update. As a result of this work, 3 genera, 6 subgenera, 16 species, and 2 subspecies are described as new. New genera and subgenera are (type species in parenthesis): Grishin, ( W.
View Article and Find Full Text PDFISME Commun
January 2025
State Key Laboratory of Lake Science and Environment, Nanjing Institute of Geography and Limnology, Chinese Academy of Sciences, Nanjing 210008, China.
Antarctic snow harbors diverse microorganisms, including pigmented algae and bacteria, which create colored snow patches and influence global climate and biogeochemical cycles. However, the genomic diversity and metabolic potential of colored snow remain poorly understood. We conducted a genome-resolved study of microbiomes in colored snow from 13 patches (7 green and 6 red) on the Fildes Peninsula, Antarctica.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!