Inferring functional constraints and divergence in protein families using 3D mapping of phylogenetic information.

Nucleic Acids Res

Canadian Institute for Advanced Research, Program in Evolutionary Biology, Genome Atlantic, Department of Biochemistry and Molecular Biology, Dalhousie University, Halifax, NS B3H 4H7, Canada.

Published: January 2003

Comparative sequence analysis has been used to study specific questions about the structure and function of proteins for many years. Here we propose a knowledge-based framework in which the maximum likelihood rate of evolution is used to quantify the level of constraint on the identity of a site. We demonstrate that site-rate mapping on 3D structures using datasets of rhodopsin-like G-protein receptors and alpha- and beta-tubulins provides an excellent tool for pinpointing the functional features shared between orthologous and paralogous proteins. In addition, functional divergence within protein families can be inferred by examining the differences in the site rates, the differences in the chemical properties of the side chains or amino acid usage between aligned sites. Two novel analytical methods are introduced to characterize rate- independent functional divergence. These are tested using a dataset of two classes of HMG-CoA reductases for which only one class can perform both the forward and reverse reaction. We show that functionally divergent sites occur in a cluster of sites interacting with the catalytic residues and that this information should facilitate the design of experimental strategies to directly test functional properties of residues.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC140515PMC
http://dx.doi.org/10.1093/nar/gkg151DOI Listing

Publication Analysis

Top Keywords

divergence protein
8
protein families
8
functional divergence
8
inferring functional
4
functional constraints
4
constraints divergence
4
families mapping
4
mapping phylogenetic
4
phylogenetic comparative
4
comparative sequence
4

Similar Publications

In most Eukaryota, telomeres are protected by the CST complex, composed of CTC1, STN1 and TEN1. In Drosophila, instead, another complex is present, composed of Modigliani, Tea and Verrocchio. We performed a search for STN1 orthologs in Arthropoda, in order to verify if Verrocchio can be considered as such.

View Article and Find Full Text PDF

Intercellular mRNA transfer alters the human pluripotent stem cell state.

Proc Natl Acad Sci U S A

January 2025

Human Biology Research Unit, Institute of Integrated Research, Institute of Science Tokyo, Bunkyo-ku, Tokyo 113-8510, Japan.

Intercellular transmission of messenger RNA (mRNA) is being explored in mammalian species using immortal cell lines. Here, we uncover an intercellular mRNA transfer phenomenon that allows for the adaptation and reprogramming of human primed pluripotent stem cells (hPSCs). This process is induced by the direct cell contact-mediated coculture with mouse embryonic stem cells under the condition impermissible for primed hPSC culture.

View Article and Find Full Text PDF

Advancements in high-throughput sequencing and associated bioinformatics methods have significantly expanded the RNA virus repertoire, including novel viruses with highly divergent genomes encoding "orphan" proteins that apparently lack homologous sequences. This absence of homologs in routine sequence similarity search complicates their taxonomic classification and raises a fundamental question: Do these orphan viral genomes represent viruses? In 2022, an orphan viral genome encoding a large polyprotein was identified in alfalfa () and thrips (), and named Snake River alfalfa virus (SRAV). SRAV was initially proposed as an uncommon flavi-like virus identified in a plant host distantly related to family .

View Article and Find Full Text PDF

GOPhage: protein function annotation for bacteriophages by integrating the genomic context.

Brief Bioinform

November 2024

Department of Electrical Engineering, City University of Hong Kong, 83 Tat Chee Ave, Kowloon Tong, Hong Kong (SAR), China.

Bacteriophages are viruses that target bacteria, playing a crucial role in microbial ecology. Phage proteins are important in understanding phage biology, such as virus infection, replication, and evolution. Although a large number of new phages have been identified via metagenomic sequencing, many of them have limited protein function annotation.

View Article and Find Full Text PDF

Deciphering Plant NLR Genomic Evolution: Synteny-Informed Classification Unveils Insights into TNL Gene Loss.

Mol Biol Evol

January 2025

State Key Laboratory for Crop Stress Resistance and High-Efficiency Production/Shaanxi Key Laboratory of Apple, College of Horticulture, Northwest A&F University, Yangling 712100, China.

Nucleotide-binding leucine-rich repeat receptor (NLR) genes encode a pivotal class of plant immune receptors. However, their rampant duplication and loss have made inferring their genomic evolutionary trajectory difficult, exemplified by the loss of TNL family genes in monocots. In this study, we introduce a novel classification system for angiosperm NLR genes, grounded in network analysis of micro-synteny information.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!