Structural Phylogenetics with Confidence.

Mol Biol Evol

Bioinformatics Institute, School of Biological Sciences, University of Auckland, Auckland, New Zealand.

Published: September 2020

For evaluating the deepest evolutionary relationships among proteins, sequence similarity is too low for application of sequence-based homology search or phylogenetic methods. In such cases, comparison of protein structures, which are often better conserved than sequences, may provide an alternative means of uncovering deep evolutionary signal. Although major protein structure databases such as SCOP and CATH hierarchically group protein structures, they do not describe the specific evolutionary relationships within a hierarchical level. Structural phylogenies have the potential to fill this gap. However, it is difficult to assess evolutionary relationships derived from structural phylogenies without some means of assessing confidence in such trees. We therefore address two shortcomings in the application of structural data to deep phylogeny. First, we examine whether phylogenies derived from pairwise structural comparisons are sensitive to differences in protein length and shape. We find that structural phylogenetics is best employed where structures have very similar lengths, and that shape fluctuations generated during molecular dynamics simulations impact pairwise comparisons, but not so drastically as to eliminate evolutionary signal. Second, we address the absence of statistical support for structural phylogeny. We present a method for assessing confidence in a structural phylogeny using shape fluctuations generated via molecular dynamics or Monte Carlo simulations of proteins. Our approach will aid the evolutionary reconstruction of relationships across structurally defined protein superfamilies. With the Protein Data Bank now containing in excess of 158,000 entries (December 2019), we predict that structural phylogenetics will become a useful tool for ordering the protein universe.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7475046PMC
http://dx.doi.org/10.1093/molbev/msaa100DOI Listing

Publication Analysis

Top Keywords

structural phylogenetics
12
evolutionary relationships
12
structural
9
protein structures
8
evolutionary signal
8
structural phylogenies
8
assessing confidence
8
shape fluctuations
8
fluctuations generated
8
generated molecular
8

Similar Publications

Glyceraldehyde-3-phosphate dehydrogenase (GAPDH) is an enzyme widely involved in glycolysis in animal cells and in non-metabolic processes, including apoptosis and the regulation of gene expression. GAPDH is a ubiquitous protein that plays a pivotal role in plant metabolism and handling of stress responses. However, its function in plant stress resistance remains unknown.

View Article and Find Full Text PDF

The Hypericaceae family, comprising nine genera and over seven hundred species, includes plants traditionally used for medicinal purposes. In this study, we performed high-throughput sequencing on three species: , , and , and conducted comparative genomic analyses with related species. The chloroplast genome sizes were 152,654 bp, 122,570 bp, and 137,652 bp, respectively, with an average GC content of 37.

View Article and Find Full Text PDF

Genome-Wide Analysis of the Hsf Gene Family in and Function in Thermotolerance.

Int J Mol Sci

December 2024

Institute of Forestry and Pomology, Beijing Academy of Agriculture and Forestry Sciences, Beijing 100093, China.

Heat shock transcription factors (Hsfs) play an important role in response to high temperatures by binding to the promoter of the heat shock protein gene to promote its expression. As an important ornamental plant, the rose often encounters heat stress during the flowering process. However, there are few studies on the family in roses ().

View Article and Find Full Text PDF

Identification of Kunitz-Type Inhibitor Gene Family of Reveals a Stress Tolerance Function in Inverted Cuttings.

Int J Mol Sci

December 2024

Key Laboratory for Forest Genetics and Tree Improvement and Propagation in University of Yunnan Province, Southwest Forestry University, Kunming 650224, China.

Plant protease inhibitors are a ubiquitous feature of plant species and exert a substantial influence on plant stress responses. However, the (Kunitz trypsin inhibitor) family responding to abiotic stress has not been fully characterized in . In this study, we conducted a genome-wide study of the family and analyzed their gene structure, gene duplication, conserved motifs, cis-acting elements, and response to stress treatment.

View Article and Find Full Text PDF

is a member of the Styracaceae family, which is well-known for its remarkable ornamental and medicinal properties. In this research, we conducted comparative analysis of the chloroplast genomes from four samples of representing . The results demonstrated that the chloroplast genome of four samples ranging from 157,103 bp to 158,357 bp exhibited a typical quadripartite structure, including one large single-copy (LSC) region (90,131 bp to 90,342 bp), one small single-copy (SSC) region (18,467 bp to 18,785 bp), and two inverted repeat regions (IRs) (24,115 bp to 24,261 bp).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!