Graph identification of proteins in tomograms (GRIP-Tomo).

Protein Sci

Environmental Molecular Sciences Laboratory, Pacific Northwest National Laboratory, Richland, Washington, USA.

Published: January 2023

In this study, we present a method of pattern mining based on network theory that enables the identification of protein structures or complexes from synthetic volume densities, without the knowledge of predefined templates or human biases for refinement. We hypothesized that the topological connectivity of protein structures is invariant, and they are distinctive for the purpose of protein identification from distorted data presented in volume densities. Three-dimensional densities of a protein or a complex from simulated tomographic volumes were transformed into mathematical graphs as observables. We systematically introduced data distortion or defects such as missing fullness of data, the tumbling effect, and the missing wedge effect into the simulated volumes, and varied the distance cutoffs in pixels to capture the varying connectivity between the density cluster centroids in the presence of defects. A similarity score between the graphs from the simulated volumes and the graphs transformed from the physical protein structures in point data was calculated by comparing their network theory order parameters including node degrees, betweenness centrality, and graph densities. By capturing the essential topological features defining the heterogeneous morphologies of a network, we were able to accurately identify proteins and homo-multimeric complexes from 10 topologically distinctive samples without realistic noise added. Our approach empowers future developments of tomogram processing by providing pattern mining with interpretability, to enable the classification of single-domain protein native topologies as well as distinct single-domain proteins from multimeric complexes within noisy volumes.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9798246PMC
http://dx.doi.org/10.1002/pro.4538DOI Listing

Publication Analysis

Top Keywords

protein structures
12
pattern mining
8
network theory
8
volume densities
8
simulated volumes
8
protein
6
graph identification
4
identification proteins
4
proteins tomograms
4
tomograms grip-tomo
4

Similar Publications

Calprotectin's Protein Structure Shields Ni-N(His) Bonds from Competing Agents.

J Phys Chem Lett

January 2025

State Key Laboratory of Coordination Chemistry, Chemistry and Biomedicine Innovation Center (ChemBIC), School of Chemistry and Chemical Engineering, Nanjing University, Nanjing, 210023, China.

The Ni-N(His) coordination bond, formed between the nickel ion and histidine residues, is essential for recombinant protein purification, especially in Ni-NTA-based systems for selectively binding polyhistidine-tagged (Histag) proteins. While previous studies have explored its bond strength in a synthetic Ni-NTA-Histag system, the influence of the surrounding protein structure remains less understood. In this study, we used atomic force microscopy-based single-molecule force spectroscopy (AFM-SMFS) to quantify the Ni-N(His) bond strength in calprotectin, a biologically relevant protein system.

View Article and Find Full Text PDF

Enzymatic asymmetric synthesis of l-phenylglycine by amino acid dehydrogenases has potential for industrial applications; however, this is hindered by their low catalytic efficiency toward high-concentration substrates. We identified and characterized a novel leucine dehydrogenase (LeuDH) with a high catalytic efficiency for benzoylformic acid via directed metagenomic approaches. Further, we obtained a triple-point mutant LeuDH-EER (D332E/G333E/L334R) with improved stability and catalytic efficiency through the rational design of distal loop 13.

View Article and Find Full Text PDF

Due to the emergence of drug resistance, androgen receptor (AR)-targeted drugs still pose great challenges in the treatment of prostate cancer, and it is urgent to explore an innovative therapeutic strategy. MK-1775, a highly selective WEE1 inhibitor, is shown to have favorable therapeutic benefits in several solid tumor models. Recent evidence suggests that the combination of MK-1775 with DNA-damaging agents could lead to enhanced antitumor efficacy.

View Article and Find Full Text PDF

The role of canopy family proteins: biological mechanism and disease function.

Mol Biol Rep

January 2025

Department of Gastroenterology, Beijing Friendship Hospital, Capital Medical University, No. 95, Yong An Road, Xi Cheng District, Beijing, 100050, China.

Canopy family proteins are highly sequence-conserved proteins with an N-terminal hydrophobic signal sequence, a unique pattern of six cysteine residues characteristic of the saposin-like proteins, and a C-terminal putative endoplasmic reticulum retention signal sequence. At present, the known canopy family proteins are canopy fibroblast growth factor signaling regulator 1 (CNPY1), CNPY2, CNPY3, and CNPY4. Despite similar structures, canopy family proteins regulate complex signal networks to participate in various biological processes.

View Article and Find Full Text PDF

The Effective Fragment Potential (EFP) method, a polarizable quantum mechanics-based force field for describing non-covalent interactions, is utilized to calculate protein-ligand interactions in seven inactive cyclin-dependent kinase 2-ligand complexes, employing structural data from molecular dynamics simulations to assess dynamic and solvent effects. Our results reveal high correlations between experimental binding affinities and EFP interaction energies across all the structural data considered. Using representative structures found by clustering analysis and excluding water molecules yields the highest correlation (R2 of 0.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!