Structure motifs are amino acid packing patterns that occur frequently within a set of protein structures. We define a labeled graph representation of protein structure in which vertices correspond to amino acid residues and edges connect pairs of residues and are labeled by (1) the Euclidian distance between the C(alpha) atoms of the two residues and (2) a boolean indicating whether the two residues are in physical/chemical contact. Using this representation, a structure motif corresponds to a labeled clique that occurs frequently among the graphs representing the protein structures. The pairwise distance constraints on each edge in a clique serve to limit the variation in geometry among different occurrences of a structure motif. We present an efficient constrained subgraph mining algorithm to discover structure motifs in this setting. Compared with contact graph representations, the number of spurious structure motifs is greatly reduced. Using this algorithm, structure motifs were located for several SCOP families including the Eukaryotic Serine Proteases, Nuclear Binding Domains, Papain-like Cysteine Proteases, and FAD/NAD-linked Reductases. For each family, we typically obtain a handful of motifs within seconds of processing time. The occurrences of these motifs throughout the PDB were strongly associated with the original SCOP family, as measured using a hyper-geometric distribution. The motifs were found to cover functionally important sites like the catalytic triad for Serine Proteases and co-factor binding sites for Nuclear Binding Domains. The fact that many motifs are highly family-specific can be used to classify new proteins or to provide functional annotation in Structural Genomics Projects.

Download full-text PDF

Source

Publication Analysis

Top Keywords

structure motifs
20
motifs
9
structure
8
subgraph mining
8
amino acid
8
protein structures
8
structure motif
8
serine proteases
8
nuclear binding
8
binding domains
8

Similar Publications

Polycyclic aromatic compounds (PACs) are pervasive environmental contaminants derived from diverse sources including pyrogenic (e.g., combustion processes), petrogenic (e.

View Article and Find Full Text PDF

The hybrid magnetic heterostructures and superlattices, composed of organic and inorganic materials, have shown great potential for quantum computing and next-generation information technology. Organic materials generally possess designable structural motifs and versatile optical, electronic, and magnetic properties, but are too delicate for robust integration into solid-state devices. In contrast, inorganic systems provide robust solid-state interface and excellent electronic properties but with limited customization space.

View Article and Find Full Text PDF

Saccharomyces cerevisiae, a model eukaryotic organism with a rich history in research and industry, has become a pivotal tool for studying Adenosine Deaminase Acting on RNA (ADAR) enzymes despite lacking these enzymes endogenously. This chapter reviews the diverse methodologies harnessed using yeast to elucidate ADAR structure and function, emphasizing its role in advancing our understanding of RNA editing. Initially, Saccharomyces cerevisiae was instrumental in the high-yield purification of ADARs, addressing challenges associated with enzyme stability and activity in other systems.

View Article and Find Full Text PDF

Kinetic and structural investigation of the 4-allyl syringol oxidase from Streptomyces cavernae.

Arch Biochem Biophys

January 2025

Microbial Biotechnology, Faculty of Biology and Biotechnology, Ruhr University Bochum, Universitaetsstrasse 150, 44780 Bochum, Germany.

4-Phenol oxidases are proposed to be involved in the utilization of lignin-derived aromatic compounds. While enzymes with selectivity towards 4-hydroxyphenyl and guaiacyl motifs are well described, we identified the first syringyl-specific oxidase from Streptomyces cavernae (Sc4ASO) only very recently. Here, in-depth studies were conducted to unravel the molecular origins of the outstanding selectivity of Sc4ASO.

View Article and Find Full Text PDF

The actin cytoskeleton is a dynamic mesh of filaments that provide structural support for cells and respond to external deformation forces. Active sensing of these forces is crucial for the function of the actin cytoskeleton, and some actin crosslinkers accomplish it. One such crosslinker is filamin, a highly conserved actin crosslinker dimeric protein with an elastic region capable of responding to mechanical changes in the actin cytoskeleton.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!