Bioinformatics
December 2024
Motivation: Beta turns are the most common type of secondary structure in proteins after alpha helices and beta sheets and play many key structural and functional roles. Turn backbone (BB) geometry has been classified at multiple levels of precision, but the current picture of side chain (SC) structure and interaction in turns is incomplete, because the distribution of SC conformations associated with each sequence motif has commonly been represented only by a static image of a single, typical structure for each turn BB geometry, and only motifs which specify a single amino acid (e.g.
View Article and Find Full Text PDFBeta turns, in which the protein backbone abruptly changes direction over four amino acid residues, are the most common type of protein secondary structure after alpha helices and beta sheets and play key structural and functional roles. Previous work has produced classification systems for turn geometry at multiple levels of precision, but these operate in backbone dihedral-angle (Ramachandran) space, and the absence of a local Euclidean-space coordinate system and structural alignment for turns, or of any systematic Euclidean-space characterization of turn backbone shape, presents challenges for the visualization, comparison and analysis of the wide range of turn conformations and the design of turns and the structures that incorporate them. This work derives a turn-local coordinate system that implicitly aligns turns, together with a set of geometric descriptors that characterize the bulk BB shapes of turns and describe modes of structural variation not explicitly captured by existing systems.
View Article and Find Full Text PDFBackground: Interactions that involve one or more amino acid side chains near the ends of protein helices stabilize helix termini and shape the geometry of the adjacent loops, making a substantial contribution to overall protein structure. Previous work has identified key helix-terminal motifs, such as Asx/ST N-caps, the capping box, and hydrophobic and electrostatic interactions, but important questions remain, including: 1) What loop backbone geometries are favoured by each motif? 2) To what extent are multi-amino acid motifs likely to represent genuine cooperative interactions? 3) Can new motifs be identified in a large, recent dataset using the latest bioinformatics tools?
Results: Three analytical tools are applied here to answer these questions. First, helix-terminal structures are partitioned by loop backbone geometry using a new 3D clustering algorithm.
Motivation: The extraction of the set of features most relevant to function from classified biological sequence sets is still a challenging problem. A central issue is the determination of expected counts for higher order features so that artifact features may be screened.
Results: Cascade detection (CD), a new algorithm for the extraction of localized features from sequence sets, is introduced.