conSSert: Consensus SVM Model for Accurate Prediction of Ordered Secondary Structure.

J Chem Inf Model

Department of Chemical and Biological Engineering, Princeton University, Princeton, New Jersey 08544, United States.

Published: March 2016

Accurate prediction of protein secondary structure remains a crucial step in most approaches to the protein-folding problem, yet the prediction of ordered secondary structure, specifically beta-strands, remains a challenge. We developed a consensus secondary structure prediction method, conSSert, which is based on support vector machines (SVM) and provides exceptional accuracy for the prediction of beta-strands with QE accuracy of over 0.82 and a Q2-EH of 0.86. conSSert uses as input probabilities for the three types of secondary structure (helix, strand, and coil) that are predicted by four top performing methods: PSSpred, PSIPRED, SPINE-X, and RAPTOR. conSSert was trained/tested using 4261 protein chains from PDBSelect25, and 8632 chains from PISCES. Further validation was performed using targets from CASP9, CASP10, and CASP11. Our data suggest that poor performance in strand prediction is likely a result of training bias and not solely due to the nonlocal nature of beta-sheet contacts. conSSert is freely available for noncommercial use as a webservice: http://ares.tamu.edu/conSSert/.

Download full-text PDF

Source
http://dx.doi.org/10.1021/acs.jcim.5b00566DOI Listing

Publication Analysis

Top Keywords

secondary structure
20
accurate prediction
8
prediction ordered
8
ordered secondary
8
prediction
6
conssert
5
secondary
5
structure
5
conssert consensus
4
consensus svm
4

Similar Publications

tRNA gene content, structure, and organization in the flowering plant lineage.

Front Plant Sci

December 2024

National Institute of Molecular Biology and Biotechnology, College of Science, University of the Philippines Diliman, Quezon City, Philippines.

Transfer RNAs (tRNAs) are noncoding RNAs involved in protein biosynthesis and have noncanonical roles in cellular metabolism, such as RNA silencing and the generation of transposable elements. Extensive tRNA gene duplications, modifications to mature tRNAs, and complex secondary and tertiary structures impede tRNA sequencing. As such, a comparative genomic analysis of complete tRNA sets is an alternative to understanding the evolutionary processes that gave rise to the extant tRNA sets.

View Article and Find Full Text PDF

Unlabelled: Structural RNAs exhibit a vast array of recurrent short 3D elements involving non-Watson-Crick interactions that help arrange canonical double helices into tertiary structures. We present CaCoFold-R3D, a probabilistic grammar that predicts these RNA 3D motifs (also termed modules) jointly with RNA secondary structure over a sequence or alignment. CaCoFold-R3D uses evolutionary information present in an RNA alignment to reliably identify canonical helices (including pseudoknots) by covariation.

View Article and Find Full Text PDF

The 1.7 kb DRAIC long noncoding RNA inhibits tumor growth, inhibits cancer cell invasion, migration, colony formation and interacts with IKK (IκB kinase) subunits, inhibiting the phosphorylation and degradation of the NF-κB inhibitor, IκB, to suppress the activation of NF-κB. Whether these activities are all linked is unclear.

View Article and Find Full Text PDF

Manganese (Mn)-sensing riboswitches protect bacteria from Mn toxicity by upregulating expression of Mn exporters. The Mn aptamers share key features but diverge in other important elements, including within the metal-binding core. Although X-ray crystal structures of isolated aptamers exist, these structural snapshots lack crucial details about how the aptamer communicates the presence or absence of ligand to the expression platform.

View Article and Find Full Text PDF

Biological data visualization is challenged by the growing complexity of datasets. Traditional single-data plots or simple juxtapositions often fail to fully capture dataset intricacies and interrelations. To address this, we introduce "cross-layout," a novel visualization paradigm that integrates multiple plot types in a cross-like structure, with a central main plot surrounded by secondary plots for enhanced contextualization and interrelation insights.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!