FastRNABindR: Fast and Accurate Prediction of Protein-RNA Interface Residues.

PLoS One

College of Information Sciences and Technology, Pennsylvania State University, University Park, PA, United States of America.

Published: August 2017

A wide range of biological processes, including regulation of gene expression, protein synthesis, and replication and assembly of many viruses are mediated by RNA-protein interactions. However, experimental determination of the structures of protein-RNA complexes is expensive and technically challenging. Hence, a number of computational tools have been developed for predicting protein-RNA interfaces. Some of the state-of-the-art protein-RNA interface predictors rely on position-specific scoring matrix (PSSM)-based encoding of the protein sequences. The computational efforts needed for generating PSSMs severely limits the practical utility of protein-RNA interface prediction servers. In this work, we experiment with two approaches, random sampling and sequence similarity reduction, for extracting a representative reference database of protein sequences from more than 50 million protein sequences in UniRef100. Our results suggest that random sampled databases produce better PSSM profiles (in terms of the number of hits used to generate the profile and the distance of the generated profile to the corresponding profile generated using the entire UniRef100 data as well as the accuracy of the machine learning classifier trained using these profiles). Based on our results, we developed FastRNABindR, an improved version of RNABindR for predicting protein-RNA interface residues using PSSM profiles generated using 1% of the UniRef100 sequences sampled uniformly at random. To the best of our knowledge, FastRNABindR is the only protein-RNA interface residue prediction online server that requires generation of PSSM profiles for query sequences and accepts hundreds of protein sequences per submission. Our approach for determining the optimal BLAST database for a protein-RNA interface residue classification task has the potential of substantially speeding up, and hence increasing the practical utility of, other amino acid sequence based predictors of protein-protein and protein-DNA interfaces.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4934694PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0158445PLOS

Publication Analysis

Top Keywords

protein-rna interface
24
protein sequences
16
pssm profiles
12
protein-rna
8
interface residues
8
predicting protein-rna
8
practical utility
8
interface residue
8
interface
6
sequences
6

Similar Publications

Dimethyl sulfoxide (DMSO) is a widely used solvent in drug research. However, recent studies indicate that even at low concentration DMSO might cause structural changes of proteins and RNA. The pyrazolopyrimidine antiviral OBR-5-340 dissolved in DMSO inhibits rhinovirus-B5 infection yet is inactive against RV-A89.

View Article and Find Full Text PDF

Protein-RNA interactions play a critical role in many cellular processes and pathologies. However, experimental determination of protein-RNA structures is still challenging, therefore computational tools are needed for the prediction of protein-RNA interfaces. Although evolutionary pressures can be exploited for structural prediction of protein-protein interfaces, and recent deep learning methods using protein multiple sequence alignments have radically improved the performance of protein-protein interface structural prediction, protein-RNA structural prediction is lagging behind, due to the scarcity of structural data and the flexibility involved in these complexes.

View Article and Find Full Text PDF

Investigating the binding between proteins and aptamers, such as peptides or RNA molecules, is of crucial importance both for understanding the molecular mechanisms that regulate cellular activities and for therapeutic applications in several pathologies. Here, a new computational procedure, employing mainly docking, clustering analysis, and molecular dynamics simulations, was designed to estimate the binding affinities between a protein and some RNA aptamers, through the investigation of the dynamical behavior of the predicted molecular complex. Using the state-of-the-art software catRAPID, we computationally designed a set of RNA aptamers interacting with the TAR DNA-binding protein 43 (TDP-43), a protein associated with several neurodegenerative diseases, including amyotrophic lateral sclerosis (ALS).

View Article and Find Full Text PDF

RNA-binding proteins (RBPs) are key regulators of gene expression. Here, we introduce EuPRI (Eukaryotic Protein-RNA Interactions) - a freely available resource of RNA motifs for 34,736 RBPs from 690 eukaryotes. EuPRI includes binding data for 504 RBPs, including newly collected RNAcompete data for 174 RBPs, along with thousands of reconstructed motifs.

View Article and Find Full Text PDF

Investigation of RNA-binding protein NOVA1 in silico: Comparison of the modern human V197 with the archaic I197 variant present in Neanderthals.

Comput Biol Med

December 2024

Epigenetics in Human Health and Disease Program, Baker Heart and Diabetes Institute, 75 Commercial Road, Prahran, VIC, 3004, Australia; yΘμ Study Group, ProspED Polytechnic, Carlton, VIC, 3053, Australia; Baker Department of Cardiometabolic Health, The University of Melbourne, Parkville, VIC 3010, Australia; Department of Clinical Pathology, The University of Melbourne, Parkville, VIC, 3010, Australia. Electronic address:

Article Synopsis
  • Researchers have identified specific genetic differences between archaic and modern human genomes, notably a switch in the NOVA1 gene linked to RNA regulation in neurons, changing an amino acid from isoleucine to valine.
  • Using advanced modeling techniques, they compared the structures and binding interactions of the archaic and modern NOVA1 variants, finding similar binding free energies for protein-RNA complexes.
  • The study suggests that although structural changes are modest, more research is needed to understand how mutations in NOVA1 affect alternative splicing and contribute to diseases, especially considering the roles of multiple mutations.
View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!