RNA-protein interactions (RPIs) have critical roles in numerous fundamental biological processes, such as post-transcriptional gene regulation, viral assembly, cellular defence and protein synthesis. As the number of available RNA-protein binding experimental data has increased rapidly due to high-throughput sequencing methods, it is now possible to measure and understand RNA-protein interactions by computational methods. In this study, we integrate a sequence-based derived kernel with regularized least squares to perform prediction. The derived kernel exploits the contextual information around an amino acid or a nucleic acid as well as the repetitive conserved motif information. We propose a novel machine learning method, called RPiRLS to predict the interaction between any RNA and protein of known sequences. For the RPiRLS classifier, each protein sequence comprises up to 20 diverse amino acids but for the RPiRLS-7G classifier, each protein sequence is represented by using 7-letter reduced alphabets based on their physiochemical properties. We evaluated both methods on a number of benchmark data sets and compared their performances with two newly developed and state-of-the-art methods, RPI-Pred and IPMiner. On the non-redundant benchmark test sets extracted from the PRIDB, the RPiRLS method outperformed RPI-Pred and IPMiner in terms of accuracy, specificity and sensitivity. Further, RPiRLS achieved an accuracy of 92% on the prediction of lncRNA-protein interactions. The proposed method can also be extended to construct RNA-protein interaction networks. The RPiRLS web server is freely available at http://bmc.med.stu.edu.cn/RPiRLS.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6017498PMC
http://dx.doi.org/10.3390/molecules23030540DOI Listing

Publication Analysis

Top Keywords

protein sequence
12
rna-protein interactions
8
derived kernel
8
classifier protein
8
rpi-pred ipminer
8
rpirls
6
protein
5
rpirls quantitative
4
quantitative predictions
4
predictions rna
4

Similar Publications

Troponin C is required for copulation and ovulation in Nilaparvata lugens.

Insect Biochem Mol Biol

January 2025

Institute of Insect Sciences, Ministry of Agriculture Key Laboratory of Molecular Biology of Crop Pathogens and Insect Pests, College of Agriculture and Biotechnology, Zhejiang University, Hangzhou, China; Zhejiang Key Laboratory of Biology and Ecological Regulation of Crop Pathogens and Insects, Zhejiang University, Hangzhou, China. Electronic address:

Troponin C (TnC) is a calcium-binding subunit of the troponin complex that regulates muscle contraction in animals. However, the physiological roles of TnC, especially in insect development and reproduction, remain largely unknown. We identified seven TnC genes encoding four EF-hand motif protein in the rice pest, the brown planthopper Nilaparvata lugens.

View Article and Find Full Text PDF

Probing the functional constraints of influenza A virus NEP by deep mutational scanning.

Cell Rep

January 2025

Department of Biochemistry, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA; Carl R. Woese Institute for Genomic Biology, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA; Center for Biophysics and Quantitative Biology, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA; Carle Illinois College of Medicine, University of Illinois Urbana-Champaign, Urbana, IL 61801, USA. Electronic address:

The influenza A virus nuclear export protein (NEP) is a multifunctional protein that is essential for the viral life cycle and has very high sequence conservation. However, since the open reading frame of NEP largely overlaps with that of another influenza viral protein, non-structural protein 1, it is difficult to infer the functional constraints of NEP based on sequence conservation analysis. In addition, the N-terminal of NEP is structurally disordered, which further complicates the understanding of its function.

View Article and Find Full Text PDF

Macrolide resistance due to (55).

Microbiol Spectr

January 2025

Institute for Microbial Systems and Society, Faculty of Science, University of Regina, Regina, Saskatchewan, Canada.

Unlabelled: Antimicrobial resistance (AMR) is a global threat. The identification and characterization of novel resistance genes is integral to AMR surveillance. The (55) gene was originally identified through whole genome sequencing of macrolide-resistant strains of .

View Article and Find Full Text PDF

Complete genome sequence of bacteriophage Godfather isolated from .

Microbiol Resour Announc

January 2025

Department of Biological Sciences, Tarleton State University, Stephenville, Texas, USA.

Microbacteriophage Godfather was collected from a soil sample in Stephenville, Texas. The 17,452-bp double-stranded genome contains 24 protein-coding genes. The genome shares >99% nucleotide sequence identity with cluster EE microbacteriophages Scamander, Danno, Kojax4, and Burgy.

View Article and Find Full Text PDF

Keyhole limpet haemocyanins (KLH1 and KLH2) from , are multi-subunit oxygen-carrying metalloproteins of approximately 3900 amino acids, that are widely used as carrier proteins in conjugate vaccines and in immunotherapy. KLHs and their derived conjugate vaccines are poorly characterized by LC-MS/MS due to their very stable supramolecular structures with megadalton molecular mass, and their resistance to efficient digestion with standard protocols. KLH1 and KLH2 proteins were conjugated to the conserved P0 peptide (pP0), derived from the P0 acidic ribosomal protein of sp.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!