Nucleotide variants can cause functional changes by altering protein-RNA binding in various ways that are not easy to predict. This can affect processes such as splicing, nuclear shuttling, and stability of the transcript. Therefore, correct modeling of protein-RNA binding is critical when predicting the effects of sequence variations. Many RNA-binding proteins recognize a diverse set of motifs and binding is typically also dependent on the genomic context, making this task particularly challenging. Here, we present DeepCLIP, the first method for context-aware modeling and predicting protein binding to RNA nucleic acids using exclusively sequence data as input. We show that DeepCLIP outperforms existing methods for modeling RNA-protein binding. Importantly, we demonstrate that DeepCLIP predictions correlate with the functional outcomes of nucleotide variants in independent wet lab experiments. Furthermore, we show how DeepCLIP binding profiles can be used in the design of therapeutically relevant antisense oligonucleotides, and to uncover possible position-dependent regulation in a tissue-specific manner. DeepCLIP is freely available as a stand-alone application and as a webtool at http://deepclip.compbio.sdu.dk.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7367176PMC
http://dx.doi.org/10.1093/nar/gkaa530DOI Listing

Publication Analysis

Top Keywords

protein-rna binding
12
nucleotide variants
8
binding
7
deepclip
6
deepclip predicting
4
predicting mutations
4
mutations protein-rna
4
binding deep
4
deep learning
4
learning nucleotide
4

Similar Publications

Small proteins (≤100 amino acids) play important roles across all life forms, ranging from unicellular bacteria to higher organisms. In this study, we have developed SProtFP which is a machine learning-based method for functional annotation of prokaryotic small proteins into selected functional categories. SProtFP uses independent artificial neural networks (ANNs) trained using a combination of physicochemical descriptors for classifying small proteins into antitoxin type 2, bacteriocin, DNA-binding, metal-binding, ribosomal protein, RNA-binding, type 1 toxin and type 2 toxin proteins.

View Article and Find Full Text PDF

Insulin-like growth factor II mRNA-binding proteins (IGF2BPs), a family of RNA-binding proteins, are pivotal in regulating RNA dynamics, encompassing processes such as localization, metabolism, stability, and translation through the formation of ribonucleoprotein complexes. First identified in 1999 for their affinity to insulin-like growth factor II mRNA, IGF2BPs have been implicated in promoting tumor malignancy behaviors, including proliferation, metastasis, and the maintenance of stemness, which are associated with unfavorable outcomes in various cancers. Additionally, non-coding RNAs (ncRNAs), particularly long non-coding RNAs, circular RNAs, and microRNAs, play critical roles in cancer progression through intricate protein-RNA interactions.

View Article and Find Full Text PDF

Rationalizing the effects of RNA modifications on protein interactions.

Mol Ther Nucleic Acids

December 2024

Centre for Human Technologies (CHT), RNA System Biology Lab, Istituto Italiano di Tecnologia (IIT), Via Enrico Melen, 83, 16152 Genova, Italy.

RNA modifications play a crucial role in regulating gene expression by altering RNA structure and modulating interactions with RNA-binding proteins (RBPs). In this study, we explore the impact of specific RNA chemical modifications-N-methyladenosine (m⁶A), A-to-I editing, and pseudouridine (Ψ)-on RNA secondary structure and protein-RNA interactions. Utilizing genome-wide data, including RNA secondary structure predictions and protein-RNA interaction datasets, we classify proteins into distinct categories based on their binding behaviors: modification specific and structure independent, or modification unspecific and structure dependent.

View Article and Find Full Text PDF

FAST MS: Software for the Automated Analysis of Top-Down Mass Spectra of Polymeric Molecules Including RNA, DNA, and Proteins.

J Am Soc Mass Spectrom

December 2024

Institute of Organic Chemistry and Center for Molecular Biosciences Innsbruck (CMBI), University of Innsbruck, 6020 Innsbruck, Austria.

Top-down mass spectrometry (MS) enables comprehensive characterization of modified proteins and nucleic acids and, when native electrospray ionization (ESI) is used, binding site mapping of their complexes with native or therapeutic ligands. However, the high complexity of top-down MS spectra poses a serious challenge to both manual and automated data interpretation, even when the protein, RNA, or DNA sequence and the type of modification or the ligand are known. Here, we introduce FAST MS, a user-friendly software that identifies, assigns and relatively quantifies signals of molecular and fragment ions in MS and MS/MS spectra of biopolymers with known sequence and provides a toolbox for statistical analysis.

View Article and Find Full Text PDF

More than 4,000 single nucleotide polymorphisms (SNP) variants have been identified in the human gene, however only a few have been studied in the context of protein function. The tandem zinc finger domain of ZFP36L2, an RNA binding protein, is the functional domain that binds to its target mRNAs. This protein/RNA interaction triggers mRNA degradation, controlling gene expression.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!