RNA-binding proteins (RBPs) control the regulation of gene expression in eukaryotic genomes at post-transcriptional level by binding to their cognate RNAs. Although several variants of CLIP (crosslinking and immunoprecipitation) protocols are currently available to study the global protein-RNA interaction landscape at single-nucleotide resolution in a cell, currently there are very few tools that can facilitate understanding and dissecting the functional associations of RBPs from the resulting binding maps. Here, we present Seten, a web-based and command line tool, which can identify and compare processes, phenotypes, and diseases associated with RBPs from condition-specific CLIP-seq profiles. Seten uses BED files resulting from most peak calling algorithms, which include scores reflecting the extent of binding of an RBP on the target transcript, to provide both traditional functional enrichment as well as gene set enrichment results for a number of gene set collections including BioCarta, KEGG, Reactome, Gene Ontology (GO), Human Phenotype Ontology (HPO), and MalaCards Disease Ontology for several organisms including fruit fly, human, mouse, rat, worm, and yeast. It also provides an option to dynamically compare the associated gene sets across data sets as bubble charts, to facilitate comparative analysis. Benchmarking of Seten using eCLIP data for IGF2BP1, SRSF7, and PTBP1 against their corresponding CRISPR RNA-seq in K562 cells as well as randomized negative controls, demonstrated that its gene set enrichment method outperforms functional enrichment, with scores significantly contributing to the discovery of true annotations. Comparative performance analysis using these CRISPR control data sets revealed significantly higher precision and comparable recall to that observed using ChIP-Enrich. Seten's web interface currently provides precomputed results for about 200 CLIP-seq data sets and both command line as well as web interfaces can be used to analyze CLIP-seq data sets. We highlight several examples to show the utility of Seten for rapid profiling of various CLIP-seq data sets. Seten is available on http://www.iupui.edu/∼sysbio/seten/.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5435856PMC
http://dx.doi.org/10.1261/rna.059089.116DOI Listing

Publication Analysis

Top Keywords

data sets
20
gene set
12
clip-seq data
12
processes phenotypes
8
phenotypes diseases
8
diseases associated
8
rna-binding proteins
8
condition-specific clip-seq
8
clip-seq profiles
8
functional enrichment
8

Similar Publications

Fully Synthetic Data for Complex Surveys.

Surv Methodol

December 2024

Department of Statistical Science, 214a Old Chemistry Building, Duke University, Durham, NC 27708-0251.

When seeking to release public use files for confidential data, statistical agencies can generate fully synthetic data. We propose an approach for making fully synthetic data from surveys collected with complex sampling designs. Our approach adheres to the general strategy proposed by Rubin (1993).

View Article and Find Full Text PDF

Blood carries some of the most valuable biomarkers for disease screening as it interacts with various tissues and organs in the body. Human blood serum is a reservoir of high molecular weight fraction (HMWF) and low molecular weight fraction (LMWF) proteins. The LMWF proteins are considered disease marker proteins and are often suppressed by HMWF proteins during analysis.

View Article and Find Full Text PDF

Accurate drug-target binding affinity (DTA) prediction is crucial in drug discovery. Recently, deep learning methods for DTA prediction have made significant progress. However, there are still two challenges: (1) recent models always ignore the correlations in drug and target data in the drug/target representation process and (2) the interaction learning of drug-target pairs always is by simple concatenation, which is insufficient to explore their fusion.

View Article and Find Full Text PDF

Here, we apply SuperResNET network analysis of dSTORM single-molecule localization microscopy (SMLM) to determine how the clathrin endocytosis inhibitors pitstop 2, dynasore and Latrunculin A alter the morphology of clathrin-coated pits. SuperResNET analysis of HeLa and Cos7 cells identifies: small oligomers (Class I); pits and vesicles (Class II); and larger clusters corresponding to fused pits or clathrin plaques (Class III). Pitstop 2 and dynasore induce distinct homogeneous populations of Class II structures in HeLa cells suggesting that they arrest endocytosis at different stages.

View Article and Find Full Text PDF

Background: Endogenous Alu RNAs form double-stranded RNAs recognized by double-stranded RNA sensors and activate IRF and NF-kB transcriptional paths and innate immunity. Deamination of adenosines to inosines by the ADAR family of enzymes, a process termed A-to-I editing, disrupts double-stranded RNA structure and prevents innate immune activation. Innate immune activation is observed in Alzheimer's disease, the most common form of dementia.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!