RNA-binding proteins (RBPs) may play a critical role in gene regulation in various diseases or biological processes by controlling post-transcriptional events such as polyadenylation, splicing and mRNA stabilization via binding activities to RNA molecules. Owing to the importance of RBPs in gene regulation, a great number of studies have been conducted, resulting in a large amount of RNA-Seq datasets. However, these datasets usually do not have structured organization of metadata, which limits their potentially wide use. To bridge this gap, the metadata of a comprehensive set of publicly available mouse RNA-Seq datasets with perturbed RBPs were collected and integrated into a database called RBPMetaDB. This database contains 292 mouse RNA-Seq datasets for a comprehensive list of 187 RBPs. These RBPs account for only ∼10% of all known RBPs annotated in Gene Ontology, indicating that most are still unexplored using high-throughput sequencing. This negative information provides a great pool of candidate RBPs for biologists to conduct future experimental studies. In addition, we found that DNA-binding activities are significantly enriched among RBPs in RBPMetaDB, suggesting that prior studies of these DNA- and RNA-binding factors focus more on DNA-binding activities instead of RNA-binding activities. This result reveals the opportunity to efficiently reuse these data for investigation of the roles of their RNA-binding activities. A web application has also been implemented to enable easy access and wide use of RBPMetaDB. It is expected that RBPMetaDB will be a great resource for improving understanding of the biological roles of RBPs.Database URL: http://rbpmetadb.yubiolab.org.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6009576PMC
http://dx.doi.org/10.1093/database/bay054DOI Listing

Publication Analysis

Top Keywords

rna-seq datasets
16
mouse rna-seq
12
rna-binding proteins
8
rbps
8
gene regulation
8
dna-binding activities
8
rna-binding activities
8
rbpmetadb
5
datasets
5
rna-binding
5

Similar Publications

Systematical identification of regulatory GPCRs by single-cell trajectory inference reveals the role of ADGRD1 and GPR39 in adipogenesis.

Sci China Life Sci

January 2025

Department of Physiology and Pathophysiology, School of Basic Medical Sciences, Peking University, Beijing, 100191, China.

Adipogenesis is the healthy expansion of white adipose tissue (WAT), serving as a compensatory response to maintain metabolic homeostasis in the presence of excess energy in the body. Therefore, the identification of novel regulatory molecules in adipogenesis, specifically membrane receptors such as G protein-coupled receptors (GPCRs), holds significant clinical promise. These receptors can serve as viable targets for pharmaceuticals, offering potential for restoring metabolic homeostasis in individuals with obesity.

View Article and Find Full Text PDF

The rapid advance of large-scale atlas-level single cell RNA sequences and single-cell chromatin accessibility data provide extraordinary avenues to broad and deep insight into complex biological mechanism. Leveraging the datasets and transfering labels from scRNA-seq to scATAC-seq will empower the exploration of single-cell omics data. However, the current label transfer methods have limited performance, largely due to the lower capable of preserving fine-grained cell populations and intrinsic or extrinsic heterogeneity between datasets.

View Article and Find Full Text PDF

scGO: interpretable deep neural network for cell status annotation and disease diagnosis.

Brief Bioinform

November 2024

School of Life Sciences and Biotechnology, Shanghai Jiao Tong University, No. 800 Dong Chuan Road, Shanghai 200240, China.

Machine learning has emerged as a transformative tool for elucidating cellular heterogeneity in single-cell RNA sequencing. However, a significant challenge lies in the "black box" nature of deep learning models, which obscures the decision-making process and limits interpretability in cell status annotation. In this study, we introduced scGO, a Gene Ontology (GO)-inspired deep learning framework designed to provide interpretable cell status annotation for scRNA-seq data.

View Article and Find Full Text PDF

Dysregulated microglia activation, leading to neuroinflammation, is crucial in neurodegenerative disease development and progression. We constructed an atlas of human brain immune cells by integrating nineteen single-nucleus RNA-seq and single-cell RNA-seq datasets from multiple neurodegenerative conditions, comprising 241 samples from patients with Alzheimer's disease, autism spectrum disorder, epilepsy, multiple sclerosis, Lewy body diseases, COVID-19, and healthy controls. The integrated Human Microglia Atlas (HuMicA) included 90,716 nuclei/cells and revealed nine populations distributed across all conditions.

View Article and Find Full Text PDF

Background: Regulatory T cells (Tregs) play a pivotal role in the development, prognosis, and treatment of breast cancer. This study aimed to develop a Treg-associated gene signature that contributes to predict prognosis and therapy benefits in breast cancer.

Methods: Treg-associated genes were screened based on single-cell RNA-sequencing (RNA-seq) in TISCH2 database and the bulk RNA-seq in The Cancer Genome Atlas (TCGA) database.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!