BioVDB: biological vector database for high-throughput gene expression meta-analysis.

Front Artif Intell

Genes and Human Disease Research Program, Oklahoma Medical Research Foundation, Oklahoma City, OK, United States.

Published: March 2024

High-throughput sequencing has created an exponential increase in the amount of gene expression data, much of which is freely, publicly available in repositories such as NCBI's Gene Expression Omnibus (GEO). Querying this data for patterns such as similarity and distance, however, becomes increasingly challenging as the total amount of data increases. Furthermore, vectorization of the data is commonly required in Artificial Intelligence and Machine Learning (AI/ML) approaches. We present BioVDB, a vector database for storage and analysis of gene expression data, which enhances the potential for integrating biological studies with AI/ML tools. We used a previously developed approach called Automatic Label Extraction (ALE) to extract sample labels from metadata, including age, sex, and tissue/cell-line. BioVDB stores 438,562 samples from eight microarray GEO platforms. We show that it allows for efficient querying of data using similarity search, which can also be useful for identifying and inferring missing labels of samples, and for rapid similarity analysis.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10957786PMC
http://dx.doi.org/10.3389/frai.2024.1366273DOI Listing

Publication Analysis

Top Keywords

gene expression
16
vector database
8
expression data
8
querying data
8
data
6
biovdb biological
4
biological vector
4
database high-throughput
4
gene
4
high-throughput gene
4

Similar Publications

Therapeutic Effects of GDF6-Overexpressing Mesenchymal Stem Cells through Upregulation of the GDF15/SIRT1 Axis in Age-Related Hearing Loss.

Front Biosci (Landmark Ed)

January 2025

Department of Otolaryngology, Head and Neck Surgery, The Second Affiliated Hospital, Jiangxi Medical College, Nanchang University, 330006 Nanchang, Jiangxi, China.

Background: It has been reported the therapeutic effects of mesenchymal stem cells (MSCs) on hearing loss. This study explored the therapeutic effects of growth differentiation factor 6 (GDF6) overexpression-induced MSCs (MSCs-GDF6) on age-related hearing loss (ARHL) and its underlying mechanisms.

Methods: Reverse transcription-quantitative PCR and western blotting were used to evaluate gene expression.

View Article and Find Full Text PDF

Background: This study investigates the role of small ubiquitin-like modifier (SUMO)-specific peptidase 5 (SENP5), a key regulator of SUMOylation, in esophageal squamous cell carcinoma (ESCC), a lethal disease, and its underlying molecular mechanisms.

Methods: Differentially expressed genes between ESCC mouse oesophageal cancer tissues and normal tissues were analysed via RNA-seq; among them, SENP5 expression was upregulated, and this gene was selected for further analysis. Immunohistochemistry and western blotting were then used to validate the increased protein level of SENP5 in both mouse and human ESCC samples.

View Article and Find Full Text PDF

Background: The inheritance of the short allele, encoding the serotonin transporter (SERT) in humans, increases susceptibility to neuropsychiatric and metabolic disorders, with aging and female sex further exacerbating these conditions. Both central and peripheral mechanisms of the compromised serotonin (5-HT) system play crucial roles in this context. Previous studies on SERT-deficient (Sert) mice, which model human SERT deficiency, have demonstrated emotional and metabolic disturbances, exacerbated by exposure to a high-fat Western diet (WD).

View Article and Find Full Text PDF

The Role of NF-κB/MIR155HG in Regulating the Stemness and Radioresistance in Breast Cancer Stem Cells.

Front Biosci (Landmark Ed)

January 2025

Department of Chemoradiotherapy, Ningbo No 2 Hospital, 315000 Ningbo, Zhejiang, China.

Background: Breast cancer stem cells (BCSCs) are instrumental in treatment resistance, recurrence, and metastasis. The development of breast cancer and radiation sensitivity is intimately pertinent to long non-coding RNA (lncRNA). This work is formulated to investigate how the lncRNA affects the stemness and radioresistance of BCSCs.

View Article and Find Full Text PDF

Context: The decline in ovarian reserve is a major concern in female reproductive health, often associated with oxidative stress and mitochondrial dysfunction. Although ginsenoside Rg1 is known to modulate mitophagy, its effectiveness in mitigating ovarian reserve decline remains unclear.

Objective: To investigate the role of ginsenoside Rg1 in promoting mitophagy to preserve ovarian reserve.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!