Gene expression knowledge graph for patient representation and diabetes prediction.

J Biomed Semantics

Data and Web Science Group, University of Mannheim, 68159, Mannheim, Germany.

Published: March 2025

Diabetes is a worldwide health issue affecting millions of people. Machine learning methods have shown promising results in improving diabetes prediction, particularly through the analysis of gene expression data. While gene expression data can provide valuable insights, challenges arise from the fact that the number of patients in expression datasets is usually limited, and the data from different datasets with different gene expressions cannot be easily combined. This work proposes a novel approach to address these challenges by integrating multiple gene expression datasets and domain-specific knowledge using knowledge graphs, a unique tool for biomedical data integration, and to learn uniform patient representations for subjects contained in different incompatible datasets. Different strategies and KG embedding methods are explored to generate vector representations, serving as inputs for a classifier. Extensive experiments demonstrate the efficacy of our approach, revealing weighted F1-score improvements in diabetes prediction up to 13% when integrating multiple gene expression datasets and domain-specific knowledge about protein functions and interactions.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11889825PMC
http://dx.doi.org/10.1186/s13326-025-00325-6DOI Listing

Publication Analysis

Top Keywords

gene expression
20
diabetes prediction
12
expression datasets
12
expression data
8
integrating multiple
8
multiple gene
8
datasets domain-specific
8
domain-specific knowledge
8
gene
6
expression
5

Similar Publications

Joubert syndrome (JS) is a rare neurodevelopmental disorder associated with mutations in genes involved in ciliary function. Germline variants in CPLANE1 have been implicated in JS. In this study, we investigated a family with three adverse pregnancies characterised by fetal malformations consistent with JS.

View Article and Find Full Text PDF

Zinc is an essential trace element for plant growth and development. Zinc transporters play an important role in regulating zinc homeostasis in plants. In this study, the potato cultivar 'Atlantic' was used as experimental material to analyze the expression characteristics of the StZIP2 gene in different potato tissues under zinc deficiency stress.

View Article and Find Full Text PDF

Multi-omics analysis of druggable genes to facilitate Alzheimer's disease therapy: A multi-cohort machine learning study.

J Prev Alzheimers Dis

March 2025

Department of Pathophysiology School of Basic Medicine Key Laboratory of Education Ministry/Hubei Province of China for Neurological Disorders Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China. Electronic address:

Background: The swift rise in the prevalence of Alzheimer's disease (AD) alongside its significant societal and economic impact has created a pressing demand for effective interventions and treatments. However, there are no available treatments that can modify the progression of the disease.

Methods: Eight AD brain tissues datasets and three blood datasets were obtained.

View Article and Find Full Text PDF

Advancing Recombinant Protein Expression in Komagataella phaffii: Opportunities and Challenges.

FEMS Yeast Res

March 2025

State Key Laboratory of Bioreactor Engineering, East China University of Science and Technology, 130 Meilong Road, Shanghai 200237, China.

Komagataella phaffii has gained recognition as a versatile platform for recombinant protein production, with applications covering biopharmaceuticals, industrial enzymes, food additives, etc. Its advantages include high-level protein expression, moderate post-translational modifications, high-density cultivation, and cost-effective methanol utilization. Nevertheless, it still faces challenges for the improvement of production efficiency and extension of applicability.

View Article and Find Full Text PDF

Integrating Genomic, Transcriptomic, and Phenotypic information to Explore Drug Resistance in Mycobacterium tuberculosis sub-lineage 4.2.2.2.

J Appl Microbiol

March 2025

Department of Pharmacology and Clinical Pharmacy, School of Pharmacy, College of Health Science, Addis Ababa University, P.O.Box 9086, Addis Ababa, Ethiopia.

Aims: Mycobacterium tuberculosis (Mtb) remains a major global health challenge, particularly due to increasing drug resistance. Beyond the well-characterized mutations, the mechanisms involved in driving resistance appear to be more complex. This study investigated the differential gene expression of Ethiopian drug-resistant Mtb sub-lineage 4.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!