Deep Clustering of Electronic Health Records Tabular Data for Clinical Interpretation.

IEEE Int Conf Telecommun Photonics

Department of Computer Science, Tennessee State University, Nashville, TN, United States.

Published: December 2023

Machine learning applications are widespread due to straightforward supervised learning of known data labels. Many data samples in real-world scenarios, including medicine, are unlabeled because data annotation can be time-consuming and error-prone. The application and evaluation of unsupervised clustering methods are not trivial and are limited to traditional methods (e.g., k-means) when clinicians demand deeper insights into patient data beyond classification accuracy. The contribution of this paper is three-fold: 1) to introduce a patient stratification strategy based on a clinical variable instead of a diagnostic label, 2) to evaluate clustering performance using within-cluster homogeneity and between-cluster statistical difference, and 3) to compare widely used traditional clustering algorithms (e.g., k-means) with a state-of-the-art deep learning solution for clustering tabular data. The deep clustering method achieves superior within-cluster homogeneity and between-cluster separation compared to k-means and identifies three statistically distinct and clinically interpretable high blood pressure patient clusters. The proposed clustering strategy and evaluation metrics will facilitate the stratification of large patient cohorts in health science research without requiring explicit diagnostic labels.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11255553PMC
http://dx.doi.org/10.1109/ictp60248.2023.10490723DOI Listing

Publication Analysis

Top Keywords

deep clustering
8
tabular data
8
within-cluster homogeneity
8
homogeneity between-cluster
8
data
6
clustering
6
clustering electronic
4
electronic health
4
health records
4
records tabular
4

Similar Publications

Transcriptional regulation of the piRNA pathway by Ovo in animal ovarian germ cells.

Genes Dev

December 2024

Cancer Research UK Cambridge Institute, Li Ka Shing Centre, University of Cambridge, Cambridge CB2 0RE, United Kingdom

The gene-regulatory mechanisms controlling the expression of the germline PIWI-interacting RNA (piRNA) pathway components within the gonads of metazoan species remain largely unexplored. In contrast to the male germline piRNA pathway, which in mice is known to be activated by the testis-specific transcription factor A-MYB, the nature of the ovary-specific gene-regulatory network driving the female germline piRNA pathway remains a mystery. Here, using as a model, we combined multiple genomics approaches to reveal the transcription factor Ovo as regulator of the germline piRNA pathway in ovarian germ cells.

View Article and Find Full Text PDF

Task Offloading with LLM-Enhanced Multi-Agent Reinforcement Learning in UAV-Assisted Edge Computing.

Sensors (Basel)

December 2024

School of Microelectronics and Communication Engineering, Chongqing University, Chongqing 400044, China.

Unmanned aerial vehicles (UAVs) furnished with computational servers enable user equipment (UE) to offload complex computational tasks, thereby addressing the limitations of edge computing in remote or resource-constrained environments. The application of value decomposition algorithms for UAV trajectory planning has drawn considerable research attention. However, existing value decomposition algorithms commonly encounter obstacles in effectively associating local observations with the global state of UAV clusters, which hinders their task-solving capabilities and gives rise to reduced task completion rates and prolonged convergence times.

View Article and Find Full Text PDF

Alzheimer's disease (AD) prevention is a critical challenge for aging societies, necessitating the exploration of food ingredients and whole foods as potential therapeutic agents. This study aimed to identify natural compounds (NCs) with therapeutic potential in AD using an innovative bioinformatics-integrated deep neural analysis approach, combining computational predictions with molecular docking and in vitro experiments for comprehensive evaluation. We employed the bioinformatics-integrated deep neural analysis of NCs for Disease Discovery (BioDeepNat) application in the data collected from chemical databases.

View Article and Find Full Text PDF

Establishment of a CRISPR-Based Lentiviral Activation Library for Transcription Factor Screening in Porcine Cells.

Animals (Basel)

December 2024

Key Laboratory of Animal Cellular and Genetic Engineering of Heilongjiang Province, Northeast Agricultural University, Harbin 150030, China.

Transcription factors play important roles in the growth and development of various tissues in pigs, such as muscle, fat, and bone. A transcription-factor-scale activation library based on the clustered, regularly interspaced, short palindromic repeat (CRISPR)/CRISPR-associated endonuclease Cas9 (Cas9) system could facilitate the discovery and functional characterization of the transcription genes involved in a specific gene network. Here, we have designed and constructed a CRISPR activation (CRISPRa) sgRNA library, containing 5056 sgRNAs targeting the promoter region of 1264 transcription factors in pigs.

View Article and Find Full Text PDF

Understanding transcriptional heterogeneity in cancer cells and its implication for treatment response is critical to identify how resistance occurs and may be targeted. Such heterogeneity can be captured by in vitro studies through clonal barcoding methods. We present TraCSED (Transformer-based modeling of Clonal Selection and Expression Dynamics), a dynamic deep learning approach for modeling clonal selection.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!