ADCoC: Adaptive Distribution Modeling Based Collaborative Clustering for Disentangling Disease Heterogeneity from Neuroimaging Data.

IEEE Trans Emerg Top Comput Intell

Neuroimage Analytics Laboratory (NAL) and Biggs Institute Neuroimaging Core, Glenn Biggs Institute for Neurodegenerative Disorders, University of Texas Health Science Center at San Antonio, San Antonio, Texas, USA; Center for Biomedical Image Computing and Analytics, University of Pennsylvania, Philadelphia, PA 19104, USA.

Published: April 2023

AI Article Synopsis

  • Conventional clustering methods for neuroimaging usually focus on differences between subjects but often ignore the variations among features and issues with data quality, leading to clustering errors.
  • This study proposes a novel approach that clusters both subjects and features simultaneously using nonnegative matrix tri-factorization, which enhances clustering accuracy by leveraging weak supervision from heterogeneous feature clusters.
  • The introduced adaptive regularization, which considers the distribution of coefficients, is shown to be more effective in reducing noise, with the method outperforming traditional techniques in experiments and revealing distinct patient clusters in MRI data from individuals with Parkinson's disease.

Article Abstract

Conventional clustering techniques for neuroimaging applications usually focus on capturing differences between given subjects, while neglecting arising differences between features and the potential bias caused by degraded data quality. In practice, collected neuroimaging data are often inevitably contaminated by noise, which may lead to errors in clustering and clinical interpretation. Additionally, most methods ignore the importance of feature grouping towards optimal clustering. In this paper, we exploit the underlying heterogeneous clusters of features to serve as weak supervision for improved clustering of subjects, which is achieved by simultaneously clustering subjects and features via nonnegative matrix tri-factorization. In order to suppress noise, we further introduce adaptive regularization based on coefficient distribution modeling. Particularly, unlike conventional sparsity regularization techniques that assume zero mean of the coefficients, we form the distributions using the data of interest so that they could better fit the non-negative coefficients. In this manner, the proposed approach is expected to be more effective and robust against noise. We compared the proposed method with standard techniques and recently published methods demonstrating superior clustering performance on synthetic data with known ground truth labels. Furthermore, when applying our proposed technique to magnetic resonance imaging (MRI) data from a cohort of patients with Parkinson's disease, we identified two stable and highly reproducible patient clusters characterized by frontal and posterior cortical/medial temporal atrophy patterns, respectively, which also showed corresponding differences in cognitive characteristics.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10038331PMC
http://dx.doi.org/10.1109/tetci.2021.3136587DOI Listing

Publication Analysis

Top Keywords

distribution modeling
8
neuroimaging data
8
clustering subjects
8
clustering
7
data
6
adcoc adaptive
4
adaptive distribution
4
modeling based
4
based collaborative
4
collaborative clustering
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!