Using the Kriging Correlation for unsupervised feature selection problems.

Sci Rep

Department of Applied Mathematics, National University of Kaohsiung, Kaohsiung, 811, Taiwan, ROC.

Published: July 2022

This paper proposes a KC Score to measure feature importance in clustering analysis of high-dimensional data. The KC Score evaluates the contribution of features based on the correlation between the original features and the reconstructed features in the low dimensional latent space. A KC Score-based feature selection strategy is further developed for clustering analysis. We investigate the performance of the proposed strategy by conducting a study of four single-cell RNA sequencing (scRNA-seq) datasets. The results show that our strategy effectively selects important features for clustering. In particular, in three datasets, our proposed strategy selected less than 5% of the features and achieved the same or better clustering performance than when using all of the features.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9263137PMC
http://dx.doi.org/10.1038/s41598-022-15529-4DOI Listing

Publication Analysis

Top Keywords

feature selection
8
clustering analysis
8
proposed strategy
8
features
6
kriging correlation
4
correlation unsupervised
4
unsupervised feature
4
selection problems
4
problems paper
4
paper proposes
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!