Estimating the number of clusters via a corrected clustering instability.

Comput Stat

Center for Cognitive and Decision Science, University of Basel, Basel, Switzerland.

Published: May 2020

We improve instability-based methods for the selection of the number of clusters in cluster analysis by developing a corrected clustering distance that corrects for the unwanted influence of the distribution of cluster sizes on cluster instability. We show that our corrected instability measure outperforms current instability-based measures across the whole sequence of possible , overcoming limitations of current insability-based methods for large . We also compare, for the first time, model-based and model-free approaches to determining cluster-instability and find their performance to be comparable. We make our method available in the R-package cstab.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7550318PMC
http://dx.doi.org/10.1007/s00180-020-00981-5DOI Listing

Publication Analysis

Top Keywords

number clusters
8
corrected clustering
8
estimating number
4
clusters corrected
4
clustering instability
4
instability improve
4
improve instability-based
4
instability-based methods
4
methods selection
4
selection number
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!