Clustering with varying risks of false assignments in discrete latent variable model.

Stat Methods Med Res

Department of Statistics, Seoul National University, Seoul, Republic of Korea.

Published: October 2020

In clustering problems, to model the intrinsic structure of unlabeled data, the latent variable models are frequently used. These model-based clustering methods often provide a clustering rule minimizing the total false assignment error. However, in many clustering applications, it is desirable to treat false assignment errors for a certain cluster differently. In this paper, we introduce the false assignment rate for clustering and estimate it by using the extended likelihood approach. We propose VRclust, a novel clustering rule that controls various errors differently across clusters. Real data examples illustrate the usage of estimation of false assignment rate and a simulation study shows that error controls are consistent as the sample size increases.

Download full-text PDF

Source
http://dx.doi.org/10.1177/0962280220913067DOI Listing

Publication Analysis

Top Keywords

false assignment
16
latent variable
8
clustering rule
8
assignment rate
8
clustering
7
false
5
clustering varying
4
varying risks
4
risks false
4
false assignments
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!