Parsimonious mixtures of multivariate contaminated normal distributions.

Biom J

Department of Mathematics and Statistics, McMaster University, Hamilton, Canada.

Published: November 2016

A mixture of multivariate contaminated normal distributions is developed for model-based clustering. In addition to the parameters of the classical normal mixture, our contaminated mixture has, for each cluster, a parameter controlling the proportion of mild outliers and one specifying the degree of contamination. Crucially, these parameters do not have to be specified a priori, adding a flexibility to our approach. Parsimony is introduced via eigen-decomposition of the component covariance matrices, and sufficient conditions for the identifiability of all the members of the resulting family are provided. An expectation-conditional maximization algorithm is outlined for parameter estimation and various implementation issues are discussed. Using a large-scale simulation study, the behavior of the proposed approach is investigated and comparison with well-established finite mixtures is provided. The performance of this novel family of models is also illustrated on artificial and real data.

Download full-text PDF

Source
http://dx.doi.org/10.1002/bimj.201500144DOI Listing

Publication Analysis

Top Keywords

multivariate contaminated
8
contaminated normal
8
normal distributions
8
parsimonious mixtures
4
mixtures multivariate
4
distributions mixture
4
mixture multivariate
4
distributions developed
4
developed model-based
4
model-based clustering
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!