Finite mixture models have been used to model population heterogeneity and to relax distributional assumptions. These models are also convenient tools for clustering and classification of complex data such as, for example, repeated-measurements data. The performance of model-based clustering algorithms is sensitive to influential and outlying observations. Methods for identifying outliers in a finite mixture model have been described in the literature. Approaches to identify influential observations are less common. In this paper, we apply local-influence diagnostics to a finite mixture model with known number of components. The methodology is illustrated on real-life data.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1177/0962280216634112 | DOI Listing |
Sensors (Basel)
December 2024
The State Key Laboratory for the Safety, Long-Life, Health Operation and Maintenance of Long-Span Bridges, Jiangsu Provincial Institute of Traffic Science (JSTI Group), Nanjing 210098, China.
The strain data acquired from structural health monitoring (SHM) systems of large-span bridges are often contaminated by a mixture of temperature-induced and vehicle-induced strain components, thereby complicating the assessment of bridge health. Existing approaches for isolating temperature-induced strains predominantly rely on statistical temperature-strain models, which can be significantly influenced by arbitrarily chosen parameters, thereby undermining the accuracy of the results. Additionally, signal processing techniques, including empirical mode decomposition (EMD) and others, frequently yield unstable outcomes when confronted with nonlinear strain signals.
View Article and Find Full Text PDFBiomech Model Mechanobiol
January 2025
CNRS, LaMCoS, UMR5259, INSA Lyon, 69621, Villeurbanne, France.
Sci Rep
January 2025
College of Pharmacy, The Islamic University, Najaf, Iraq.
In the current years, gas-liquid membrane contactors (GLMCs) have been introduced as a promising, versatile and easy-to-operate technology for mitigating the emission of major greenhouse contaminants (i.e., CO and HS) to the ecosystem.
View Article and Find Full Text PDFPLoS One
January 2025
Department of Statistics, Western Michigan University, Kalamazoo, MI, United States of America.
Graphical models have been widely used to explicitly capture the statistical relationships among the variables of interest in the form of a graph. The central question in these models is to infer significant conditional dependencies or independencies from high-dimensional data. In the current literature, it is common to assume that the high-dimensional data come from a homogeneous source and follow a parametric graphical model.
View Article and Find Full Text PDFIntell Based Med
July 2024
School of Industrial Engineering and Management, Oklahoma State University, Stillwater, OK, USA.
Objective: The paper aims to address the problem of massive unlabeled patients in electronic health records (EHR) who potentially have undiagnosed diabetic retinopathy (DR). It is desired to estimate the actual DR prevalence in EHR with 96 % missing labels.
Materials And Methods: The Cerner Health Facts data are used in the study, with 3749 labeled DR patients and 97,876 unlabeled diabetic patients.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!