Advantages of model-based clustering methods over heuristic alternatives have been widely demonstrated in the literature. Most model-based clustering algorithms assume that the data are either discrete or continuous, possibly allowing both types to be present in separate features. In this paper, we introduce a model-based approach for clustering feature vectors of mixed type, allowing each feature to simultaneously take on both categorical and real values. Such data may be encountered, for instance, in chemical and biological analyses, in the analysis of survey data, as well as in image analysis. Our model is formulated within a Bayesian predictive framework, where clustering solutions correspond to random partitions of the data. Using conjugate analysis, the posterior probability for each possible partition can be determined analytically, enabling the utilization of efficient computational search strategies for finding the posterior optimal partition. The derived model is illustrated using several synthetic and real datasets.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TPAMI.2014.2359431 | DOI Listing |
Vet Anim Sci
March 2025
University of Dar es Salaam, P.O. Box 35091, Dar es Salaam, Tanzania.
This study aimed to evaluate and compare Bayesian predictive models to identify and quantify the key household inputs affecting cattle milk production in Tanzania. A sample of 1,266 households with at least one milked cow was extracted from the National Panel Survey datasets, the data were collected in 2012/2013 (wave 3), 2014/2015 (wave 4), and 2020/2021 (wave 5). Two generalized linear and generalized additive mixed models were fitted using the Integrated Nested Laplace Approximation.
View Article and Find Full Text PDFiScience
January 2025
School of Materials Science and Engineering, Nanyang Technological University, Singapore 639798, Singapore.
Developing high-performance alloys is essential for applications in advanced electromagnetic energy conversion devices. In this study, we assess Fe-Co-Ni alloy compositions identified in our previous work through a machine learning (ML) framework, which used both multi-property ML models and multi-objective Bayesian optimization to design compositions with predicted high values of saturation magnetization, Curie temperature, and Vickers hardness. Experimental validation was conducted on two promising compositions synthesized using three different methods: arc melting, ball milling followed by spark plasma sintering (SPS), and chemical synthesis followed by SPS.
View Article and Find Full Text PDFLancet Reg Health Southeast Asia
January 2025
Department of Tropical Hygiene, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand.
Background: In highly measles immunized countries, immunity gaps in adolescents and young adults are a key issue posing an obstacle to measles elimination. This study aims to identify the gaps by estimating the age-stratified probability of seropositivity, and to ascertain a suitable age for the administration of a third dose of a measles-containing vaccine (MCV3) to effectively fill these gaps.
Methods: We retrospectively obtained measles serological results from hospital setting among among individuals aged 13-39 years and developed a serocatalytic dynamic probability model, stratifying seropositivity due to vaccination or natural infection.
J Appl Stat
May 2024
Department of Biostatistics, College of Public Health, University of Iowa, Iowa City, IA, USA.
Ischemic stroke is responsible for significant morbidity and mortality in the United States and worldwide. Stroke treatment optimization requires emergency medical personnel to make rapid triage decisions concerning destination hospitals that may differ in their ability to provide highly time-sensitive pharmaceutical and surgical interventions. These decisions are particularly crucial in rural areas, where transport decisions can have a large impact on treatment times - often involving a trade-off between delay in pharmaceutical therapy or a delay in endovascular thrombectomy.
View Article and Find Full Text PDFJ R Stat Soc Ser A Stat Soc
January 2025
Biostatistics, University of Michigan, 1415 Washington Heights, Michigan 48109, USA.
Model integration refers to the process of incorporating a fitted historical model into the estimation of a current study to increase statistical efficiency. Integration can be challenging when the current model includes new covariates, leading to potential model misspecification. We present and evaluate seven existing and novel model integration techniques, which employ both likelihood constraints and Bayesian informative priors.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!