Coordination of cluster ensembles via exact methods.

IEEE Trans Pattern Anal Mach Intell

Athens Information Technology, Paiania, Greece.

Published: February 2011

We present a novel optimization-based method for the combination of cluster ensembles for the class of problems with intracluster criteria, such as Minimum-Sum-of-Squares-Clustering (MSSC). We propose a simple and efficient algorithm-called EXAMCE-for this class of problems that is inspired from a Set-Partitioning formulation of the original clustering problem. We prove some theoretical properties of the solutions produced by our algorithm, and in particular that, under general assumptions, though the algorithm recombines solution fragments so as to find the solution of a Set-Covering relaxation of the original formulation, it is guaranteed to find better solutions than the ones in the ensemble. For the MSSC problem in particular, a prototype implementation of our algorithm found a new better solution than the previously best known for 21 of the test instances of the 40-instance TSPLIB benchmark data sets used in [1], [2], and [3], and found a worse-quality solution than the best known only five times. For other published benchmark data sets where the optimal MSSC solution is known, we match them. The algorithm is particularly effective when the number of clusters is large, in which case it is able to escape the local minima found by K-means type algorithms by recombining the solutions in a Set-Covering context. We also establish the stability of the algorithm with extensive computational experiments, by showing that multiple runs of EXAMCE for the same clustering problem instance produce high-quality solutions whose Adjusted Rand Index is consistently above 0.95. Finally, in experiments utilizing external criteria to compute the validity of clustering, EXAMCE is capable of producing high-quality results that are comparable in quality to those of the best known clustering algorithms.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2010.85DOI Listing

Publication Analysis

Top Keywords

cluster ensembles
8
class problems
8
clustering problem
8
solution best
8
benchmark data
8
data sets
8
algorithm
5
solution
5
coordination cluster
4
ensembles exact
4

Similar Publications

Unraveling EEG correlates of unimanual finger movements: insights from non-repetitive flexion and extension tasks.

J Neuroeng Rehabil

December 2024

Laboratory for Neuro- & Psychophysiology, Department of Neurosciences, KU Leuven, Leuven, Belgium.

Background: The loss of finger control in individuals with neuromuscular disorders significantly impacts their quality of life. Electroencephalography (EEG)-based brain-computer interfaces that actuate neuroprostheses directly via decoded motor intentions can help restore lost finger mobility. However, the extent to which finger movements exhibit distinct and decodable EEG correlates remains unresolved.

View Article and Find Full Text PDF

MorphoGlia, an interactive method to identify and map microglia morphologies, demonstrates differences in hippocampal subregions of an Alzheimer's disease mouse model.

Front Cell Neurosci

December 2024

Departamento de Neurobiología del Desarrollo y Neurofisiología, Instituto de Neurobiología, Santiago de Querétaro, Mexico.

Microglia are dynamic central nervous system cells crucial for maintaining homeostasis and responding to neuroinflammation, as evidenced by their varied morphologies. Existing morphology analysis often fails to detect subtle variations within the full spectrum of microglial morphologies due to their reliance on predefined categories. Here, we present MorphoGlia, an interactive, user-friendly pipeline that objectively characterizes microglial morphologies.

View Article and Find Full Text PDF

GOCIA: a grand canonical global optimizer for clusters, interfaces, and adsorbates.

Phys Chem Chem Phys

December 2024

Department of Chemistry and Biochemistry, University of California, Los Angeles, California, 90095-1569, USA.

Restructuring of surfaces and interfaces plays a key role in the activation and/or deactivation of a wide spectrum of heterogeneous catalysts and functional materials. The statistical ensemble representation can provide unique atomistic insights into this fluxional and metastable realm, but constructing the ensemble is very challenging, especially for the systems with off-stoichiometric reconstruction and varying coverage of mixed adsorbates. Here, we report GOCIA, a versatile global optimizer for exploring the chemical space of these systems.

View Article and Find Full Text PDF

Mass spectral identification (in particular, in metabolomics) can be refined by comparing the observed and predicted properties of molecules, such as chromatographic retention. Significant advancements have been made in predicting these values using machine learning and deep learning. Usually, model predictions do not contain any indication of the possible error (uncertainty) or only one criterion is used for this purpose.

View Article and Find Full Text PDF

Background/objectives: This study develops machine learning (ML) models to predict hypoxemia severity during emergency triage, particularly in Chemical, Biological, Radiological, Nuclear, and Explosive (CBRNE) scenarios, using physiological data from medical-grade sensors.

Methods: Tree-based models (TBMs) such as XGBoost, LightGBM, CatBoost, Random Forests (RFs), Voting Classifier ensembles, and sequential models (LSTM, GRU) were trained on the MIMIC-III and IV datasets. A preprocessing pipeline addressed missing data, class imbalances, and synthetic data flagged with masks.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!