The global kernel k-means algorithm for clustering in feature space.

IEEE Trans Neural Netw

Department of Computer Science, University of Ioannina, GR 45110 Ioannina, Greece.

Published: July 2009

Kernel k-means is an extension of the standard k -means clustering algorithm that identifies nonlinearly separable clusters. In order to overcome the cluster initialization problem associated with this method, we propose the global kernel k-means algorithm, a deterministic and incremental approach to kernel-based clustering. Our method adds one cluster at each stage, through a global search procedure consisting of several executions of kernel k-means from suitable initializations. This algorithm does not depend on cluster initialization, identifies nonlinearly separable clusters, and, due to its incremental nature and search procedure, locates near-optimal solutions avoiding poor local minima. Furthermore, two modifications are developed to reduce the computational cost that do not significantly affect the solution quality. The proposed methods are extended to handle weighted data points, which enables their application to graph partitioning. We experiment with several data sets and the proposed approach compares favorably to kernel k -means with random restarts.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNN.2009.2019722DOI Listing

Publication Analysis

Top Keywords

kernel k-means
16
global kernel
8
k-means algorithm
8
identifies nonlinearly
8
nonlinearly separable
8
separable clusters
8
cluster initialization
8
search procedure
8
k-means
4
algorithm
4

Similar Publications

Learning model combined with data clustering and dimensionality reduction for short-term electricity load forecasting.

Sci Rep

January 2025

Department of Industrial Engineering/Graduate School of Data Science/Research Center for Electrical and Information Science, Seoul National University of Science and Technology, Seoul, South Korea.

Electric load forecasting is crucial in the planning and operating electric power companies. It has evolved from statistical methods to artificial intelligence-based techniques that use machine learning models. In this study, we investigate short-term load forecasting (STLF) for large-scale electricity usage datasets.

View Article and Find Full Text PDF

Weakly Supervised Nuclei Segmentation with Point-Guided Attention and Self-Supervised Pseudo-Labeling.

Bioengineering (Basel)

January 2025

Institute of Electronic Information Engineering, Beihang University, 37 Xueyuan Road, Haidian District, Beijing 100191, China.

Due to the labor-intensive manual annotations for nuclei segmentation, point-supervised segmentation based on nuclei coordinate supervision has gained recognition in recent years. Despite great progress, two challenges hinder the performance of weakly supervised nuclei segmentation methods: (1) The stable and effective segmentation of adjacent cell nuclei remains an unresolved challenge. (2) Existing approaches rely solely on initial pseudo-labels generated from point annotations for training, and inaccurate labels may lead the model to assimilate a considerable amount of noise information, thereby diminishing performance.

View Article and Find Full Text PDF

Sparse kernel -means clustering.

J Appl Stat

June 2024

Graduate School, Department of Urban Big Data Convergence, University of Seoul, Seoul, South Korea.

Clustering is an essential technique that groups similar data points to uncover the underlying structure and features of the data. Although traditional clustering methods such as -means are widely utilized, they have limitations in identifying nonlinear clusters. Thus, alternative techniques, such as kernel -means and spectral clustering, have been developed to address this issue.

View Article and Find Full Text PDF

Background: Wheat landraces represent a reservoir of genetic diversity that can support wheat improvement through breeding. A core panel of 300 Watkins wheat landraces, as well as 16 non-Watkins landraces and elite wheat cultivars, was grown during the 2020-2021 and 2021-2022 seasons at four Agricultural Research Stations in Egypt, Gemmiza, Nubaria, Sakha, and Sids, to evaluate the core panel for agromorphological and yield-related traits. The genetic population structure within these genotypes were assessed using 35,143 single nucleotide polymorphisms (SNPs).

View Article and Find Full Text PDF

Serum concentrations of different or multiple vitamins and Sarcopenia risk among US adults: insights from NHANES.

BMC Public Health

December 2024

Department of Preventive Medicine, School of Medicine, Shihezi University, North 2th Road, Shihezi, 832000, Xinjiang, People's Republic of China.

Background: The relationship between serum concentrations of different or multiple vitamins and sarcopenia remains underexplored. This investigation evaluates potential links between serum concentrations of different or multiple vitamins and sarcopenia prevalence among adults in the United States.

Methods: Utilizing a cross-sectional design, this research draws from the National Health and Nutrition Examination Survey (NHANES) dataset of 2003-2006, encompassing 5,060 participants with comprehensive serum vitamin A, E, B9, B12, C, and D concentrations, alongside sarcopenia and covariate measurements.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!