Kernel k-means is an extension of the standard k -means clustering algorithm that identifies nonlinearly separable clusters. In order to overcome the cluster initialization problem associated with this method, we propose the global kernel k-means algorithm, a deterministic and incremental approach to kernel-based clustering. Our method adds one cluster at each stage, through a global search procedure consisting of several executions of kernel k-means from suitable initializations. This algorithm does not depend on cluster initialization, identifies nonlinearly separable clusters, and, due to its incremental nature and search procedure, locates near-optimal solutions avoiding poor local minima. Furthermore, two modifications are developed to reduce the computational cost that do not significantly affect the solution quality. The proposed methods are extended to handle weighted data points, which enables their application to graph partitioning. We experiment with several data sets and the proposed approach compares favorably to kernel k -means with random restarts.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TNN.2009.2019722 | DOI Listing |
Sci Rep
January 2025
Department of Industrial Engineering/Graduate School of Data Science/Research Center for Electrical and Information Science, Seoul National University of Science and Technology, Seoul, South Korea.
Electric load forecasting is crucial in the planning and operating electric power companies. It has evolved from statistical methods to artificial intelligence-based techniques that use machine learning models. In this study, we investigate short-term load forecasting (STLF) for large-scale electricity usage datasets.
View Article and Find Full Text PDFBioengineering (Basel)
January 2025
Institute of Electronic Information Engineering, Beihang University, 37 Xueyuan Road, Haidian District, Beijing 100191, China.
Due to the labor-intensive manual annotations for nuclei segmentation, point-supervised segmentation based on nuclei coordinate supervision has gained recognition in recent years. Despite great progress, two challenges hinder the performance of weakly supervised nuclei segmentation methods: (1) The stable and effective segmentation of adjacent cell nuclei remains an unresolved challenge. (2) Existing approaches rely solely on initial pseudo-labels generated from point annotations for training, and inaccurate labels may lead the model to assimilate a considerable amount of noise information, thereby diminishing performance.
View Article and Find Full Text PDFJ Appl Stat
June 2024
Graduate School, Department of Urban Big Data Convergence, University of Seoul, Seoul, South Korea.
Clustering is an essential technique that groups similar data points to uncover the underlying structure and features of the data. Although traditional clustering methods such as -means are widely utilized, they have limitations in identifying nonlinear clusters. Thus, alternative techniques, such as kernel -means and spectral clustering, have been developed to address this issue.
View Article and Find Full Text PDFBackground: Wheat landraces represent a reservoir of genetic diversity that can support wheat improvement through breeding. A core panel of 300 Watkins wheat landraces, as well as 16 non-Watkins landraces and elite wheat cultivars, was grown during the 2020-2021 and 2021-2022 seasons at four Agricultural Research Stations in Egypt, Gemmiza, Nubaria, Sakha, and Sids, to evaluate the core panel for agromorphological and yield-related traits. The genetic population structure within these genotypes were assessed using 35,143 single nucleotide polymorphisms (SNPs).
View Article and Find Full Text PDFBMC Public Health
December 2024
Department of Preventive Medicine, School of Medicine, Shihezi University, North 2th Road, Shihezi, 832000, Xinjiang, People's Republic of China.
Background: The relationship between serum concentrations of different or multiple vitamins and sarcopenia remains underexplored. This investigation evaluates potential links between serum concentrations of different or multiple vitamins and sarcopenia prevalence among adults in the United States.
Methods: Utilizing a cross-sectional design, this research draws from the National Health and Nutrition Examination Survey (NHANES) dataset of 2003-2006, encompassing 5,060 participants with comprehensive serum vitamin A, E, B9, B12, C, and D concentrations, alongside sarcopenia and covariate measurements.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!