Traditional clustering algorithms often focus on the most fine-grained information and achieve clustering by calculating the distance between each pair of data points or implementing other calculations based on points. This way is not inconsistent with the cognitive mechanism of "global precedence" in the human brain, resulting in those methodsbad performance in efficiency, generalization ability, and robustness. To address this problem, we propose a new clustering algorithm called granular-ball clustering via granular-ball computing. First, clustering algorithm based on granular-ball (GBCT) generates a smaller number of granular-balls to represent the original data and forms clusters according to the relationship between granular-balls, instead of the traditional point relationship. At the same time, its coarse-grained characteristics are not susceptible to noise, and the algorithm is efficient and robust; besides, as granular-balls can fit various complex data, GBCT performs much better in nonspherical datasets than other traditional clustering methods. The completely new coarse granularity representation method of GBCT and cluster formation mode can also be used to improve other traditional methods. All codes can be available at https://github.com/wylbdthxbw/GBC.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TNNLS.2024.3497174 | DOI Listing |
IEEE Trans Cybern
February 2025
Efficient and robust data clustering remains a challenging task in data analysis. Recent efforts have explored the integration of granular-ball (GB) computing with clustering algorithms to address this challenge, yielding promising results. However, existing methods for generating GBs often rely on single indicators to measure GB quality and employ threshold-based or greedy strategies, potentially leading to GBs that do not accurately capture the underlying data distribution.
View Article and Find Full Text PDFIEEE Trans Neural Netw Learn Syst
January 2025
Traditional clustering algorithms often focus on the most fine-grained information and achieve clustering by calculating the distance between each pair of data points or implementing other calculations based on points. This way is not inconsistent with the cognitive mechanism of "global precedence" in the human brain, resulting in those methodsbad performance in efficiency, generalization ability, and robustness. To address this problem, we propose a new clustering algorithm called granular-ball clustering via granular-ball computing.
View Article and Find Full Text PDFIEEE Trans Pattern Anal Mach Intell
December 2024
In the real world, data distributions often exhibit multiple granularities. However, the majority of existing neighbor-based machine-learning methods rely on manually setting a single-granularity for neighbor relationships. These methods typically handle each data point using a single-granularity approach, which severely affects their accuracy and efficiency.
View Article and Find Full Text PDFPLoS One
February 2024
College of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing, China.
Hydrological and water quality datasets usually encompass a large number of characteristic variables, but not all of these significantly influence analytical outcomes. Therefore, by wisely selecting feature variables with rich information content and removing redundant features, it not only can the analysis efficiency be improved, but the model complexity can also be simplified. This paper considers introducing the granular-ball rough set algorithm for feature variable selection and combining it with the k-nearest neighbor method and back propagation network to analyze hydrological and water quality data, thus promoting overall and fused inspection.
View Article and Find Full Text PDFIEEE Trans Neural Netw Learn Syst
December 2024
Density peaks clustering algorithm (DP) has difficulty in clustering large-scale data, because it requires the distance matrix to compute the density and -distance for each object, which has time complexity. Granular ball (GB) is a coarse-grained representation of data. It is based on the fact that an object and its local neighbors have similar distribution and they have high possibility of belonging to the same class.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!