Large amounts of high-dimensional unlabeled data typically contain only a small portion of truly effective information. Consequently, the issue of unsupervised feature selection methods has gained significant attention in research. However, current unsupervised feature selection approaches face limitations when dealing with datasets that exhibit uneven density, and they also require substantial computational time. To address this problem, this research article proposes a feature extraction technique that combines the Fuzzy C-Means (FCM) and k -nearest neighbor rough sets. FCM is a clustering algorithm grounded in fuzzy theory, which takes into account the inherent data structure and the correlations between different features. Consequently, FCM is particularly well-suited for datasets with uneven density. Our proposed method consists of three steps. First, the FCM algorithm is used to cluster the unlabeled data. Second, a measure that evaluates the importance of features is defined and sorted based on the clustering results. Finally, redundant features are filtered using k -nearest neighbor rough sets while retaining important features, significantly reducing the running time. In addition, we designed the feature selection algorithm (KND-UFS) and conducted experiments on 12 public datasets. We compared KND-UFS with eight existing algorithms in terms of running time, classification accuracy, and the number of selected features. The experimental results provided strong evidence supporting the superior performance of the KND-UFS algorithm.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNNLS.2024.3460796DOI Listing

Publication Analysis

Top Keywords

feature selection
16
unsupervised feature
12
-nearest neighbor
12
neighbor rough
12
rough sets
12
fcm -nearest
8
unlabeled data
8
uneven density
8
running time
8
feature
5

Similar Publications

Modulation of singlet and triplet energy transfer from excited semiconductor nanocrystals to attached dye molecules remains an important criterion for the design of light-harvesting assemblies. Whereas one can consider the selection of donor and acceptor with favorable energetics, spectral overlap, and kinetics of energy transfer as a means to direct the singlet and triplet energy transfer pathways, it is not obvious how to control the singlet and triplet characteristics of the donor semiconductor nanocrystal itself. By doping CsPb(ClBr) nanocrystals with Mn, we have now succeeded in increasing the triplet characteristics of semiconductor nanocrystals.

View Article and Find Full Text PDF

An external control arm based on health registry data can serve as an alternative comparator in single-arm drug development studies that lack a benchmark for comparison to the experimental treatment. However, accessing such observational healthcare data involves a lengthy and intricate application process, delaying drug approval studies and access to novel treatments. Clinical trials typically comprise only a few hundred patients usually with high-cardinality features, which makes individual data instances more exposed to re-identification attacks.

View Article and Find Full Text PDF

Data-driven discovery and parameter estimation of mathematical models in biological pattern formation.

PLoS Comput Biol

January 2025

Department of Anatomy and Cell Biology, Graduate School of Medical Sciences, Kyushu University, Fukuoka, Fukuoka, Japan.

Mathematical modeling has been utilized to explain biological pattern formation, but the selections of models and parameters have been made empirically. In the present study, we propose a data-driven approach to validate the applicability of mathematical models. Specifically, we developed methods to automatically select the appropriate mathematical models based on the patterns of interest and to estimate the model parameters.

View Article and Find Full Text PDF

Objectives: To evaluate 18F-DCFPyL-PET/MRI whole-gland-derived radiomics for detecting clinically significant (cs) prostate cancer (PCa) and predicting metastasis.

Methods: Therapy-naïve PCa patients who underwent 18F-DCFPyL PET/MRI were included. Whole-prostate-segmentation was performed.

View Article and Find Full Text PDF

Superhydrophobic and Self-Healing Porous Organic Macrocycle Crystals for Methane Purification under Humid Conditions.

J Am Chem Soc

January 2025

Stoddart Institute of Molecular Science, Department of Chemistry, Zhejiang University, Hangzhou 310058, P. R. China.

Purifying methane from natural gas using adsorbents not only requires the adsorbents to possess excellent separation performance but also to overcome additional daunting challenges such as humidity interference and durability requirements for sustainable use. Herein, porous organic crystals of a new macrocycle () with superhydrophobic and self-healing features are prepared and employed for the purification of methane (>99.99% purity) from ternary methane/ethane/propane mixtures under 97% relative humidity.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!