AI Article Synopsis

  • Feature selection helps improve learning performance by removing irrelevant or redundant features, making data easier to understand and interpret.
  • Heterogeneous data, which includes both numerical and categorical representations, is common in real-world applications and can be effectively managed using the Neighborhood Rough Set (NRS) model.
  • The article introduces a feature selection method using a new concept called conditional neighborhood combination entropy (cNCE) and presents an algorithm (FScNCE) that demonstrates superior effectiveness through experimental results.

Article Abstract

Feature selection aims to remove irrelevant or redundant features and thereby remain relevant or informative features so that it is often preferred for alleviating the dimensionality curse, enhancing learning performance, providing better readability and interpretability, and so on. Data that contain numerical and categorical representations are called heterogeneous data, and they exist widely in many real-world applications. Neighborhood rough set (NRS) can effectively deal with heterogeneous data by using neighborhood binary relation, which has been successfully applied to heterogeneous feature selection. In this article, the NRS model as a unified framework is used to design a feature selection method to handle categorical, numerical, and heterogeneous data. First, the concept of neighborhood combination entropy (NCE) is presented. It can reflect the probability of pairs of the neighborhood granules that are probably distinguishable from each other. Then, the conditional neighborhood combination entropy (cNCE) based on NCE is proposed under the condition of considering decision attributes. Moreover, some properties and relationships between cNCE and NCE are derived. Finally, the functions of inner and outer significances are constructed to design a feature selection algorithm based on cNCE (FScNCE). The experimental results show the effectiveness and superiority of the proposed algorithm.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNNLS.2022.3193929DOI Listing

Publication Analysis

Top Keywords

feature selection
20
neighborhood combination
12
combination entropy
12
heterogeneous data
12
heterogeneous feature
8
design feature
8
neighborhood
6
heterogeneous
5
selection
5
selection based
4

Similar Publications

Background: Due to advances in treatment, HIV is now a chronic condition with near-normal life expectancy. However, people with HIV continue to have a higher burden of mental and physical health conditions and are impacted by wider socioeconomic issues. Positive Voices is a nationally representative series of surveys of people with HIV in the United Kingdom.

View Article and Find Full Text PDF

Stock trend prediction is a significant challenge due to the inherent uncertainty and complexity of stock market time series. In this study, we introduce an innovative dual-branch network model designed to effectively address this challenge. The first branch constructs recurrence plots (RPs) to capture the nonlinear relationships between time points from historical closing price sequences and computes the corresponding recurrence quantifification analysis measures.

View Article and Find Full Text PDF

Binuclear silver(I) and copper(I) complexes, and , with bridging diphenylphosphine ligands were prepared. In , the silver(I) center is located inside a trigonal plane composed of three phosphorus donors from three separate and bridging dppm ligands. The fourth coordination site is filled with neighboring silver(I) ions.

View Article and Find Full Text PDF

Establishing a living biobank of pediatric high-grade glioma and ependymoma suitable for cancer pharmacology.

Neuro Oncol

January 2025

Childhood Cancer & Cell Death team (C3 team), Consortium South-ROCK, LabEx DEVweCAN, Institut Convergence Plascan, Centre Léon Bérard, Centre de Recherche en Cancérologie de Lyon (CRCL), Université Claude Bernard Lyon 1, INSERM 1052, CNRS 5286, 69008 Lyon, France.

Background: Brain tumors are the deadliest solid tumors in children and adolescents. Most of these tumors are glial in origin and exhibit strong heterogeneity, hampering the development of effective therapeutic strategies. In the past decades, patient-derived tumor organoids (PDT-O) have emerged as powerful tools for modeling tumoral cell diversity and dynamics, and they could then help defining new therapeutic options for pediatric brain tumors.

View Article and Find Full Text PDF

Quantum mechanics has proved to be suitable for the study of molecular systems. In particular, the Born-Oppenheimer approximation enables one to separate the motions of electrons and nuclei. In the case of diatomic molecules, this approximation leads to the so-called potential-energy function that provides the interaction between the two nuclei.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!