Incremental forward feature selection with application to microarray gene expression data.

J Biopharm Stat

Department of Computer Science and Information Engineering, National Taiwan University of Science and Technology, Taipei, Taiwan.

Published: November 2008

In this study, the authors propose a new feature selection scheme, the incremental forward feature selection, which is inspired by incremental reduced support vector machines. In their method, a new feature is added into the current selected feature subset if it will bring in the most extra information. This information is measured by using the distance between the new feature vector and the column space spanned by current feature subset. The incremental forward feature selection scheme can exclude highly linear correlated features that provide redundant information and might degrade the efficiency of learning algorithms. The method is compared with the weight score approach and the 1-norm support vector machine on two well-known microarray gene expression data sets, the acute leukemia and colon cancer data sets. These two data sets have a very few observations but huge number of genes. The linear smooth support vector machine was applied to the feature subsets selected by these three schemes respectively and obtained a slightly better classification results in the 1-norm support vector machine and incremental forward feature selection. Finally, the authors claim that the rest of genes still contain some useful information. The previous selected features are iteratively removed from the data sets and the feature selection and classification steps are repeated for four rounds. The results show that there are many distinct feature subsets that can provide enough information for classification tasks in these two microarray gene expression data sets.

Download full-text PDF

Source
http://dx.doi.org/10.1080/10543400802277868DOI Listing

Publication Analysis

Top Keywords

feature selection
24
data sets
20
incremental forward
16
forward feature
16
support vector
16
feature
12
microarray gene
12
gene expression
12
expression data
12
vector machine
12

Similar Publications

Background: This study investigated the oral microbiome signatures associated with upper gastrointestinal (GI) and pancreaticobiliary cancers.

Methods: Saliva samples from cancer patients and age- and sex-matched healthy controls were analyzed using 16S rRNA-targeted sequencing, followed by comprehensive bioinformatics analysis.

Results: Significant dissimilarities in microbial composition were observed between cancer patients and controls across esophageal cancer (EC), gastric cancer (GC), biliary tract cancer (BC), and pancreatic cancer (PC) groups (R = 0.

View Article and Find Full Text PDF

Background: The diagnosis and treatment of epilepsy continue to face numerous challenges, highlighting the urgent need for the development of rapid, accurate, and non-invasive methods for seizure detection. In recent years, advancements in the analysis of electroencephalogram (EEG) signals have garnered widespread attention, particularly in the area of seizure recognition.

Methods: A novel hybrid deep learning approach that combines feature fusion for efficient seizure detection is proposed in this study.

View Article and Find Full Text PDF

Background: Rectal cancer is a highly heterogeneous gastrointestinal tumor, and the prognosis for patients with treatment-resistant and metastatic rectal cancer remains poor. Mitophagy, a type of selective autophagy that targets mitochondria, plays a role in promoting or inhibiting tumors; however, the importance of mitophagy-related genes (MRGs) in the prognosis and treatment of rectal cancer is unclear.

Methods: In this study, we used the differentially expressed genes (DEGs) and MRGs from the TCGA-READ dataset to identify differentially expressed mitophagy-related genes (MRDEGs).

View Article and Find Full Text PDF

Habitat-based MRI radiomics to predict the origin of brain metastasis.

Med Phys

January 2025

Department of Scientific Research and Academic, Cancer Hospital of China Medical University, Liaoning Cancer Hospital and Institute, Shenyang, Liaoning, P. R. China.

Background: This study aims to explore the value of habitat-based magnetic resonance imaging (MRI) radiomics for predicting the origin of brain metastasis (BM).

Purpose: To investigate whether habitat-based radiomics can identify the metastatic tumor type of BM and whether an imaging-based model that integrates the volume of peritumoral edema (VPE) can enhance predictive performance.

Methods: A primary cohort was developed with 384 patients from two centers, which comprises 734 BM lesions.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!