Background: Gene expression data are often used to classify cancer genes. In such high-dimensional datasets, however, only a few feature genes are closely related to tumors. Therefore, it is important to accurately select a subset of feature genes with high contributions to cancer classification.

Methods: In this article, a new three-stage hybrid gene selection method is proposed that combines a variance filter, extremely randomized tree and Harris Hawks (VEH). In the first stage, we evaluated each gene in the dataset through the variance filter and selected the feature genes that meet the variance threshold. In the second stage, we use extremely randomized tree to further eliminate irrelevant genes. Finally, we used the Harris Hawks algorithm to select the gene subset from the previous two stages to obtain the optimal feature gene subset.

Results: We evaluated the proposed method using three different classifiers on eight published microarray gene expression datasets. The results showed a 100% classification accuracy for VEH in gastric cancer, acute lymphoblastic leukemia and ovarian cancer, and an average classification accuracy of 95.33% across a variety of other cancers. Compared with other advanced feature selection algorithms, VEH has obvious advantages when measured by many evaluation criteria.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10280456PMC
http://dx.doi.org/10.7717/peerj-cs.1229DOI Listing

Publication Analysis

Top Keywords

harris hawks
12
feature genes
12
feature gene
8
gene selection
8
gene expression
8
variance filter
8
extremely randomized
8
randomized tree
8
classification accuracy
8
gene
7

Similar Publications

This paper proposes a hybridized model for air quality forecasting that combines the Support Vector Regression (SVR) method with Harris Hawks Optimization (HHO) called (HHO-SVR). The proposed HHO-SVR model utilizes five datasets from the environmental protection agency's Downscaler Model (DS) to predict Particulate Matter ([Formula: see text]) levels. In order to assess the efficacy of the suggested HHO-SVR forecasting model, we employ metrics such as Mean Absolute Percentage Error (MAPE), Average, Standard Deviation (SD), Best Fit, Worst Fit, and CPU time.

View Article and Find Full Text PDF

Health monitoring and analysis of photovoltaic (PV) systems are critical for optimizing energy efficiency, improving reliability, and extending the operational lifespan of PV power plants. Effective fault detection and monitoring are vital for ensuring the proper functioning and maintenance of these systems. PV power plants operating under fault conditions show significant deviations in current-voltage (I-V) characteristics compared to those under normal conditions.

View Article and Find Full Text PDF

Maintaining stable voltage and frequency regulation is critical for modern power systems, particularly with the integration of renewable energy sources. This study proposes a coordinated control strategy for voltage and frequency in a deregulated power system comprising six Generation Companies (GENCOs) and six Distribution Companies (DISCOs). The system integrates thermal, diesel, wind, solar photovoltaic (PV), and hydroelectric sources.

View Article and Find Full Text PDF

Parasitic survey of birds of prey used for falconry in Poland.

Pol J Vet Sci

December 2024

University of Warmia and Mazury in Olsztyn, Faculty of Veterinary Medicine, Department of Parasitology and Invasive Diseases, Oczapowskiego 13, 10-718 Olsztyn, Poland.

Birds of prey raised in captivity have direct contact with the environment and are fed raw meat various animals, which increases the risk of infections caused by parasites, including endoparasites. The aim of this study was to evaluate the prevalence of endoparasites in predatory birds of the orders Accipitriformes and Falconiformes that are used in falconry in Poland. Fresh feces were sampled from 52 birds, including 16 saker falcons (Falco cherrug), 8 lanner falcons (Falco biarmicus), 7 peregrine falcons (Falco peregrinus), 8 Harris's hawks (Parabuteo unicinctus), 7 Eurasian goshawks (Accipiter gentilis), 3 common kestrels (Falco tinnunculus), 1 Eurasian sparrowhawk (Accipiter nisus), 1 red-tailed hawk (Buteo jamaicensis), and 1 common buzzard (Buteo buteo).

View Article and Find Full Text PDF

A novel case-based reasoning system for explainable lung cancer diagnosis.

Comput Biol Med

December 2024

Department of Industrial Engineering & Management Systems, Amirkabir University of Technology (Tehran Polytechnic), Tehran, Iran. Electronic address:

Lung cancer is a leading cause of cancer death worldwide. The survival rate is generally higher when this disease is detected in its early stages. Advances in artificial intelligence (AI) have enabled the development of decision support systems that help physicians diagnose diseases.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!