Genetic programming based ensemble system for microarray data classification.

Comput Math Methods Med

Department of Computing, The Hong Kong Polytechnic University, Hung Hom, Kowloon 999077, Hong Kong.

Published: December 2015

Recently, more and more machine learning techniques have been applied to microarray data analysis. The aim of this study is to propose a genetic programming (GP) based new ensemble system (named GPES), which can be used to effectively classify different types of cancers. Decision trees are deployed as base classifiers in this ensemble framework with three operators: Min, Max, and Average. Each individual of the GP is an ensemble system, and they become more and more accurate in the evolutionary process. The feature selection technique and balanced subsampling technique are applied to increase the diversity in each ensemble system. The final ensemble committee is selected by a forward search algorithm, which is shown to be capable of fitting data automatically. The performance of GPES is evaluated using five binary class and six multiclass microarray datasets, and results show that the algorithm can achieve better results in most cases compared with some other ensemble systems. By using elaborate base classifiers or applying other sampling techniques, the performance of GPES may be further improved.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4355811PMC
http://dx.doi.org/10.1155/2015/193406DOI Listing

Publication Analysis

Top Keywords

ensemble system
16
genetic programming
8
programming based
8
based ensemble
8
microarray data
8
base classifiers
8
performance gpes
8
ensemble
7
system
4
system microarray
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!