Supervised, Unsupervised, and Semi-Supervised Feature Selection: A Review on Gene Selection.

Jun Chin Ang Andri Mirzal Habibollah Haron Haza Nuzly Abdull Hamed

IEEE/ACM Trans Comput Biol Bioinform

Published: October 2017

Recently, feature selection and dimensionality reduction have become fundamental tools for many data mining tasks, especially for processing high-dimensional data such as gene expression microarray data. Gene expression microarray data comprises up to hundreds of thousands of features with relatively small sample size. Because learning algorithms usually do not work well with this kind of data, a challenge to reduce the data dimensionality arises. A huge number of gene selection are applied to select a subset of relevant features for model construction and to seek for better cancer classification performance. This paper presents the basic taxonomy of feature selection, and also reviews the state-of-the-art gene selection methods by grouping the literatures into three categories: supervised, unsupervised, and semi-supervised. The comparison of experimental results on top 5 representative gene expression datasets indicates that the classification accuracy of unsupervised and semi-supervised feature selection is competitive with supervised feature selection.

Download full-text PDF	Source
http://dx.doi.org/10.1109/TCBB.2015.2478454	DOI Listing

Publication Analysis

Top Keywords

feature selection

unsupervised semi-supervised

gene selection

gene expression

supervised unsupervised

semi-supervised feature

selection

data gene

expression microarray

microarray data

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!