Discretization acts as a variable selection method in addition to transforming the continuous values of the variable to discrete ones. Machine learning algorithms such as Support Vector Machines and Random Forests have been used for classification in high-dimensional genomic and proteomic data due to their robustness to the dimensionality of the data. We show that discretization can help improve significantly the classification performance of these algorithms as well as algorithms like Naïve Bayes that are sensitive to the dimensionality of the data.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC2656082PMC

Publication Analysis

Top Keywords

classification performance
8
dimensionality data
8
improving classification
4
performance discretization
4
discretization biomedical
4
biomedical datasets
4
datasets discretization
4
discretization acts
4
acts variable
4
variable selection
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!