AI Article Synopsis

  • Feature subset selection helps identify the most relevant variables in large data sets but struggles with high dimensionality and resource limitations.
  • The proposed solution involves extending online feature selection (OFS) by creating an ensemble of online linear models that utilize partial feature information for predictions.
  • This ensemble approach not only reduces error rates compared to individual linear models but also maintains the same level of sparsity and complexity during testing.

Article Abstract

Feature subset selection can be used to sieve through large volumes of data and discover the most informative subset of variables for a particular learning problem. Yet, due to memory and other resource constraints (e.g., CPU availability), many of the state-of-the-art feature subset selection methods cannot be extended to high dimensional data, or data sets with an extremely large volume of instances. In this brief, we extend online feature selection (OFS), a recently introduced approach that uses partial feature information, by developing an ensemble of online linear models to make predictions. The OFS approach employs a linear model as the base classifier, which allows the $l_{0}$ -norm of the parameter vector to be constrained to perform feature selection leading to sparse linear models. We demonstrate that the proposed ensemble model typically yields a smaller error rate than any single linear model, while maintaining the same level of sparsity and complexity at the time of testing.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNNLS.2017.2746107DOI Listing

Publication Analysis

Top Keywords

feature selection
12
online feature
8
feature subset
8
subset selection
8
linear models
8
linear model
8
feature
6
selection
5
extensions online
4
selection bagging
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!