A Sequential Learning Approach for Scaling Up Filter-Based Feature Subset Selection.

Gregory Ditzler Robi Polikar Gail Rosen Gregory Ditzler Robi Polikar Gail Rosen Gregory Ditzler Gail Rosen Robi Polikar

IEEE Trans Neural Netw Learn Syst

Published: June 2018

Increasingly, many machine learning applications are now associated with very large data sets whose sizes were almost unimaginable just a short time ago. As a result, many of the current algorithms cannot handle, or do not scale to, today's extremely large volumes of data. Fortunately, not all features that make up a typical data set carry information that is relevant or useful for prediction, and identifying and removing such irrelevant features can significantly reduce the total data size. The unfortunate dilemma, however, is that some of the current data sets are so large that common feature selection algorithms-whose very goal is to reduce the dimensionality-cannot handle such large data sets, creating a vicious cycle. We describe a sequential learning framework for feature subset selection (SLSS) that can scale with both the number of features and the number of observations. The proposed framework uses multiarm bandit algorithms to sequentially search a subset of variables, and assign a level of importance for each feature. The novel contribution of SLSS is its ability to naturally scale to large data sets, evaluate such data in a very small amount of time, and be performed independently of the optimization of any classifier to reduce unnecessary complexity. We demonstrate the capabilities of SLSS on synthetic and real-world data sets.

Download full-text PDF	Source
http://dx.doi.org/10.1109/TNNLS.2017.2697407	DOI Listing

Publication Analysis

Top Keywords

data sets

large data

data

sequential learning

feature subset

subset selection

large

sets

learning approach

approach scaling

Similar Publications

Signals of propaganda-Detecting and estimating political influences in information spread in social networks.

PLoS One

January 2025

Department of Information Technologies, Faculty of Economics and Management, Czech University of Life Sciences Prague, Prague, Czech Republic.

Alon Sela Omer Neter Václav Lohr Petr Cihelka Fan Wang

Social networks are a battlefield for political propaganda. Protected by the anonymity of the internet, political actors use computational propaganda to influence the masses. Their methods include the use of synchronized or individual bots, multiple accounts operated by one social media management tool, or different manipulations of search engines and social network algorithms, all aiming to promote their ideology.

View Article and Find Full Text PDF

Similar Publications

Can Focusing on One Deep Learning Architecture Improve Fault Diagnosis Performance?

J Chem Inf Model

January 2025

Department of Chemical and Materials Engineering, Pontifical Catholic University of Rio de Janeiro, 225, Marquês de São Vicente Street, Gávea, Rio de Janeiro, RJ 22451-900, Brazil.

João G Neto Karla Figueiredo João B P Soares Amanda L T Brandão

Machine learning approaches often involve evaluating a wide range of models due to various available architectures. This standard strategy can lead to a lack of depth in exploring established methods. In this study, we concentrated our efforts on a single deep learning architecture type to assess whether a focused approach could enhance performance in fault diagnosis.

View Article and Find Full Text PDF

Similar Publications

PiNN: Equivariant Neural Network Suite for Modeling Electrochemical Systems.

J Chem Theory Comput

January 2025

Department of Chemistry-Ångström Laboratory, Uppsala University, Lägerhyddsvägen 1, P.O. Box 538, 75121 Uppsala, Sweden.

Jichen Li Lisanne Knijff Zhan-Yun Zhang Linnéa Andersson Chao Zhang

Electrochemical energy storage and conversion play increasingly important roles in electrification and sustainable development across the globe. A key challenge therein is to understand, control, and design electrochemical energy materials with atomistic precision. This requires inputs from molecular modeling powered by machine learning (ML) techniques.

View Article and Find Full Text PDF

Similar Publications

A glass half full of money: Dispositional optimism and wealth accumulation across the income spectrum.

J Pers Soc Psychol

January 2025

Marketing Division, Paul College of Business and Economics, University of New Hampshire.

Joe J Gladstone Justin Pomerance

What drives some people to save more effectively for their future than others? This multistudy investigation (N = 143,461) explores how dispositional optimism-the generalized tendency to hold positive expectations about the future-shapes individuals' financial decisions and outcomes. Leveraging both cross-sectional and longitudinal designs across several countries, our findings reveal that optimism significantly predicts greater savings over time, even when controlling for various demographic, psychological, and financial covariates. Furthermore, we find that the role of optimism varies based on socioeconomic circumstances: Among lower income individuals, optimism is more strongly associated with saving.

View Article and Find Full Text PDF

Similar Publications

A physical optics formulation of Bloch waves and its application to 4D STEM, 3D ED and inelastic scattering simulations.

Acta Crystallogr A Found Adv

March 2025

Department of Physics, Durham University, South Road, Durham, DH1 3LE, United Kingdom.

Budhika G Mendis

Bloch waves are often used in dynamical diffraction calculations, such as simulating electron diffraction intensities for crystal structure refinement. However, this approach relies on matrix diagonalization and is therefore computationally expensive for large unit cell crystals. Here Bloch wave theory is re-formulated using the physical optics concepts underpinning the multislice method.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!