Feature selection in feature network models: finding predictive subsets of features with the Positive Lasso.

Br J Math Stat Psychol

Department of Methodology and Statistics, Utrecht University, The Netherlands.

Published: May 2008

A set of features is the basis for the network representation of proximity data achieved by feature network models (FNMs). Features are binary variables that characterize the objects in an experiment, with some measure of proximity as response variable. Sometimes features are provided by theory and play an important role in the construction of the experimental conditions. In some research settings, the features are not known a priori. This paper shows how to generate features in this situation and how to select an adequate subset of features that takes into account a good compromise between model fit and model complexity, using a new version of least angle regression that restricts coefficients to be non-negative, called the Positive Lasso. It will be shown that features can be generated efficiently with Gray codes that are naturally linked to the FNMs. The model selection strategy makes use of the fact that FNM can be considered as univariate multiple regression model. A simulation study shows that the proposed strategy leads to satisfactory results if the number of objects is less than or equal to 22. If the number of objects is larger than 22, the number of features selected by our method exceeds the true number of features in some conditions.

Download full-text PDF

Source
http://dx.doi.org/10.1348/000711006X119365DOI Listing

Publication Analysis

Top Keywords

features
10
feature network
8
network models
8
positive lasso
8
number objects
8
number features
8
feature selection
4
selection feature
4
models finding
4
finding predictive
4

Similar Publications

GradeDiff-IM: An Ensembles Model-based Grade Classification of Breast Cancer.

Biomed Phys Eng Express

January 2025

School of Engineering and Computing, University of the West of Scotland, University of the West of Scotland - Paisley Campus, Paisley PA1 2BE, UK, City, Paisley, PA1 2BE, UNITED KINGDOM OF GREAT BRITAIN AND NORTHERN IRELAND.

Cancer grade classification is a challenging task identified from the cell structure of healthy and abnormal tissues. The partitioner learns about the malignant cell through the grading and plans the treatment strategy accordingly. A major portion of researchers used DL models for grade classification.

View Article and Find Full Text PDF

This paper systematically evaluates saliency methods as explainability tools for convolutional neural networks trained to diagnose glaucoma using simplified eye fundus images that contain only disc and cup outlines. These simplified images, a methodological novelty, were used to relate features highlighted in the saliency maps to the geometrical clues that experts consider in glaucoma diagnosis. Despite their simplicity, these images retained sufficient information for accurate classification, with balanced accuracies ranging from 0.

View Article and Find Full Text PDF

Mild cognitive impairment (MCI) is a significant predictor of the early progression of Alzheimer's disease, and it can be used as an important indicator of disease progression. However, many existing methods focus mainly on the image itself when processing brain imaging data, ignoring other non-imaging data (e.g.

View Article and Find Full Text PDF

Visibility, Physical Work Environment, and Stress in ICU Nurses.

J Nurs Adm

December 2024

Author Affiliations: Research Associate (Dr Keys), The Center for Health Design, Concord, California; National Senior Director (Dr Fineout-Overholt), Evidence-Based Practice and Implementation Science, at Ascension in St. Louis, MO.

Objective: Relationships among coworker and patient visibility, reactions to physical work environment, and work stress in ICU nurses are explored.

Background: Millions of dollars are invested annually in the building or remodeling of ICUs, yet there is a gap in understanding relationships between the physical layout of nursing units and work stress.

Methods: Using a cross-sectional, correlational, exploratory, predictive design, relationships among variables were studied in a diverse sample of ICU nurses.

View Article and Find Full Text PDF

Learning the language of antibody hypervariability.

Proc Natl Acad Sci U S A

January 2025

Computer Science and Artificial Intelligence Laboratory, Massachusetts Institute of Technology, Cambridge, MA 02139.

Protein language models (PLMs) have demonstrated impressive success in modeling proteins. However, general-purpose "foundational" PLMs have limited performance in modeling antibodies due to the latter's hypervariable regions, which do not conform to the evolutionary conservation principles that such models rely on. In this study, we propose a transfer learning framework called Antibody Mutagenesis-Augmented Processing (AbMAP), which fine-tunes foundational models for antibody-sequence inputs by supervising on antibody structure and binding specificity examples.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!