Application-Dedicated Selection of Filters (ADSF) using covariance maximization and orthogonal projection.

Anal Chim Acta

Irstea, UMR ITAP, 361 rue J-F Breton BP 5095, 34196 Montpellier Cedex 5, France.

Published: May 2016

Visible and near-infrared (Vis-NIR) spectra are generated by the combination of numerous low resolution features. Spectral variables are thus highly correlated, which can cause problems for selecting the most appropriate ones for a given application. Some decomposition bases such as Fourier or wavelet generally help highlighting spectral features that are important, but are by nature constraint to have both positive and negative components. Thus, in addition to complicating the selected features interpretability, it impedes their use for application-dedicated sensors. In this paper we have proposed a new method for feature selection: Application-Dedicated Selection of Filters (ADSF). This method relaxes the shape constraint by enabling the selection of any type of user defined custom features. By considering only relevant features, based on the underlying nature of the data, high regularization of the final model can be obtained, even in the small sample size context often encountered in spectroscopic applications. For larger scale deployment of application-dedicated sensors, these predefined feature constraints can lead to application specific optical filters, e.g., lowpass, highpass, bandpass or bandstop filters with positive only coefficients. In a similar fashion to Partial Least Squares, ADSF successively selects features using covariance maximization and deflates their influences using orthogonal projection in order to optimally tune the selection to the data with limited redundancy. ADSF is well suited for spectroscopic data as it can deal with large numbers of highly correlated variables in supervised learning, even with many correlated responses.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.aca.2016.04.004DOI Listing

Publication Analysis

Top Keywords

application-dedicated selection
8
selection filters
8
filters adsf
8
covariance maximization
8
orthogonal projection
8
highly correlated
8
application-dedicated sensors
8
features
6
application-dedicated
4
filters
4

Similar Publications

Summary: Unsupervised deconvolution algorithms are often used to estimate cell composition from bulk tissue samples. However, applying cell-type deconvolution and interpreting the results remain a challenge, even more without prior training in bioinformatics. Here, we propose a tool for estimating and identifying cell type composition from bulk transcriptomes or methylomes.

View Article and Find Full Text PDF

Objectives: to describe Open Data Covid, an online application dedicated to the pandemic and the health of the population of the province of L'Aquila (Abruzzo Region, Southern Italy), created following the health emergency in Italy and worldwide.

Design: Open Data Covid is the result of a multidisciplinary study group including University of L'Aquila, Local Health Unit 1 Abruzzo, and Gran Sasso Science Institute. In the first phase, the information to be shown was identified and made available based on the pandemic national reports to obtain comparable results.

View Article and Find Full Text PDF

Visible and near-infrared (Vis-NIR) spectra are generated by the combination of numerous low resolution features. Spectral variables are thus highly correlated, which can cause problems for selecting the most appropriate ones for a given application. Some decomposition bases such as Fourier or wavelet generally help highlighting spectral features that are important, but are by nature constraint to have both positive and negative components.

View Article and Find Full Text PDF

Pattern extraction in interictal EEG recordings towards detection of electrodes leading to seizures.

Biomed Sci Instrum

July 2006

Department of Electrical and Computer Engineering, Florida International University, 10555 W. Flagler Street, Miami, FL 33174, USA.

This study introduces an algorithm for a new application dedicated at discriminating between electrodes leading to a seizure onset and those that do not lead to seizure using interictal subdural EEG data. The significance of this study is in determining among all of these channels, all containing interictal spikes that are asynchronously, independent of region and time, which are selected randomly (these EEG portions may or may not contain spikes), and yet through the developed algorithm, we are able to classify those channels that lead to seizure and those that do not. The main zones of ictal activity are supposed to evolve from the tissue located at the channels that present interictal activity, but sometimes this is no the case.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!