Visible and near-infrared (Vis-NIR) spectra are generated by the combination of numerous low resolution features. Spectral variables are thus highly correlated, which can cause problems for selecting the most appropriate ones for a given application. Some decomposition bases such as Fourier or wavelet generally help highlighting spectral features that are important, but are by nature constraint to have both positive and negative components. Thus, in addition to complicating the selected features interpretability, it impedes their use for application-dedicated sensors. In this paper we have proposed a new method for feature selection: Application-Dedicated Selection of Filters (ADSF). This method relaxes the shape constraint by enabling the selection of any type of user defined custom features. By considering only relevant features, based on the underlying nature of the data, high regularization of the final model can be obtained, even in the small sample size context often encountered in spectroscopic applications. For larger scale deployment of application-dedicated sensors, these predefined feature constraints can lead to application specific optical filters, e.g., lowpass, highpass, bandpass or bandstop filters with positive only coefficients. In a similar fashion to Partial Least Squares, ADSF successively selects features using covariance maximization and deflates their influences using orthogonal projection in order to optimally tune the selection to the data with limited redundancy. ADSF is well suited for spectroscopic data as it can deal with large numbers of highly correlated variables in supervised learning, even with many correlated responses.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1016/j.aca.2016.04.004 | DOI Listing |
Bioinform Adv
September 2024
TIMC, UMR 5525, Université Grenoble Alpes, CNRS, Grenoble F-38700, France.
Summary: Unsupervised deconvolution algorithms are often used to estimate cell composition from bulk tissue samples. However, applying cell-type deconvolution and interpreting the results remain a challenge, even more without prior training in bioinformatics. Here, we propose a tool for estimating and identifying cell type composition from bulk transcriptomes or methylomes.
View Article and Find Full Text PDFEpidemiol Prev
July 2023
Objectives: to describe Open Data Covid, an online application dedicated to the pandemic and the health of the population of the province of L'Aquila (Abruzzo Region, Southern Italy), created following the health emergency in Italy and worldwide.
Design: Open Data Covid is the result of a multidisciplinary study group including University of L'Aquila, Local Health Unit 1 Abruzzo, and Gran Sasso Science Institute. In the first phase, the information to be shown was identified and made available based on the pandemic national reports to obtain comparable results.
Anal Chim Acta
May 2016
Irstea, UMR ITAP, 361 rue J-F Breton BP 5095, 34196 Montpellier Cedex 5, France.
Visible and near-infrared (Vis-NIR) spectra are generated by the combination of numerous low resolution features. Spectral variables are thus highly correlated, which can cause problems for selecting the most appropriate ones for a given application. Some decomposition bases such as Fourier or wavelet generally help highlighting spectral features that are important, but are by nature constraint to have both positive and negative components.
View Article and Find Full Text PDFBiomed Sci Instrum
July 2006
Department of Electrical and Computer Engineering, Florida International University, 10555 W. Flagler Street, Miami, FL 33174, USA.
This study introduces an algorithm for a new application dedicated at discriminating between electrodes leading to a seizure onset and those that do not lead to seizure using interictal subdural EEG data. The significance of this study is in determining among all of these channels, all containing interictal spikes that are asynchronously, independent of region and time, which are selected randomly (these EEG portions may or may not contain spikes), and yet through the developed algorithm, we are able to classify those channels that lead to seizure and those that do not. The main zones of ictal activity are supposed to evolve from the tissue located at the channels that present interictal activity, but sometimes this is no the case.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!