Deep learning (DL) has been demonstrated to be a valuable tool for analyzing signals such as sounds and images, thanks to its capabilities of automatically extracting relevant patterns as well as its end-to-end training properties. When applied to tabular structured data, DL has exhibited some performance limitations compared to shallow learning techniques. This work presents a novel technique for tabular data called adaptive multiscale attention deep neural network architecture (also named excited attention). By exploiting parallel multilevel feature weighting, the adaptive multiscale attention can successfully learn the feature attention and thus achieve high levels of F1-score on seven different classification tasks (on small, medium, large, and very large datasets) and low mean absolute errors on four regression tasks of different size. In addition, adaptive multiscale attention provides four levels of explainability (i.e., comprehension of its learning process and therefore of its outcomes): 1) calculates attention weights to determine which layers are most important for given classes; 2) shows each feature's attention across all instances; 3) understands learned feature attention for each class to explore feature attention and behavior for specific classes; and 4) finds nonlinear correlations between co-behaving features to reduce dataset dimensionality and improve interpretability. These interpretability levels, in turn, allow for employing adaptive multiscale attention as a useful tool for feature ranking and feature selection.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNNLS.2024.3392355DOI Listing

Publication Analysis

Top Keywords

adaptive multiscale
20
multiscale attention
20
feature attention
12
attention
11
attention deep
8
deep neural
8
neural network
8
tabular data
8
feature
6
multiscale
5

Similar Publications

As combination therapy becomes more common in clinical applications, predicting adverse effects of combination medications is a challenging task. However, there are three limitations of the existing prediction models. First, they rely on a single view of the drug and cannot fully utilize multiview information, resulting in limited performance when capturing complex structures.

View Article and Find Full Text PDF

Coal-gangue recognition technology plays an important role in the intelligent realization of integrated working faces and coal quality improvement. However, the existing methods are easily affected by high dust, noise, and other disturbances, resulting in unstable recognition results that make it difficult to meet the needs of industrial applications. To realize accurate recognition of coal-gangue in noisy environments, this paper proposes an end-to-end multi-scale feature fusion convolutional neural network (MCNN-BILSTM) based gangue recognition method, which can automatically learn and fuse complementary information from multiple signal components of vibration signals.

View Article and Find Full Text PDF

ChromatinHD connects single-cell DNA accessibility and conformation to gene expression through scale-adaptive machine learning.

Nat Commun

January 2025

Laboratory of Systems Biology and Genetics, Institute of Bio-engineering and Global Health Institute, School of Life Sciences, Swiss Federal Institute of Technology (EPFL), Lausanne, Switzerland.

Gene regulation is inherently multiscale, but scale-adaptive machine learning methods that fully exploit this property in single-nucleus accessibility data are still lacking. Here, we develop ChromatinHD, a pair of scale-adaptive models that uses the raw accessibility data, without peak-calling or windows, to link regions to gene expression and determine differentially accessible chromatin. We show how ChromatinHD consistently outperforms existing peak and window-based approaches and find that this is due to a large number of uniquely captured, functional accessibility changes within and outside of putative cis-regulatory regions.

View Article and Find Full Text PDF

Bone tissue is a biological composite material with a complex hierarchical structure that could continuously adjust its internal structure to adapt to the alterations in the external load environment. The fluid flow within bone is the main route of osteocyte metabolism, and the pore pressure as well as the fluid shear stress generated by it are important mechanical stimuli perceived by osteocytes. Owing to the irregular multiscale structure of bone tissue, the fluid stimulation that lacunar-canalicular network (LCN) in different regions of the tissue underwent remained unclear.

View Article and Find Full Text PDF

Multimodal sleep staging network based on obstructive sleep apnea.

Front Comput Neurosci

December 2024

School of Electrical and Electronic Engineering, Chongqing University of Technology, Chongqing, China.

Background: Automatic sleep staging is essential for assessing sleep quality and diagnosing sleep disorders. While previous research has achieved high classification performance, most current sleep staging networks have only been validated in healthy populations, ignoring the impact of Obstructive Sleep Apnea (OSA) on sleep stage classification. In addition, it remains challenging to effectively improve the fine-grained detection of polysomnography (PSG) and capture multi-scale transitions between sleep stages.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!