Orthogonal Gated Recurrent Unit With Neumann-Cayley Transformation.

Neural Comput

Department of Mathematics, University of Kentucky, Lexington, KY 40506, U.S.A.

Published: November 2024

In recent years, using orthogonal matrices has been shown to be a promising approach to improving recurrent neural networks (RNNs) with training, stability, and convergence, particularly to control gradients. While gated recurrent unit (GRU) and long short-term memory (LSTM) architectures address the vanishing gradient problem by using a variety of gates and memory cells, they are still prone to the exploding gradient problem. In this work, we analyze the gradients in GRU and propose the use of orthogonal matrices to prevent exploding gradient problems and enhance long-term memory. We study where to use orthogonal matrices and propose a Neumann series-based scaled Cayley transformation for training orthogonal matrices in GRU, which we call Neumann-Cayley orthogonal GRU (NC-GRU). We present detailed experiments of our model on several synthetic and real-world tasks, which show that NC-GRU significantly outperforms GRU and several other RNNs.

Download full-text PDF

Source
http://dx.doi.org/10.1162/neco_a_01710DOI Listing

Publication Analysis

Top Keywords

orthogonal matrices
16
gated recurrent
8
recurrent unit
8
gradient problem
8
exploding gradient
8
orthogonal
6
gru
5
orthogonal gated
4
unit neumann-cayley
4
neumann-cayley transformation
4

Similar Publications

The occurrence of off-flavor in osmanthus absolutes has emerged as a significant concern that could hinder its broad market acceptance and associated economic development. In this study, key off-flavor molecules in industrial osmanthus absolute were identified through sensomics and chemometric approaches. A group of 10 off-flavor (OF) samples, eliciting smoky/phenolic, sweaty/sour, and spicy odors, were compared with 10 pleasant aroma (PA) samples through various analyses, including overall aroma assessment, comprehensive chemical profiling, aroma extract dilution analysis (AEDA), and orthogonal partial least-squares-discriminant analysis (OPLS-DA).

View Article and Find Full Text PDF

Purpose: The long scan times of quantitative MRI techniques make motion artifacts more likely. For MR-Fingerprinting-like approaches, this problem can be addressed with self-navigated retrospective motion correction based on reconstructions in a singular value decomposition (SVD) subspace. However, the SVD promotes high signal intensity in all tissues, which limits the contrast between tissue types and ultimately reduces the accuracy of registration.

View Article and Find Full Text PDF

Introduction: This report presents prevalence of high total cholesterol and low high-density lipoprotein cholesterol (HDL-C) by sex and age group for adults age 20 and older from the August 2021-August 2023 National Health and Nutrition Examination Survey (NHANES). Trends in the prevalence of high total cholesterol and low HDL-C are also presented.

Methods: Data from the August 2021-August 2023 NHANES were used to estimate the prevalence of high total cholesterol (240 mg/dL or higher) and low HDL-C (less than 40 mg/dL).

View Article and Find Full Text PDF

Exploring Distinct Second-Order Data Approaches for Thiamine Quantification via Carbon Dot/Silver Nanoparticle FRET Reversion.

Biosensors (Basel)

December 2024

LAQV, REQUIMTE, Department of Chemical Sciences, Laboratory of Applied Chemistry, Faculty of Pharmacy, University of Porto, Rua de Jorge Viterbo Ferreira n° 228, 4050-313 Porto, Portugal.

Accurate and selective monitoring of thiamine levels in multivitamin supplements is essential for preventing deficiencies and ensuring product quality. To achieve this, a Förster resonance energy transfer (FRET) system using carbon dots (CDs) as energy donors and citrate-stabilized silver nanoparticles (AgNPs) as energy acceptors was developed. The aqueous synthesis of AgNPs using microwave irradiation was optimized to obtain efficient plasmonic nanoparticles for FRET applications, targeting maximal absorbance intensity, stability, and wavelength alignment.

View Article and Find Full Text PDF

Design of Label-Free DNA Light-Up Aptaswitches for Multiplexed Biosensing.

ACS Sens

January 2025

Department of Chemistry, Faculty of Science, McGill University, Montreal, Quebec H3A 0B8, Canada.

We present a straightforward design approach to develop DNA-based light-up aptasensors. We performed the first systematic comparison of DNA fluorescent light-up aptamers (FLAPs), revealing key differences in affinity and specificity for their target dyes. Based on our analysis, two light-up aptamers emerged with remarkable specificity, fluorescence enhancement, and functionality in diverse environments.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!