Robust Differentiable SVD.

IEEE Trans Pattern Anal Mach Intell

Published: September 2022

Eigendecomposition of symmetric matrices is at the heart of many computer vision algorithms. However, the derivatives of the eigenvectors tend to be numerically unstable, whether using the SVD to compute them analytically or using the Power Iteration (PI) method to approximate them. This instability arises in the presence of eigenvalues that are close to each other. This makes integrating eigendecomposition into deep networks difficult and often results in poor convergence, particularly when dealing with large matrices. While this can be mitigated by partitioning the data into small arbitrary groups, doing so has no theoretical basis and makes it impossible to exploit the full power of eigendecomposition. In previous work, we mitigated this using SVD during the forward pass and PI to compute the gradients during the backward pass. However, the iterative deflation procedure required to compute multiple eigenvectors using PI tends to accumulate errors and yield inaccurate gradients. Here, we show that the Taylor expansion of the SVD gradient is theoretically equivalent to the gradient obtained using PI without relying in practice on an iterative process and thus yields more accurate gradients. We demonstrate the benefits of this increased accuracy for image classification and style transfer.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2021.3072422DOI Listing

Publication Analysis

Top Keywords

robust differentiable
4
svd
4
differentiable svd
4
svd eigendecomposition
4
eigendecomposition symmetric
4
symmetric matrices
4
matrices heart
4
heart computer
4
computer vision
4
vision algorithms
4

Similar Publications

This research introduces an innovative approach to optimal control for a class of linear systems with input saturation. It leverages the synergy of Takagi-Sugeno (T-S) fuzzy models and reinforcement learning (RL) techniques. To enhance interpretability and analytical accessibility, our approach applies T-S models to approximate the value function and generate optimal control laws while incorporating prior knowledge.

View Article and Find Full Text PDF

The health, safety, and well-being of household pets such as cats has become a challenging task in previous years. To estimate a cat's behavior, objective observations of both the frequency and variability of specific behavior traits are required, which might be difficult to come by in a cat's ordinary life. There is very little research on cat activity and cat disease analysis based on real-time data.

View Article and Find Full Text PDF

Generative modeling of the Circle of Willis using 3D-StyleGAN.

Neuroimage

December 2024

CLAIM - Charité Lab for Artificial Intelligence in Medicine, Charité Universitätsmedizin Berlin, Germany; Department of Neurosurgery, Charité Universitätsmedizin Berlin, Germany. Electronic address:

The circle of Willis (CoW) is a network of cerebral arteries with significant inter-individual anatomical variations. Deep learning has been used to characterize and quantify the status of the CoW in various applications for the diagnosis and treatment of cerebrovascular disease. In medical imaging, the performance of deep learning models is limited by the diversity and size of training datasets.

View Article and Find Full Text PDF

This study investigated the categorical perception (CP) of linguistic pitch (lexical tones) and nonlinguistic pitch (pure tones), as well as tonal production in Mandarin-speaking children with autism spectrum disorders (ASD). A total of 26 Mandarin-speaking children with ASD and 29 age-matched typically developing (TD) children were recruited for this study. The Mandarin T2-T3 contrast and corresponding pure tones with identical pitch contours were adopted to assess the nuanced pitch processing abilities of the child participants via the CP paradigm.

View Article and Find Full Text PDF

Purpose: The complex signal decay during the transient FLASH MRI readout can lead to artifacts in magnitude and phase images. We show that target-driven optimization of individual RF flip angles and phases can realize near-ideal signal behavior and mitigate artifacts.

Methods: The differentiable end-to-end optimization framework MR-zero is used to optimize RF trains of the FLASH sequence.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!