Robust Differentiable SVD.

Wei Wang Zheng Dang Yinlin Hu Pascal Fua Mathieu Salzmann

IEEE Trans Pattern Anal Mach Intell

Published: September 2022

Eigendecomposition of symmetric matrices is at the heart of many computer vision algorithms. However, the derivatives of the eigenvectors tend to be numerically unstable, whether using the SVD to compute them analytically or using the Power Iteration (PI) method to approximate them. This instability arises in the presence of eigenvalues that are close to each other. This makes integrating eigendecomposition into deep networks difficult and often results in poor convergence, particularly when dealing with large matrices. While this can be mitigated by partitioning the data into small arbitrary groups, doing so has no theoretical basis and makes it impossible to exploit the full power of eigendecomposition. In previous work, we mitigated this using SVD during the forward pass and PI to compute the gradients during the backward pass. However, the iterative deflation procedure required to compute multiple eigenvectors using PI tends to accumulate errors and yield inaccurate gradients. Here, we show that the Taylor expansion of the SVD gradient is theoretically equivalent to the gradient obtained using PI without relying in practice on an iterative process and thus yields more accurate gradients. We demonstrate the benefits of this increased accuracy for image classification and style transfer.

Download full-text PDF	Source
http://dx.doi.org/10.1109/TPAMI.2021.3072422	DOI Listing

Publication Analysis

Top Keywords

robust differentiable

svd

differentiable svd

svd eigendecomposition

eigendecomposition symmetric

symmetric matrices

matrices heart

heart computer

computer vision

vision algorithms

Similar Publications

Fuzzy reinforcement learning based control of linear systems with input saturation.

ISA Trans

January 2025

Toronto Metropolitan University, Toronto, Canada. Electronic address:

Kainan Liu Xiaojun Ban Shengkun Xie

This research introduces an innovative approach to optimal control for a class of linear systems with input saturation. It leverages the synergy of Takagi-Sugeno (T-S) fuzzy models and reinforcement learning (RL) techniques. To enhance interpretability and analytical accessibility, our approach applies T-S models to approximate the value function and generate optimal control laws while incorporating prior knowledge.

View Article and Find Full Text PDF

Similar Publications

Automated Pipeline for Robust Cat Activity Detection Based on Deep Learning and Wearable Sensor Data.

Sensors (Basel)

November 2024

Institute of Digital Anti-Aging Healthcare, Inje University, Gimhae 50834, Republic of Korea.

Md Ariful Islam Mozumder Tagne Poupi Theodore Armand Rashadul Islam Sumon Shah Muhammad Imtiyaj Uddin Hee-Cheol Kim

The health, safety, and well-being of household pets such as cats has become a challenging task in previous years. To estimate a cat's behavior, objective observations of both the frequency and variability of specific behavior traits are required, which might be difficult to come by in a cat's ordinary life. There is very little research on cat activity and cat disease analysis based on real-time data.

View Article and Find Full Text PDF

Similar Publications

Generative modeling of the Circle of Willis using 3D-StyleGAN.

Neuroimage

December 2024

CLAIM - Charité Lab for Artificial Intelligence in Medicine, Charité Universitätsmedizin Berlin, Germany; Department of Neurosurgery, Charité Universitätsmedizin Berlin, Germany. Electronic address:

Orhun Utku Aydin Adam Hilbert Alexander Koch Felix Lohrke Jana Rieger

The circle of Willis (CoW) is a network of cerebral arteries with significant inter-individual anatomical variations. Deep learning has been used to characterize and quantify the status of the CoW in various applications for the diagnosis and treatment of cerebrovascular disease. In medical imaging, the performance of deep learning models is limited by the diversity and size of training datasets.

View Article and Find Full Text PDF

Similar Publications

Perception and Production of Pitch Information in Mandarin-Speaking Children with Autism Spectrum Disorders.

J Autism Dev Disord

November 2024

School of Foreign Languages and Literature, Shandong University, Jinan, China.

Wen Ma Xuequn Dai Hao Zhang

This study investigated the categorical perception (CP) of linguistic pitch (lexical tones) and nonlinguistic pitch (pure tones), as well as tonal production in Mandarin-speaking children with autism spectrum disorders (ASD). A total of 26 Mandarin-speaking children with ASD and 29 age-matched typically developing (TD) children were recruited for this study. The Mandarin T2-T3 contrast and corresponding pure tones with identical pitch contours were adopted to assess the nuanced pitch processing abilities of the child participants via the CP paradigm.

View Article and Find Full Text PDF

Similar Publications

MR-zero meets FLASH - controlling the transient signal decay in gradient- and RF-spoiled gradient echo sequences.

Magn Reson Med

March 2025

Institute of Neuroradiology, Uniklinikum Erlangen, Friedrich-Alexander-Universität Erlangen-Nürnberg (FAU), Erlangen, Germany.

Simon Weinmüller Jonathan Endres Nam Dang Rudolf Stollberger Moritz Zaiss

Purpose: The complex signal decay during the transient FLASH MRI readout can lead to artifacts in magnitude and phase images. We show that target-driven optimization of individual RF flip angles and phases can realize near-ideal signal behavior and mitigate artifacts.

Methods: The differentiable end-to-end optimization framework MR-zero is used to optimize RF trains of the FLASH sequence.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!