Interpolated Joint Space Adversarial Training for Robust and Generalizable Defenses.

Chun Pong Lau Jiang Liu Hossein Souri Wei-An Lin Soheil Feizi Rama Chellappa

IEEE Trans Pattern Anal Mach Intell

Published: November 2023

Adversarial training (AT) is considered to be one of the most reliable defenses against adversarial attacks. However, models trained with AT sacrifice standard accuracy and do not generalize well to unseen attacks. Recent works show generalization improvement with adversarial samples under unseen threat models such as on-manifold threat model or neural perceptual threat model. However, the former requires exact manifold information while the latter requires algorithm relaxation. Motivated by these considerations, we propose a novel threat model called Joint Space Threat Model (JSTM), which exploits the underlying manifold information with Normalizing Flow, ensuring that the exact manifold assumption holds. Under JSTM, we develop novel adversarial attacks and defenses. Specifically, we propose the Robust Mixup strategy in which we maximize the adversity of the interpolated images and gain robustness and prevent overfitting. Our experiments show that Interpolated Joint Space Adversarial Training (IJSAT) achieves good performance in standard accuracy, robustness, and generalization. IJSAT is also flexible and can be used as a data augmentation method to improve standard accuracy and combined with many existing AT approaches to improve robustness. We demonstrate the effectiveness of our approach on three benchmark datasets, CIFAR-10/100, OM-ImageNet and CIFAR-10-C.

Download full-text PDF	Source
http://dx.doi.org/10.1109/TPAMI.2023.3286772	DOI Listing

Publication Analysis

Top Keywords

threat model

joint space

adversarial training

standard accuracy

interpolated joint

space adversarial

defenses adversarial

adversarial attacks

exact manifold

adversarial

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!