AI Article Synopsis

  • Video anomaly detection (VAD) is vital for smart surveillance, but seemingly ignored scene-dependent anomalies and the related area of video anomaly anticipation (VAA) need more focus.
  • To address these issues, the researchers created the NWPU Campus dataset, the largest semi-supervised dataset specifically for scene-dependent VAD and VAA.
  • They developed a unique forward-backward framework utilizing a generative model and hierarchical variational auto-encoders to refine and generate scene-specific features, achieving impressive results in both anomaly detection and anticipation on multiple datasets.

Article Abstract

Video anomaly detection (VAD) plays a crucial role in intelligent surveillance. However, an essential type of anomaly named scene-dependent anomaly is overlooked. Moreover, the task of video anomaly anticipation (VAA) also deserves attention. To fill these gaps, we build a comprehensive dataset named NWPU Campus, which is the largest semi-supervised VAD dataset and the first dataset for scene-dependent VAD and VAA. Meanwhile, we introduce a novel forward-backward framework for scene-dependent VAD and VAA, in which the forward network individually solves the VAD and jointly solves the VAA with the backward network. Particularly, we propose a scene-dependent generative model in latent space for the forward and backward networks. First, we propose a hierarchical variational auto-encoder to extract scene-generic features. Next, we design a score-based diffusion model in latent space to refine these features more compact for the task and generate scene-dependent features with a scene information auto-encoder, modeling the relationships between video events and scenes. Finally, we develop a temporal loss from key frames to constrain the motion consistency of video clips. Extensive experiments demonstrate that our method can handle both scene-dependent anomaly detection and anticipation well, achieving state-of-the-art performance on ShanghaiTech, CUHK Avenue, and the proposed NWPU Campus datasets.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2024.3461718DOI Listing

Publication Analysis

Top Keywords

latent space
12
video anomaly
12
anomaly detection
12
detection anticipation
8
scene-dependent anomaly
8
nwpu campus
8
scene-dependent vad
8
vad vaa
8
model latent
8
scene-dependent
7

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!