Despite the prevailing transition from single-task to multi-task approaches in video anomaly detection, we observe that many adopt sub-optimal frameworks for individual proxy tasks. Motivated by this, we contend that optimizing single-task frameworks can advance both single- and multi-task approaches. Accordingly, we leverage middle-frame prediction as the primary proxy task, and introduce an effective hybrid framework designed to generate accurate predictions for normal frames and flawed predictions for abnormal frames. This hybrid framework is built upon a bi-directional structure that seamlessly integrates both vision transformers and ConvLSTMs. Specifically, we utilize this bi-directional structure to fully analyze the temporal dimension by predicting frames in both forward and backward directions, significantly boosting the detection stability. Given the transformer's capacity to model long-range contextual dependencies, we develop a convolutional temporal transformer that efficiently associates feature maps from all context frames to generate attention-based predictions for target frames. Furthermore, we devise a layer-interactive ConvLSTM bridge that facilitates the smooth flow of low-level features across layers and time-steps, thereby strengthening predictions with fine details. Anomalies are eventually identified by scrutinizing the discrepancies between target frames and their corresponding predictions. Several experiments conducted on public benchmarks affirm the efficacy of our hybrid framework, whether used as a standalone single-task approach or integrated as a branch in a multi-task approach. These experiments also underscore the advantages of merging vision transformers and ConvLSTMs for video anomaly detection. The implementation of our hybrid framework is available at https://github.com/SHENGUODONG19951126/ConvTTrans-ConvLSTM.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TIP.2024.3512369DOI Listing

Publication Analysis

Top Keywords

hybrid framework
20
video anomaly
12
anomaly detection
12
multi-task approaches
12
single- multi-task
8
bi-directional structure
8
vision transformers
8
transformers convlstms
8
target frames
8
frames
6

Similar Publications

Integrating mixed electron donor (D) and electron acceptor (A) ligands into metal-organic frameworks (MOFs) is an effective yet relatively unexplored approach for improving the anode performance of hybrid lithium-ion capacitors (HLICs). In this study, using an electron donor 2,6-bis(4'-pyridyl)tetrathiafulvalene and an electron acceptor ,'-bis(5-isophthalic acid) naphthalene diimide as ligands, a new Zn-TTF/NDI MOF () is constructed as a pseudocapacitive anode of HLICs. Crystallographic characterization revealed that MOF adopts a two-dimensional (2D) coordination network.

View Article and Find Full Text PDF

Advancing virtual and hybrid team well-being through a job demand-resources lens.

Int J Qual Stud Health Well-being

December 2025

Institute of Psychiatry, Psychology and Neuroscience, King's College, London, UK.

As the modern workplace evolves, the shift to virtual and hybrid team working necessitates a re-evaluation of well-being. While workplace well-being research has predominantly focused on the individual level, understanding team-level well-being is critical, as its underlying psychological and social processes differ. This study applies the Job Demands-Resources (JD-R) framework to virtual and hybrid contexts globally, demonstrating the dual nature of demands and resources at the team level, where the same constructs may contribute to driving positive gain cycles or negative loss cycles of well-being.

View Article and Find Full Text PDF

There exist several interconnected issues that hinder the development of family medicine in Commonwealth realms such as the United Kingdom, Canada, New Zealand, and Australia. These issues affect both the medical and nursing professions. Family physicians, in most countries including the United Kingdom, are not considered "specialists" and are called "general practitioners" instead.

View Article and Find Full Text PDF

Background: Cardiovascular disease (CVD) is a prominent determinant of mortality, accounting for 17 million lives lost across the globe each year. This underscores its severity as a critical health issue. Extensive research has been undertaken to refine the forecasting of CVD in patients using various supervised, unsupervised, and deep learning approaches.

View Article and Find Full Text PDF

The global imperative for clean energy solutions has positioned photocatalytic water splitting as a promising pathway for sustainable hydrogen production. This review comprehensively analyzes recent advances in TiO-based photocatalytic systems, focusing on materials engineering, water source effects, and scale-up strategies. We recognize the advancements in nanoscale architectural design, the engineered heterojunction of catalysts, and cocatalyst integration, which have significantly enhanced photocatalytic efficiency.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!