Video anomaly detection (VAD) aims at localizing the snippets containing anomalous events in long unconstrained videos. The weakly supervised (WS) setting, where solely video-level labels are available during training, has attracted considerable attention, owing to its satisfactory trade-off between the detection performance and annotation cost. However, due to lack of snippet-level dense labels, the existing WS-VAD methods still get easily stuck on the detection errors, caused by false alarms and incomplete localization. To address this dilemma, in this paper, we propose to inject text clues of anomaly-event categories for improving WS-VAD, via a dedicated dual-branch framework. For suppressing the response of confusing normal contexts, we first present a text-guided anomaly discovering (TAG) branch based on a hierarchical matching scheme, which utilizes the label-text queries to search the discriminative anomalous snippets in a global-to-local fashion. To facilitate the completeness of anomaly-instance localization, an anomaly-conditioned text completion (ATC) branch is further designed to perform an auxiliary generative task, which intrinsically forces the model to gather sufficient event semantics from all the relevant anomalous snippets for completely reconstructing the masked description sentence. Furthermore, to encourage the cross-branch knowledge sharing, a mutual learning strategy is introduced by imposing a consistency constraint on the anomaly scores of these two branches. Extensive experimental results on two public benchmarks validate that the proposed method achieves superior performance over the competing methods.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/TIP.2024.3477351 | DOI Listing |
Neural Netw
January 2025
College of Intelligence and Computing, Tianjin University, Tianjin, 300350, China; Tianjin Key Laboratory of Cognitive Computing and Application, Tianjin, 300350, China; Tianjin Key Laboratory of Advanced Networking, Tianjin, 300350, China. Electronic address:
Distant supervision aligns the unstructured text to the knowledge base, thereby enabling automatic machine annotation. Nevertheless, this inevitably introduces a considerable amount of noise. Distant supervised relation extraction models aggregate all sentences sharing the same entity pairs into bags and employ various attention mechanisms to reduce the impact of noisy instances.
View Article and Find Full Text PDFBMC Public Health
December 2024
Department of Hospital Infection Control, Tongji Hospital, School of Medicine, Tongji University, Shanghai, 200065, China.
Am J Chin Med
December 2024
College of Pharmacy, Shaanxi University of Chinese Medicine, Xianyang 712046, P. R. China.
Irritable bowel syndrome (IBS) is the functional gastrointestinal disorder, characterized by abdominal pain and altered bowel habits. The interest in intestinal immune activation as a potential disease mechanism for IBS has increased exponentially in recent years. This study was designed to summarize the Chinese herbal medicine (CHM) that potentially exert protective effects against IBS through inhibition of intestinal immune activation.
View Article and Find Full Text PDFAm J Clin Dermatol
January 2025
Department of Dermatology, Wayne State University, 5250 Auto Club Dr, Suite 290A, Dearborn, MI, 48126, USA.
Secondary hyperhidrosis is a multifactorial condition that poses unique diagnostic and management challenges. Distinguishing secondary from primary hyperhidrosis remains difficult due to overlapping symptoms. This review consolidates existing evidence on the numerous underlying causes and pathophysiologic mechanisms of secondary hyperhidrosis across various disciplines.
View Article and Find Full Text PDFPharmacoecon Open
January 2025
National Pharmaceutical Council, Washington, D.C., USA.
Background: The US Centers for Medicare and Medicaid Services (CMS) held patient-focused listening sessions in Fall 2023 for each of the first ten drugs selected for the Inflation Reduction Act's (IRA) Drug Price Negotiation Program (DPNP). This study aimed to quantitatively describe speaker input at the sessions, including the absolute and relative time allocated to key areas of interest for the DPNP.
Methods: In this descriptive analysis of speaker remarks from ten CMS-hosted patient-focused listening sessions, speaker demographics were examined using video streams, time-stamped transcripts, public data, contextual clues, validated tools, and visual assessment when necessary.
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!