Video compression is indispensable to most video analysis systems. Despite saving the transportation bandwidth, it also deteriorates downstream video understanding tasks, especially at low-bitrate settings. To systematically investigate this problem, we first thoroughly review the previous methods, revealing that three principles, i.e., task-decoupled, label-free, and data-emerged semantic prior, are critical to a machine-friendly coding framework but are not fully satisfied so far. In this paper, we propose a traditional-neural mixed coding framework that simultaneously fulfills all these principles, by taking advantage of both traditional codecs and neural networks (NNs). On one hand, the traditional codecs can efficiently encode the pixel signal of videos but may distort the semantic information. On the other hand, highly non-linear NNs are proficient in condensing video semantics into a compact representation. The framework is optimized by ensuring that a transportation-efficient semantic representation of the video is preserved w.r.t. the coding procedure, which is spontaneously learned from unlabeled data in a self-supervised manner. The videos collaboratively decoded from two streams (codec and NN) are of rich semantics, as well as visually photo-realistic, empirically boosting several mainstream downstream video analysis task performances without any post-adaptation procedure. Furthermore, by introducing the attention mechanism and adaptive modeling scheme, the video semantic modeling ability of our approach is further enhanced. Fianlly, we build a low-bitrate video understanding benchmark with three downstream tasks on eight datasets, demonstrating the notable superiority of our approach. All codes, data, and models will be open-sourced for facilitating future research.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2024.3367879DOI Listing

Publication Analysis

Top Keywords

coding framework
12
video understanding
12
video
9
low-bitrate video
8
video analysis
8
downstream video
8
traditional codecs
8
coding
4
framework benchmark
4
benchmark low-bitrate
4

Similar Publications

Background: Women living with HIV bear a disproportionate burden of stigma, especially in countries where gender discrimination is more common. A result is widespread domestic violence against women. This violence is itself stigmatized, but the intersectional stigma of HIV and domestic violence has not been well studied.

View Article and Find Full Text PDF

Background: Despite the rising prevalence of common mental symptoms, information is scarce on how health workers make sense of symptoms of mental disorders and perceive a link with inadequate water, sanitation, and hygiene (WASH) as work stressors to understand causation and produce useful knowledge for policy and professionals. Therefore, this study aimed to explore how health workers perceive the link between inadequate WASH and common mental symptoms (CMSs) at hospitals in central and southern Ethiopian regions.

Methods: We used an interpretive and descriptive phenomenological design guided by theoretical frameworks.

View Article and Find Full Text PDF

Background: Long-acting injectable (LAI) antiretroviral medications are as effective as daily oral antiretroviral therapy (ART) and offer discreet, less frequent dosing. LAIs may be ideal treatment options for people who experience challenges with adherence to daily oral ART, including mobile men living with HIV (MLHIV).

Methods: We conducted a qualitative substudy within two parent trials in 24 health facilities in Malawi that enrolled MLHIV ≥15 years not on ART.

View Article and Find Full Text PDF

Background: Brucellosis, one of the most common zoonotic diseases globally, is a serious public health problem. The complex and diverse clinical manifestations pose numerous challenges for patients when coping with brucellosis. Scarce studies have been performed in China.

View Article and Find Full Text PDF

Introduction: The 2018 Common Rule revision intended to improve informed consent by recommending a concise key information (KI) section, yet provided little guidance about how to describe KI. We developed innovative, visual KI templates with attention to health literacy and visual design principles. We explored end users' attitudes, beliefs, and institutional policies that could affect implementing visual KI pages.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!