Efficient Transformer-Based Compressed Video Modeling via Informative Patch Selection.

Sensors (Basel)

Department of Electronics and Electrical Engineering, Faculty of Science and Technology, Keio University, 3-14-1, Hiyoshi, Kohoku-ku, Yokohama 223-8522, Kanagawa, Japan.

Published: December 2022

Recently, Transformer-based video recognition models have achieved state-of-the-art results on major video recognition benchmarks. However, their high inference cost significantly limits research speed and practical use. In video compression, methods considering small motions and residuals that are less informative and assigning short code lengths to them (e.g., MPEG4) have successfully reduced the redundancy of videos. Inspired by this idea, we propose Informative Patch Selection (IPS), which efficiently reduces the inference cost by excluding redundant patches from the input of the Transformer-based video model. The redundancy of each patch is calculated from motions and residuals obtained while decoding a compressed video. The proposed method is simple and effective in that it can dynamically reduce the inference cost depending on the input without any policy model or additional loss term. Extensive experiments on action recognition demonstrated that our method could significantly improve the trade-off between the accuracy and inference cost of the Transformer-based video model. Although the method does not require any policy model or additional loss term, its performance approaches that of existing methods that do require them.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9823838PMC
http://dx.doi.org/10.3390/s23010244DOI Listing

Publication Analysis

Top Keywords

inference cost
16
transformer-based video
12
compressed video
8
informative patch
8
patch selection
8
video recognition
8
motions residuals
8
video model
8
policy model
8
model additional
8

Similar Publications

Background: In infected hosts, immune responses trigger a systemic energy reallocation away from energy storage and growth, to fuel a costly defense program. The exact energy costs of immune defense are however unknown in general. Life history theory predicts that such costs underpin trade-offs between host disease resistance and other fitness related traits, yet this has been seldom assessed.

View Article and Find Full Text PDF

Background: Hospital regionalization involves balancing hospital volume and travel time. We investigated how hospital volume and travel time affect perinatal mortality and the risk of delivery in transit using three different study designs.

Methods: This nationwide cohort study used data from the Medical Birth Registry of Norway (1999-2016) and Statistics Norway.

View Article and Find Full Text PDF

Introduction: Weeds are a major factor affecting crop yield and quality. Accurate identification and localization of crops and weeds are essential for achieving automated weed management in precision agriculture, especially given the challenges in recognition accuracy and real-time processing in complex field environments. To address this issue, this paper proposes an efficient crop-weed segmentation model based on an improved UNet architecture and attention mechanisms to enhance both recognition accuracy and processing speed.

View Article and Find Full Text PDF

Typhoon localization detection algorithm based on TGE-YOLO.

Sci Rep

January 2025

College of Mathematics and Computer Science, Guangdong Ocean University, Zhanjiang, 524088, China.

To address the problems of complex cloud features in satellite cloud maps, inaccurate typhoon localization, and poor target detection accuracy, this paper proposes a new typhoon localization algorithm, named TGE-YOLO. It is based on the YOLOv8n model with excellent high-low feature fusion capability and innovatively achieves the organic combination of feature fusion, computational efficiency, and localization accuracy. Firstly, the TFAM_Concat module is creatively designed in the neck network, which comprehensively utilizes the detailed information of shallow features and the semantic information of deeper features, enhancing the fusion ability of features at each layer.

View Article and Find Full Text PDF

Background And Aims: Genome size varies by orders of magnitude across land plants, and the factors driving evolutionary increases and decreases in genome size vary across lineages. Bryophytes have the smallest genomes relative to other land plants and there is growing evidence for frequent whole genome duplication (WGD) across the lineage. However, the broad patterns of genome size, chromosome number, and WGD have yet to be characterized across bryophytes in a phylogenetic context.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!