Publications by Linger Deng

Publications by authors named "Linger Deng"

Page 1 of 1

VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-Domain Generalization.

Yuliang Liu Mingxin Huang Hao Yan Linger Deng Weijia Wu

IEEE Trans Pattern Anal Mach Intell

April 2025

Text spotting, a task involving the extraction of textual information from image or video sequences, faces challenges in cross-domain adaption, such as image-to-image and image-to-video generalization. In this paper, we introduce a new method, termed VimTS, which enhances the generalization ability of the model by achieving better synergy among different tasks. Typically, we propose a Prompt Queries Generation Module and a Tasks-aware Adapter to effectively convert the original single-task model into a multi-task model suitable for both image and video scenarios with minimal additional parameters.

View Article and Find Full Text PDF

Publications by authors named "Linger Deng"

VimTS: A Unified Video and Image Text Spotter for Enhancing the Cross-Domain Generalization.

A PHP Error was encountered

A PHP Error was encountered