Text that appears in a scene or is graphically added to video can provide an important supplemental source of index information as well as clues for decoding the video's structure and for classification. In this work, we present algorithms for detecting and tracking text in digital video. Our system implements a scale-space feature extractor that feeds an artificial neural processor to detect text blocks. Our text tracking scheme consists of two modules: a sum of squared difference (SSD)-based module to find the initial position and a contour-based module to refine the position. Experiments conducted with a variety of video sources show that our scheme can detect and track text robustly.
Download full-text PDF |
Source |
---|---|
http://dx.doi.org/10.1109/83.817607 | DOI Listing |
Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!