Text Font Correction and Alignment Method for Scene Text Recognition.

Sensors (Basel)

School of Digital and Intelligent Industry, Inner Mongolia University of Science and Technology, Baotou 014010, China.

Published: December 2024

Text recognition is a rapidly evolving task with broad practical applications across multiple industries. However, due to the arbitrary-shape text arrangement, irregular text font, and unintended occlusion of font, this remains a challenging task. To handle images with arbitrary-shape text arrangement and irregular text font, we designed the Discriminative Standard Text Font (DSTF) and the Feature Alignment and Complementary Fusion (FACF). To address the unintended occlusion of font, we propose a Dual Attention Serial Module (DASM), which is integrated between residual modules to enhance the focus on text texture. These components improve text recognition by correcting irregular text and aligning it with the original feature extraction, thus complementing the overall recognition process. Additionally, to enhance the study of text recognition in natural scenes, we developed the VBC Chinese dataset under varying lighting conditions, including strong light, weak light, darkness, and other natural environments. Experimental results show that our method achieves competitive performance on the VBC dataset with an accuracy of 90.8% and an overall average accuracy of 93.8%.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11679380PMC
http://dx.doi.org/10.3390/s24247917DOI Listing

Publication Analysis

Top Keywords

text font
16
text recognition
16
text
12
irregular text
12
arbitrary-shape text
8
text arrangement
8
arrangement irregular
8
unintended occlusion
8
occlusion font
8
recognition
5

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!