A hybrid approach to detect and localize texts in natural scene images.

IEEE Trans Image Process

National Laboratory of Pattern Recognition (NLPR), Institute of Automation, Chinese Academy of Sciences (CASIA), Beijing 100190, China.

Published: March 2011

Text detection and localization in natural scene images is important for content-based image analysis. This problem is challenging due to the complex background, the non-uniform illumination, the variations of text font, size and line orientation. In this paper, we present a hybrid approach to robustly detect and localize texts in natural scene images. A text region detector is designed to estimate the text existing confidence and scale information in image pyramid, which help segment candidate text components by local binarization. To efficiently filter out the non-text components, a conditional random field (CRF) model considering unary component properties and binary contextual component relationships with supervised parameter learning is proposed. Finally, text components are grouped into text lines/words with a learning-based energy minimization method. Since all the three stages are learning-based, there are very few parameters requiring manual tuning. Experimental results evaluated on the ICDAR 2005 competition dataset show that our approach yields higher precision and recall performance compared with state-of-the-art methods. We also evaluated our approach on a multilingual image dataset with promising results.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TIP.2010.2070803DOI Listing

Publication Analysis

Top Keywords

natural scene
12
scene images
12
hybrid approach
8
detect localize
8
localize texts
8
texts natural
8
images text
8
text components
8
text
7
approach detect
4

Similar Publications

Drones are extensively utilized in both military and social development processes. Eliminating the reliance of drone positioning systems on GNSS and enhancing the accuracy of the positioning systems is of significant research value. This paper presents a novel approach that employs a real-scene 3D model and image point cloud reconstruction technology for the autonomous positioning of drones and attains high positioning accuracy.

View Article and Find Full Text PDF

Roadside tree segmentation and parameter extraction play an essential role in completing the virtual simulation of road scenes. Point cloud data of roadside trees collected by LiDAR provide important data support for achieving assisted autonomous driving. Due to the interference from trees and other ground objects in street scenes caused by mobile laser scanning, there may be a small number of missing points in the roadside tree point cloud, which makes it familiar for under-segmentation and over-segmentation phenomena to occur in the roadside tree segmentation process.

View Article and Find Full Text PDF

Interacting hand reconstruction presents significant opportunities in various applications. However, it currently faces challenges such as the difficulty in distinguishing the features of both hands, misalignment of hand meshes with input images, and modeling the complex spatial relationships between interacting hands. In this paper, we propose a multilevel feature fusion interactive network for hand reconstruction (HandFI).

View Article and Find Full Text PDF

Multi-Band Scattering Characteristics of Miniature Masson Pine Canopy Based on Microwave Anechoic Chamber Measurement.

Sensors (Basel)

December 2024

Laboratory of Target Microwave Properties, Deqing Academy of Satellite Applications, Deqing 313200, China.

Using microwave remote sensing to invert forest parameters requires clear canopy scattering characteristics, which can be intuitively investigated through scattering measurements. However, there are very few ground-based measurements on forest branches, needles, and canopies. In this study, a quantitative analysis of the canopy branches, needles, and ground contribution of Masson pine scenes in C-, X-, and Ku-bands was conducted based on a microwave anechoic chamber measurement platform.

View Article and Find Full Text PDF

Music pre-processing methods are currently becoming a recognized area of research with the goal of making music more accessible to listeners with a hearing impairment. Our previous study showed that hearing-impaired listeners preferred spectrally manipulated multi-track mixes. Nevertheless, the acoustical basis of mixing for hearing-impaired listeners remains poorly understood.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!