Arbitrary shape text detection is a challenging task due to the significantly varied sizes and aspect ratios, arbitrary orientations or shapes, inaccurate annotations, etc. Due to the scalability of pixel-level prediction, segmentation-based methods can adapt to various shape texts and hence attracted considerable attention recently. However, accurate pixel-level annotations of texts are formidable, and the existing datasets for scene text detection only provide coarse-grained boundary annotations. Consequently, numerous misclassified text pixels or background pixels inside annotations always exist, degrading the performance of segmentation-based text detection methods. Generally speaking, whether a pixel belongs to text or not is highly related to the distance with the adjacent annotation boundary. With this observation, in this paper, we propose an innovative and robust segmentation-based detection method via probability maps for accurately detecting text instances. To be concrete, we adopt a Sigmoid Alpha Function (SAF) to transfer the distances between boundaries and their inside pixels to a probability map. However, one probability map can not cover complex probability distributions well because of the uncertainty of coarse-grained text boundary annotations. Therefore, we adopt a group of probability maps computed by a series of Sigmoid Alpha Functions to describe the possible probability distributions. In addition, we propose an iterative model to learn to predict and assimilate probability maps for providing enough information to reconstruct text instances. Finally, simple region growth algorithms are adopted to aggregate probability maps to complete text instances. Experimental results demonstrate that our method achieves state-of-the-art performance in terms of detection accuracy on several benchmarks. Notably, our method with Watershed Algorithm as post-processing achieves the best F-measure on Total-Text (88.79%), CTW1500 (85.75%), and MSRA-TD500 (88.93%). Besides, our method achieves promising performance on multi-oriented datasets (ICDAR2015) and multilingual datasets (ICDAR2017-MLT). Code is available at: https://github.com/GXYM/TextPMs.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TPAMI.2022.3176122DOI Listing

Publication Analysis

Top Keywords

probability maps
20
text detection
16
text instances
12
text
10
probability
9
arbitrary shape
8
shape text
8
boundary annotations
8
sigmoid alpha
8
probability map
8

Similar Publications

This study aimed to assess the environmental variables affecting the Body Mass Index of older adults at neighborhood levels (1 ha) while mapping probability distributions of normal, overweight-obese, and underweight older adults. We applied a data-driven method that integrates open-access remote sensing products and geospatial data, along with the first nutritional survey in the Philippines with geo-locations conducted in 2021. We used ensemble machine learning of different presence-only and presence-absence models, all subjected to hyperparameter tuning and variable decorrelation.

View Article and Find Full Text PDF
Article Synopsis
  • Accurate 3D information estimation from images is crucial for computer vision, and while binocular stereo vision is a common approach, it faces challenges with baseline distance affecting reliability.
  • This research proposes a new method that progressively increases the baseline in multiocular vision, introducing a rectification technique that significantly reduces distortion errors in the images.
  • The method enhances disparity estimation accuracy by 20% for multiocular images and demonstrates superior performance through extensive evaluations against existing methods.
View Article and Find Full Text PDF

Background: The Expanded Special Project for the Elimination of Neglected Tropical Diseases (ESPEN) was launched in 2019 by the World Health Organization and African nations to combat Neglected Tropical Diseases (NTDs), including Soil-transmitted helminths (STH), which still affect over 1.5 billion people globally. In this study, we present a comprehensive geostatistical analysis of publicly available STH survey data from ESPEN to delineate inter-country disparities in STH prevalence and its environmental drivers while highlighting the strengths and limitations that arise from the use of the ESPEN data.

View Article and Find Full Text PDF

Working memory is associated with general intelligence and is crucial for performing complex cognitive tasks. Neuroimaging investigations have recognized that working memory is supported by a distribution of activity in regions across the entire brain. Identification of these regions has come primarily from general linear model analyses of statistical parametric maps to reveal brain regions whose activation is linearly related to working memory task conditions.

View Article and Find Full Text PDF

Human-driven habitat loss is recognized as the greatest cause of the biodiversity crisis, yet to date we lack robust, spatially explicit metrics quantifying the impacts of anthropogenic changes in habitat extent on species' extinctions. Existing metrics either fail to consider species identity or focus solely on recent habitat losses. The persistence score approach developed by Durán .

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!