The main task in Optical Character Recognition (OCR) is to get and convert all the text characters on an image as a plain text data. However, if the image has low contrast and low exposure, an issue may occur. The characters may be hidden and can't be recovered completely. One solution that has been done and reported in 2017 is by applying histogram equalization as a pre-processing step in OCR. Here, we deliver a total of 30 sample data, some of which had been used on the research's experiment reported in 2017, and some others were added later.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC6727008PMC
http://dx.doi.org/10.1016/j.dib.2019.104397DOI Listing

Publication Analysis

Top Keywords

reported 2017
8
images hidden
4
hidden data
4
data set
4
set retrieval
4
retrieval usage
4
usage main
4
main task
4
task optical
4
optical character
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!