This study aimed to develop an English version of a doping drug-recognition system using deep learning-based optical character recognition (OCR) technology. A database of 336 banned substances was built based on the World Anti-Doping Agency's International Standard Prohibited List and the Korean Pharmaceutical Information Center's Drug Substance Information. For accuracy and validity analysis, 886 drug substance images, including 152 images of prescriptions and drug substance labels collected using data augmentation, were used. The developed hybrid system, based on the Tesseract OCR model, can be accessed by both a smartphone and website. A total of 5379 words were extracted, and the system showed character recognition errors regarding 91 words, showing high accuracy (98.3%). The system correctly classified all 624 images for acceptable substances, 218 images for banned substances, and incorrectly recognized 44 of the banned substances as acceptable. The validity analysis showed a high level of accuracy (0.95), sensitivity (1.00), and specificity (0.93), suggesting system validity. The system has the potential of allowing athletes who lack knowledge about doping to quickly and accurately check whether they are taking banned substances. It may also serve as an efficient option to support the development of a fair and healthy sports culture.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10297893PMC
http://dx.doi.org/10.3390/healthcare11121769DOI Listing

Publication Analysis

Top Keywords

banned substances
16
drug substance
12
deep learning-based
8
character recognition
8
validity analysis
8
system
7
drug
5
substances
5
validation study
4
study deep
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!