Background: The International Classification of Diseases (ICD) code is widely used as the reference in medical system and billing purposes. However, classifying diseases into ICD codes still mainly relies on humans reading a large amount of written material as the basis for coding. Coding is both laborious and time-consuming. Since the conversion of ICD-9 to ICD-10, the coding task became much more complicated, and deep learning- and natural language processing-related approaches have been studied to assist disease coders.

Objective: This paper aims at constructing a deep learning model for ICD-10 coding, where the model is meant to automatically determine the corresponding diagnosis and procedure codes based solely on free-text medical notes to improve accuracy and reduce human effort.

Methods: We used diagnosis records of the National Taiwan University Hospital as resources and apply natural language processing techniques, including global vectors, word to vectors, embeddings from language models, bidirectional encoder representations from transformers, and single head attention recurrent neural network, on the deep neural network architecture to implement ICD-10 auto-coding. Besides, we introduced the attention mechanism into the classification model to extract the keywords from diagnoses and visualize the coding reference for training freshmen in ICD-10. Sixty discharge notes were randomly selected to examine the change in the F-score and the coding time by coders before and after using our model.

Results: In experiments on the medical data set of National Taiwan University Hospital, our prediction results revealed F-scores of 0.715 and 0.618 for the ICD-10 Clinical Modification code and Procedure Coding System code, respectively, with a bidirectional encoder representations from transformers embedding approach in the Gated Recurrent Unit classification model. The well-trained models were applied on the ICD-10 web service for coding and training to ICD-10 users. With this service, coders can code with the F-score significantly increased from a median of 0.832 to 0.922 (P<.05), but not in a reduced interval.

Conclusions: The proposed model significantly improved the F-score but did not decrease the time consumed in coding by disease coders.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8441604PMC
http://dx.doi.org/10.2196/23230DOI Listing

Publication Analysis

Top Keywords

icd-10 coding
12
neural network
12
coding
9
coding training
8
deep neural
8
diseases icd
8
natural language
8
national taiwan
8
taiwan university
8
university hospital
8

Similar Publications

Background: Antiplatelet drugs, such as clopidogrel, ticagrelor, prasugrel, and acetylsalicylic acid, may be associated with a risk of adverse events (AEs). Vanessa's Law was enacted to strengthen regulations to protect Canadians from drug-related side effects (with mandatory reporting of serious adverse events [SAEs]).

Objective: To determine whether Vanessa's Law has led to an increase in SAE reporting among antiplatelet users.

View Article and Find Full Text PDF

Epidemiology of heart failure in Türkiye.

Turk J Med Sci

December 2024

Department of Cardiology, Faculty of Medicine, Mersin University, Mersin, Turkiye.

Background/aim: The epidemiological data on heart failure (HF) vary between regions within the same country. We aimed to investigate the epidemiological data on HF in Türkiye across all age groups regarding seven geographical regions.

Materials And Methods: We included all patients from the Turkish population who received a first diagnosis of HF between January 1, 2016 and December 31, 2022, using ICD-10 codes from the National Electronic Healthcare Database.

View Article and Find Full Text PDF

Approaching a nationwide registry: analyzing big data in patients with heart failure.

Turk J Med Sci

December 2024

Department of Biostatistics and Medical Informatics, Faculty of Medicine, Mersin University, Mersin, Turkiye.

Background/aim: Randomized controlled trials usually lack generabilizity to real-world context. Real-world data, enabled by the use of big data analysis, serve as a connection between the results of trials and the implementation of findings in clinical practice. Nevertheless, using big data in the healthcare has difficulties such as ensuring data quality and consistency.

View Article and Find Full Text PDF

Background/aim: Heart failure (HF) is associated with a wide range of comorbidities that negatively impact clinical outcomes and cause high economic burden. We aimed to evaluate the frequency and burden of comorbidities in HF patients in Türkiye and their relationships with patients' demographic characteristics.

Materials And Methods: Based on ICD-10 codes in the national electronic database of the Turkish Ministry of Health covering the entire population of Türkiye (n = 85,279,553) from 1 January 2016 to 31 December 2022, data on the prevalence of comorbidities in HF patients were obtained.

View Article and Find Full Text PDF

Background Combining left atrial appendage closure with catheter ablation (LAACCA) has been proposed as a potential approach to improving outcomes by simultaneously addressing arrhythmia and reducing stroke risk. This study compares the in-hospital outcomes of LAACCA vs. catheter ablation (CA) alone for atrial fibrillation (AFib) in patients with heart failure with reduced ejection fraction (HFrEF).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!