AnEMIC: A Framework for Benchmarking ICD Coding Models.

Juyong Kim Abheesht Sharma Suhas Shanbhogue Pradeep Ravikumar Jeremy C Weiss

Proc Conf Empir Methods Nat Lang Process

National Library of Medicine, National Institutes of Health.

Published: December 2022

Diagnostic coding, or ICD coding, is the task of assigning diagnosis codes defined by the ICD (International Classification of Diseases) standard to patient visits based on clinical notes. The current process of manual ICD coding is time-consuming and often error-prone, which suggests the need for automatic ICD coding. However, despite the long history of automatic ICD coding, there have been no standardized frameworks for benchmarking ICD coding models. We open-source an easy-to-use tool named , which provides a streamlined pipeline for preprocessing, training, and evaluating for automatic ICD coding. We correct errors in preprocessing by existing works, and provide key models and weights trained on the correctly preprocessed datasets. We also provide an interactive demo performing real-time inference from custom inputs, and visualizations drawn from explainable AI to analyze the models. We hope the framework helps move the research of ICD coding forward and helps professionals explore the potential of ICD coding. The framework and the associated code are available here.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10929571	PMC
http://dx.doi.org/10.18653/v1/2022.emnlp-demos.11	DOI Listing

Publication Analysis

Top Keywords

icd coding

automatic icd

icd

coding

benchmarking icd

coding models

anemic framework

framework benchmarking

models

models diagnostic

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!