CMCN: Chinese medical concept normalization using continual learning and knowledge-enhanced.

Artif Intell Med

School of Basic Medical Sciences, Nanjing Medical University, Nanjing 210029, China. Electronic address:

Published: November 2024

Medical Concept Normalization (MCN) is a crucial process for deep information extraction and natural language processing tasks, which plays a vital role in biomedical research. Although MCN in English has achieved significant research achievements, Chinese medical concept normalization (CMCN) remains insufficiently explored due to its complex syntactic structure and the paucity of Chinese medical semantic and ontology resources. In recent years, deep learning has been extensively applied across numerous natural language processing tasks, owing to its robust learning capabilities, adaptability, and transferability. It has proven to be well suited for intricate and specialized knowledge discovery research in the biomedical field. In this study, we conduct research on CMCN through the lens of deep learning. Specifically, our research introduces a model that leverages polymorphic semantic information and knowledge enhanced through multi-task learning and retain more important medical features through continual learning. As the cornerstone of CMCN, disease names are the main focus of this research. We evaluated various methodologies on Chinese disease dataset built by ourselves, finally achieving 76.12 % on Accuracy@1, 87.20 % on Accuracy@5 and 90.02 % on Accuracy@10 with our best-performing model GCBM-BSCL. This research not only advances the fields of knowledge mining and medical concept normalization but also enhances the integration and application of artificial intelligence in the medical and health field. We have published the source code and results on https://github.com/BearLiX/CMCN.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.artmed.2024.102965DOI Listing

Publication Analysis

Top Keywords

medical concept
16
concept normalization
16
chinese medical
12
continual learning
8
natural language
8
language processing
8
processing tasks
8
deep learning
8
medical
7
learning
6

Similar Publications

The objective of this retrospective observational study was to estimate the prevalence of actinic keratosis (AK) in individuals aged ≥ 40 years in France, to describe the characteristics of affected patients, and to describe treatments. A representative panel of 20,000 households with ≥ 1 member aged ≥ 40 years were invited to participate. Participants who reported AK lesions diagnosed by a physician were eligible.

View Article and Find Full Text PDF

Diabetes is a growing health concern in developing countries, causing considerable mortality rates. While machine learning (ML) approaches have been widely used to improve early detection and treatment, several studies have shown low classification accuracies due to overfitting, underfitting, and data noise. This research employs parallel and sequential ensemble ML approaches paired with feature selection techniques to boost classification accuracy.

View Article and Find Full Text PDF

Development of a Patient Reported Outcome Measure of Side Effects for Patients Taking Calcineurin Inhibitors: The FACIT-CNI-Ntx.

Am J Kidney Dis

January 2025

Department of Medical Social Sciences, Northwestern University Feinberg School of Medicine, Chicago, USA; Northwestern University Transplant Outcomes Research Collaborative, Comprehensive Transplant Center, Feinberg School of Medicine, Chicago, IL, USA; Centre for Patient Reported Outcomes Research, Department of Applied Health Sciences, University of Birmingham, Edgbaston, Birmingham, UK.

Rationale & Objective: Valid measures of side effects are important to inform clinical use of calcineurin inhibitors (CNIs). This study sought to develop and establish the content validity of a PRO measure to capture side effects among kidney transplant recipients taking CNIs.

Study Design: Qualitative interviews for concept elicitation and cognitive debriefing.

View Article and Find Full Text PDF

ARCH: Large-scale knowledge graph via aggregated narrative codified health records analysis.

J Biomed Inform

January 2025

Harvard T.H. Chan School of Public Health, 677 Huntington Ave, Boston, 02115, MA, USA; VA Boston Healthcare System, 150 S Huntington Ave, Boston, 02130, MA, USA. Electronic address:

Objective: Electronic health record (EHR) systems contain a wealth of clinical data stored as both codified data and free-text narrative notes (NLP). The complexity of EHR presents challenges in feature representation, information extraction, and uncertainty quantification. To address these challenges, we proposed an efficient Aggregated naRrative Codified Health (ARCH) records analysis to generate a large-scale knowledge graph (KG) for a comprehensive set of EHR codified and narrative features.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!