A benchmark dataset and case study for Chinese medical question intent classification.

BMC Med Inform Decis Mak

Inner Mongolia Key Laboratory of Mongolian Information Processing Technology, College of Computer Science, Inner Mongolia Univeristy, University West Road, Hohhot, China.

Published: July 2020

AI Article Synopsis

Article Abstract

Background: To provide satisfying answers, medical QA system has to understand the intentions of the users' questions precisely. For medical intent classification, it requires high-quality datasets to train a deep-learning approach in a supervised way. Currently, there is no public dataset for Chinese medical intent classification, and the datasets of other fields are not applicable to the medical QA system. To solve this problem, we construct a Chinese medical intent dataset (CMID) using the questions from medical QA websites. On this basis, we compare four intent classification models on CMID using a case study.

Methods: The questions in CMID are obtained from several medical QA websites. The intent annotation standard is developed by the medical experts, which includes four types and 36 subtypes of users' intents. Besides the intent label, CMID also provides two types of additional information, including word segmentation and named entity. We use the crowdsourcing way to annotate the intent information for each Chinese medical question. Word segmentation and named entities are obtained using the Jieba and a well-trained Lattice-LSTM model. We loaded a Chinese medical dictionary consisting of 530,000 for word segmentation to obtain a more accurate result. We also select four popular deep learning-based models and compare their performances of intent classification on CMID.

Results: The final CMID contains 12,000 Chinese medical questions and is organized in JSON format. Each question is labeled the intention, word segmentation, and named entity information. The information about question length, number of entities, and are also detailed analyzed. Among Fast Text, TextCNN, TextRNN, and TextGCN, Fast Text and TextCNN models have achieved the best results in four types and 36 subtypes intent classification, respectively.

Conclusions: In this work, we provide a dataset for Chinese medical intent classification, which can be used in medical QA and related fields. We performed an intent classification task on the CMID. In addition, we also did some analysis on the content of the dataset.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7346345PMC
http://dx.doi.org/10.1186/s12911-020-1122-3DOI Listing

Publication Analysis

Top Keywords

intent classification
32
chinese medical
28
medical intent
16
word segmentation
16
medical
14
intent
12
segmentation named
12
medical question
8
classification
8
medical system
8

Similar Publications

In human activity-recognition scenarios, including head and entire body pose and orientations, recognizing the pose and direction of a pedestrian is considered a complex problem. A person may be traveling in one sideway while focusing his attention on another side. It is occasionally desirable to analyze such orientation estimates using computer-vision tools for automated analysis of pedestrian behavior and intention.

View Article and Find Full Text PDF

A Lightweight Network with Domain Adaptation for Motor Imagery Recognition.

Entropy (Basel)

December 2024

Academy of Medical Engineering and Translational Medicine, Tianjin University, Tianjin 300384, China.

Brain-computer interfaces (BCI) are an effective tool for recognizing motor imagery and have been widely applied in the motor control and assistive operation domains. However, traditional intention-recognition methods face several challenges, such as prolonged training times and limited cross-subject adaptability, which restrict their practical application. This paper proposes an innovative method that combines a lightweight convolutional neural network (CNN) with domain adaptation.

View Article and Find Full Text PDF

Background: Decoding motor intentions from electroencephalogram (EEG) signals is a critical component of motor imagery-based brain-computer interface (MI-BCIs). In traditional EEG signal classification, effectively utilizing the valuable information contained within the electroencephalogram is crucial.

Objectives: To further optimize the use of information from various domains, we propose a novel framework based on multi-domain feature rotation transformation and stacking ensemble for classifying MI tasks.

View Article and Find Full Text PDF

Objective: To investigate the effectiveness of posterior lateral perforator flap in lower limb combined with free fibula for maxillary tissue defect repair.

Methods: Between December 2018 and December 2023, 16 patients with the maxillary malignant tumors were admitted. There were 10 males and 6 females, with an average age of 64.

View Article and Find Full Text PDF

Objective: Stage-based models of change posit stage specific factors to promote motivation and intention formation for those not ready to change and volitional action strategies for others. The impact of two interventions on energy restriction and weight change among adults with prediabetes (n = 190) was examined by baseline stage.

Methods: Stage classification included: Pre-intenders had no intention to change; Intenders set an intention but were not acting; and Actors reported eating a low-fat diet at baseline.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!