AI Article Synopsis

  • The study discusses the limitations of using graph convolutional networks (GCN) for classifying natural language texts, particularly in terms of memory usage and distribution.
  • It introduces a new model called FastMPN, which features a message passing architecture that allows for adjustable node embeddings and edge weights, improving the GCN's problem-solving ability.
  • The FastMPN model was tested on extracting clinical data from cancer pathology reports, outperforming or matching existing models and training quickly on a large dataset using advanced hardware.

Article Abstract

Background: Applying graph convolutional networks (GCN) to the classification of free-form natural language texts leveraged by graph-of-words features (TextGCN) was studied and confirmed to be an effective means of describing complex natural language texts. However, the text classification models based on the TextGCN possess weaknesses in terms of memory consumption and model dissemination and distribution. In this paper, we present a fast message passing network (FastMPN), implementing a GCN with message passing architecture that provides versatility and flexibility by allowing trainable node embedding and edge weights, helping the GCN model find the better solution. We applied the FastMPN model to the task of clinical information extraction from cancer pathology reports, extracting the following six properties: main site, subsite, laterality, histology, behavior, and grade.

Results: We evaluated the clinical task performance of the FastMPN models in terms of micro- and macro-averaged F1 scores. A comparison was performed with the multi-task convolutional neural network (MT-CNN) model. Results show that the FastMPN model is equivalent to or better than the MT-CNN.

Conclusions: Our implementation revealed that our FastMPN model, which is based on the PyTorch platform, can train a large corpus (667,290 training samples) with 202,373 unique words in less than 3 minutes per epoch using one NVIDIA V100 hardware accelerator. Our experiments demonstrated that using this implementation, the clinical task performance scores of information extraction related to tumors from cancer pathology reports were highly competitive.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11409592PMC
http://dx.doi.org/10.1186/s12911-024-02662-5DOI Listing

Publication Analysis

Top Keywords

cancer pathology
12
pathology reports
12
fastmpn model
12
graph convolutional
8
convolutional networks
8
natural language
8
language texts
8
message passing
8
clinical task
8
task performance
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!