AI Article Synopsis

  • Recent advances in automatic medical report generation use deep learning, combining CNNs for image encoding and RNNs for report decoding, but face issues like incomplete optimization, simplistic attention mechanisms, and repeated output generation.
  • The article introduces HReMRG-MR, a new method that employs a hybrid reward system, m-linear attention for improved feature interaction, and a repetition penalty to enhance report accuracy and detail.
  • Experimental results confirm that HReMRG-MR outperforms existing methods in efficiency and quality while demonstrating that its components effectively address previous limitations, including significantly reducing weight search time without sacrificing performance.

Article Abstract

To reduce doctors' workload, deep-learning-based automatic medical report generation has recently attracted more and more research efforts, where deep convolutional neural networks (CNNs) are employed to encode the input images, and recurrent neural networks (RNNs) are used to decode the visual features into medical reports automatically. However, these state-of-the-art methods mainly suffer from three shortcomings: 1) incomprehensive optimization; 2) low-order and unidimensional attention; and 3) repeated generation. In this article, we propose a hybrid reinforced medical report generation method with m-linear attention and repetition penalty mechanism (HReMRG-MR) to overcome these problems. Specifically, a hybrid reward with different weights is employed to remedy the limitations of single-metric-based rewards, and a local optimal weight search algorithm is proposed to significantly reduce the complexity of searching the weights of the rewards from exponential to linear. Furthermore, we use m-linear attention modules to learn multidimensional high-order feature interactions and to achieve multimodal reasoning, while a new repetition penalty is proposed to apply penalties to repeated terms adaptively during the model's training process. Extensive experimental studies on two public benchmark datasets show that HReMRG-MR greatly outperforms the state-of-the-art baselines in terms of all metrics. The effectiveness and necessity of all components in HReMRG-MR are also proved by ablation studies. Additional experiments are further conducted and the results demonstrate that our proposed local optimal weight search algorithm can significantly reduce the search time while maintaining superior medical report generation performances.

Download full-text PDF

Source
http://dx.doi.org/10.1109/TNNLS.2023.3343391DOI Listing

Publication Analysis

Top Keywords

medical report
16
report generation
16
m-linear attention
12
repetition penalty
12
hybrid reinforced
8
reinforced medical
8
attention repetition
8
neural networks
8
local optimal
8
optimal weight
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!