Purpose: Automatic localization of pneumonia on chest X-rays (CXRs) is highly desirable both as an interpretive aid to the radiologist and for timely diagnosis of the disease. However, pneumonia's amorphous appearance on CXRs and complexity of normal anatomy in the chest present key challenges that hinder accurate localization. Existing studies in this area are either not optimized to preserve spatial information of abnormality or depend on expensive expert-annotated bounding boxes. We present a novel generative adversarial network (GAN)-based machine learning approach for this problem, which is weakly supervised (does not require any location annotations), was trained to retain spatial information, and can produce pixel-wise abnormality maps highlighting regions of abnormality (as opposed to bounding boxes around abnormality).

Methods: Our method is based on the Wasserstein GAN framework and, to the best of our knowledge, the first application of GANs to this problem. Specifically, from an abnormal CXR as input, we generated the corresponding pseudo normal CXR image as output. The pseudo normal CXR is the "hypothetical" normal, if the same abnormal CXR were not to have any abnormalities. We surmise that the difference between the pseudo normal and the abnormal CXR highlights the pixels suspected to have pneumonia and hence is our output abnormality map. We trained our algorithm on an "unpaired" data set of abnormal and normal CXRs and did not require any location annotations such as bounding boxes/segmentations of abnormal regions. Furthermore, we incorporated additional prior knowledge/constraints into the model and showed that they help improve localization performance. We validated the model on a data set consisting of 14 184 CXRs from the Radiological Society of North America pneumonia detection challenge.

Results: We evaluated our methods by comparing the generated abnormality maps with radiologist annotated bounding boxes using receiver operating characteristic (ROC) analysis, image similarity metrics such as normalized cross-correlation/mutual information, and abnormality detection rate.We also present visual examples of the abnormality maps, covering various scenarios of abnormality occurrence. Results demonstrate the ability to highlight regions of abnormality with the best method achieving an ROC area under the curve (AUC) of 0.77 and a detection rate of 85%.The GAN tended to perform better as prior knowledge/constraints were incorporated into the model.

Conclusions: We presented a novel GAN based approach for localizing pneumonia on CXRs that (1) does not require expensive hand annotated location ground truth; and (2) was trained to produce abnormality maps at the pixel level as opposed to bounding boxes. We demonstrated the efficacy of our methods via quantitative and qualitative results.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10997001PMC
http://dx.doi.org/10.1002/mp.15185DOI Listing

Publication Analysis

Top Keywords

bounding boxes
16
abnormality maps
16
abnormal cxr
12
pseudo normal
12
abnormality
10
weakly supervised
8
chest x-rays
8
generative adversarial
8
require location
8
location annotations
8

Similar Publications

Percutaneous nephrostomy can be an effective means of preventing irreparable renal damage from obstructive renal disease thereby providing patients with more time to access treatment to remove the source of the blockage. In sub-Saharan Africa, where there is limited access to treatments such as dialysis and transplantation, a nephrostomy can be life-saving. Training this procedure in simulation can allow trainees to develop their technical skills without risking patient safety, but still requires an ex-pert observer to provide performative feedback.

View Article and Find Full Text PDF

The electroencephalogram (EEG) is a major diagnostic tool that provides detailed insight into the electrical activity of the brain. This signal contains a number of distinctive waveform patterns that reflect the subject's health state in relation to sleep, neurological disorders, memory functions, and more. In this regard, sleep spindles and K-complexes are two major waveform patterns of interest to specialists, who visually inspect the recordings to identify these events.

View Article and Find Full Text PDF

FabricSpotDefect: An annotated dataset for identifying spot defects in different fabric types.

Data Brief

December 2024

Center for Computational & Data Sciences, Independent University, Bangladesh, Block B, Bashundhara R/A, Dhaka 1229, Bangladesh.

The FabricSpotDefect dataset is, to the best of our knowledge, the first dataset specifically designed to accurately challenge computer vision in detecting fabric spots. There are a total of 1014 raw images and manually annotated 3288 different categories of spots. This dataset expands to 2300 augmented images after applying six categories of augmentation techniques like flipping, rotating, shearing, saturation adjustment, brightness adjustment, and noise addition.

View Article and Find Full Text PDF

Background And Objectives: Chest X-ray (CXR) images are commonly used to diagnose respiratory and cardiovascular diseases. However, traditional manual interpretation is often subjective, time-consuming, and prone to errors, leading to inconsistent detection accuracy and poor generalization. In this paper, we present deep learning-based object detection methods for automatically identifying and annotating abnormal regions in CXR images.

View Article and Find Full Text PDF

Shared intention and shared awareness for conditional automated driving: An online, randomized video experiment.

Traffic Inj Prev

December 2024

Centre for Accident Research and Road Safety-Queensland (CARRS-Q), Queensland University of Technology (QUT), Brisbane, Queensland, Australia.

Objectives: In conditional automation for automated vehicles (AVs), drivers are tasked with remaining vigilant and ready to assume control should the system encounter a malfunction. However, little to no information is provided to the driver either about the AV's intended maneuvers or the AV's awareness of potential threats in the surrounding environment. To address this research gap, the present study proposes 2 human-machine interaction (HMI) concepts: Firstly, the shared intended pathway (SIP), which presents a forecast of the AV's intended maneuvers and, secondly, object recognition bounding boxes (ORBBs), which place transparent blue squares around other road users likely to contribute to a crash.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!