Background: While deep learning classifiers have shown remarkable results in detecting chest X-ray (CXR) pathologies, their adoption in clinical settings is often hampered by the lack of transparency. To bridge this gap, this study introduces the neural prototype tree (NPT), an interpretable image classifier that combines the diagnostic capability of deep learning models and the interpretability of the decision tree for CXR pathology detection.

Objective: This study aimed to investigate the utility of the NPT classifier in 3 dimensions, including performance, interpretability, and fairness, and subsequently examined the complex interaction between these dimensions. We highlight both local and global explanations of the NPT classifier and discuss its potential utility in clinical settings.

Methods: This study used CXRs from the publicly available Chest X-ray 14, CheXpert, and MIMIC-CXR datasets. We trained 6 separate classifiers for each CXR pathology in all datasets, 1 baseline residual neural network (ResNet)-152, and 5 NPT classifiers with varying levels of interpretability. Performance, interpretability, and fairness were measured using the area under the receiver operating characteristic curve (ROC AUC), interpretation complexity (IC), and mean true positive rate (TPR) disparity, respectively. Linear regression analyses were performed to investigate the relationship between IC and ROC AUC, as well as between IC and mean TPR disparity.

Results: The performance of the NPT classifier improved as the IC level increased, surpassing that of ResNet-152 at IC level 15 for the Chest X-ray 14 dataset and IC level 31 for the CheXpert and MIMIC-CXR datasets. The NPT classifier at IC level 1 exhibited the highest degree of unfairness, as indicated by the mean TPR disparity. The magnitude of unfairness, as measured by the mean TPR disparity, was more pronounced in groups differentiated by age (chest X-ray 14 0.112, SD 0.015; CheXpert 0.097, SD 0.010; MIMIC 0.093, SD 0.017) compared to sex (chest X-ray 14 0.054 SD 0.012; CheXpert 0.062, SD 0.008; MIMIC 0.066, SD 0.013). A significant positive relationship between interpretability (ie, IC level) and performance (ie, ROC AUC) was observed across all CXR pathologies (P<.001). Furthermore, linear regression analysis revealed a significant negative relationship between interpretability and fairness (ie, mean TPR disparity) across age and sex subgroups (P<.001).

Conclusions: By illuminating the intricate relationship between performance, interpretability, and fairness of the NPT classifier, this research offers insightful perspectives that could guide future developments in effective, interpretable, and equitable deep learning classifiers for CXR pathology detection.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11659703PMC
http://dx.doi.org/10.2196/59045DOI Listing

Publication Analysis

Top Keywords

chest x-ray
24
npt classifier
16
performance interpretability
12
interpretability fairness
12
roc auc
12
tpr disparity
12
neural prototype
8
prototype tree
8
deep learning
8
cxr pathologies
8

Similar Publications

CXR-LLaVA: a multimodal large language model for interpreting chest X-ray images.

Eur Radiol

January 2025

Department of Radiology, Seoul National University College of Medicine, Seoul National University Hospital, Seoul, Republic of Korea.

Objective: This study aimed to develop an open-source multimodal large language model (CXR-LLaVA) for interpreting chest X-ray images (CXRs), leveraging recent advances in large language models (LLMs) to potentially replicate the image interpretation skills of human radiologists.

Materials And Methods: For training, we collected 592,580 publicly available CXRs, of which 374,881 had labels for certain radiographic abnormalities (Dataset 1) and 217,699 provided free-text radiology reports (Dataset 2). After pre-training a vision transformer with Dataset 1, we integrated it with an LLM influenced by the LLaVA network.

View Article and Find Full Text PDF

Introduction: Nemaline myopathy (NM), also known as Nemalinosis, is a rare congenital muscle disease with an incidence of 1 in 50000. It is characterized by nemaline rods in muscle fibers, leading to muscle weakness. We reported a case of NM revealed by cardiac involvement, and we highlighted the challenges in diagnosing this condition as well as its poor prognosis.

View Article and Find Full Text PDF

Structural, architectural, contractile or electrophysiological alterations may occur in the left atrium (LA). The concept of LA cardiopathy is supported by accumulating scientific evidence demonstrating that LA remodeling has become a cornerstone diagnostic and prognostic marker. The structure and the function of LA and left atrial appendage (LAA) which is an integral part of the LA, are key elements for a better understanding of multiple clinical conditions, most notably atrial fibrillation (AF), cardioembolism, heart failure and mitral valve diseases.

View Article and Find Full Text PDF

Diffusion models, variational autoencoders, and generative adversarial networks (GANs) are three common types of generative artificial intelligence models for image generation. Among these, GANs are the most frequently used for medical image generation and are often employed for data augmentation in various studies. However, due to the adversarial nature of GANs, where the generator and discriminator compete against each other, the training process can sometimes end with the model unable to generate meaningful images or even producing noise.

View Article and Find Full Text PDF

A Structured, Anatomy-Based Chest CT Interpretation Curriculum for Pulmonary Fellows Covering the Main Patterns of Parenchymal Lung Disease.

MedEdPORTAL

January 2025

Associate Professor, Division of Pulmonary, Critical Care and Sleep Medicine, University of Washington School of Medicine; Staff Physician, Pulmonary, Critical Care and Sleep Medicine Section, Veterans Affairs Puget Sound Healthcare System.

Introduction: Chest computed tomography (CT) interpretation is a key competency for pulmonary fellows, with many resources intended for radiologists but very few for this specific group. We endeavored to create a curriculum to teach chest CT interpretation to first-year pulmonary fellows.

Methods: We assembled a team of two pulmonologists, one radiologist, and a fellow with computer drafting software experience.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!