Objective: To evaluate ChatGPT's performance in brain glioma adjuvant therapy decision-making.

Methods: We randomly selected 10 patients with brain gliomas discussed at our institution's central nervous system tumour board (CNS TB). Patients' clinical status, surgical outcome, textual imaging information and immuno-pathology results were provided to ChatGPT V.3.5 and seven CNS tumour experts. The chatbot was asked to give the adjuvant treatment choice, and the regimen while considering the patient's functional status. The experts rated the artificial intelligence-based recommendations from 0 (complete disagreement) to 10 (complete agreement). An intraclass correlation coefficient agreement (ICC) was used to measure the inter-rater agreement.

Results: Eight patients (80%) met the criteria for glioblastoma and two (20%) were low-grade gliomas. The experts rated the quality of ChatGPT recommendations as poor for diagnosis (median 3, IQR 1-7.8, ICC 0.9, 95% CI 0.7 to 1.0), good for treatment recommendation (7, IQR 6-8, ICC 0.8, 95% CI 0.4 to 0.9), good for therapy regimen (7, IQR 4-8, ICC 0.8, 95% CI 0.5 to 0.9), moderate for functional status consideration (6, IQR 1-7, ICC 0.7, 95% CI 0.3 to 0.9) and moderate for overall agreement with the recommendations (5, IQR 3-7, ICC 0.7, 95% CI 0.3 to 0.9). No differences were observed between the glioblastomas and low-grade glioma ratings.

Conclusions: ChatGPT performed poorly in classifying glioma types but was good for adjuvant treatment recommendations as evaluated by CNS TB experts. Even though the ChatGPT lacks the precision to replace expert opinion, it may serve as a promising supplemental tool within a human-in-the-loop approach.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10314415PMC
http://dx.doi.org/10.1136/bmjhci-2023-100775DOI Listing

Publication Analysis

Top Keywords

icc 95% ci
20
glioma adjuvant
8
adjuvant therapy
8
adjuvant treatment
8
functional status
8
experts rated
8
95% ci good
8
95% ci moderate
8
icc
6
chatgpt
5

Similar Publications

Background: The complex interactions of the tumor micromilieu may be reflected by diffusion-weighted imaging (DWI) derived from the magnetic resonance imaging (MRI). The present study investigated the association between apparent diffusion coefficient (ADC) values and histopathologic features in uterine cervical cancer.

Methods: In this retrospective study, prebiopsy MRI was used to analyze histogram ADC-parameters.

View Article and Find Full Text PDF

Background: Total joint arthroplasty (TJA) is increasingly being performed as an outpatient (i.e., same-day discharge) procedure.

View Article and Find Full Text PDF

Objectives: To assess the consistency of automated measurements of coronary artery calcification (CAC) burden and emphysema extent on computed tomography (CT) images acquired with different radiation dose protocols in a lung cancer screening (LCS) population.

Materials And Methods: The patient cohort comprised 361 consecutive screenees who underwent a low-dose CT (LDCT) scan and an ultra-low-dose CT (ULDCT) scan at an incident screening round. Exclusion criteria for CAC measurements were software failure and previous history of CVD, including coronary stenting, whereas for emphysema assessment, software failure only.

View Article and Find Full Text PDF

Background: Respiratory function is impaired in chronic obstructive pulmonary disease (COPD). Automation of multi-volume CT-based measurements of different components of breathing-related airway deformations will help understand multi-pathway impairments in respiratory mechanics in COPD.

Purpose: To develop and evaluate multi-volume chest CT-based automated measurements of breathing-related radial and longitudinal expansion of individual airways between inspiratory and expiratory lung volumes.

View Article and Find Full Text PDF

Background: The Upper Quarter Y Balance Test (YBT-UQ) assesses upper limb dynamic balance in able-bodied individuals but lacks a reliable version for those with disabilities.

Objective: This study aimed to introduce a modified YBT-UQ (mYBT-UQ) for physically impaired individuals (PI) and establish its validity and reliability.

Methods: The study involved 33 male athletes aged 18-55, divided into three equal groups: able-bodied, spinal cord injury with trunk control (SCI), and below-the-knee amputation (BKA).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!