To trust or not to trust: evaluating the reliability and safety of AI responses to laryngeal cancer queries.

Magdalena Ostrowska Paulina Kacała Deborah Onolememen Katie Vaughan-Lane Anitta Sisily Joseph Adam Ostrowski Wioletta Pietruszewska Jacek Banaszewski Maciej J Wróbel

Eur Arch Otorhinolaryngol

Department of Otolaryngology and Laryngological Oncology, Collegium Medicum, Nicolaus Copernicus University in Torun, ul.Marie Sklodowskiej-Curie 9, 85-094, Bydgoszcz, Poland.

Published: November 2024

Purpose: As online health information-seeking surges, concerns mount over the quality and safety of accessible content, potentially leading to patient harm through misinformation. On one hand, the emergence of Artificial Intelligence (AI) in healthcare could prevent it; on the other hand, questions raise regarding the quality and safety of the medical information provided. As laryngeal cancer is a prevalent head and neck malignancy, this study aims to evaluate the utility and safety of three large language models (LLMs) as sources of patient information about laryngeal cancer.

Methods: A cross-sectional study was conducted using three LLMs (ChatGPT 3.5, ChatGPT 4.0, and Bard). A questionnaire comprising 36 inquiries about laryngeal cancer was categorised into diagnosis (11 questions), treatment (9 questions), novelties and upcoming treatments (4 questions), controversies (8 questions), and sources of information (4 questions). The population of reviewers consisted of 3 groups, including ENT specialists, junior physicians, and non-medicals, who graded the responses. Each physician evaluated each question twice for each model, while non-medicals only once. Everyone was blinded to the model type, and the question order was shuffled. Outcome evaluations were based on a safety score (1-3) and a Global Quality Score (GQS, 1-5). Results were compared between LLMs. The study included iterative assessments and statistical validations.

Results: Analysis revealed that ChatGPT 3.5 scored highest in both safety (mean: 2.70) and GQS (mean: 3.95). ChatGPT 4.0 and Bard had lower safety scores of 2.56 and 2.42, respectively, with corresponding quality scores of 3.65 and 3.38. Inter-rater reliability was consistent, with less than 3% discrepancy. About 4.2% of responses fell into the lowest safety category (1), particularly in the novelty category. Non-medical reviewers' quality assessments correlated moderately (r = 0.67) with response length.

Conclusions: LLMs can be valuable resources for patients seeking information on laryngeal cancer. ChatGPT 3.5 provided the most reliable and safe responses among the models evaluated.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11512842	PMC
http://dx.doi.org/10.1007/s00405-024-08643-8	DOI Listing

Publication Analysis

Top Keywords

laryngeal cancer

safety

quality safety

chatgpt bard

questions

laryngeal

quality

chatgpt

trust trust

trust evaluating

Similar Publications

Lymph Node Metastasis for pN+ Superficial Esophageal Squamous Cell Carcinoma.

Thorac Cancer

January 2025

Department of Thoracic Surgery, National Cancer Center/Cancer Hospital, Chinese Academy of MedicalSciences and Peking Union Medical College, Beijing, China.

Yafan Yang Liyan Xue Xiankai Chen Mingqiang Kang Renquan Zhang

Objectives: This study aimed to analyze lymph node metastasis (LNM) distribution in superficial esophageal squamous cell carcinoma (ESCC) and its impact factors on survival.

Methods: We reviewed 241 pT1N+ ESCC cases between February 2012 and April 2022 from 10 Chinese hospitals with a high volume of esophageal cancer (EC). We analyzed clinicopathological data to identify overall survival (OS) risk factors and LNM distribution in relation to tumor invasion depth.

View Article and Find Full Text PDF

Similar Publications

Superior 125-month outcome through cetuximab in the larynx organ preservation trial DeLOS-II: a single study center's experience.

Front Oncol

December 2024

Clinic for Otorhinolaryngology, University Hospital Leipzig, Leipzig, Germany.

Gunnar Wichmann Theresa Wald Veit Zebralla Matthaeus Stoehr Markus Pirlich

Introduction: The larynx organ preservation (LOP) trial DeLOS-II enrolled = 173 patients with advanced laryngeal/hypopharyngeal squamous cell carcinoma (LHSCC) amenable (only curatively resectable) through total laryngectomy (TL) to receive induction chemotherapy (IC) with TPF [docetaxel (T), cisplatin (P), and 5-fluorouracil (F)] (arm A, 85 patients) or additional cetuximab (E) weekly (arm B, 88 patients). Responders with endoscopic estimated tumor surface shrinkage (ETSS) ≥30% after 1 cycle IC (IC-1) received a further two cycles of IC followed by radiotherapy (RT), whereas TL was recommended for non-responders. Arm B failed to show superior 24-month laryngectomy-free survival (LFS) and overall survival (OS), the protocol-specified primary and secondary endpoints.

View Article and Find Full Text PDF

Similar Publications

Upregulation of WDR4 mediated by RBFOX2 promotes laryngeal cancer progression through the WDR4/m7G/lncRNA ZFAS1/RBFOX2 axis.

Naunyn Schmiedebergs Arch Pharmacol

January 2025

Department of Otolaryngology, the First Affiliated Hospital of Xinxiang Medical University, No. 88, Jiankang Road, Xinxiang, Henan, 453100, China.

Baocai Lu Xiao Li Wenjie Miao Qi Liu Ruixue Li

High levels of the N7 methylguanosine (m7G) methyltransferase WD repeat domain 4 (WDR4) are associated with the progression of multiple tumors, including head and neck squamous cell carcinoma. Laryngeal cancer (LC) is the second most common malignant tumor of the head and neck. However, the role of WDR4 in LC remains unclear.

View Article and Find Full Text PDF

Similar Publications

A Critical Review on microRNAs as Prognostic Biomarkers in Laryngeal Carcinoma.

Int J Mol Sci

December 2024

Department of ENT, Medical University, 1000 Sofia, Bulgaria.

Kristina S Komitova Lyuben D Dimitrov Gergana S Stancheva Silva G Kyurkchiyan Veronika Petkova

During the past decade, a vast number of studies were dedicated to unravelling the obscurities of non-coding RNAs in all fields of the medical sciences. A great amount of data has been accumulated, and consequently a natural need for organization and classification in all subfields arises. The aim of this review is to summarize all reports on microRNAs that were delineated as prognostic biomarkers in laryngeal carcinoma.

View Article and Find Full Text PDF

Similar Publications

Prevention and Management of Recurrent Laryngeal Nerve Palsy in Minimally Invasive Esophagectomy: Current Status and Future Perspectives.

J Clin Med

December 2024

Department of Surgery, Tohoku University Graduate School of Medicine, 2-1 Seiryo-machi, Aoba-ku, Sendai 980-8575, Miyagi, Japan.

Yusuke Taniyama Hiroshi Okamoto Chiaki Sato Yohei Ozawa Hirotaka Ishida

Recurrent laryngeal nerve palsy remains a significant complication following minimally invasive esophagectomy for esophageal cancer. Despite advancements in surgical techniques and lymphadenectomy precision, the incidence of recurrent laryngeal nerve palsy has not been improved. Recurrent laryngeal nerve palsy predominantly affects the left side and may lead to unilateral or bilateral vocal cord paralysis, resulting in hoarseness, dysphagia, and an increased risk of aspiration pneumonia.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!