Purpose: As online health information-seeking surges, concerns mount over the quality and safety of accessible content, potentially leading to patient harm through misinformation. On one hand, the emergence of Artificial Intelligence (AI) in healthcare could prevent it; on the other hand, questions raise regarding the quality and safety of the medical information provided. As laryngeal cancer is a prevalent head and neck malignancy, this study aims to evaluate the utility and safety of three large language models (LLMs) as sources of patient information about laryngeal cancer.

Methods: A cross-sectional study was conducted using three LLMs (ChatGPT 3.5, ChatGPT 4.0, and Bard). A questionnaire comprising 36 inquiries about laryngeal cancer was categorised into diagnosis (11 questions), treatment (9 questions), novelties and upcoming treatments (4 questions), controversies (8 questions), and sources of information (4 questions). The population of reviewers consisted of 3 groups, including ENT specialists, junior physicians, and non-medicals, who graded the responses. Each physician evaluated each question twice for each model, while non-medicals only once. Everyone was blinded to the model type, and the question order was shuffled. Outcome evaluations were based on a safety score (1-3) and a Global Quality Score (GQS, 1-5). Results were compared between LLMs. The study included iterative assessments and statistical validations.

Results: Analysis revealed that ChatGPT 3.5 scored highest in both safety (mean: 2.70) and GQS (mean: 3.95). ChatGPT 4.0 and Bard had lower safety scores of 2.56 and 2.42, respectively, with corresponding quality scores of 3.65 and 3.38. Inter-rater reliability was consistent, with less than 3% discrepancy. About 4.2% of responses fell into the lowest safety category (1), particularly in the novelty category. Non-medical reviewers' quality assessments correlated moderately (r = 0.67) with response length.

Conclusions: LLMs can be valuable resources for patients seeking information on laryngeal cancer. ChatGPT 3.5 provided the most reliable and safe responses among the models evaluated.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11512842PMC
http://dx.doi.org/10.1007/s00405-024-08643-8DOI Listing

Publication Analysis

Top Keywords

laryngeal cancer
16
safety
8
quality safety
8
chatgpt bard
8
questions
6
laryngeal
5
quality
5
chatgpt
5
trust trust
4
trust evaluating
4

Similar Publications

Objectives: This study aimed to analyze lymph node metastasis (LNM) distribution in superficial esophageal squamous cell carcinoma (ESCC) and its impact factors on survival.

Methods: We reviewed 241 pT1N+ ESCC cases between February 2012 and April 2022 from 10 Chinese hospitals with a high volume of esophageal cancer (EC). We analyzed clinicopathological data to identify overall survival (OS) risk factors and LNM distribution in relation to tumor invasion depth.

View Article and Find Full Text PDF

Introduction: The larynx organ preservation (LOP) trial DeLOS-II enrolled = 173 patients with advanced laryngeal/hypopharyngeal squamous cell carcinoma (LHSCC) amenable (only curatively resectable) through total laryngectomy (TL) to receive induction chemotherapy (IC) with TPF [docetaxel (T), cisplatin (P), and 5-fluorouracil (F)] (arm A, 85 patients) or additional cetuximab (E) weekly (arm B, 88 patients). Responders with endoscopic estimated tumor surface shrinkage (ETSS) ≥30% after 1 cycle IC (IC-1) received a further two cycles of IC followed by radiotherapy (RT), whereas TL was recommended for non-responders. Arm B failed to show superior 24-month laryngectomy-free survival (LFS) and overall survival (OS), the protocol-specified primary and secondary endpoints.

View Article and Find Full Text PDF

High levels of the N7 methylguanosine (m7G) methyltransferase WD repeat domain 4 (WDR4) are associated with the progression of multiple tumors, including head and neck squamous cell carcinoma. Laryngeal cancer (LC) is the second most common malignant tumor of the head and neck. However, the role of WDR4 in LC remains unclear.

View Article and Find Full Text PDF

During the past decade, a vast number of studies were dedicated to unravelling the obscurities of non-coding RNAs in all fields of the medical sciences. A great amount of data has been accumulated, and consequently a natural need for organization and classification in all subfields arises. The aim of this review is to summarize all reports on microRNAs that were delineated as prognostic biomarkers in laryngeal carcinoma.

View Article and Find Full Text PDF

Recurrent laryngeal nerve palsy remains a significant complication following minimally invasive esophagectomy for esophageal cancer. Despite advancements in surgical techniques and lymphadenectomy precision, the incidence of recurrent laryngeal nerve palsy has not been improved. Recurrent laryngeal nerve palsy predominantly affects the left side and may lead to unilateral or bilateral vocal cord paralysis, resulting in hoarseness, dysphagia, and an increased risk of aspiration pneumonia.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!