Objective: Evaluate the quality of responses from Chat Generative Pre-Trained Transformer (ChatGPT) models compared to the answers for "Frequently Asked Questions" (FAQs) from the American Academy of Otolaryngology-Head and Neck Surgery (AAO-HNS) Clinical Practice Guidelines (CPG) for Ménière's disease (MD).

Study Design: Comparative analysis.

Setting: The AAO-HNS CPG for MD includes FAQs that clinicians can give to patients for MD-related questions. The ability of ChatGPT to properly educate patients regarding MD is unknown.

Methods: ChatGPT-3.5 and 4.0 were each prompted with 16 questions from the MD FAQs. Each response was rated in terms of (1) comprehensiveness, (2) extensiveness, (3) presence of misleading information, and (4) quality of resources. Readability was assessed using Flesch-Kincaid Grade Level (FKGL) and Flesch Reading Ease Score (FRES).

Results: ChatGPT-3.5 was comprehensive in 5 responses whereas ChatGPT-4.0 was comprehensive in 9 (31.3% vs 56.3%,  = .2852). ChatGPT-3.5 and 4.0 were extensive in all responses ( = 1.0000). ChatGPT-3.5 was misleading in 5 responses whereas ChatGPT-4.0 was misleading in 3 (31.3% vs 18.75%,  = .6851). ChatGPT-3.5 had quality resources in 10 responses whereas ChatGPT-4.0 had quality resources in 16 (62.5% vs 100%,  = .0177). AAO-HNS CPG FRES (62.4 ± 16.6) demonstrated an appropriate readability score of at least 60, while both ChatGPT-3.5 (39.1 ± 7.3) and 4.0 (42.8 ± 8.5) failed to meet this standard. All platforms had FKGL means that exceeded the recommended level of 6 or lower.

Conclusion: While ChatGPT-4.0 had significantly better resource reporting, both models have room for improvement in being more comprehensive, more readable, and less misleading for patients.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11225079PMC
http://dx.doi.org/10.1002/oto2.163DOI Listing

Publication Analysis

Top Keywords

quality resources
12
responses chatgpt-40
12
ménière's disease
8
clinical practice
8
aao-hns cpg
8
chatgpt-35
6
responses
5
chatgpt responses
4
responses frequently
4
frequently asked
4

Similar Publications

Many patients with cancer approaching the end of life (EOL) continue to receive treatments that are unlikely to provide meaningful clinical benefit, potentially causing more harm than good. This is called overtreatment at the EOL. Overtreatment harms patients by causing side-effects, increasing health care costs, delaying important discussions about and preparation for EOL care, and occasionally accelerating death.

View Article and Find Full Text PDF

The long-term presence of antibiotics in the aquatic environment will affect ecology and human health. Techniques for determining antibiotics are often time-consuming, labor-intensive and costly, and it is desirable to seek new methods to achieve rapid prediction of antibiotics. Many scholars have shown the effectiveness of machine learning in water quality prediction, however, its effectiveness in predicting antibiotic concentrations in the aquatic environment remains inconclusive.

View Article and Find Full Text PDF

Alternariol (AOH) has attracted much attention as an emerging toxin in edible herbs that can pose potential carcinogenic risks to human. However, the rapid detection of AOH to ensure food safety remains a challenge. Here, a CRISPR-Cas12a-mediated aptamer-based sensor (aptasensor) was proposed for the sensitive quantification of AOH by using a personal glucose meter.

View Article and Find Full Text PDF

DNAzyme-mediated isothermal catalytic hairpin assembly for rapid and enzyme-free amplified detection of lead(Ⅱ) ion.

J Hazard Mater

January 2025

Shanghai Engineering Research Center of Aquatic-Product Processing & Preservation, Shanghai Ocean University, Shanghai 201306, China; Laboratory of Quality and Safety Risk Assessment for Aquatic Product on Storage and Preservation (Shanghai), Ministry of Agriculture, Shanghai Ocean University, Shanghai 201306,  China; Shanghai Ocean University, Shanghai 201306, China. Electronic address:

The detection of heavy metal ions, particularly lead (Pb²⁺), in environmental samples is crucial for public health and safety. Current nucleic acid signal amplification methods for Pb²⁺ detection often rely on biological enzymes and are limited in applicability due to high costs, prolonged detection times, and nonspecific adsorption. In this study, we introduce an enzyme-free, DNAzyme-mediated isothermal catalytic hairpin assembly (DMICHA) assay, which combines a DNAzyme-based Pb²⁺ recognition module with a signal amplification process utilizing isothermal catalytic hairpin assembly (CHA).

View Article and Find Full Text PDF

Evaluating the influence of sports-induced trauma on temporomandibular disorders: A systematic review and meta-analysis.

Arch Oral Biol

December 2024

Department of Research Analytics, Saveetha Dental College and Hospitals, Saveetha Institute of Medical and Technical Sciences, Saveetha University, Chennai, India; Department of Prosthodontics, Faculty of Stomatology, Yerevan State Medical University after Mkhitar Heratsi, Yerevan, Armenia; Department of Prosthodontics, School of Dentistry, Tehran University of Medical Sciences, Tehran, Iran. Electronic address:

Objective: The main aim of this study was to identify the existing literature on the association between sporting activities and temporomandibular disorders and to critically appraise evidence of this association through a systematic review and meta-analysis.

Design: A comprehensive search was conducted using PubMed, ScienceDirect, Dimensions, Google Scholar, Cochrane Library, and the Education Resources Information Centre (ERIC). Articles were selected using pre-specified eligibility criteria.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!