Purpose: ChatGPT is a commonly used source of information by patients and clinicians. However, it can be prone to error and requires validation. We sought to assess the quality and accuracy of information regarding corneal transplantation and Fuchs dystrophy from 2 iterations of ChatGPT, and whether its answers improve over time.

Methods: A total of 10 corneal specialists collaborated to assess responses of the algorithm to 10 commonly asked questions related to endothelial keratoplasty and Fuchs dystrophy. These questions were asked from both ChatGPT-3.5 and its newer generation, GPT-4. Assessments tested quality, safety, accuracy, and bias of information. Chi-squared, Fisher exact tests, and regression analyses were conducted.

Results: We analyzed 180 valid responses. On a 1 (A+) to 5 (F) scale, the average score given by all specialists across questions was 2.5 for ChatGPT-3.5 and 1.4 for GPT-4, a significant improvement ( P < 0.0001). Most responses by both ChatGPT-3.5 (61%) and GPT-4 (89%) used correct facts, a proportion that significantly improved across iterations ( P < 0.00001). Approximately a third (35%) of responses from ChatGPT-3.5 were considered against the scientific consensus, a notable rate of error that decreased to only 5% of answers from GPT-4 ( P < 0.00001).

Conclusions: The quality of responses in ChatGPT significantly improved between versions 3.5 and 4, and the odds of providing information against the scientific consensus decreased. However, the technology is still capable of producing inaccurate statements. Corneal specialists are uniquely positioned to assist users to discern the veracity and application of such information.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11076168PMC
http://dx.doi.org/10.1097/ICO.0000000000003439DOI Listing

Publication Analysis

Top Keywords

scientific consensus
12
fuchs dystrophy
12
corneal transplantation
8
transplantation fuchs
8
corneal specialists
8
responses chatgpt-35
8
responses
5
quality
4
quality agreement
4
agreement scientific
4

Similar Publications

Background: There is currently no consensus on the surgical treatment of lumbar spondylolysis in young adults, and the nonunion rate remains relatively high even after surgery. Therefore, in this study, we proposed a modified intravertebral screw-rod fixation technique within a single vertebral segment and investigated the clinical efficacy of this modified fixation system combined with autologous cancellous bone grafting in the treatment of lumbar spondylolysis in young adults.

Methods: This study included 28 young adults with lumbar spondylolysis who were treated at our center between 2021 and 2023.

View Article and Find Full Text PDF

Consensus Preconception Educational Domains for People With Mobility Disabilities: A Delphi Study.

Womens Health Issues

December 2024

Department of Obstetrics, Gynecology and Reproductive Sciences, Magee-Women's Research Institute, School of Medicine, University of Pittsburgh, Pittsburgh, Pennsylvania. Electronic address:

Background: Preconception health education is critical to improve pregnancy and neonatal outcomes, but people with mobility disabilities have specific, often unique issues related to preparing for pregnancy. This study sought to develop consensus-based domains for a preconception education curriculum for people with mobility disabilities.

Methods: We used a mixed methods approach, including a literature review and a Delphi method to develop consensus.

View Article and Find Full Text PDF

Allergen-specific immunotherapy is the only etiological treatment that can prevent the progression of allergic diseases at present. Cluster immunotherapy is an improved immunotherapy regimen, which shortens the dose escalation period from 4-6 months in conventional regimen to 1-8 weeks. In the past, there was no consensus or guideline to guide the standardized application of subcutaneous cluster immunotherapy of inhaled allergens in China.

View Article and Find Full Text PDF

Long-term outcomes of intrathoracic vs. cervical anastomosis post-esophagectomy: a large-scale propensity score matching analysis.

J Thorac Cardiovasc Surg

December 2024

Department of Thoracic Surgery, Sichuan Clinical Research Center for Cancer, Sichuan Cancer Hospital & Institute, Sichuan Cancer Center, Affiliated Cancer Hospital of University of Electronic Science and Technology of China (Sichuan Cancer Hospital), Chengdu, China. Electronic address:

Background: Esophageal squamous cell carcinoma (ESCC) is a prevalent and aggressive gastrointestinal tumor, particularly in East Asia. However, there is a lack of consensus on the long-term survival outcomes of intrathoracic anastomosis and cervical anastomosis following esophagectomy. This study aims to provide a comprehensive summary of the long-term survival outcomes of these two anastomosis techniques.

View Article and Find Full Text PDF

Objective: To identify a consensus among pharmacy educators regarding relevant social/administrative science (SAS) topic areas and their priorities within pharmacy curricula.

Methods: A modified Delphi process was conducted with members of selected American Association of Colleges of Pharmacy (AACP) affinity groups as the expert panel. Eighty-three potential topic areas across 12 domains were gathered via an informal literature review.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!