Background: As artificial intelligence (AI) becomes increasingly prevalent in the medical field, the effectiveness of AI-generated medical reports in disease diagnosis remains to be evaluated. ChatGPT is a large language model developed by open AI with a notable capacity for text abstraction and comprehension. This study aimed to explore the capabilities, limitations, and potential of Generative Pre-trained Transformer (GPT)-4 in analyzing thyroid cancer ultrasound reports, providing diagnoses, and recommending treatment plans.

Methods: Using 109 diverse thyroid cancer cases, we evaluated GPT-4's performance by comparing its generated reports to those from doctors with various levels of experience. We also conducted a Turing Test and a consistency analysis. To enhance the interpretability of the model, we applied the Chain of Thought (CoT) method to deconstruct the decision-making chain of the GPT model.

Results: GPT-4 demonstrated proficiency in report structuring, professional terminology, and clarity of expression, but showed limitations in diagnostic accuracy. In addition, our consistency analysis highlighted certain discrepancies in the AI's performance. The CoT method effectively enhanced the interpretability of the AI's decision-making process.

Conclusions: GPT-4 exhibits potential as a supplementary tool in healthcare, especially for generating thyroid gland diagnostic reports. Our proposed online platform, "ThyroAIGuide", alongside the CoT method, underscores the potential of AI to augment diagnostic processes, elevate healthcare accessibility, and advance patient education. However, the journey towards fully integrating AI into healthcare is ongoing, requiring continuous research, development, and careful monitoring by medical professionals to ensure patient safety and quality of care.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10895085PMC
http://dx.doi.org/10.21037/qims-23-1180DOI Listing

Publication Analysis

Top Keywords

cot method
12
chain thought
8
thyroid cancer
8
consistency analysis
8
assessing role
4
gpt-4
4
role gpt-4
4
thyroid
4
gpt-4 thyroid
4
thyroid ultrasound
4

Similar Publications

Background: Large language models (LLMs) have demonstrated impressive performance on medical licensing and diagnosis-related exams. However, comparative evaluations to optimize LLM performance and ability in the domain of comprehensive medication management (CMM) are lacking. The purpose of this evaluation was to test various LLMs performance optimization strategies and performance on critical care pharmacotherapy questions used in the assessment of Doctor of Pharmacy students.

View Article and Find Full Text PDF

This study presents a family of coexisting multi-scroll chaos in a network of coupled non-oscillatory neurons. The dynamics of the system are analyzed using phase portraits, basins of attraction, time series, bifurcation diagrams, and spectra of Lyapunov exponents. The coexistence of multiple bifurcation diagrams leads to a complex pattern of multi-scroll formation, which is further complicated by the presence of coexisting single-scroll attractors that merge to form multi-scroll chaos.

View Article and Find Full Text PDF

Introduction: We aimed to explore if anatomical and technical features could interact and favor the chances of reperfusion according to the treatment strategy: combined technique (CoT) of mechanical thrombectomy (MT) with contact aspiration and stent-retriever (SR) versus SR alone.

Methods: Retrospective analysis of a prospective MT database for carotid terminus or MCA-M1 occlusion, first-line SR alone or CoT, and angiographic run with SR deployed on the first pass. The primary analysis involved the interaction between clinical and angiographic characteristics and first-line MT modality on first-pass effect (FPE; first pass eTICI2c-3).

View Article and Find Full Text PDF

Introduction: American College of Surgeons-Committee on Trauma (ACS-COT) defines minimum Standard Criteria (SC) for Level 1 trauma. In our hospital, discretion of prehospital personnel ("Paramedic Judgment" [PJ]) can initiate Full Trauma Triage Activation (FTTA) in the absence of ACS-COT criteria. The aim of this study was to evaluate overtriage and undertriage for PJ vs SC.

View Article and Find Full Text PDF

Cotinine, trans-3'-hydroxycotinine, and nicotine metabolite ratio indicate association between smoking and tooth loss.

J Periodontol

January 2025

Key Laboratory of Shaanxi Province for Craniofacial Precision Medicine Research, College of Stomatology, Xi'an Jiaotong University, Xi'an, Shaanxi, China.

Background: Previous research has indicated a potential connection between smoking and tooth loss, but it remains unclear how the metabolites of nicotine, cotinine (COT) and trans-3'-hydroxycotinine (HC), and the nicotine metabolite ratio (NMR) affect the occurrence and progress of tooth loss. In this study, we aimed to investigate the relationship between tooth loss and smoking metabolites, then verify how the systemic immunoinflammatory index (SII) or monocyte to high-density lipoprotein cholesterol ratio (MHR) levels mediate this process.

Methods: The cross-sectional study data were collected from the National Health and Nutrition Examination Survey (NHANES).

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!