Artificial intelligence (ChatGPT) ready to evaluate ECG in real life? Not yet!

Digit Health

Bahçeşehir Üniversite Hastanesi Medical Park Göztepe, İstanbul, Turkey.

Published: March 2025

Objective: This study aims at evaluating if ChatGPT-based artificial intelligence (AI) models are effective in interpreting electrocardiograms (ECGs) and determine their accuracy as compared to those of cardiologists. The purpose is therefore to explore if ChatGPT can be employed for clinical setting, particularly where there are no available cardiologists.

Methods: A total of 107 ECG cases classified according to difficulty (simple, intermediate, complex) were analyzed using three AI models (GPT-ECGReader, GPT-ECGAnalyzer, GPT-ECGInterpreter) and compared with the performance of two cardiologists. The statistical analysis was conducted using chi-square and Fisher exact tests using scikit-learn library in Python 3.8.

Results: Cardiologists demonstrated superior accuracy (92.52%) compared to ChatGPT-based models (GPT-ECGReader: 57.94%, GPT-ECGInterpreter: 62.62%, GPT-ECGAnalyzer: 62.62%). Statistically significant differences were observed between cardiologists and AI models ( < 0.05). ChatGPT models exhibited enhanced performance with female patients; however, the differences found were not statistically significant. Cardiologists significantly outperformed AI models across all difficulty levels. When it comes to diagnosing patients with arrhythmia (A) and cardiac structural disease ECG patterns, cardiologists gave the best results though there was no statistical difference between them and AI models in diagnosing people with normal (N) ECG patterns.

Conclusions: ChatGPT-based models have potential in ECG interpretation; however, they currently lack adequate reliability beyond oversight from a doctor. Additionally, further studies that would improve the accuracy of these models, especially in intricate diagnoses are needed.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11898233PMC
http://dx.doi.org/10.1177/20552076251325279DOI Listing

Publication Analysis

Top Keywords

artificial intelligence
8
models gpt-ecgreader
8
intelligence chatgpt
4
chatgpt ready
4
ready evaluate
4
evaluate ecg
4
ecg real
4
real life?
4
life? yet!
4
yet! objective
4

Similar Publications

Early and precise diagnosis of cancer is pivotal for effective therapeutic intervention. Traditional diagnostic methods, despite their reliability, often face limitations such as invasiveness, high costs, labor-intensive procedures, extended processing times, and reduced sensitivity for early-stage detection. Electrochemical biosensing is a revolutionary method that provides rapid, cost-effective, and highly sensitive detection of cancer biomarkers.

View Article and Find Full Text PDF

Large Language Model-Based Critical Care Big Data Deployment and Extraction: Descriptive Analysis.

JMIR Med Inform

March 2025

Department of Critical Care Medicine, Beijing Tiantan Hospital, Capital Medical University, No.119 Nansihuanxi Road, Fengtai District, Beijing, 100070, China, 86 17611757717.

Background: Publicly accessible critical care-related databases contain enormous clinical data, but their utilization often requires advanced programming skills. The growing complexity of large databases and unstructured data presents challenges for clinicians who need programming or data analysis expertise to utilize these systems directly.

Objective: This study aims to simplify critical care-related database deployment and extraction via large language models.

View Article and Find Full Text PDF

Background: Breast cancer, a highly prevalent global cancer, poses significant challenges, especially in advanced stages. Prognostic models are crucial to enhance patient outcomes. Tertiary lymphoid structures (TLS) within the tumor microenvironment have been associated with better prognostic outcomes.

View Article and Find Full Text PDF

The landscape of artificial intelligence (AI) research is witnessing a transformative shift with the emergence of the Kolmogorov-Arnold network (KAN), presenting a novel architectural paradigm aimed to redefine the structural foundations of AI models, which are based on multilayer perceptron (MLP). Through rigorous experimentation and evaluation, we introduce the KAN-electroencephalogram (EEG) model, a tailored design for efficient seizure detection. Our proposed network is tested and successfully generalized on three different datasets, one from the USA, one from Europe, and one from Oceania, recorded with different front-end hardware.

View Article and Find Full Text PDF

Artificial Intelligence Screening Tool for Obstructive Sleep Apnoea: A Study Based on Outpatients at a Sleep Medical Centre.

Nat Sci Sleep

March 2025

Department of Otorhinolaryngology, the Central Hospital of Wuhan, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, 430014, People's Republic of China.

Purpose: Due to the lack of clear screening guidelines for different populations, identify strategies for obstructive sleep apnea (OSA) in the outpatient population are unclear, a large number of potential OSA outpatients have not been identified in time. The purpose of our study was to evaluate the applicability and accuracy of artificial intelligence sleep screening in outpatients and to provide a reference for OSA screening in different populations.

Methods: A type IV wearable artificial intelligence sleep monitoring (AISM) device was used to screen adults in the sleep clinic of the Sleep Medical Center for OSA screening, and the general demographic data of the patients were collected.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!