The recent rise in telemedicine, notably during the COVID-19 pandemic, highlights the potential of integrating artificial intelligence tools in healthcare. This study assessed the effectiveness of ChatGPT versus medical oncologists in the telemedicine-based management of metastatic prostate cancer. In this retrospective study, 102 patients who met inclusion criteria were analyzed to compare the competencies of ChatGPT and oncologists in telemedicine consultations. ChatGPT's role in pre-charting and determining the need for in-person consultations was evaluated. The primary outcome was the concordance between ChatGPT and oncologists in treatment decisions. Results showed a moderate concordance (Cohen's Kappa = 0.43, < 0.001). The number of diagnoses made by both parties was not significantly different (median number of diagnoses: 5 vs. 5, = 0.12). In conclusion, ChatGPT exhibited moderate agreement with oncologists in management via telemedicine, indicating the need for further research to explore its healthcare applications.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11394468PMC
http://dx.doi.org/10.3390/diagnostics14171899DOI Listing

Publication Analysis

Top Keywords

effectiveness chatgpt
8
telemedicine-based management
8
management metastatic
8
metastatic prostate
8
chatgpt oncologists
8
number diagnoses
8
chatgpt telemedicine-based
4
prostate carcinoma
4
carcinoma rise
4
rise telemedicine
4

Similar Publications

A comparison of the persuasiveness of human and ChatGPT generated pro-vaccine messages for HPV.

Front Public Health

January 2025

CAS Key Laboratory of Behavioral Science, Institute of Psychology, Chinese Academy of Sciences, Beijing, China.

Introduction: Public health messaging is crucial for promoting beneficial health outcomes, and the latest advancements in artificial intelligence offer new opportunities in this field. This study aimed to evaluate the effectiveness of ChatGPT-4 in generating pro-vaccine messages on different topics for Human Papillomavirus (HPV) vaccination.

Methods: In this study ( = 60), we examined the persuasive effect of pro-vaccine messages generated by GPT-4 and humans, which were constructed based on 17 factors impacting HPV vaccination.

View Article and Find Full Text PDF

Classification performance and reproducibility of GPT-4 omni for information extraction from veterinary electronic health records.

Front Vet Sci

January 2025

Department of Pathology, Microbiology and Immunology, School of Veterinary Medicine, University of California, Davis, Davis, CA, United States.

Large language models (LLMs) can extract information from veterinary electronic health records (EHRs), but performance differences between models, the effect of hyperparameter settings, and the influence of text ambiguity have not been previously evaluated. This study addresses these gaps by comparing the performance of GPT-4 omni (GPT-4o) and GPT-3.5 Turbo under different conditions and by investigating the relationship between human interobserver agreement and LLM errors.

View Article and Find Full Text PDF

Transformer Graph Variational Autoencoder for Generative Molecular Design.

Biophys J

January 2025

Department of Machine Learning, Moffitt Cancer Center, Tampa, Florida, United States. Electronic address:

In the field of drug discovery, the generation of new molecules with desirable properties remains a critical challenge. Traditional methods often rely on SMILES (Simplified Molecular Input Line Entry System) representations for molecular input data, which can limit the diversity and novelty of generated molecules. To address this, we present the Transformer Graph Variational Autoencoder (TGVAE), an innovative AI model that employs molecular graphs as input data, thus captures the complex structural relationships within molecules more effectively than string models.

View Article and Find Full Text PDF

Background: The implementation of large language models (LLMs), such as BART (Bidirectional and Auto-Regressive Transformers) and GPT-4, has revolutionized the extraction of insights from unstructured text. These advancements have expanded into health care, allowing analysis of social media for public health insights. However, the detection of drug discontinuation events (DDEs) remains underexplored.

View Article and Find Full Text PDF

Background: Fetal growth restriction (FGR) is a leading risk factor for stillbirth, yet the diagnosis of FGR confers considerable prognostic uncertainty, as most infants with FGR do not experience any morbidity. Our objective was to use data from a large, deeply phenotyped observational obstetric cohort to develop a probabilistic graphical model (PGM), a type of "explainable artificial intelligence (AI)", as a potential framework to better understand how interrelated variables contribute to perinatal morbidity risk in FGR.

Methods: Using data from 9,558 pregnancies delivered at ≥ 20 weeks with available outcome data, we derived and validated a PGM using randomly selected sub-cohorts of 80% (n = 7645) and 20% (n = 1,912), respectively, to discriminate cases of FGR resulting in composite perinatal morbidity from those that did not.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!