Enhancing perinatal health patient information through ChatGPT - An accuracy study.

P L M de Vries D Baud S Baggio M Ceulemans G Favre E Gerbier H Legardeur E Maisonneuve C Pena-Reyes L Pomar U Winterfeld A Panchaud

PEC Innov

Institute of Primary Health Care (BIHAM), University of Bern, Bern, Switzerland.

Published: June 2025

Objectives: To evaluate ChatGPT's accuracy as information source for women and maternity-care workers on "nutrition" and "red flags" in pregnancy.

Methods: Accuracy of ChatGPT-generated recommendations was assessed by a 5-point Likert scale by eight raters for ten indicators per topic in four languages (French, English, German and Dutch). Accuracy and interrater agreement were calculated per topic and language.

Results: For both topics, median accuracy scores of ChatGPT-generated recommendations were excellent (5.0; IQR 4-5) independently of language. Median accuracy scores varied with a maximum of 1 on a 5-point Likert-scare according to question's framing. Overall accuracy scores were 83-89 % for 'nutrition in pregnancy' versus 96-98 % for 'red flags in pregnancy'. Inter-rater agreement was good to excellent for both topics.

Conclusion: Although ChatGPT generated accurate recommendations regarding the tested indicators for nutrition and red flags during pregnancy, women should be aware of ChatGPT's limitations such as inconsistencies according to formulation, language and the woman's personal context.

Innovation: Despite a growing interest in the potential use of artificial intelligence in healthcare, this is, to the best of our knowledge, the first study assessing potential limitations that may impact accuracy of ChatGPT-generated recommendations such as language and question-framing in key domains of perinatal health.

Download full-text PDF	Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11872132	PMC
http://dx.doi.org/10.1016/j.pecinn.2025.100381	DOI Listing

Publication Analysis

Top Keywords

chatgpt-generated recommendations

accuracy scores

perinatal health

accuracy

accuracy chatgpt-generated

median accuracy

enhancing perinatal

health patient

patient chatgpt

chatgpt accuracy

Similar Publications

Enhancing perinatal health patient information through ChatGPT - An accuracy study.

PEC Innov

June 2025

Institute of Primary Health Care (BIHAM), University of Bern, Bern, Switzerland.

P L M de Vries D Baud S Baggio M Ceulemans G Favre

View Article and Find Full Text PDF

Similar Publications

ChatGPT 4.0's efficacy in the self-diagnosis of non-traumatic hand conditions.

J Hand Microsurg

May 2025

Division of Hand Surgery, Department of Orthopedic Surgery, Mayo Clinic - 5777 E. Mayo Blvd, Phoenix, AZ, 85054, USA.

Krishna D Unadkat Isra Abdulwadood Annika N Hiredesai Carina P Howlett Laura E Geldmaker

Background: With advancements in artificial intelligence, patients increasingly turn to generative AI models like ChatGPT for medical advice. This study explores the utility of ChatGPT 4.0 (GPT-4.

View Article and Find Full Text PDF

Similar Publications

Application of ChatGPT as a content generation tool in continuing medical education: acne as a test topic.

Dermatol Reports

November 2024

Zadig ltd Benefit Company, CME national provider, Milan.

Luigi Naldi Vincenzo Bettoli Eugenio Santoro Maria Rosa Valetto Anna Bolzon

The large language model (LLM) ChatGPT can answer open-ended and complex questions, but its accuracy in providing reliable medical information requires a careful assessment. As part of the AICHECK (Artificial Intelligence for CME Health E-learning Contents and Knowledge) Study, aimed at evaluating the potential of ChatGPT in continuous medical education (CME), we compared ChatGPT-generated educational contents to the recommendations of the National Institute for Health and Care Excellence (NICE) guidelines on acne vulgaris. ChatGPT version 4 was exposed to a 23-item questionnaire developed by an experienced dermatologist.

View Article and Find Full Text PDF

Similar Publications

Patient preferences for carpal tunnel release education: a comparison of education materials from popular healthcare websites and ChatGPT.

Hand Surg Rehabil

February 2025

Department of Orthopaedic Surgery, Rothman Orthopaedic Institute, Philadelphia, PA, United States.

Nicholas B Pohl Omar H Tarawneh Evan Johnson Daren Aita Madeline Tadley

Introduction: ChatGPT has been increasingly utilized to create, simplify, and revise hand surgery patient education materials. While significant research has examined the quality and readability of ChatGPT-derived hand surgery patient education, the patient perspective has not previously been evaluated. This study compared patient reported clarity and readability grades as well as patient preferences for carpal tunnel surgery educational information from medical education websites and ChatGPT.

View Article and Find Full Text PDF

Similar Publications

Trustworthiness, Value, Danger, and Readability of ChatGPT-Generated Responses to Health Questions Related to Pulmonary Arterial Hypertension.

Cureus

October 2024

Public Health, Kahramanmaraş Sütçü İmam University, Kahramanmaraş, TUR.

Murat Kerkütlüoğlu Erhan Kaya Rasim Gökmen

Aim: To enhance outcomes for patients with pulmonary arterial hypertension (PAH), comprehensive and individualized therapy is needed. A large language model called Generative Pre-trained Transformer (ChatGPT) has the ability to provide expert yet patient-friendly care. We wanted to determine how well ChatGPT could accurately and consistently respond to inquiries on knowledge and management for PAH.

View Article and Find Full Text PDF

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!