Objectives: To evaluate ChatGPT's accuracy as information source for women and maternity-care workers on "nutrition" and "red flags" in pregnancy.
Methods: Accuracy of ChatGPT-generated recommendations was assessed by a 5-point Likert scale by eight raters for ten indicators per topic in four languages (French, English, German and Dutch). Accuracy and interrater agreement were calculated per topic and language.
Results: For both topics, median accuracy scores of ChatGPT-generated recommendations were excellent (5.0; IQR 4-5) independently of language. Median accuracy scores varied with a maximum of 1 on a 5-point Likert-scare according to question's framing. Overall accuracy scores were 83-89 % for 'nutrition in pregnancy' versus 96-98 % for 'red flags in pregnancy'. Inter-rater agreement was good to excellent for both topics.
Conclusion: Although ChatGPT generated accurate recommendations regarding the tested indicators for nutrition and red flags during pregnancy, women should be aware of ChatGPT's limitations such as inconsistencies according to formulation, language and the woman's personal context.
Innovation: Despite a growing interest in the potential use of artificial intelligence in healthcare, this is, to the best of our knowledge, the first study assessing potential limitations that may impact accuracy of ChatGPT-generated recommendations such as language and question-framing in key domains of perinatal health.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11872132 | PMC |
http://dx.doi.org/10.1016/j.pecinn.2025.100381 | DOI Listing |
PEC Innov
June 2025
Institute of Primary Health Care (BIHAM), University of Bern, Bern, Switzerland.
Objectives: To evaluate ChatGPT's accuracy as information source for women and maternity-care workers on "nutrition" and "red flags" in pregnancy.
Methods: Accuracy of ChatGPT-generated recommendations was assessed by a 5-point Likert scale by eight raters for ten indicators per topic in four languages (French, English, German and Dutch). Accuracy and interrater agreement were calculated per topic and language.
J Hand Microsurg
May 2025
Division of Hand Surgery, Department of Orthopedic Surgery, Mayo Clinic - 5777 E. Mayo Blvd, Phoenix, AZ, 85054, USA.
Background: With advancements in artificial intelligence, patients increasingly turn to generative AI models like ChatGPT for medical advice. This study explores the utility of ChatGPT 4.0 (GPT-4.
View Article and Find Full Text PDFDermatol Reports
November 2024
Zadig ltd Benefit Company, CME national provider, Milan.
The large language model (LLM) ChatGPT can answer open-ended and complex questions, but its accuracy in providing reliable medical information requires a careful assessment. As part of the AICHECK (Artificial Intelligence for CME Health E-learning Contents and Knowledge) Study, aimed at evaluating the potential of ChatGPT in continuous medical education (CME), we compared ChatGPT-generated educational contents to the recommendations of the National Institute for Health and Care Excellence (NICE) guidelines on acne vulgaris. ChatGPT version 4 was exposed to a 23-item questionnaire developed by an experienced dermatologist.
View Article and Find Full Text PDFHand Surg Rehabil
February 2025
Department of Orthopaedic Surgery, Rothman Orthopaedic Institute, Philadelphia, PA, United States.
Introduction: ChatGPT has been increasingly utilized to create, simplify, and revise hand surgery patient education materials. While significant research has examined the quality and readability of ChatGPT-derived hand surgery patient education, the patient perspective has not previously been evaluated. This study compared patient reported clarity and readability grades as well as patient preferences for carpal tunnel surgery educational information from medical education websites and ChatGPT.
View Article and Find Full Text PDFCureus
October 2024
Public Health, Kahramanmaraş Sütçü İmam University, Kahramanmaraş, TUR.
Aim: To enhance outcomes for patients with pulmonary arterial hypertension (PAH), comprehensive and individualized therapy is needed. A large language model called Generative Pre-trained Transformer (ChatGPT) has the ability to provide expert yet patient-friendly care. We wanted to determine how well ChatGPT could accurately and consistently respond to inquiries on knowledge and management for PAH.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!