Background: Although uncertainties exist regarding implementation, artificial intelligence-driven generative language models (GLMs) have enormous potential in medicine. Deployment of GLMs could improve patient comprehension of clinical texts and improve low health literacy.

Objective: The goal of this study is to evaluate the potential of ChatGPT-3.5 and GPT-4 to tailor the complexity of medical information to patient-specific input education level, which is crucial if it is to serve as a tool in addressing low health literacy.

Methods: Input templates related to 2 prevalent chronic diseases-type II diabetes and hypertension-were designed. Each clinical vignette was adjusted for hypothetical patient education levels to evaluate output personalization. To assess the success of a GLM (GPT-3.5 and GPT-4) in tailoring output writing, the readability of pre- and posttransformation outputs were quantified using the Flesch reading ease score (FKRE) and the Flesch-Kincaid grade level (FKGL).

Results: Responses (n=80) were generated using GPT-3.5 and GPT-4 across 2 clinical vignettes. For GPT-3.5, FKRE means were 57.75 (SD 4.75), 51.28 (SD 5.14), 32.28 (SD 4.52), and 28.31 (SD 5.22) for 6th grade, 8th grade, high school, and bachelor's, respectively; FKGL mean scores were 9.08 (SD 0.90), 10.27 (SD 1.06), 13.4 (SD 0.80), and 13.74 (SD 1.18). GPT-3.5 only aligned with the prespecified education levels at the bachelor's degree. Conversely, GPT-4's FKRE mean scores were 74.54 (SD 2.6), 71.25 (SD 4.96), 47.61 (SD 6.13), and 13.71 (SD 5.77), with FKGL mean scores of 6.3 (SD 0.73), 6.7 (SD 1.11), 11.09 (SD 1.26), and 17.03 (SD 1.11) for the same respective education levels. GPT-4 met the target readability for all groups except the 6th-grade FKRE average. Both GLMs produced outputs with statistically significant differences (P<.001; 8th grade P<.001; high school P<.001; bachelors P=.003; FKGL: 6th grade P=.001; 8th grade P<.001; high school P<.001; bachelors P<.001) between mean FKRE and FKGL across input education levels.

Conclusions: GLMs can change the structure and readability of medical text outputs according to input-specified education. However, GLMs categorize input education designation into 3 broad tiers of output readability: easy (6th and 8th grade), medium (high school), and difficult (bachelor's degree). This is the first result to suggest that there are broader boundaries in the success of GLMs in output text simplification. Future research must establish how GLMs can reliably personalize medical texts to prespecified education levels to enable a broader impact on health care literacy.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC11350306PMC
http://dx.doi.org/10.2196/54371DOI Listing

Publication Analysis

Top Keywords

education levels
12
generative language
8
language models
8
low health
8
gpt-35 gpt-4
8
fkgl scores
8
evaluation generative
4
models personalizing
4
personalizing medical
4
medical instrument
4

Similar Publications

This study assesses gender inequality in education and employment in BRICS (Brazil, Russia, India, China, and South Africa) countries between the 2000 and 2021, using data from the World Bank database. A descriptive data analysis was carried out as well as graphical representations to compare among countries. The results showed that that female education achieved significant success, especially at secondary and tertiary levels, through specific policies including financial support and affirmative actions.

View Article and Find Full Text PDF

The effect of web-based breastfeeding education given to primiparous pregnant women: a randomised controlled study.

Afr J Reprod Health

December 2024

Department of Nursing Obstetrics and Gynaecology Nursing Department, Istanbul-Turkey.

This was a randomised controlled study to investigate the effect of web-based breastfeeding education given to primiparous pregnant women on postpartum breastfeeding. The study included a total of 120 primiparous pregnant women, including control group (n:60) and experimental group (n:60). The study was conducted in a district in northern Turkey.

View Article and Find Full Text PDF

Context: Point-of-care ultrasound (POCUS) has diverse applications across various clinical specialties, serving as an adjunct to clinical findings and as a tool for increasing the quality of patient care. Owing to its multifunctionality, a growing number of medical schools are increasingly incorporating POCUS training into their curriculum, some offering hands-on training during the first 2 years of didactics and others utilizing a longitudinal exposure model integrated into all 4 years of medical school education. Midwestern University Arizona College of Osteopathic Medicine (MWU-AZCOM) adopted a 4-year longitudinal approach to include POCUS education in 2017.

View Article and Find Full Text PDF

Background: Distance education emerged as a potential solution to enhance access, standardize content, and facilitate updates. However, student perceptions varied widely. The COVID-19 pandemic prompted a rapid shift towards distance education in anatomy, presenting challenges and opportunities for medical students globally.

View Article and Find Full Text PDF

The study by Cao aimed to identify early second-trimester biomarkers that could predict gestational diabetes mellitus (GDM) development using advanced proteomic techniques, such as Isobaric tags for relative and absolute quantitation isobaric tags for relative and absolute quantitation and liquid chromatography-mass spectrometry liquid chromatography-mass spectrometry. Their analysis revealed 47 differentially expressed proteins in the GDM group, with retinol-binding protein 4 and angiopoietin-like 8 showing significantly elevated serum levels compared to controls. Although these findings are promising, the study is limited by its small sample size ( = 4 per group) and lacks essential details on the reproducibility and reliability of the protein quantification methods used.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!