Introduction: ChatGPT, a sophisticated large language model (LLM), has garnered widespread attention for its ability to mimic human-like communication. As recent studies indicate a potential supportive role of ChatGPT in academic writing, we assessed the LLM's capacity to generate accurate and comprehensive scientific abstracts from published Randomised Controlled Trial (RCT) data, focusing on the adherence to the Consolidated Standards of Reporting Trials for Abstracts (CONSORT-A) statement, in comparison to the original authors' abstracts.

Methodology: RCTs, identified in a PubMed/MEDLINE search post-September 2021 across various medical disciplines, were subjected to abstract generation via ChatGPT versions 3.5 and 4, following the guidelines of the respective journals. The overall quality score (OQS) of each abstract was determined by the total number of adequately reported components from the 18-item CONSORT-A checklist. Additional outcome measures included percent adherence to each CONOSORT-A item, readability, hallucination rate, and regression analysis of reporting quality determinants.

Results: Original abstracts achieved a mean OQS of 11.89 (95% CI: 11.23-12.54), outperforming GPT 3.5 (7.89; 95% CI: 7.32-8.46) and GPT 4 (5.18; 95% CI: 4.64-5.71). Compared to GPT 3.5 and 4 outputs, original abstracts were more adherent with 10 and 14 CONSORT-A items, respectively. In blind assessments, GPT 3.5-generated abstracts were deemed most readable in 62.22% of cases which was significantly greater than the original (31.11%; P = 0.003) and GPT 4-generated (6.67%; P<0.001) abstracts. Moreover, ChatGPT 3.5 exhibited a hallucination rate of 0.03 items per abstract compared to 1.13 by GPT 4. No determinants for improved reporting quality were identified for GPT-generated abstracts.

Conclusions: While ChatGPT could generate more readable abstracts, their overall quality was inferior to the original abstracts. Yet, its proficiency to concisely relay key information with minimal error holds promise for medical research and warrants further investigations to fully ascertain the LLM's applicability in this domain.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10866463PMC
http://journals.plos.org/plosone/article?id=10.1371/journal.pone.0297701PLOS

Publication Analysis

Top Keywords

original abstracts
12
scientific abstracts
8
abstracts
7
chatgpt
5
original
5
gpt
5
chatgpt assist
4
assist authors
4
authors abstract
4
abstract writing
4

Similar Publications

Introduction: Atrial fibrillation (AF) is the most prevalent form of cardiac arrhythmia worldwide. Early diagnosis and treatment are essential, emphasizing the need to develop novel biomarkers. Lipoprotein(a) [Lp(a)] has recently been widely investigated as a potential risk factor for various cardiovascular conditions, including AF.

View Article and Find Full Text PDF

Introduction: Work engagement enhances nurses' physical and mental health, well-being, job performance and satisfaction. This reduces turnover rates and improves patient care quality, making work engagement a crucial factor in the nursing workplace. However, no systematic review or meta-analysis has explored the effects of randomised controlled trial (RCT) interventions aimed at improving nurses' work engagement.

View Article and Find Full Text PDF

Objective: To compare the rates of surgical site infection (SSI) after hysterectomy using vaginal antisepsis with chlorhexidine gluconate (CHG) versus povidone-iodine (PI).

Data Sources: PubMed, Embase, and Clinicaltrials.gov databases were queried from January 1, 1985 through Dec 7, 2023.

View Article and Find Full Text PDF

Background: Intensive care units (ICUs) handle the most critical patients with a high risk of mortality. Due to those conditions, close monitoring is necessary and therefore, a large volume of data is collected. Collaborative ventures have enabled the emergence of large open access databases, leading to numerous publications in the field.

View Article and Find Full Text PDF

The disencapsulated mind: A premotor theory of human imagination.

Psychol Rev

January 2025

Department of Psychological and Brain Sciences, Dartmouth College.

Our premodern ancestors had perceptual, motoric, and cognitive functional domains that were modularly encapsulated. Some of these came to interact through a new type of cross-modular binding in our species. This allowed previously domain-dedicated, encapsulated motoric and sensory operators to operate on operands for which they had not evolved.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!