The over-representation of significant p values in abstracts compared to corresponding full texts: A systematic review of surgical randomized trials.

Contemp Clin Trials Commun

University of New South Wales, South Western Sydney Clinical School, Liverpool Hospital, Liverpool, Australia.

Published: September 2017

AI Article Synopsis

Article Abstract

Background: Abstracts are often the only read summaries of research findings, and it is essential that they accurately represent of the contents of the full text of the randomised control trial (RCT). We investigated whether outcomes in surgical trials were selectively reported in abstracts based on their statistical significance.

Objective: To compare the proportion of significant p-values reported in abstracts to their corresponding full texts in surgical RCTs.

Method: A Meta-analysis of 350 full text RCTs conducted on humans that compared a surgical intervention to any other intervention. An electronic search of MEDLINE, EMBASE, and the Cochrane Central Register of Controlled Trials (CENTRAL) was conducted. All outcomes were extracted from the abstract and the full text. Frequency histograms were used to plot the distribution of numerically reported p-values across the statistical significance spectrum. For each RCT, a 2 × 2 table was populated with that trial's outcomes and whether the outcome was statistically significant (p < 0.05). From each 2 × 2 table, an odds ratio (OR) was calculated describing the association between statistical significance, and reporting in the abstract. ORs were pooled in random effects meta-analysis for an overall estimate of the association.

Results: A total of 8258 reported outcomes were included. Outcomes reported in a surgical RCT abstract had three times the odds of being significant when compared to the corresponding full text (OR = 3.0, 95% confidence interval 2.5-3.6, p < 0.001). This finding was consistent and not subject to heterogeneity (I = 0%). Both histograms demonstrated a large drop in the frequency of reported p values between 0.04 and 0.05, and after the 0.06 thresholds.

Conclusions: Data presented in abstracts is biased to statistically significant outcomes. Clinicians and policy makers should do not rely solely on information presented in abstracts for their decision-making.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC5898552PMC
http://dx.doi.org/10.1016/j.conctc.2017.07.007DOI Listing

Publication Analysis

Top Keywords

full text
12
corresponding full
8
full texts
8
reported abstracts
8
full
5
over-representation values
4
abstracts
4
values abstracts
4
abstracts compared
4
compared corresponding
4

Similar Publications

Background: There is potential for digital mental health interventions to provide affordable, efficient, and scalable support to individuals. Digital interventions, including cognitive behavioral therapy, stress management, and mindfulness programs, have shown promise when applied in workplace settings.

Objective: The aim of this study is to conduct an umbrella review of systematic reviews in order to critically evaluate, synthesize, and summarize evidence of various digital mental health interventions available within a workplace setting.

View Article and Find Full Text PDF

Background: Recent advancements in artificial intelligence (AI) have changed the care processes in mental health, particularly in decision-making support for health care professionals and individuals with mental health problems. AI systems provide support in several domains of mental health, including early detection, diagnostics, treatment, and self-care. The use of AI systems in care flows faces several challenges in relation to decision-making support, stemming from technology, end-user, and organizational perspectives with the AI disruption of care processes.

View Article and Find Full Text PDF

Purpose: The purpose of this systematic review was to consolidate and summarize available data comparing virtual reality perimetry (VRP) with standard automated perimetry (SAP) in adults with glaucoma. Understanding the utility and diagnostic performance of emerging VRP technology may expand access to visual field testing but requires evidence-based validation.

Methods: A systematic literature search was conducted in 3 databases (PubMed Central, Embase, and Cochrane Central Register of Controlled Trials) from the date of inception to 10/09/2024.

View Article and Find Full Text PDF

The reliability of cardiotocographs as diagnostic tools for fetal well-being is hampered by interpretational variations among healthcare professionals, contributing to high rates of cesarean sections and instrumental deliveries. While adjunct technologies may be used to confirm cases of fetal distress, those in resource constrained areas continue to rely on visual cardiotocograph interpretation to come up with the diagnosis of fetal hypoxia. This study investigated the factors contributing to variations in the visual interpretation of intrapartum cardiotocograph among healthcare professionals in the absence of adjunctive technologies.

View Article and Find Full Text PDF

Introduction: Assessment of functional capacity is an essential part of peri-operative risk stratification. Subjective functional capacity is easier to examine than objective tests of patient fitness. However, the association between subjective functional capacity and postoperative outcomes has not been established.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!