Objective: To assess the methodological quality of studies on prediction models developed using machine learning techniques across all medical specialties.

Design: Systematic review.

Data Sources: PubMed from 1 January 2018 to 31 December 2019.

Eligibility Criteria: Articles reporting on the development, with or without external validation, of a multivariable prediction model (diagnostic or prognostic) developed using supervised machine learning for individualised predictions. No restrictions applied for study design, data source, or predicted patient related health outcomes.

Review Methods: Methodological quality of the studies was determined and risk of bias evaluated using the prediction risk of bias assessment tool (PROBAST). This tool contains 21 signalling questions tailored to identify potential biases in four domains. Risk of bias was measured for each domain (participants, predictors, outcome, and analysis) and each study (overall).

Results: 152 studies were included: 58 (38%) included a diagnostic prediction model and 94 (62%) a prognostic prediction model. PROBAST was applied to 152 developed models and 19 external validations. Of these 171 analyses, 148 (87%, 95% confidence interval 81% to 91%) were rated at high risk of bias. The analysis domain was most frequently rated at high risk of bias. Of the 152 models, 85 (56%, 48% to 64%) were developed with an inadequate number of events per candidate predictor, 62 handled missing data inadequately (41%, 33% to 49%), and 59 assessed overfitting improperly (39%, 31% to 47%). Most models used appropriate data sources to develop (73%, 66% to 79%) and externally validate the machine learning based prediction models (74%, 51% to 88%). Information about blinding of outcome and blinding of predictors was, however, absent in 60 (40%, 32% to 47%) and 79 (52%, 44% to 60%) of the developed models, respectively.

Conclusion: Most studies on machine learning based prediction models show poor methodological quality and are at high risk of bias. Factors contributing to risk of bias include small study size, poor handling of missing data, and failure to deal with overfitting. Efforts to improve the design, conduct, reporting, and validation of such studies are necessary to boost the application of machine learning based prediction models in clinical practice.

Systematic Review Registration: PROSPERO CRD42019161764.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8527348PMC
http://dx.doi.org/10.1136/bmj.n2281DOI Listing

Publication Analysis

Top Keywords

risk bias
32
machine learning
24
prediction models
20
methodological quality
12
prediction model
12
high risk
12
learning based
12
based prediction
12
prediction
9
models
9

Similar Publications

Background: The systemic immune-inflammation index (SII) is an emerging marker of inflammation, and the onset of psoriasis is associated with inflammation. The aim of our study was to investigate the potential impact of SII on the incidence rate of adult psoriasis.

Methods: We conducted a cross-sectional study based on the National Health and Nutrition Examination Survey (NHANES) 2011-2014 data sets.

View Article and Find Full Text PDF

The presence of an aberrant right hepatic artery (a-RHA) could influence the oncological and postoperative outcomes after pancreaticoduodenectomy (PD). A comparative study was conducted, including patients who underwent PD with a-RHA or with normal RHA anatomy. The primary endpoints were R1 resection in all margins (pancreatic, anterior, posterior, superior mesenteric artery, and portal groove), overall survival (OS), and disease-free survival (DFS).

View Article and Find Full Text PDF

Background: Hyaluronidase remains the mainstay treatment for impending filler-induced facial skin necrosis. Complete resolution of impending skin necrosis following hyaluronidase injection is estimated to be around 77.8%.

View Article and Find Full Text PDF

Background: Minimally invasive pancreatoduodenectomy has gained widespread acceptance among hepatopancreatobiliary surgeons due to its demonstrated advantages in perioperative outcomes compared to the conventional open approach. This meta-analysis, along with trial sequential analysis, aimed to compare the outcomes of robotic pancreatoduodenectomy and laparoscopic pancreatoduodenectomy based on the current available evidence.

Methods: A systematic search of PubMed, Cochrane, Scopus, and Web of Science was conducted from inception to July 2024.

View Article and Find Full Text PDF

Brucellosis is a bacterial disease of many domestic and wild animals with great economic and public health importance. Although it has a major constraint in dairy production, comprehensive information regarding the epidemiology of brucellosis in dairy herds is limited. Besides, evaluating the dairy farmers' knowledge, attitude, and practice (KAP) regarding brucellosis is crucial for generating information that can enhance control programs and public health interventions.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!