Predicting discharge mortality after acute ischemic stroke using balanced data.

AMIA Annu Symp Proc

Department of Bioengineering, University of California, Los Angeles, CA ; Medical Imaging Informatics, Department of Radiological Sciences, University of California, Los Angeles, CA.

Published: September 2015

Several models have been developed to predict stroke outcomes (e.g., stroke mortality, patient dependence, etc.) in recent decades. However, there is little discussion regarding the problem of between-class imbalance in stroke datasets, which leads to prediction bias and decreased performance. In this paper, we demonstrate the use of the Synthetic Minority Over-sampling Technique to overcome such problems. We also compare state of the art machine learning methods and construct a six-variable support vector machine (SVM) model to predict stroke mortality at discharge. Finally, we discuss how the identification of a reduced feature set allowed us to identify additional cases in our research database for validation testing. Our classifier achieved a c-statistic of 0.865 on the cross-validated dataset, demonstrating good classification performance using a reduced set of variables.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC4419881PMC

Publication Analysis

Top Keywords

predict stroke
8
stroke mortality
8
stroke
5
predicting discharge
4
discharge mortality
4
mortality acute
4
acute ischemic
4
ischemic stroke
4
stroke balanced
4
balanced data
4

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!