AI Article Synopsis

  • In 2016, the CDC began tracking pregnant women infected with Zika virus and their infants across the U.S. to identify Zika-associated birth defects through manual review of medical data.
  • As the number of reported cases increased during the outbreak, the surveillance system faced challenges, leading to the exploration of machine learning as a way to predict case status.
  • Ensemble machine learning models were developed to effectively identify cases with high sensitivity, resulting in significant reductions in the amount of data requiring manual review, indicating a promising approach for future public health emergencies.

Article Abstract

In 2016, Centers for Disease Control and Prevention (CDC) established surveillance of pregnant women with Zika virus infection and their infants in the U.S. states, territories, and freely associated states. To identify cases of Zika-associated birth defects, subject matter experts review data reported from medical records of completed pregnancies to identify findings that meet surveillance case criteria (manual review). The volume of reported data increased over the course of the Zika virus outbreak in the Americas, challenging the resources of the surveillance system to conduct manual review. Machine learning was explored as a possible method for predicting case status. Ensemble models (using machine learning algorithms including support vector machines, logistic regression, random forests, k-nearest neighbors, gradient boosted trees, and decision trees) were developed and trained using data collected from January 2016-October 2017. Models were developed separately, on data from the U.S. states, non-Puerto Rico territories, and freely associated states (referred to as the U.S. Zika Pregnancy and Infant Registry [USZPIR]) and data from Puerto Rico (referred to as the Zika Active Pregnancy Surveillance System [ZAPSS]) due to differences in data collection and storage methods. The machine learning models demonstrated high sensitivity for identifying cases while potentially reducing volume of data for manual review (USZPIR: 96% sensitivity, 25% reduction in review volume; ZAPSS: 97% sensitivity, 50% reduction in review volume). Machine learning models show potential for identifying cases of Zika-associated birth defects and for reducing volume of data for manual review, a potential benefit in other public health emergency response settings.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC8054247PMC
http://dx.doi.org/10.1002/bdr2.1767DOI Listing

Publication Analysis

Top Keywords

machine learning
20
manual review
16
zika-associated birth
12
birth defects
12
review volume
12
zika virus
8
territories freely
8
freely associated
8
associated states
8
cases zika-associated
8

Similar Publications

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!