According to the World Health Organization (WHO), around 60% of all outbreaks are detected using informal sources. In many public health institutes, including the WHO and the Robert Koch Institute (RKI), dedicated groups of public health agents sift through numerous articles and newsletters to detect relevant events. This media screening is one important part of event-based surveillance (EBS). Reading the articles, discussing their relevance, and putting key information into a database is a time-consuming process. To support EBS, but also to gain insights into what makes an article and the event it describes relevant, we developed a natural language processing framework for automated information extraction and relevance scoring. First, we scraped relevant sources for EBS as done at the RKI (WHO Disease Outbreak News and ProMED) and automatically extracted the articles' key data: disease, country, date, and confirmed-case count. For this, we performed named entity recognition in two steps: EpiTator, an open-source epidemiological annotation tool, suggested many different possibilities for each. We extracted the key country and disease using a heuristic with good results. We trained a naive Bayes classifier to find the key date and confirmed-case count, using the RKI's EBS database as labels which performed modestly. Then, for relevance scoring, we defined two classes to which any article might belong: The article is relevant if it is in the EBS database and irrelevant otherwise. We compared the performance of different classifiers, using bag-of-words, document and word embeddings. The best classifier, a logistic regression, achieved a sensitivity of 0.82 and an index balanced accuracy of 0.61. Finally, we integrated these functionalities into a web application called EventEpi where relevant sources are automatically analyzed and put into a database. The user can also provide any URL or text, that will be analyzed in the same way and added to the database. Each of these steps could be improved, in particular with larger labeled datasets and fine-tuning of the learning algorithms. The overall framework, however, works already well and can be used in production, promising improvements in EBS. The source code and data are publicly available under open licenses.
Download full-text PDF |
Source |
---|---|
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC7717563 | PMC |
http://dx.doi.org/10.1371/journal.pcbi.1008277 | DOI Listing |
J Glob Health
January 2025
Medical-surgical Nursing Department, Faculty of Nursing, Cairo University, Cairo, Egypt.
Background: We aimed to identify the central lifestyle, the most impactful among lifestyle factor clusters; the central health outcome, the most impactful among health outcome clusters; and the bridge lifestyle, the most strongly connected to health outcome clusters, across 29 countries to optimise resource allocation for local holistic health improvements.
Methods: From July 2020 to August 2021, we surveyed 16 461 adults across 29 countries who self-reported changes in 18 lifestyle factors and 13 health outcomes due to the pandemic. Three networks were generated by network analysis for each country: lifestyle, health outcome, and bridge networks.
Hum Brain Mapp
January 2025
Department of Psychology, Concordia University, Montreal, Quebec, Canada.
The cortex and cerebellum are densely connected through reciprocal input/output projections that form segregated circuits. These circuits are shown to differentially connect anterior lobules of the cerebellum to sensorimotor regions, and lobules Crus I and II to prefrontal regions. This differential connectivity pattern leads to the hypothesis that individual differences in structure should be related, especially for connected regions.
View Article and Find Full Text PDFSSM Popul Health
March 2025
School of Foreign Languages, Chongqing Technology and Business University, Chongqing, 400067, China.
The digital infrastructure has profoundly changed people's daily lives and health outcomes. However, the causal effect of digital infrastructure on cognitive health remains unclear. The study employs the "Broadband China" policy as a reliable proxy for digital infrastructure, using the China Health and Retirement Longitudinal Study (CHARLS) five waves panel data from 2011 to 2020 and a staggered difference-in-differences (DID) method to investigate the causal impact of digital infrastructure construction on the cognitive health in Chinese older adults.
View Article and Find Full Text PDFBehav Anal Pract
December 2024
Department of Behavior Analysis, Simmons University, Boston, MA USA.
Unlabelled: Mands are consistently described as critical learning targets for members of vulnerable populations in need of language intervention (Ala'i-Rosales et al., 2018; Michael, 1988; Sundberg, 2004). Reviews of the literature demonstrate a prevalence of the mand in the applied literature (e.
View Article and Find Full Text PDFCardiovasc Diagn Ther
December 2024
Department of Radiology, University of Cagliari, Cagliari, Italy.
Background And Objective: Interleukin-6 (IL-6) plays multifaceted roles in cancer and atherosclerosis. Initially recognized for its role in immune response and inflammation, IL-6 promotes tumor progression via the JAK-STAT and MAP kinase pathways and is associated with poor cancer prognoses. In atherosclerosis, IL-6 contributes to endothelial dysfunction and plaque formation.
View Article and Find Full Text PDFEnter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!