Background: Infectious diseases represent a major challenge for health systems worldwide. With the recent global pandemic of COVID-19, the need to research strategies to treat these health problems has become even more pressing. Although the literature on big data and data science in health has grown rapidly, few studies have synthesized these individual studies, and none has identified the utility of big data in infectious disease surveillance and modeling.

Objective: The aim of this study was to synthesize research and identify hotspots of big data in infectious disease epidemiology.

Methods: Bibliometric data from 3054 documents that satisfied the inclusion criteria retrieved from the Web of Science database over 22 years (2000-2022) were analyzed and reviewed. The search retrieval occurred on October 17, 2022. Bibliometric analysis was performed to illustrate the relationships between research constituents, topics, and key terms in the retrieved documents.

Results: The bibliometric analysis revealed internet searches and social media as the most utilized big data sources for infectious disease surveillance or modeling. The analysis also placed US and Chinese institutions as leaders in this research area. Disease monitoring and surveillance, utility of electronic health (or medical) records, methodology framework for infodemiology tools, and machine/deep learning were identified as the core research themes.

Conclusions: Proposals for future studies are made based on these findings. This study will provide health care informatics scholars with a comprehensive understanding of big data research in infectious disease epidemiology.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC10071404PMC
http://dx.doi.org/10.2196/42292DOI Listing

Publication Analysis

Top Keywords

big data
24
infectious disease
20
data infectious
16
bibliometric analysis
12
disease epidemiology
8
disease surveillance
8
data
7
big
6
infectious
6
disease
6

Similar Publications

Background: There is limited population-based evidence on the prevalence of cognitive impairment in Mexico, a country with a rapidly aging population and where key risk factors, such as diabetes and obesity, are common. This study describes the distribution of cognitive impairment in adults from Mexico City.

Methods: This cross-sectional population-based study included participants from the Mexico City Prospective Study which recruited 150,000 adults aged ≥ 35 years in 1998-2004.

View Article and Find Full Text PDF

MEFA-Net: A mask enhanced feature aggregation network for polyp segmentation.

Comput Biol Med

December 2024

Fujian Provincial Key Laboratory of Networking Computing and Intelligent Information Processing, College of Computer and Data Science, Fuzhou University, Fuzhou 350116, China; Engineering Research Center of Big Data Intelligence, Ministry of Education, Fuzhou 350116, China. Electronic address:

Accurate polyp segmentation is crucial for early diagnosis and treatment of colorectal cancer. This is a challenging task for three main reasons: (i) the problem of model overfitting and weak generalization due to the multi-center distribution of data; (ii) the problem of interclass ambiguity caused by motion blur and overexposure to endoscopic light; and (iii) the problem of intraclass inconsistency caused by the variety of morphologies and sizes of the same type of polyps. To address these challenges, we propose a new high-precision polyp segmentation framework, MEFA-Net, which consists of three modules, including the plug-and-play Mask Enhancement Module (MEG), Separable Path Attention Enhancement Module (SPAE), and Dynamic Global Attention Pool Module (DGAP).

View Article and Find Full Text PDF

How small is big enough? Big data-driven machine learning predictions for a full-scale wastewater treatment plant.

Water Res

December 2024

State Key Laboratory of Pollution Control and Resource Reuse, School of Environment, Nanjing University, Nanjing 210023, China. Electronic address:

Wastewater treatment plants (WWTPs) generate vast amounts of water quality, operational, and biological data. The potential of these big data, particularly through machine learning (ML), to improve WWTP management is increasingly recognized. However, the costs associated with data collection and processing can rise sharply as datasets grow larger, and research on determining the optimal data volume for effective ML application remains limited.

View Article and Find Full Text PDF

Background: Research on personality types among doctors reveals its impact on medical specialty choices, suggesting that considering personality in career planning may enhance work satisfaction and reduce burnout risks.

Objective: This study, encompassing 2104 medical students, explores how personality types, traits, and gender relate to specialty preferences.

Methods: Participants of this study were medical students from various universities in Poland.

View Article and Find Full Text PDF

Electronic health records (EHRs), though they are maintained and utilized for clinical and billing purposes, may provide a wealth of information for research. Currently, sources are available that offer insight into the health histories of well over a quarter of a billion people. Their use, however, is fraught with hazards, including introduction or reinforcement of biases, clarity of disease definitions, protection of patient privacy, definitions of covariates or confounders, accuracy of medication usage compared with prescriptions, the need to introduce other data sources such as vaccination or death records and the ensuing potential for inaccuracy, duplicative records, and understanding and interpreting the outcomes of data queries.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!