This study investigated how changes in weather factors affect the prevalence of conjunctivitis using public big data in South Korea. A total of 1,428 public big data entries from January 2013 to December 2019 were collected. Disease data and basic climate/air pollutant concentration records were collected from nationally provided big data. Meteorological factors affecting eye diseases were identified using multiple linear regression and machine learning analysis methods such as extreme gradient boosting (XGBoost), decision tree, and random forest. The prediction model with the best performance was XGBoost (1.180), followed by multiple regression (1.195), random forest (1.206), and decision tree (1.544) when using root mean square error (RMSE) values. With the XGBoost model, province was the most important variable (0.352), followed by month (0.289) and carbon monoxide exposure (0.133). Other air pollutants including sulfur dioxide, PM, nitrogen dioxides, and ozone showed low associations with conjunctivitis. We identified factors associated with conjunctivitis using traditional multiple regression analysis and machine learning techniques. Regional factors were important for the prevalence of conjunctivitis as well as the atmosphere and air quality factors.

Download full-text PDF

Source
http://www.ncbi.nlm.nih.gov/pmc/articles/PMC9203752PMC
http://dx.doi.org/10.1038/s41598-022-13344-5DOI Listing

Publication Analysis

Top Keywords

big data
16
prevalence conjunctivitis
12
public big
12
south korea
8
machine learning
8
decision tree
8
random forest
8
multiple regression
8
conjunctivitis
5
data
5

Similar Publications

Rapid growth in bio-logging-the use of animal-borne electronic tags to document the movements, behaviour, physiology and environments of wildlife-offers opportunities to mitigate biodiversity threats and expand digital natural history archives. Here we present a vision to achieve such benefits by accounting for the heterogeneity inherent to bio-logging data and the concerns of those who collect and use them. First, we can enable data integration through standard vocabularies, transfer protocols and aggregation protocols, and drive their wide adoption.

View Article and Find Full Text PDF

Background: In-person interaction offers invaluable benefits to people. To guarantee safe in-person activities during a COVID-19 outbreak, effective identification of infectious individuals is essential. In this study, we aim to analyze the impact of screening with antigen tests in schools and workplaces on identifying COVID-19 infections.

View Article and Find Full Text PDF

Background: Coronavirus disease-2019 (COVID-19), caused by SARS-CoV-2 virus infection, is characterized as a multisystem disease, potentially yielding multifaceted consequences on various organs at multiple levels. At the end of 2022, over 90% of the Chinese population was infected by SARS-CoV-2 within 35 days because of adjustments to epidemic prevention and control policies. This short-term change provides an unprecedented opportunity for comparative studies on COVID-19 infection among large populations.

View Article and Find Full Text PDF

Integrating crowdsourced data in the built environment studies: A systematic review.

J Environ Manage

January 2025

Department of Landscape Architecture, University of Nevada, Las Vegas, NV, USA. Electronic address:

The integration of crowdsourced data has become central to contemporary built environment studies, driven by the rapid growth in digital technologies and participatory approaches that characterize modern urbanism. Despite its potential, a systematic framework for its analysis remains underdeveloped. This review, conducted in accordance with the PRISMA protocol, examines the use of crowdsourced data in shaping the built environment, scrutinizing its applications, crowdsourcing techniques, methodologies, and comparison with other big data forms.

View Article and Find Full Text PDF

Background: The pervasiveness of drug culture has become evident in popular music and social media. Previous research has examined drug abuse content in both social media and popular music; however, to our knowledge, the intersection of drug abuse content in these 2 domains has not been explored. To address the ongoing drug epidemic, we analyzed drug-related content on Twitter (subsequently rebranded X), with a specific focus on lyrics.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!