Advanced forecasting of COVID-19 epidemic: Leveraging ensemble models, advanced optimization, and decomposition techniques.

Comput Biol Med

Department of Information Technology, College of Computer and Information Sciences, Princess Nourah bint Abdulrahman University, P.O.BOX 84428, Riyadh 11671, Saudi Arabia. Electronic address:

Published: June 2024

AI Article Synopsis

  • Accurate forecasting of COVID-19 outbreak patterns is essential for effective interventions, but it's challenging due to volatile data and limitations in prior individual forecasting methods.
  • This study presents a novel ensemble framework that combines three machine learning techniques with optimization methods to enhance predictive accuracy for COVID-19 cases in Italy and Poland.
  • Results indicate that this new model significantly outperforms other forecasting models, achieving high correlation coefficients for predicting cases 10 and 14 days in advance.

Article Abstract

In the global effort to address the outbreak of the new coronavirus pneumonia (COVID-19) pandemic, accurate forecasting of epidemic patterns has become crucial for implementing successful interventions aimed at preventing and controlling the spread of the disease. The correct prediction of the course of COVID-19 outbreaks is a complex and challenging task, mainly because of the significant volatility in the data series related to COVID-19. Previous studies have been limited by the exclusive use of individual forecasting techniques in epidemic modeling, disregarding the integration of diverse prediction procedures. The lack of attention to detail in this situation can yield worse-than-ideal results. Consequently, this study introduces a novel ensemble framework that integrates three machine learning methods (kernel ridge regression (KRidge), Deep random vector functional link (dRVFL), and ridge regression) within a linear relationship (L-K-d-R). The optimization of this framework is accomplished through a distinctive approach, specifically adaptive differential evolution and particle swarm optimization (A-DEPSO). Moreover, an effective decomposition method, known as time-varying filter empirical mode decomposition (TVF-EMD), is employed to decompose the input variables. A feature selection technique, specifically using the light gradient boosting machine (LGBM), is also implemented to extract the most influential input variables. The daily datasets of COVID-19 collected from two countries, namely Italy and Poland, were used as the experimental examples. Additionally, all models are implemented to forecast COVID-19 at two-time horizons: 10- and 14-day ahead (t+10 and t+14). According to the results, the proposed model can yield higher correlation coefficient (R) for both case studies: Italy (t+10 = 0.965, t+14 = 0.961) and Poland (t+10 = 0.952, t+14 = 0.940) than the other models. The experimental results demonstrate that the model suggested in this paper has outstanding results in various kinds of complex epidemic prediction situations. The proposed ensemble model demonstrates exceptional accuracy and resilience, outperforming all similar models in terms of efficacy.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.compbiomed.2024.108442DOI Listing

Publication Analysis

Top Keywords

ridge regression
8
input variables
8
covid-19
6
advanced forecasting
4
forecasting covid-19
4
epidemic
4
covid-19 epidemic
4
epidemic leveraging
4
leveraging ensemble
4
models
4

Similar Publications

Objectives: The objective of this study was to evaluate the long-term clinical outcomes of one-piece narrow-diameter implants (NDIs), with diameters of 2.5 mm and 3.0 mm, and to investigate the factors that affect marginal bone loss (MBL) around these implants.

View Article and Find Full Text PDF

Introduction: Variations in cervical cancer incidence rates and trends have been reported by sociodemographic characteristics. However, research on economic characteristics is limited especially among younger women in the United States.

Methods: We analyzed United States Cancer Statistics data to examine age-standardized cervical cancer incidence rates among women aged 15-29 years during 2007-2020.

View Article and Find Full Text PDF

Background: Words are a natural way to describe mental states in humans, while numerical values are a convenient and effective way to carry out quantitative psychological research. With the growing interest of researchers in gaming disorder, the number of screening tools is growing. However, they all require self-quantification of mental states.

View Article and Find Full Text PDF

Reconfiguration of brain network dynamics in bipolar disorder: a hidden Markov model approach.

Transl Psychiatry

December 2024

School of Computer Science and Technology (School of Data Science), Taiyuan University of Technology, Taiyuan, 030024, China.

Bipolar disorder (BD) is a neuropsychiatric disorder characterized by severe disturbance and fluctuation in mood. Dynamic functional connectivity (dFC) has the potential to more accurately capture the evolving processes of emotion and cognition in BD. Nevertheless, prior investigations of dFC typically centered on larger time scales, limiting the sensitivity to transient changes.

View Article and Find Full Text PDF

When the traditional random forest (RF) algorithm is used to select feature elements in biostatistical data, a large amount of noise data and parameters can affect the importance of the selected feature elements, making the control of feature selection difficult. Therefore, it is a challenge for the traditional RF algorithm to preserve the accuracy of algorithm results in the presence of noise data. Generally, directly removing noise data can result in significant bias in the results.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!