Evaluation of data preprocessing and feature selection process for prediction of hourly PM concentration using long short-term memory models.

Environ Pollut

Department of International Trade and Finance, Faculty of Applied Science, Sakarya University of Applied Science, Sakarya, Turkey; AI Research and Application Center, Sakarya University of Applied Sciences, Sakarya, Turkey.

Published: October 2022

Studies have confirmed that PM, defined as respirable particles with diameters of 10 μm and smaller, has adverse effects on human health and the environment. Various estimation methods are employed to determine the PM concentration using historical data on controlling PM air pollution, early warning, and protecting public health and the environment. The present study analyses different Long Short-Term Memory (LSTM) models that can predict hourly PM concentration. In parallel, the study also investigates the effectiveness of the data preprocessing and feature selection (DPFS) process on the prediction accuracy of the LSTM models. For this purpose, three different LSTM models, namely Vanilla, Bi-Directional, and Stacked, were developed. Then, a comprehensive data preprocessing stage is used to eliminate missing and erroneous data and outliers from real-world raw data, and a feature selection process is applied to extract unnecessary features. The LSTM models consider three air quality parameters, including SO, O, and CO, and three meteorological factors, including relative humidity, wind direction, and wind speed. The prediction performances of the LSTM models are compared using the RMSE, MAE and R performance index according to whether DPFS is used in the models or not. As a result, when the DPFS process was applied, the proposed LSTM models achieved high prediction performance and can be used to predict hourly PM concentrations. Overall, the DPFS process significantly enhanced the developed LSTM models' prediction performance. Furthermore, the proposed model might be a useful tool for city administrators to make decisions and improve air quality management efforts.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.envpol.2022.119973DOI Listing

Publication Analysis

Top Keywords

lstm models
24
data preprocessing
12
feature selection
12
dpfs process
12
preprocessing feature
8
selection process
8
process prediction
8
hourly concentration
8
long short-term
8
short-term memory
8

Similar Publications

Monitoring the quantity and quality of karst springs is essential for groundwater resource management. However, it is challenging to robustly forecast the karst spring discharge and pollutant concentration due to the high complexity and heterogeneity of karst aquifers. Few researchers have addressed the long-term prediction of hourly spring quantity and quality, which is crucial for emergency management.

View Article and Find Full Text PDF

The Gravity Recovery and Climate Experiment (GRACE) and its follow-on (GRACE-FO) missions have provided estimates of Terrestrial Water Storage Anomalies (TWSA) since 2002, enabling the monitoring of global hydrological changes. However, temporal gaps within these datasets and the lack of TWSA observations prior to 2002 limit our understanding of long-term freshwater variability. In this study, we develop GRAiCE, a set of four global monthly TWSA reconstructions from 1984 to 2021 at 0.

View Article and Find Full Text PDF

Intelligent Intrusion Detection System Against Various Attacks Based on a Hybrid Deep Learning Algorithm.

Sensors (Basel)

January 2025

Department of Electrical Engineering, Faculty of Engineering, Universitas Indonesia, Depok 16424, Indonesia.

The Internet of Things (IoT) has emerged as a crucial element in everyday life. The IoT environment is currently facing significant security concerns due to the numerous problems related to its architecture and supporting technology. In order to guarantee the complete security of the IoT, it is important to deal with these challenges.

View Article and Find Full Text PDF

Real-time and accurate traffic forecasting aids in traffic planning and design and helps to alleviate congestion. Addressing the negative impacts of partial data loss in traffic forecasting, and the challenge of simultaneously capturing short-term fluctuations and long-term trends, this paper presents a traffic forecasting model, D-MGDCN-CLSTM, based on Multi-Graph Gated Dilated Convolution and Conv-LSTM. The model uses the DTWN algorithm to fill in missing data.

View Article and Find Full Text PDF

A Deep Learning Approach for Mental Fatigue State Assessment.

Sensors (Basel)

January 2025

Institute of Artificial Intelligence in Sports, Capital University of Physical Education and Sports, Beijing 100191, China.

This study investigates mental fatigue in sports activities by leveraging deep learning techniques, deviating from the conventional use of heart rate variability (HRV) feature analysis found in previous research. The study utilizes a hybrid deep neural network model, which integrates Residual Networks (ResNet) and Bidirectional Long Short-Term Memory (Bi-LSTM) for feature extraction, and a transformer for feature fusion. The model achieves an impressive accuracy of 95.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!