Exploring potential machine learning application based on big data for prediction of wastewater quality from different full-scale wastewater treatment plants.

Sci Total Environ

Institute of Environmental Engineering & Nano-Technology, Tsinghua Shenzhen International Graduate School, Tsinghua University, Shenzhen 518055, Guangdong, China; Guangdong Provincial Engineering Research Center for Urban Water Recycling and Environmental Safety, Tsinghua-Shenzhen International Graduate School, Tsinghua University, Shenzhen 518055, Guangdong, China; School of Environment, Tsinghua University, Beijing 100084, China. Electronic address:

Published: August 2022

Water pollution generated from intensive anthropogenic activities has emerged as a critical issue concerning ecosystem balance and livelihoods worldwide. Although optimizing wastewater treatment efficiency is widely regarded as the foremost step to minimize pollutants released into the environment, this widespread application has encountered two major problems: firstly, the significant variation of influent wastewater constituents; secondly, complex treatment processes within wastewater treatment plants (WWTPs). Based on the data collected hourly using real-time sensors in three different full-scale WWTPs (24 h × 365 days × 3 WWTPs × 10 wastewater parameters), this work introduced the potential application of Machine Learning (ML) to predict wastewater quality. In this work, six different ML algorithms were examined and compared, varying from shallow to deep learning architectures including Seasonal Autoregressive Integrated Moving Average (SARIMAX), Random Forest (RF), Support Vector Machine (SVM), Gradient Tree Boosting (GTB), Adaptive Neuro-Fuzzy Inference System (ANFIS) and Long Short-Term Memory (LSTM). These models were developed to detect total phosphorus in the outlet (Outlet-TP), which served as an output variable due to the rising concerns about the eutrophication problem. Irrespective of WWTPs, SARIMAX consistently demonstrated the best performance for regression estimation as evidenced by the lowest values of Mean Square Error (MSE), Mean Absolute Error (MAE), Mean Absolute Percentage Error (MAPE) and the highest coefficient of determination (R). In terms of computation efficiency, SARIMAX exhibited acceptable time computation, acknowledging the successful application of this algorithm for Outlet-TP modeling. In contrast, the complex structure of LSTM made it time-consuming and unstable coupled with noise, while other shallower architectures, i.e., RF, SVM, GTB, and ANFIS were unable to address large datasets with nonlinear and nonstationary behavior. Consequently, this study provides a reliable and accurate approach to forecast wastewater effluent quality, which is pivotal in terms of the socio-economic aspects of wastewater management.

Download full-text PDF

Source
http://dx.doi.org/10.1016/j.scitotenv.2022.154930DOI Listing

Publication Analysis

Top Keywords

wastewater treatment
12
wastewater
9
machine learning
8
wastewater quality
8
treatment plants
8
exploring potential
4
potential machine
4
application
4
learning application
4
application based
4

Similar Publications

We hypothesized that a strategy employing tissue-specific endothelial cells (EC) might facilitate the identification of tissue- or organ-specific vascular functions of ubiquitous metabolites. An unbiased approach was employed to identify water-soluble small molecules with mitogenic activity on choroidal EC. We identified adenosine diphosphate (ADP) as a candidate, following biochemical purification from mouse EL4 lymphoma extracts.

View Article and Find Full Text PDF

Introduction: Enteric pathogens are a leading causes of diarrheal deaths in low-and middle-income countries. The Exposure Assessment of Infections in Rural Ethiopia (EXCAM) project, aims to identify potential sources of bacteria in the genus and, more generally, fecal contamination of infants during the first 1.5 years of life using as indicator.

View Article and Find Full Text PDF

A highly efficient and widely applicable adsorbent for the removal of methylene blue (MB) was created using nitrogen-doped and reduced graphene oxide (NRGO). The effects of NRGO mass, pH, contact time, and the initial MB concentration on the adsorption properties of MB onto NRGO were investigated. The results showed that the adsorption behavior remained stable within the pH range of 2.

View Article and Find Full Text PDF

Microbial Fuel Cells (MFCs) are innovative environmental engineering systems that harness the metabolic activities of microbial communities to convert chemical energy in waste into electrical energy. However, MFC performance optimization remains challenging due to limited understanding of microbial metabolic mechanisms, particularly with complex substrates under realistic environmental conditions. This study investigated the effects of substrate complexity (acetate vs.

View Article and Find Full Text PDF

Thiabendazole (TBZ), a recalcitrant fungicide, is frequently applied in postharvest fruit treatment and generates significant volumes of industrial wastewater (WW) that conventional treatment plants cannot handle. This explores a bioelectrochemical system (BES) for TBZ degradation using Tunisian hypersaline sediments (THSs) as inoculum. Four sets of BES, along with biological controls, were tested using THS subjected to different levels of TBZ biostimulation.

View Article and Find Full Text PDF

Want AI Summaries of new PubMed Abstracts delivered to your In-box?

Enter search terms and have AI summaries delivered each week - change queries or unsubscribe any time!