In the pursuit of understanding surface water quality for sustainable urban management, we created a machine learning modeling framework that utilized Random Forest (RF), Cubist, Extreme Gradient Boosting (XGB), Multivariate Adaptive Regression Splines (MARS), Gradient Boosting Machine (GBM), Support Vector Machine (SVM), and their hybrid stacking ensemble RF (SE-RF), as well as stacking Cubist (SE-Cubist), to predict the distribution of water quality in the Howrah Municipal Corporation (HMC) area in West Bengal, India. Additionally, we employed the ReliefF and Shapley Additive exPlanations (SHAP) methods to elucidate the underlying factors driving water quality. We first estimated the water quality index (WQI) to model seven water quality parameters: total hardness (TH), pH, total dissolved solids (TDS), dissolved oxygen (DO), biochemical oxygen demand (BOD), calcium (Ca), magnesium (Mg).
View Article and Find Full Text PDF