In the last decades, the quality of the surface-water bodies has been increasingly threatened mainly due to anthropogenic activities (i.e., urbanization and the consequent increase of impervious surfaces). This leads to the generation of runoff even more rich in pollutants and difficult to adequately treat in the specific first-flush treatment plant. Consequently, accurate water-quality predictions in urban areas and related detections of events that may generate first flush are the key to enhancing urban water management and pollution control. In this study, the ability of a supervised machine-learning technique (XGBoost) to predict the occurrence of first flush was demonstrated. The model showed outstanding performance in predicting first flush for three of the most detected pollutants in urban areas (total suspended solids, total nitrogen, and total phosphorus) by using rainfall-runoff variables as input. Furthermore, by exploiting a non-model-biased method based on game theory (SHAP), such variables were quantified and ranked based on their level of importance in pollutant first-flush predictions. The findings of this work proved that the XGBoost model is a functional tool for enhancing the accuracy of first-flush predictions in urban watersheds and, thus, contributes to the development of an effective design of first-flush treatment plants.

First Flush Occurrence Prediction and Ranking of Its Influential Variables in Urban Watersheds: Evaluation of XGBoost and SHAP Techniques

Gioia A.;Iacobellis V.;
2022

Abstract

In the last decades, the quality of the surface-water bodies has been increasingly threatened mainly due to anthropogenic activities (i.e., urbanization and the consequent increase of impervious surfaces). This leads to the generation of runoff even more rich in pollutants and difficult to adequately treat in the specific first-flush treatment plant. Consequently, accurate water-quality predictions in urban areas and related detections of events that may generate first flush are the key to enhancing urban water management and pollution control. In this study, the ability of a supervised machine-learning technique (XGBoost) to predict the occurrence of first flush was demonstrated. The model showed outstanding performance in predicting first flush for three of the most detected pollutants in urban areas (total suspended solids, total nitrogen, and total phosphorus) by using rainfall-runoff variables as input. Furthermore, by exploiting a non-model-biased method based on game theory (SHAP), such variables were quantified and ranked based on their level of importance in pollutant first-flush predictions. The findings of this work proved that the XGBoost model is a functional tool for enhancing the accuracy of first-flush predictions in urban watersheds and, thus, contributes to the development of an effective design of first-flush treatment plants.
22nd International Conference on Computational Science and Its Applications , ICCSA 2022
978-3-031-10544-9
978-3-031-10545-6
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11589/245040
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact