In Journal of environmental management
Evaporation is an important hydrological process in the water cycle, especially for water bodies. Machine Learning (ML) models have become accurate and powerful tools in predicting pan evaporation. Meanwhile, the "black-box" character and the consistency with the physical process can decrease the practical implication of ML models. To overcome such limitations, we attempt to develop an interpretable based-ML framework to predict daily pan evaporation using Extra Tree, XGBoost, SVR, and Deep Neural Network (DNN) ML models using hourly climate datasets. To that end, we integrated and employed the Shapely Additive explanations (SHAP), Sobol-based sensitivity analysis, and Local Interpretable Model-agnostic Explanations (LIME) to evaluate the interpretability of the models in predicting daily pan evaporation, at Sidi Mohammed Ben Abdellah (SMBA) weather station, in Morocco. The validation results of the models showed that the developed models are accurate in reproducing the daily pan evaporation with NSE ranging from 0.76 to 0.83 during the validation phase. Furthermore, the interpretability results of the ML models showed that the air temperature (Ta), solar radiation (Rs), followed by relative humidity (H) are the most important climate variables with inflection points of the Ta_median, Ta_mean, Rs_sum, H_mean, and w_std are 17.42 °C, 17.65 °C, 3.8 kw.m-2, 69.59%, and 1.25 m s-1, sequentially. Overall, the interpretability of the models showed a good consistency of the ML models with the real hydro-climatic process of evaporation in a semi-arid environment. Hence, the proposed methodology is powerful in enhancing the reliability and transparency of the developed models for predicting daily pan evaporation. Finally, the proposed approach is new insights to reduce the ''Black-Box'' character of ML models in hydrological studies.
El Bilali Ali, Abdeslam Taleb, Ayoub Nafii, Lamane Houda, Ezzaouini Mohamed Abdellah, Elbeltagi Ahmed
2022-Nov-29
Climate variables, Interpretable machine learning, LIME, SHAP, Sobol index