In Environmental science & technology ; h5-index 132.0
Recent advances in machine-learning methods offer the opportunity to improve risk assessment and to decipher factors influencing spatial variability of groundwater arsenic ([As]gw). A systematic comparison revealed that boosted regression trees (BRT) and random forest (RF) outperformed logistic regression. The probability of [As]gw exceeding 5 μg/L (approximate median value of Bangladesh [As]gw), 10 μg/L (WHO provisional guideline value), and 50 μg/L (Bangladesh drinking water standard) was modeled by BRT and RF methods for Bangladesh and its 4 sub-regions demarcated by major rivers. Of 109 geo-environmental and hydrochemical predictor variables, phosphorus and iron emerged as the most important across spatial scales, consistent with known As mobilization mechanisms. Well depth is significant only when hydrochemical parameters are not considered, consistent with prior studies. A peak of probability of [As]gw exceedance at ~ 30 m depth is evident in the partial dependence plots (PDPs) for spatial-parameter-only models but not in the equivalent all-parameter models, suggesting that sediment depositional history explains interdependent spatial patterns of groundwater As-P-Fe in Holocene aquifers. The South region exhibits a decrease of probability of [As]gw exceedance below 150 m depth in PDPs for spatial-parameter-only and all-parameter models, supporting that the deeper Pleistocene aquifer is a low-As water resource.
Tan Zhen, Yang Qiang, Zheng Yan