In Environmental monitoring and assessment
This study aims to compare three popular machine learning (ML) algorithms including random forest (RF), boosting regression tree (BRT), and multinomial logistic regression (MnLR) for spatial prediction of groundwater quality classes and mapping it for salinity hazard. Three hundred eighty-six groundwater samples were collected from an agriculturally intensive area in Fars Province, Iran, and nine hydro-chemical parameters were defined and interpreted. Variance inflation factor and Pearson's correlations were used to check collinearity between variables. Thereinafter, the performance of ML models was evaluated by statistical indices, namely, overall accuracy (OA) and Kappa index obtained from the confusion matrix. The results showed that the RF model was more accurate than other models with the slight difference. Moreover, the analysis of relative importance also indicated that sodium adsorption ratio (SAR) and pH have the most impact parameters in explaining groundwater quality classes, respectively. In this research, applied ML algorithms along with the hydro-chemical parameters affecting the quality of ground water can lead to produce spatial distribution maps with high accuracy for managing irrigation practice.
Masoudi Reyhaneh, Mousavi Seyed Roohollah, Rahimabadi Pouyan Dehghan, Panahi Mehdi, Rahmani Asghar
2023-Jan-23
Boosting regression tree, Data mining algorithms, Groundwater quality, Hydro-chemical, Multinomial logistic regression, Random forest