In Environmental science and ecotechnology
Data-driven approaches that make timely predictions about pollutant concentrations in the effluent of constructed wetlands are essential for improving the treatment performance of constructed wetlands. However, the effect of the meteorological condition and flow changes in a real scenario are generally neglected in water quality prediction. To address this problem, in this study, we propose an approach based on multi-source data fusion that considers the following indicators: water quality indicators, water quantity indicators, and meteorological indicators. In this study, we establish four representative methods to simultaneously predict the concentrations of three representative pollutants in the effluent of a practical large-scale constructed wetland: (1) multiple linear regression; (2) backpropagation neural network (BPNN); (3) genetic algorithm combined with the BPNN to solve the local minima problem; and (4) long short-term memory (LSTM) neural network to consider the influence of past results on the present. The results suggest that the LSTM-predicting model performed considerably better than the other deep neural network-based model or linear method, with a satisfactory R2. Additionally, given the huge fluctuation of different pollutant concentrations in the effluent, we used a moving average method to smooth the original data, which successfully improved the accuracy of traditional neural networks and hybrid neural networks. The results of this study indicate that the hybrid modeling concept that combines intelligent and scientific data preprocessing methods with deep learning algorithms is a feasible approach for forecasting water quality in the effluent of actual engineering.
Yang Bowen, Xiao Zijie, Meng Qingjie, Yuan Yuan, Wang Wenqian, Wang Haoyu, Wang Yongmei, Feng Xiaochi
Constructed wetlands, Deep learning, LSTM, Multi-source data fusion, Water quality prediction