Identification of groundwater pollution sources based on Bayes’ theorem
ZHANG Shuang-sheng1,3, QIANG Jing2, LIU Han-hu1, LIU Xi-kun3, ZHU Xue-qiang1
1. School of Environment Science and Spatial Informatics, China University of Mining and Technology, Xuzhou 221116, China;
2. School of Mathematics, China University of Mining and Technology, Xuzhou 221116, China;
3. Xuzhou City Water Resource Administrative Office, Xuzhou 221018, China
Coupling Bayes’ Theorem with a two-dimensional (2D) groundwater solute advection-diffusion transport equation, it is possible to establish an inverse model based on monitoring well data to identify a set of contamination source parameters including source intensity (M), release location (X0,Y0) and release time (T0). To address the issues of insufficient monitoring data from the wells or weak correlation between monitoring data and model parameters, a monitoring well design optimization approach was developed based on the Bayesian formula and information entropy. To demonstrate how the model works, an example with an instantaneous release of a contaminant in a confined groundwater aquifer was employed. Under the condition of single well monitoring and determined monitoring counts, with the target of optimization of monitoring location D and monitoring frequency Dt, both the single-objective monitoring scheme with the minimum information entropy of the model parameter posterior distribution and the multi-objective monitoring scheme with the smallest information entropy and the shortest monitoring time were optimized respectively. According to the optimized monitoring scheme, the delayed rejection adaptive Metropolis algorithm was used to identify the pollution source parameters. The case study results showed that under the condition of pre-set single well monitoring and 5monitoring times, the single-objective optimized monitoring scheme was D=(830.2,199.8),△t=2.7. Under this monitoring scheme, the mean errors of inverse 4pollution source parameters M,X0,Y0,T0 were 19.5%, 13.2%, 3.4%, and 1.3%, respectively. The multi-objective optimization monitoring scheme was D=(807.9,199.4),△t=1.2. Under the monitoring scheme, the mean errors of inverse 4parameters M,X0,Y0,T0 were 19.9%, 13.4%, 3.7%, and 4.2%, respectively. Compared with the monitoring scheme based on the single-objective optimization, while the inversion mean error of the pollution source parameters based on the multi-objective optimization monitoring scheme increased by 0.4%, 0.2%, 0.3%, 2.9% respectively, the monitoring time has been significantly reduced by 55.6%.
Chen M, Izady A, Abdalla O A, et al. A surrogate-based sensitivity quantification and Bayesian inversion of a regional groundwater flow model[J]. Journal of Hydrology, 2018,557:826-837.
[2]
Snodgrass M F, Kitanidis P K. A geostatistical approach to contaminant source identification[J]. Water Resources Research, 1997, 33(4):537-546.
[3]
Ruzek B, Kvasnicka M. Differential evolution algorithm in the earthquake hypocenter location[J]. Pure and Applied Geophysics, 2001,158:667-693.
[4]
Mahinthakumar G, Sayeed M. Hybrid genetic algorithm-local search methods for solving groundwater source identification inverse problems[J]. Journal of Water Resources Planning and Management, 2005,131(1):45-57.
[5]
Dougherty D E, Marryott R A. Optimal groundwater management:1. Simulated annealing[J]. Water Resources Research, 1991,27(10):2493-2508.
[6]
Tanner M A. Tools for statistical inference:methods for the expectation of posterior distribution and likelihood functions[M].New York:Springer-Verlag, 1996:1-52.
[7]
李久辉,卢文喜,常振波,等.基于不确定性分析的地下水污染超标风险预警[J]. 中国环境科学, 2017,37(6):2270-2277. Li J, Lu W, Chang Z, et al. Risk prediction of groundwater pollution based on uncertainty analysis[J]. China Environmental Science, 2017, 37(6):2270-2277.
[8]
Metropolis N, Rosenbluth A W, Rosenbluth M N, et al. Equation of state calculations by fast computing machines[J]. The Journal of Chemical Physics, 1953,21(6):1087-1092.
[9]
Hastings W K. Monte Carlo sampling methods using Markov chains and their applications[J]. Biometrika, 1970,57(1):97-109.
[10]
Mira A. Ordering and improving the performance of Monte Carlo Markov Chains[J]. Statistical Science, 2002,16:340-350.
[11]
Haario H, Saksman E, Tamminen J. An adaptive Metropolis algorithm[J]. Bernoulli, 2001,7(2):223-242.
[12]
Xu W, Jiang C, Yan L, et al. An adaptive metropolis-hastings optimization algorithm of Bayesian estimation in non-stationary flood frequency analysis[J]. Water Resources Management, 2018,32(4):1343-1366.
[13]
Haario H, Laine M, Mira A. DRAM:Efficient adaptive MCMC[J]. Statistics and Computing, 2006,16(4):339-354.
[14]
Huan X, Marzouk Y M. Simulation-based optimal Bayesian experimental design for nonlinear systems[J]. Journal of Computational Physics, 2013,232(1):288-317.
[15]
张江江.地下水污染源解析的贝叶斯监测设计与参数反演方法[D]. 杭州:浙江大学, 2017. Zhang J. Bayesian monitoring design and parameter inversion for groundwater contaminant source identification[D]. Hangzhou:Zhejiang University, 2017.
[16]
Carrera J, Neuman S P. Estimation of aquifer parameters under transient and steady state conditions:2. Uniqueness, stability, and solution algorithms[J]. Water Resources Research, 1986,22(2):211-227.
[17]
Czanner G, Sarma S V, Eden U T, et al. A signal-to-noise ratio estimator for generalized linear model systems[J]. Lecture Notes in Engineering & Computer Science, 2008,2171(1).
[18]
Lindley D V. On a measure of the information provided by an experiment[J]. The Annals of Mathematical Statistics, 1956,27(4):986-1005.
[19]
Shannon C E. A mathematical theory of communication[J]. The Bell System Technical Journal, 1948,27(3):379-423.
[20]
Deb K, Pratap A, Agarwal S, et al. A fast and elitist multiobjective genetic algorithm:NSGA-Ⅱ[J]. IEEE Transactions on Evolutionary Computation, 2002,6(2):182-197.
[21]
Berger J O. Recent development and applications of Bayesian analysis[D]. Proceedings 50th ISI, Book I. 1995.
[22]
戴英彪.基于拉丁超立方试验设计的事故再现结果不确定性分析[D]. 长沙:长沙理工大学, 2011. Dai Y. Uncertainty analysis of vehicle accident reconstruction results based on Latin hypercube sampling[D]. Changsha:Changsha University of Science and Technology, 2011.
[23]
Gelman A G, Rubin D B. Inference from iterative simulation using multiple sequences[J]. Statistical Science, 1992,7:457-472.
[24]
陈崇希,李国敏.地下水溶质运移理论及模型[M]. 武汉:中国地质大学出版社, 1996:46-49. Chen C, Li G. Theory and model of solute transport in groundwater[M]. Wuhan:China University of Geosciences Press, 1996:46-49.
[25]
周圣武,李金玉,周长新.概率论与数理统计(第二版)[M]. 北京:煤炭工业出版社, 2007:208-215. Zhou S, Li J, Zhou C. Probability theory and mathematical statistics (Second edition)[M]. Beijin:China Coal Industry Publishing House, 2007:208-215.