Risk assessment of PM2.5 pollution based on machine learning and nonparametric estimation
ZHOU Qi1,2, YU Yang3, LIU Miao-miao1, BI Jun1
1. State Key Laboratory of Pollution Control and Resource Reuse, School of the Environment, Nanjing University, Nanjing 210023, China; 2. School of Environment, Tsinghua University, Beijing 100084, China; 3. Institute for Interdisciplinary Information Sciences, Tsinghua University, Beijing 100084, China
Abstract:A systematic approach of regional PM2.5 risk and characterization assessment was developed in this study by integrating random forest model, Quantile-Quantile plot model, and K-mean model, based on multi-source data including mobile phone signals, meteorological data, geographic data, etc. This new approach was further applied in a case study of Nanjing at a 0.3km resolution grid. On the one hand, this new approach effectively simulated the temporal and spatial distribution of the PM2.5 concentration with an10-fold cross-validation R2 of 0.76 and screened out four major pollution characteristics. On the other hand, it effectively captured the short-term population mobility risk. Short-term population mobility increased the PM2.5 exposure risk by 0.30~0.97 times, even keeping PM2.5 concentration unchanged. After combining PM2.5 concentration and population mobility simultaneously, four major PM2.5 exposure risk modes were identified. 6.5% of the areas of Nanjing were at high risk, and 23.0% were at low risk. During the 14th Five Year Plan, it is suggested that the government should speed up the application of modern science and technology in environmental protection and implement gridding and differentiated policies on air pollution risk control to promote human health.
周琪, 于洋, 刘苗苗, 毕军. 基于机器学习和非参数估计的PM2.5风险评估[J]. 中国环境科学, 2022, 42(8): 3554-3560.
ZHOU Qi, YU Yang, LIU Miao-miao, BI Jun. Risk assessment of PM2.5 pollution based on machine learning and nonparametric estimation. CHINA ENVIRONMENTAL SCIENCECE, 2022, 42(8): 3554-3560.
Ma Z, Hu X, Huang L, et al. Estimating ground-level PM2.5 in China using satellite remote sensing[J]. Environmental Science & Technology, 2014,48(13):7436-7444.
[2]
朱翠云,何清,赵竹君,等.乌鲁木齐市区与南郊山区颗粒物污染特征对比分析[J]. 中国环境科学, 2022:1-17. Zhu C, He Q, Zhao Z, et al. Comparative analysis of particulate pollution characteristics between Urumqi urban area and southern mountainous area[J]. China Environmental Science, 2022:1-17.
[3]
阮芳芳,曾贤刚,段存儒.基于城市面板数据的PM2.5对公共健康的影响[J]. 中国环境科学, 2020,40(12):5451-5458. Ruan F, Zeng X, Duan C. Influence of PM2.5 pollution on public health based on urban panel data[J]. China Environmental Science, 2020, 40(12):5451-5458.
[4]
Liu M, Huang Y, Ma Z, et al. Spatial and temporal trends in the mortality burden of air pollution in China:2004~2012[J]. Environmental International, 2017,98:75-81.
[5]
He C Y, Han L J, Zhang R Q. More than 500 million Chinese urban residents (14% of the global urban population) are imperiled by fine particulate hazard[J]. Environmental Pollution, 2016,218:558-562.
[6]
Shen H, Tao S, Chen Y, et al. Urbanization-induced population migration has reduced ambient PM2.5 concentrations in China[J]. Science Advances, 2017,3(7):e1700300.
[7]
Silva R A, West J J, Lamarque J F, et al. Future global mortality from changes in air pollution attributable to climate change[J]. Nature Climate Change, 2017,7(9):647.
[8]
Deville P, Linard C, Martin S, et al. Dynamic population mapping using mobile phone data[J]. PNAS, 2014,111(45):15888-15893.
[9]
Dewulf B, Neutens T, Lefebvre W, et al. Dynamic assessment of exposure to air pollution using mobile phone data[J]. International Journal of Health Geographics, 2016,15:14.
[10]
Nyhan M, Grauwin S, Britter R, et al. "Exposure Track" The impact of mobile-device-based mobility patterns on quantifying population exposure to air pollution[J]. Environmental Science & Technology, 2016,50(17):9671-9681.
[11]
Wang F, Ren J, Liu J, et al. Spatial correlation network and population mobility effect of regional haze pollution:Empirical evidence from Pearl River Delta urban agglomeration in China[J]. Environment, Development and Sustainability, 2021,23(11):15881-15896.
[12]
Kloog I, Chudnovsky A A, Just A C, et al. A new hybrid spatio-temporal model for estimating daily multi-year PM2.5 concentrations across northeastern USA using high resolution aerosol optical depth data[J]. Atmospheric Environment, 2014,95:581-590.
[13]
Ma Z, Hu X, Sayer A M, et al. Satellite-based spatiotemporal trends in PM2.5 concentrations:China, 2004~2013[J]. Environmental Health Perspectives, 2016,124(2):184-192.
[14]
Gupta P, Christopher S A. Particulate matter air quality assessment using integrated surface, satellite, and meteorological products:A neural network approach[J]. Journal of Geophysical Research-Atmospheres, 2009,114(D20):205.
[15]
梁泽,王玥瑶,岳远紊,等.耦合遗传算法与RBF神经网络的PM2.5浓度预测模型[J]. 中国环境科学, 2020,40(2):523-529. Liang Z, Wang Y, Yue Y, et al. A coupling model of genetic algorithm and RBF neural network for the prediction of PM2.5 concentration[J]. China Environmental Science, 2020,40(2):523-529.
[16]
Reid C E, Jerrett M, Petersen M L, et al. Spatiotemporal prediction of fine particulate matter during the 2008 Northern California wildfires using machine learning[J]. Environmental Science & Technology, 2015,49(6):3887-3896.
[17]
Di Q, Koutrakis P, Schwartz J. A hybrid prediction model for PM2.5mass and components using a chemical transport model and land use regression[J]. Atmospheric Environment, 2016,131:390-399.
[18]
Hu X F, Belle J H, Meng X, et al. Estimating PM2.5 concentrations in the conterminous United States using the random forest approach[J]. Environmental Science & Technology, 2017,51(12):6936-6944.
[19]
Li R, Mei X, Wei L, et al. Study on the contribution of transport to PM2.5 in typical regions of China using the regional air quality model RAMS-CMAQ[J]. Atmospheric Environment, 2019,214:116856.
[20]
刘文军,郑国义,田学,等.西安市PM2.5相关因素多元回归分析模型[J]. 经济数学, 2015,32(1):85-88. Liu W, Zheng G, Tian X. PM2.5factors associated with multivariate regression analysis model in Xi'an[J]. Journal of Quantitative Economics, 2015,32(1):85-88.
[21]
卢月明,王亮,仇阿根,等.一种基于主成分分析的协同克里金插值方法[J]. 测绘通报, 2017(11):51,57-63. Lu M, Wang L, Qiu A. A CoKriging interpolation method based on principal component analysis[J]. Bulletin of Surveying and Mapping, 2017(11):51,57-63.
[22]
Vaidyanathan A, Dimmick W F, Kegler S R, et al. Statistical air quality predictions for public health surveillance:evaluation and generation of county level metrics of PM2.5 for the environmental public health tracking network[J]. International Journal of Health Geographics, 2013,12:12.
[23]
Pozdnoukhov A, Kanevski M. Monitoring network optimisation for spatial data classification using support vector machines[J]. International Journal of Environment and Pollution, 2006,28(3/4):465-484.
[24]
Kaburlasos V G, Athanasiadis I N, Mitkas P A. Fuzzy lattice reasoning (FLR) classifier and its application for ambient ozone estimation[J]. International Journal of Approximate Reasoning, 2007,45(1):152-188.
[25]
Sfetsos A, Vlachogiannis D. A new approach to discovering the causal relationship between meteorological patterns and PM10 exceedances[J]. Atmospheric Research, 2010,98(2-4):500-511.
[26]
中国环境监测总站.全国城市空气质量实时发布平台. China National Environmental Monitoring Centre. National air quality real-time release platform.
[27]
中国气象局.中国地面气象站逐小时观测资料. http://data.cma.cn/. China Meteorological Administration. Hourly observation data from stationary meteorological sites. http://data.cma.cn/.
[28]
Geofabrik. Open street map. https://www.openstreetmap.org/.
[29]
中国科学院地理科学与资源研究所.资源环境科学与数据中心. https://www.resdc.cn/. Institute of Geographic Sciences and Natural Resources Research. Resource and environment science and data center. https://www.resdc.cn/.
[30]
Svetnik V, Liaw A, Tong C, et al. Random forest:A classification and regression tool for compound classification and QSAR modeling[J]. Journal of Chemical Information and Computer Sciences, 2003,43(6):1947-1958.
[31]
Cutler D R, Edwards T C, Beard K H, et al. Random forests for classification in ecology[J]. Ecology, 2007,88(11):2783-2792.
[32]
Shao Y, Ma Z, Wang J, et al. Estimating daily ground-level PM2.5 in China with random-forest-based spatiotemporal kriging[J]. Science of the Total Environment, 2020,740:139761.
[33]
李名升,任晓霞,于洋,等.中国大陆城市PM2.5污染时空分布规律[J]. 中国环境科学, 2016,36(3):641-650. Li M, Ren X, Yu Y. Spatiotemporal pattern of ground-level fine particulate matter (PM2.5) pollution in mainland China[J]. China Environmental Science, 2016,36(3):641-650.
[34]
许珊,邹滨,胡晨霞.面向场景的城市PM2.5浓度空间分布精细模拟[J]. 中国环境科学, 2019,39(11):4570-4579. Xu S, Zou B, Hu C. Urban scene-oriented simulation of the spatial distribution of PM2.5concentration in an intra-urban area at fine scale[J]. China Environmental Science, 2019,39(11):4570-4579.