The fitting component selection model of PM2.5 based on fuzzy clustering
XU Heng-peng1,2, LI Yue1,2, SHI Guo-liang3, WANG Wei1,2, XUAN Shu-yan4
1. College of Computer and Control Engineering, NanKai University, Tianjin 300071, China;
2. College of Software, NanKai University, Tianjin 300071, China;
3. State Environmental Protection Key Laboratory of Urban Ambient Air Particulate Matter Pollution Prevention and Control, College of Environmental Science and Engineering, NanKai University, Tianjin 300071, China;
4. Yutian Environmental Protection Agency, Tangshan 064199, China
In current research, there is a lack of uniform standards for components selection in PM2.5 source profile apportionment. Researchers tend to choose the component manually and empirically, leading to a subsequent poor fitting result, or even failures. Concerning on this problem, this paper has proposed an innovative component selection model of PM2.5 source profiles apportionment. On the basis of the physical representative of each component, the proposed model calculates the accuracy of fuzzy clustering as the standard score for selection. The experiments prove that our model outperforms the traditional empirical models. The successful rate for fitting, measured by the fitting errors in 0 to 0.05, grows to 83% by implementing our model, in contrast to rate of 40% from the traditional selection model.
徐恒鹏, 李岳, 史国良, 王玮, 轩淑艳. 基于模糊聚类的PM2.5拟合组分选择模型的研究[J]. 中国环境科学, 2016, 36(1): 12-17.
XU Heng-peng, LI Yue, SHI Guo-liang, WANG Wei, XUAN Shu-yan. The fitting component selection model of PM2.5 based on fuzzy clustering. CHINA ENVIRONMENTAL SCIENCECE, 2016, 36(1): 12-17.
Shi G L, Tian Y Z, Zhang Y F, et al. Estimation of the concentrations of primary and secondary organic carbon in ambient particulate matter:Application of the CMB-Iteration method[J]. Atmospheric Environment, 2011,45(32):5692-5698.
Scott D W. Multivariate density estimation:theory, practice, and visualization[M]. Wiley. com, 2009.
[9]
Müller E, Günnemann S, Assent I, et al. Evaluating clustering in subspace projections of high dimensional data[J]. Proceedings of the VLDB Endowment, 2009,2(1):1270-1281.
[10]
Parsons L, Haque E, Liu H. Subspace clustering for high dimensional data:a review[J]. ACM SIGKDD Explorations Newsletter, 2004,6(1):90-105.
[11]
Gan G, Wu J. A convergence theorem for the fuzzy subspace clustering (FSC) algorithm[J]. Pattern Recognition, 2008,41(6):1939-1947.
[12]
Deng Z, Choi K S, Chung F L, et al. Enhanced soft subspace clustering integrating within-cluster and between-cluster information[J]. Pattern Recognition, 2010,43(3):767-781.
[13]
Jing L, Ng M K, Huang J Z. An Entropy Weighting k-Means Algorithm for Subspace Clustering of High-Dimensional Sparse Data[J]. IEEE Transactions on Knowledge & Data Engineering, 2007,19(8):1026-1041.
[14]
徐恒鹏.基于智能信息处理的PM2.5源解析问题的研究[D]. 天津:南开大学, 2014.
[15]
Habre R, Coull B, Koutrakis P. Impact of source collinearity in simulated PM2.5 data on the PMF receptor model solution[J]. Atmospheric Environment, 2011,45(38):6938-6946.
[16]
Christensen W F, Gunst R F. Measurement error models in chemical mass balance analysis of air quality data[J]. Atmospheric Environment, 2004,38(5):733-744.