首页  |  本刊简介  |  编委会  |  投稿须知  |  订阅与联系  |  微信  |  出版道德声明  |  Ei收录本刊数据  |  封面
基于机器学习和情景分析的我国焦化行业企业用地污染预测
摘要点击 2238  全文点击 825  投稿时间:2022-11-23  修订日期:2022-12-23
查看HTML全文 查看全文  查看/发表评论  下载PDF阅读器
中文关键词  焦化行业  模型性能  污染预测  机器学习  情景分析
英文关键词  coking industry  model performance  pollution prediction  machine learning  scenario analysis
作者单位E-mail
李凯 中国科学院地理科学与资源研究所, 北京 100101
中国科学院生态环境研究中心城市与区域生态国家重点实验室, 北京 100085 
kaili@rcees.ac.cn 
郭广慧 中国科学院地理科学与资源研究所, 北京 100101 guogh@igsnrr.ac.cn 
雷梅 中国科学院地理科学与资源研究所, 北京 100101 leim@igsnrr.ac.cn 
中文摘要
      针对焦化行业企业用地缺乏时序连续监测数据而无法预测其污染趋势的问题,从企业特征、企业管理水平、污染物特征和自然地理要素等4个方面选取13个影响企业用地污染的指标,识别焦化行业企业用地污染主控因子,在此基础上构建基于机器学习的焦化行业企业用地污染预测模型,并在不同情境下,对2025年和2030年焦化行业企业用地污染状况进行预测.结果表明,生产经营活动时间、建厂时间、企业环境监管记录、土壤黏粒和年均风速是焦化行业企业用地污染的主控因子;相对于支持向量机模型、BP神经网络模型和决策树模型,逻辑斯蒂模型预测价值高、性能指标稳健,其预测精度受试者工作曲线面积为0.91,模型准确率和召回率分别为84%和88%.在乐观情境下,2025年和2030年焦化行业高概率污染地块数量分别为1599块和1695块;在悲观情境下,2025年和2030年焦化行业高概率污染地块数量分别为1671块和1715块.研究结果可为焦化行业企业用地的修复治理和生态环境的宏观决策提供科学依据.
英文摘要
      Owing to the lack of sequential monitoring data of soil pollutants in coking industry enterprises, it is hard to accurately predict their soil pollution. To predict the trend of soil pollution of coking industry enterprises in the future, a prediction model should be developed using machine learning based on the influencing factors. A total of 13 potential factors were selected from the enterprise characteristics, enterprise management level, pollutant characteristics, and natural factors, and the main controlling factors were identified. On this basis, the prediction models were developed using a support vector machine, BP neural network model, decision tree model, and logistic model, and then the pollution situation of enterprises in the coking industry in 2025 and 2030 was predicted under different scenarios. The results indicated that time of service for the enterprise, time of establishment for the enterprise, the environmental illegal record, soil clay, and annual wind speed were the major controlling factors of soil pollution of enterprises in the coking industry. Compared with the support vector machine, BP neural network model, and decision tree model, the logistic model had a robust performance index, with an area under the receiver operating characteristic curve of 0.91. The accurate rate and recall rate were 84% and 88%, respectively. Under the optimistic scenario, there will be 1599 and 1695 plots with a high probability of pollution in the coking industry in 2025 and 2030, respectively; under the pessimistic scenario, there will be 1671 and 1715 plots with a high probability of pollution in the coking industry in 2025 and 2030, respectively. The results of this study provided a scientific basis for soil environmental remediation and eco-environmental strategy development for the coking industry.

您是第75746420位访客
主办单位:中国科学院生态环境研究中心 单位地址:北京市海淀区双清路18号
电话:010-62941102 邮编:100085 E-mail: hjkx@rcees.ac.cn
本系统由北京勤云科技发展有限公司设计  京ICP备05002858号-2