首页> 外文期刊>Cybernetics and information technologies: CIT >High Performance Machine Learning Models of Large Scale Air Pollution Data in Urban Area
【24h】

High Performance Machine Learning Models of Large Scale Air Pollution Data in Urban Area

机译:高性能机器学习模型大规模空气污染数据在市区

获取原文
           

摘要

Preserving the air quality in urban areas is crucial for the health of thepopulation as well as for the environment. The availability of large volumes ofmeasurement data on the concentrations of air pollutants enables their analysis andmodelling to establish trends and dependencies in order to forecast and preventfuture pollution. This study proposes a new approach for modelling air pollutantsdata using the powerful machine learning method Random Forest (RF) and Auto-Regressive Integrated Moving Average (ARIMA) methodology. Initially, a RF modelof the pollutant is built and analysed in relation to the meteorological variables. Thismodel is then corrected through subsequent modelling of its residuals using theunivariate ARIMA. The approach is demonstrated for hourly data on seven airpollutants (O 3 , NOx, NO, NO 2 , CO, SO 2 , PM 10 ) in the town of Dimitrovgrad,Bulgaria over 9 years and 3 months. Six meteorological and three time variables areused as predictors. High-performance models are obtained explaining the data withR 2 = 90%-98%.
机译:保持城市地区的空气质量对于健康的健康以及环境来说至关重要。关于空气污染物浓度的大量的大量的粪便数据可以分析和发出的趋势,以建立趋势和依赖性,以预测和预防污染。本研究提出了一种使用强大的机器学习方法使用强大的机器学习方法(RF)和自动回归综合移动平均(ARIMA)方法来建立空气污染物DATA的新方法。最初,污染物的RF模型与气象变量相关并分析。然后通过随后使用Theunivariate Arima进行剩余的残差来纠正该模型。在DIMITROVGRAD镇,保加利亚超过9年和3个月,对七个空气塑料(O 3,NOX,NO,NO 2,CO,SO 2,PM 10)的每小时数据进行了说明的方法。六个气象和三次变量被视为预测因子。获得高性能模型,解释了2 = 90%-98%的数据。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号