...
首页> 外文期刊>Journal of supercomputing >An implementation of cloud-based platform with R packages for spatiotemporal analysis of air pollution
【24h】

An implementation of cloud-based platform with R packages for spatiotemporal analysis of air pollution

机译:带有R包的基于云的平台的实现,用于时空分析空气污染

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Recently, the R package has become a popular tool for big data analysis due to its several matured software packages for the data analysis and visualization, including the analysis of air pollution. The air pollution problem is of increasing global concern as it has greatly impacts on the environment and human health. With the rapid development of IoT and the increase in the accuracy of geographical information collected by sensors, a huge amount of air pollution data were generated. Thus, it is difficult to analyze the air pollution data in a single machine environment effectively and reliably due to its inherent characteristic of memory design. In this work, we construct a distributed computing environment based on both the softwares of RHadoop and SparkR for performing the analysis and visualization of air pollution with the R more reliably and effectively. In the work, we firstly use the sensors, called EdiGreen AirBox to collect the air pollution data in Taichung, Taiwan. Then, we adopt the Inverse Distance Weighting method to transform the sensors' data into the density map. Finally, the experimental results show the accuracy of the short-term prediction results of PM2.5 by using the ARIMA model. In addition, the verification with respect to the prediction accuracy with the MAPE method is also presented in the experimental results.
机译:最近,R软件包由于其用于数据分析和可视化(包括空气污染分析)的几种成熟软件包而成为大数据分析的流行工具。空气污染问题日益受到全球关注,因为它对环境和人类健康产生了巨大影响。随着物联网的飞速发展和传感器收集的地理信息准确性的提高,产生了大量的空气污染数据。因此,由于其存储器设计的固有特性,难以有效且可靠地分析单个机器环境中的空气污染数据。在这项工作中,我们基于RHadoop和SparkR软件构建了一个分布式计算环境,以便更可靠,更有效地使用R进行空气污染的分析和可视化。在工作中,我们首先使用名为EdiGreen AirBox的传感器来收集台湾台中的空气污染数据。然后,我们采用反距离权重方法将传感器的数据转换为密度图。最后,实验结果表明,使用ARIMA模型可以预测PM2.5的短期预测结果的准确性。另外,在实验结果中还提出了利用MAPE方法对预测准确性的验证。

著录项

  • 来源
    《Journal of supercomputing》 |2020年第3期|1416-1437|共22页
  • 作者

  • 作者单位

    Tunghai Univ Dept Comp Sci 1727 Sect 4 Taiwan Blvd Taichung 40704 Taiwan;

    Providence Univ Coll Comp & Informat 200 Sect 7 Taiwan Blvd Taichung Taiwan;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Air pollution; R packages; IDW; RHadoop; SparkR;

    机译:空气污染;R包;IDW;RHadoop;星火;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号