...
首页> 外文期刊>Atmospheric environment >Single imputation method of missing values in environmental pollution data sets
【24h】

Single imputation method of missing values in environmental pollution data sets

机译:环境污染数据集中缺失值的单一估算方法

获取原文
获取原文并翻译 | 示例
           

摘要

Missing data represent a general problem in many scientific fields above all in environmental research. Several methods have been proposed in literature for handling missing data and the choice of an appropriate method depends, among others, on the missing data pattern and on the missing-data mechanism. One approach to the problem is to impute them to yield a complete data set. The goal of this paper is to propose a new single imputation method and to compare its performance to other single and multiple imputation methods known in literature. Considering a data set of PM10 concentration measured every 2 h by eight monitoring stations distributed over the metropolitan area of Palermo, Sicily, during 2003, simulated incomplete data have been generated, and the performance of the imputation methods have been compared on the correlation coefficient (rho), the index of agreement (d), the root mean square deviation (RMSD) and the mean absolute deviation (MAD). All the performance indicators agree to evaluate the proposed method as the best among the ones compared, independently on the gap length and on the number of stations with missing data. (c) 2006 Elsevier Ltd. All rights reserved.
机译:数据丢失是环境研究中许多科学领域的普遍问题。文献中已经提出了几种用于处理丢失数据的方法,并且适当方法的选择尤其取决于丢失数据模式和丢失数据机制。解决该问题的一种方法是估算它们以产生完整的数据集。本文的目的是提出一种新的单一插补方法,并将其性能与文献中已知的其他单一和多重插补方法进行比较。考虑到2003年期间分布在西西里岛巴勒莫市区的八个监测站每2小时测量一次PM10浓度的数据集,已生成模拟的不完整数据,并根据相关系数比较了估算方法的性能( rho),一致性指数(d),均方根偏差(RMSD)和平均绝对偏差(MAD)。所有性能指标均同意将所建议的方法评估为所比较方法中的最佳方法,而与间隙长度和缺少数据的站点数无关。 (c)2006 Elsevier Ltd.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号