...
首页> 外文期刊>Journal of Hydrology >CUTOFF: A spatio-temporal imputation method
【24h】

CUTOFF: A spatio-temporal imputation method

机译:CUTOFF:时空插补方法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Missing values occur frequently in many different statistical applications and need to be dealt with-carefully, especially when the data are collected spatio-temporally. We propose a method called CUTOFF imputation that utilizes the spatio-temporal nature of the data to accurately and efficiently impute missing values. The main feature of this method is that the estimate of a missing value is produced by incorporating similar observed temporal information from the value's nearest spatial neighbors. Extensions to this method are also developed to expand the method's ability to accommodate other data generating processes. We develop a cross-validation procedure that optimally chooses parameters for CUTOFF, which can be used by other imputation methods as well. We analyze some rainfall data from 78 gauging stations in the Murray-Darling Basin in Australia using the CUTOFF imputation method and compare its performance to four well-studied competing imputation methods, namely, k-nearest neighbors, singular value decomposition, multiple imputation and random forest. Empirical results show that our method captures the temporal patterns well and is effective at imputing large gaps in the data. Compared to the competing methods, CUTOFF is more accurate and much faster. We analyze further examples to demonstrate CUTOFF's applications to two different data sets and provide extra evidence of its validity and usefulness. We implement a simulation study based on the Murray-Darling Basin data to evaluate the method; the results show that our method performs well in both accuracy and computational efficiency. (C) 2014 Elsevier B.V. All rights reserved.
机译:缺失值在许多不同的统计应用程序中经常发生,需要谨慎处理,尤其是在时空收集数据时。我们提出一种称为CUTOFF插补的方法,该方法利用数据的时空性质来准确有效地插补缺失值。此方法的主要特征是,通过合并值的最近空间邻居的相似观测时间信息,可以得出缺失值的估计值。还开发了对该方法的扩展,以扩展该方法适应其他数据生成过程的能力。我们开发了一种交叉验证程序,该程序可以最佳地选择CUTOFF的参数,也可以由其他插补方法使用。我们使用CUTOFF插值方法分析了澳大利亚Murray-Darling盆地的78个测量站的一些降雨数据,并将其性能与四种经过充分研究的竞争插值方法进行了比较,即k最近邻,奇异值分解,多重插值和随机森林。实证结果表明,我们的方法很好地捕获了时间模式,并且有效地填补了数据中的大空白。与竞争方法相比,CUTOFF更准确,更快。我们将分析更多示例,以演示CUTOFF在两个不同数据集上的应用,并提供其有效性和有用性的额外证据。我们基于Murray-Darling盆地数据进行模拟研究以评估该方法;结果表明,该方法在准确性和计算效率上均表现良好。 (C)2014 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号