首页> 外文期刊>Computational statistics & data analysis >Imputation of missing values for compositional data using classical and robust methods
【24h】

Imputation of missing values for compositional data using classical and robust methods

机译:使用经典和鲁棒的方法为成分数据估算缺失值

获取原文
获取原文并翻译 | 示例
       

摘要

New imputation algorithms for estimating missing values in compositional data are introduced. A first proposal uses the k-nearest neighbor procedure based on the Aitchison distance, a distance measure especially designed for compositional data. It is important to adjust the estimated missing values to the overall size of the compositional parts of the neighbors. As a second proposal an iterative model-based imputation technique is introduced which initially starts from the result of the proposed k-nearest neighbor procedure. The method is based on iterative regressions, thereby accounting for the whole multivariate data information. The regressions have to be performed in a transformed space, and depending on the data quality classical or robust regression techniques can be employed. The proposed methods are tested on a real and on simulated data sets. The results show that the proposed methods outperform standard imputation methods. In the presence of outliers, the model-based method with robust regressions is preferable.
机译:介绍了用于估算成分数据中缺失值的新插补算法。第一个建议使用基于Aitchison距离的k最近邻程序,Aitchison距离是专门为成分数据设计的距离度量。重要的是将估计的缺失值调整为邻居组成部分的整体大小。作为第二个建议,引入了一种基于迭代模型的插补技术,该技术最初从建议的k最近邻过程开始。该方法基于迭代回归,从而考虑了整个多元数据信息。回归必须在变换后的空间中执行,并且可以根据数据质量采用经典的或鲁棒的回归技术。在真实数据集和模拟数据集上对提出的方法进行了测试。结果表明,所提出的方法优于标准插补方法。在存在异常值的情况下,具有鲁棒回归的基于模型的方法是可取的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号