...
首页> 外文期刊>Journal of the American statistical association >Robust Estimation of Multivariate Location and Scatter in the Presence of Missing Data
【24h】

Robust Estimation of Multivariate Location and Scatter in the Presence of Missing Data

机译:存在缺失数据时多元定位和散布的鲁棒估计

获取原文
获取原文并翻译 | 示例

摘要

Two main issues regarding data quality are data contamination (outliers) and data completion (missing data). These two problems have attracted much attention and research but surprisingly, they are seldom considered together. Popular robust methods such as S-estimators of multivariate location and scatter offer protection against outliers but cannot deal with missing data, except for the obviously inefficient approach of deleting all incomplete cases. We generalize the definition of S-estimators of multivariate location and scatter to simultaneously deal with missing data and outliers. We show that the proposed estimators are strongly consistent under elliptical models when data are missing completely at random. We derive an algorithm similar to the Expectation-Maximization algorithm for computing the proposed estimators. This algorithm is initialized by an extension for missing data of the minimum volume ellipsoid. We assess the performance of our proposal by Monte Carlo simulation and give some real data examples. This article has supplementary material online.
机译:有关数据质量的两个主要问题是数据污染(异常值)和数据完成(丢失数据)。这两个问题已经引起了很多关注和研究,但是令人惊讶的是,很少将它们一起考虑。流行的鲁棒方法(例如,多元位置和散布的S估计量)可提供针对异常值的保护,但不能处理缺失的数据,除非删除所有不完整案例的方法显然效率不高。我们概括了多元位置和分散的S估计量的定义,以同时处理丢失的数据和离群值。我们表明,当数据完全随机丢失时,在椭圆模型下,拟议的估计量具有很强的一致性。我们推导了一种与期望最大化算法相似的算法,用于计算所提出的估计量。通过扩展以最小体积椭圆体的丢失数据来初始化该算法。我们通过蒙特卡洛模拟评估我们的建议的性能,并给出一些真实的数据示例。本文在线提供了补充材料。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号