...
首页> 外文期刊>Journal of Multivariate Analysis: An International Journal >On masking and swamping robustness of leading nonparametric outlier identifiers for multivariate data
【24h】

On masking and swamping robustness of leading nonparametric outlier identifiers for multivariate data

机译:关于多变量数据领先非参数异常异常标识符的掩蔽和沼泽鲁棒性

获取原文
获取原文并翻译 | 示例

摘要

For any outlier detection procedure, a key concern is robustness with respect to possible misclassification errors, masking (Type I) and swamping (Type II). Although parametric model-based simulation results are informative, one also desires nonparametric masking and robustness measures that are more broadly applicable. To this effect, notions of finite sample masking and swamping breakdown points formulated abstractly for outlyingness functions in arbitrary data settings (Serfling and Wang, 2014) are introduced in the present paper into the multivariate data setting. Formulas for the measures are derived for three important affine invariant nonparametric multivariate outlyingness functions: Mahalanobis distance, Mahalanobis spatial, and projection. Using the formulas, favorable masking and swamping breakdown points, balanced equally, are seen for the Mahalanobis distance outlyingness using minimum covariance determinant (MCD) location and scatter estimators, and likewise for the projection outlyingness with median and MAD for univariate location and scale. Also, Mahalanobis spatial outlyingness with MCD standardization is competitive when swamping robustness is given higher priority than masking robustness. A small simulation study with bivariate contaminated standard normal and contaminated exponential models yields results consistent with the theoretical formulas. Some practical recommendations are discussed. (C) 2018 Elsevier Inc. All rights reserved.
机译:对于任何异常检测程序,关键问题对于可能的错误分类错误,屏蔽(类型I)和沼泽(II型)而言是鲁棒性。尽管基于参数模型的仿真结果是信息性的,但是一个也希望更广泛适用的非参数掩蔽和鲁棒性度量。为此,在本纸上将本文引入多变量数据设置,向非任意数据设置(Serfling和Wang,2014)中的广泛函数抽象地配制的有限样本屏蔽和沼泽分解点的概念。用于措施的公式旨在为三个重要的仿射不变非参数非参数多变量偏远函数:Mahalanobis距离,Mahalanobis空间和投影。使用最小协方差决定因素(MCD)位置和散射估计的Mahalanobis距离远方,且散射估计,同样地,使用相应的掩蔽和沼泽击穿点的平衡平衡,同样能够与单变量和尺度的中位数和疯狂的投影边界相同。此外,Mahalanobis空间外围与MCD标准化的空间远方在潮汐稳健性比掩蔽稳健性获得更高的优先级时具有竞争力。具有双变量污染标准正常和污染指数模型的小型模拟研究产生了与理论公式一致的结果。讨论了一些实际建议。 (c)2018年Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号