首页> 外文期刊>Statistics and computing >Outlier detection in non-elliptical data by kernel MRCD
【24h】

Outlier detection in non-elliptical data by kernel MRCD

机译:内核MRCD的非椭圆数据中的异常检测

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

The minimum regularized covariance determinant method (MRCD) is a robust estimator for multivariate location and scatter, which detects outliers by fitting a robust covariance matrix to the data. Its regularization ensures that the covariance matrix is well-conditioned in any dimension. The MRCD assumes that the non-outlying observations are roughly elliptically distributed, but many datasets are not of that form. Moreover, the computation time of MRCD increases substantially when the number of variables goes up, and nowadays datasets with many variables are common. The proposed kernel minimum regularized covariance determinant (KMRCD) estimator addresses both issues. It is not restricted to elliptical data because it implicitly computes the MRCD estimates in a kernel-induced feature space. A fast algorithm is constructed that starts from kernel-based initial estimates and exploits the kernel trick to speed up the subsequent computations. Based on the KMRCD estimates, a rule is proposed to flag outliers. The KMRCD algorithm performs well in simulations, and is illustrated on real-life data.
机译:最小正则化协方差决定因素(MRCD)是用于多变量位置和分散的强大估计器,其通过拟合强大的协方差矩阵到数据来检测异常值。其正则化确保协方差矩阵在任何维度下都很良好。 MRCD假设非外观观测大致椭圆分布,但许多数据集不是该形式的。此外,当变量的数量上升时,MRCD的计算时间基本上增加,而今具有许多变量的数据集是常见的。建议的内核最小正规化协方差决定因素(KMRCD)估计人解决了这两个问题。它不限于椭圆数据,因为它隐含地计算了内核引起的特征空间中的MRCD估计。构建快速算法,其从基于内核的初始估计开始,并利用内核技巧来加速后续计算。基于KMRCD估计,提出了规则来标记异常值。 KMRCD算法在模拟中执行良好,并在现实生活数据上说明。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号