首页> 外文期刊>Annals. Computer Science Series >Alternative Estimator for Multivariate Location and Scatter Matrix in the Presence of Outlier
【24h】

Alternative Estimator for Multivariate Location and Scatter Matrix in the Presence of Outlier

机译:存在离群值时多元位置和散布矩阵的替代估计

获取原文
           

摘要

It is generally known that in estimating location and scatter matrix of multivariate data when outliers are presents, the method of classical is not robust. The Maximum Likelihood Estimator (MLE) is always very sensitive to some deviations from the assumptions made on the data, especially, presence of outliers. To get over the above stated problem, many alternative estimators that are robust have been proposed in the last decades. Some of these estimators include the Minimum Covariance Determinant (MCD), the Minimum Volume Ellipsoid (MVE), S-Estimators, M-Estimators and Minimum Regularized Covariance Determinant (MRCD) among others. All the methods converged on tackling the problem of robust estimation by finding a sufficiently large subset of the data. In this paper, a robust method of estimating multivariate location and scatter matrix in the presence of outliers is proposed. The proposed estimator is obtained using the best units (samples) from the available data set that satisfied a set of three optimality criteria (CA,CH,CG).The performance of the proposed robust method was compared with two of the existing robust methods (MCD and MVE) and the classical method with their application in Principal component analysis data simulation. The measure of performance used was the Mean Square Errors (MSE) of the characteristic roots (eigen-values) of the variance-covariance matrix. Generally, the proposed alternative method is better than other robust methods and classical method, when the level of magnitude of outliers is small and also performed considerably well with MCD and MVE when the level of magnitude is high at all percentages of outliers.
机译:众所周知,当估计离群值时,在估计多元数据的位置和散布矩阵时,经典方法并不可靠。最大似然估计器(MLE)总是对数据假设的某些偏差非常敏感,尤其是存在异常值时。为了克服上述问题,在过去的几十年中已经提出了许多健壮的替代估计器。这些估计量中的一些包括最小协方差行列式(MCD),最小体积椭球(MVE),S估计量,M估计量和最小正则协方差确定量(MRCD)。所有方法都通过找到足够大的数据子集来集中解决鲁棒估计问题。本文提出了一种在异常值存在的情况下估计多元位置和散布矩阵的鲁棒方法。拟议的估计量是使用可得数据集中的最佳单位(样本)获得的,该数据集满足一组三个最优性标准(CA,CH,CG)。将拟议的鲁棒方法的性能与现有的两种鲁棒方法进行比较MCD和MVE)及其经典方法在主成分分析数据模拟中的应用。使用的性能度量是方差-协方差矩阵的特征根(特征值)的均方误差(MSE)。通常,当离群值的水平较小时,提出的替代方法优于其他鲁棒方法和经典方法,并且在离群值的所有百分比均较高的情况下,MCD和MVE的性能也相当好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号