首页> 外文期刊>Computational statistics & data analysis >RelaxMCD: Smooth optimisation for the Minimum Covariance Determinant estimator
【24h】

RelaxMCD: Smooth optimisation for the Minimum Covariance Determinant estimator

机译:RelaxMCD:最小协方差行列式估计量的平滑优化

获取原文
获取原文并翻译 | 示例
           

摘要

The Minimum Covariance Determinant (MCD) estimator is a highly robust procedure for estimating the centre and shape of a high dimensional data set. It consists of determining a subsample of h points out of IT which minimises the generalised variance. By definition, the computation of this estimator gives rise to a combinatorial optimisation problem, for which several approximate algorithms have been developed. Some of these approximations are quite powerful, but they do not take advantage of any smoothness in the objective function. Recently, in a general framework, an approach transforming any discrete and high dimensional combinatorial problem of this type into a continuous and low-dimensional one has been developed and a general algorithm to solve the transformed problem has been designed. The idea is to build on that general algorithm in order to take into account particular features of the MCD methodology. More specifically, two main goals are considered: (a) adaptation of the algorithm to the specific MCD target function and (b) comparison of this 'tuned' algorithm with the usual competitors for computing MCD. The adaptation focuses on the design of 'clever' starting points in order to systematically investigate the search domain. Accordingly, a new and surprisingly efficient procedure based on a suitably equivariant modification of the well-known k-means algorithm is constructed. The adapted algorithm, called RelaxMCD, is then compared by means of simulations with FASTMCD and the Feasible Subset Algorithm, both benchmark algorithms for computing MCD. As a by-product, it is shown that RelaxMCD is a general technique encompassing the two others, yielding insight into their overall good performance.
机译:最小协方差决定因素(MCD)估计器是用于估计高维数据集的中心和形状的高度鲁棒的过程。它由确定IT的h点的子样本组成,这使广义方差最小。根据定义,该估计器的计算会引起组合优化问题,为此已经开发了几种近似算法。这些近似中的一些近似非常有力,但是它们没有利用目标函数中的任何平滑度。最近,在通用框架中,已经开发了一种将这种离散和高维组合问题转换为连续和低维问题的方法,并且已经设计了解决该转换问题的通用算法。想法是在该通用算法的基础上考虑MCD方法的特定功能。更具体地说,考虑了两个主要目标:(a)使算法适应特定的MCD目标功能,以及(b)将此“调整的”算法与计算MCD的通常竞争对手进行比较。改编侧重于“聪明”起点的设计,以便系统地调查搜索域。因此,基于众所周知的k-均值算法的适当等变修改,构造了新的且出人意料的有效过程。然后,通过使用FASTMCD和可行子集算法(这是用于计算MCD的基准算法)的仿真,将经过调整的算法称为RelaxMCD,进行比较。作为副产品,表明RelaxMCD是涵盖其他两个方面的通用技术,可深入了解它们的总体性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号