首页> 外文会议>International Symposium on Symbolic and Numeric Algorithms for Scientific Computing >Efficient and Robust Median-of-Means Algorithms for Location and Regression
【24h】

Efficient and Robust Median-of-Means Algorithms for Location and Regression

机译:高效,稳健的位置和回归均值算法

获取原文

摘要

We consider the computational problem to learn models from data that are possibly contaminated with outliers. We design and analyze algorithms for robust location and robust linear regression. Such algorithms are essential for solving central problems of robust statistics and outlier detection. We show that our algorithms, which are based on a novel extension of the Median-of-Means method by employing the discrete geometric median, are accurate, efficient and robust against many outliers in the data. The discrete geometric median has many desirable characteristics such as it works for general metric spaces and preserves combinatorial and statistical properties. Furthermore, there is an exact and efficient algorithm to compute it, and an even faster approximation algorithm. We present theoretical and experimental results. In particular, we emphasize the generality of Median-of-Means and its ability to speedup and parallelize algorithms which additionally are accurate and robust against many outliers in the data.
机译:我们考虑了计算问题,以从可能被异常值污染的数据中学习模型。我们设计并分析了用于稳健位置和稳健线性回归的算法。这样的算法对于解决鲁棒统计和离群值检测的中心问题至关重要。我们证明了基于离散几何中值的基于均值中值方法的新颖扩展的算法,对于数据中的许多离群值而言是准确,高效和鲁棒的。离散几何中值具有许多理想的特性,例如,它适用于一般度量空间并保留组合和统计属性。此外,还有一种精确而有效的算法可以对其进行计算,甚至还有一种更快的近似算法。我们提出理论和实验结果。特别是,我们强调均值中位数的一般性及其加速和并行化算法的能力,这些算法对于数据中的许多异常值也具有精确性和鲁棒性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号