...
首页> 外文期刊>Journal of computer and system sciences >Optimal outlier removal in high-dimensional spaces
【24h】

Optimal outlier removal in high-dimensional spaces

机译:高维空间中的最佳离群值去除

获取原文
获取原文并翻译 | 示例
           

摘要

We study the problem of finding an outlier-free subset of a set of points (or a probability distribution) in n-dimensional Euclidean space. As in [BFKV 99], a point x is defined to be a beta-outlier if there exists some direction it; in which its squared distance from the mean along w is greater than beta times the average squared distance from the mean along w. Our main theorem is that for any epsilon > 0, there exists a (1 - epsilon) fraction of the original distribution that has no O((n)/(epsilon)(b + log(n)/(epsilon)))-outliers, improving on the previous bound of 0(n(7) b/e). This is asymptotically the best possible, as shown by a matching lower bound. The theorem is constructive, and results in a (1)/(1-epsilon) approximation to the following optimization problem: given a distribution mu(i.e. the ability to sample from it), and a parameter epsilon > 0, find the minimum beta for which there exists a subset of probability at least (1 - epsilon) with no beta-outliers. (C) 2003 Elsevier Inc. All rights reserved. [References: 8]
机译:我们研究在n维欧几里得空间中找到一组点(或概率分布)的无离群子集的问题。与[BFKV 99]中一样,如果存在某个方向,则将点x定义为beta异常点;其中它与沿w的平均值的平方距离大于beta乘以与w的平均值的平均平方距离。我们的主要定理是,对于任何大于0的epsilon,原始分布中都有一个(1-epsilon)分数,而没有O((n)/(epsilon)(b + log(n)/(epsilon)))-离群值,改进了先前的0(n(7)b / e)范围。渐近地,这是最佳的,如匹配的下限所示。该定理是构造性的,并且导致以下优化问题的(1)/(1-ε)近似值:给定一个分布μu(即从中采样的能力),并且参数ε≥0,找到最小β因此,存在至少(1- epsilon)的概率子集,没有beta异常值。 (C)2003 Elsevier Inc.保留所有权利。 [参考:8]

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号