【24h】

Robust PCA and clustering in noisy mixtures

机译:强大的PCA和嘈杂混合物中的聚类

获取原文

摘要

This paper presents a polynomial algorithm for learning mixtures of logconcave distributions in Rn in the presence of malicious noise. That is, each sample is corrupted with some small probability, being replaced by a point about which we can make no assumptions. A key element of the algorithm is Robust Principle Components Analysis (PCA), which is less susceptible to corruption by noisy points. While noise may cause standard PCA to collapse well-separated mixture components so that they are indistinguishable, Robust PCA preserves the distance between some of the components, making a partition possible. It then recurses on each half of the mixture until every component is isolated. The success of this algorithm requires only a O(log n) factor increase in the required separation between components of the mixture compared to the noiseless case.
机译:本文提出了一种用于在存在恶意噪声的情况下学习Rn中对数凹面分布的混合的多项式算法。也就是说,每个样本都以很小的概率被破坏,取而代之的是我们无法作任何假设的点。该算法的关键要素是稳健的主成分分析(PCA),它不易受到噪声点的破坏。噪声可能会导致标准PCA压倒分离良好的混合成分,从而使它们难以区分,而健壮的PCA会保留某些成分之间的距离,从而使分隔成为可能。然后在混合物的每一半上重复进行,直到分离出每种成分。与无噪声情况相比,该算法的成功仅要求混合物成分之间所需间隔的O(log n)因子增加。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号