【24h】

Outlier Detection via Localized p-value Estimation

机译:通过局部p值估计进行异常值检测

获取原文

摘要

We propose a novel non-parametric adaptive outlier detection algorithm, called LPE, for high dimensional data based on score functions derived from nearest neighbor graphs on n-point nominal data. Outliers are predicted whenever the score of a test sample falls below α, which is supposed to be the desired false alarm level. The resulting outlier detector is shown to be asymptotically optimal in that it is uniformly most powerful for the specified false alarm level, α, for the case when the density associated with the outliers is a mixture of the nominal and a known density. Our algorithm is computationally efficient, being linear in dimension and quadratic in data size. The whole empirical Receiving Operating Characteristics (ROC) curve can be derived with almost no additional cost based on the estimated score function. It does not require choosing complicated tuning parameters or function approximation classes and it can adapt to local structure such as local change in dimensionality by incorporating the technique of manifold learning. We demonstrate the algorithm on both artificial and real data sets in high dimensional feature spaces.
机译:我们提出了一种新的非参数自适应离群值检测算法,称为LPE,用于基于从n点标称数据上的最近邻图得出的得分函数的高维数据。每当测试样本的分数下降到α以下(被认为是所需的虚警级别)时,就可以预测异常值。所显示的异常值检测器是渐近最优的,因为对于与异常值相关的密度是标称密度和已知密度的混合的情况,它对于指定的虚警级别α始终是最强大的。我们的算法计算效率高,维数线性,数据大小平方。基于估算的得分函数,几乎不需要额外的成本就可以得出整个经验接收操作特性(ROC)曲线。它不需要选择复杂的调整参数或函数逼近类,并且可以通过整合流形学习技术来适应局部结构,例如局部维数变化。我们在高维特征空间中的人工数据集和真实数据集上都演示了该算法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号