首页> 外文会议>IEEE international conference on data engineering >Discriminative features for identifying and interpreting outliers
【24h】

Discriminative features for identifying and interpreting outliers

机译:识别和解释离群值的判别功能

获取原文

摘要

We consider the problem of outlier detection and interpretation. While most existing studies focus on the first problem, we simultaneously address the equally important challenge of outlier interpretation. We propose an algorithm that uncovers outliers in subspaces of reduced dimensionality in which they are well discriminated from regular objects while at the same time retaining the natural local structure of the original data to ensure the quality of outlier explanation. Our algorithm takes a mathematically appealing approach from the spectral graph embedding theory and we show that it achieves the globally optimal solution for the objective of subspace learning. By using a number of real-world datasets, we demonstrate its appealing performance not only w.r.t. the outlier detection rate but also w.r.t. the discriminative human-interpretable features. This is the first approach to exploit discriminative features for both outlier detection and interpretation, leading to better understanding of how and why the hidden outliers are exceptional.
机译:我们考虑离群值检测和解释的问题。尽管大多数现有研究都集中在第一个问题上,但我们同时解决了离群值解释同样重要的挑战。我们提出一种算法,该算法可发现维数较小的子空间中的异常值,在这些维中,它们与常规对象有很好的区别,同时保留了原始数据的自然局部结构,以确保异常值的解释质量。我们的算法从频谱图嵌入理论中采用了一种数学上有吸引力的方法,并且我们证明了该算法实现了针对子空间学习目标的全局最优解。通过使用许多现实世界的数据集,我们不仅展示了其引人注目的性能。离群点检出率但也有w.r.t.具有歧视性的人类可解释特征。这是利用判别特征进行离群值检测和解释的第一种方法,从而可以更好地理解隐藏的离群值是如何以及为什么是异常的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号