首页> 外文会议>International conference on database systems for advanced applications;DASFAA 2011 >An Unbiased Distance-Based Outlier Detection Approach for High-Dimensional Data
【24h】

An Unbiased Distance-Based Outlier Detection Approach for High-Dimensional Data

机译:基于无偏距离的高维数据离群值检测方法

获取原文

摘要

Traditional outlier detection techniques usually fail to work efficiently on high-dimensional data due to the curse of dimensionality. This work proposes a novel method for subspace outlier detection, that specifically deals with multidimensional spaces where feature relevance is a local rather than a global property. Different from existing approaches, it is not grid-based and dimensionality unbiased. Thus, its performance is impervious to grid resolution as well as the curse of dimensionality. In addition, our approach ranks the outliers, allowing users to select the number of desired outliers, thus mitigating the issue of high false alarm rate. Extensive empirical studies on real datasets show that our approach efficiently and effectively detects outliers, even in high-dimensional spaces.
机译:由于维数的诅咒,传统的异常值检测技术通常无法有效地处理高维数据。这项工作提出了一种用于子空间离群值检测的新方法,该方法专门处理特征相关性是局部属性而不是全局属性的多维空间。与现有方法不同,它不是基于网格的,并且维数没有偏见。因此,它的性能不受网格分辨率以及维数诅咒的影响。此外,我们的方法对异常值进行排名,允许用户选择所需的异常值数量,从而减轻了误报率高的问题。对真实数据集的大量实证研究表明,即使在高维空间中,我们的方法也能有效地检测异常值。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号