首页> 外文会议>IEEE International Congress on Big Data >Local subspace-based outlier detection using global neighbourhoods
【24h】

Local subspace-based outlier detection using global neighbourhoods

机译:使用全局邻域的基于局部子空间的离群值检测

获取原文

摘要

Outlier detection in high-dimensional data is a challenging yet important task, as it has applications in, e.g., fraud detection and quality control. State-of-the-art density-based algorithms perform well because they 1) take the local neighbourhoods of data points into account and 2) consider feature subspaces. In highly complex and high-dimensional data, however, existing methods are likely to overlook important outliers because they do not explicitly take into account that the data is often a mixture distribution of multiple components. We therefore introduce GLOSS, an algorithm that performs local subspace outlier detection using global neighbourhoods. Experiments on synthetic data demonstrate that GLOSS more accurately detects local outliers in mixed data than its competitors. Moreover, experiments on real-world data show that our approach identifies relevant outliers overlooked by existing methods, confirming that one should keep an eye on the global perspective even when doing local outlier detection.
机译:高维数据中的异常检测是一项具有挑战性但重要的任务,因为它在例如欺诈检测和质量控制中具有应用。基于密度的最新算法表现出色,因为它们1)考虑了数据点的局部邻域,并且2)考虑了特征子空间。但是,在高度复杂和高维的数据中,现有方法可能会忽略重要的异常值,因为它们没有明确考虑到数据通常是多个组成部分的混合分布。因此,我们介绍了GLOSS,这是一种使用全局邻域执行局部子空间离群值检测的算法。综合数据的实验表明,GLOSS比其竞争对手更准确地检测混合数据中的局部异常值。此外,对现实世界数据的实验表明,我们的方法可以识别现有方法所忽略的相关异常值,从而确认即使进行局部异常值检测,也应该关注全局视角。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号