An Unbiased Distance-Based Outlier Detection Approach for High-Dimensional Data

机译：基于无偏距离的高维数据离群值检测方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Traditional outlier detection techniques usually fail to work efficiently on high-dimensional data due to the curse of dimensionality. This work proposes a novel method for subspace outlier detection, that specifically deals with multidimensional spaces where feature relevance is a local rather than a global property. Different from existing approaches, it is not grid-based and dimensionality unbiased. Thus, its performance is impervious to grid resolution as well as the curse of dimensionality. In addition, our approach ranks the outliers, allowing users to select the number of desired outliers, thus mitigating the issue of high false alarm rate. Extensive empirical studies on real datasets show that our approach efficiently and effectively detects outliers, even in high-dimensional spaces.

机译：由于维数的诅咒，传统的异常值检测技术通常无法有效地处理高维数据。这项工作提出了一种用于子空间离群值检测的新方法，该方法专门处理特征相关性是局部属性而不是全局属性的多维空间。与现有方法不同，它不是基于网格的，并且维数没有偏见。因此，它的性能不受网格分辨率以及维数诅咒的影响。此外，我们的方法对异常值进行排名，允许用户选择所需的异常值数量，从而减轻了误报率高的问题。对真实数据集的大量实证研究表明，即使在高维空间中，我们的方法也能有效地检测异常值。

著录项

来源
《International conference on database systems for advanced applications;DASFAA 2011》|2011年|p.138-152|共15页
会议地点
作者
Hoang Vu Nguyen; Vivekanand Gopalkrishnan; Ira Assent;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.13;
关键词

相似文献

外文文献
中文文献
专利

1. Fast mining of distance-based outliers in high-dimensional datasets [J] . Amol Ghoting, Srinivasan Parthasarathy, Matthew Eric Otey Data Mining and Knowledge Discovery . 2008,第3期

机译：快速挖掘高维数据集中基于距离的离群值
2. Fast mining of distance-based outliers in high-dimensional datasets [J] . Ghoting A, Parthasarathy S, Otey ME Data mining and knowledge discovery . 2008,第3期

机译：高维数据集中基于距离的离群值的快速挖掘
3. OUTLIER DETECTION WITH ENHANCED ANGLE-BASED OUTLIER FACTOR IN HIGH-DIMENSIONAL DATA STREAM [J] . Zhaoyu Shou, Hao Tian, Simin Li, International Journal of Innovative Computing Information and Control . 2018,第5期

机译：高维数据流中基于角度的离群因子的离群检测
4. Fast Mining of Distance-Based Outliers in High-Dimensional Datasets [C] . Amol Ghoting, Srinivasan Parthasarathy, Matthew Eric Otey International Conference on Data Mining . 2006

机译：快速挖掘高维数据集中的距离的异常值
5. Towards outlier detection for high-dimensional data streams using projected outlier analysis strategy. [D] . Zhang, Ji. 2009

机译：使用投影离群值分析策略实现对高维数据流的离群值检测。
6. A kernel-based approach for detecting outliers of high-dimensional biological data [O] . Jung Hun Oh, Jean Gao 2009

机译：基于内核的高维生物学数据离群值检测方法
7. A New Local Distance-Based Outlier Detection Approach for Scattered Real-World Data [O] . Zhang, Ke, Hutter, Marcus, Jin, Huidong 2009

机译：一种新的基于局部距离的散乱异常检测方法真实世界的数据

An Unbiased Distance-Based Outlier Detection Approach for High-Dimensional Data

摘要

著录项

相似文献

相关主题

期刊订阅