【24h】

ODDC: Outlier Detection Using Distance Distribution Clustering

机译:ODDC:使用距离分布聚类的异常值检测

获取原文
获取原文并翻译 | 示例

摘要

Outlier detection is an important issue in many industrial and financial applications. Most outlier detection methods suffer from two problems: First, they need parameter tuning in accord to domain knowledge. Second, they are incapable to scale up to high dimensional space. In this paper, we propose a distance-based outlier definition and a detection algorithm ODDC (Distribution Clustering Outlier Detection). We redefine the problem by clustering in the distribution difference space rather than the original feature space. As a result, the new algorithm is stable regardless of different input and scalable to the dimensionality. Experiments on both synthetic and real datasets show that ODDC outperforms the counterpart both in effectiveness and efficiency.
机译:离群检测是许多工业和金融应用中的重要问题。大多数异常值检测方法都存在两个问题:首先,它们需要根据领域知识进行参数调整。其次,它们无法扩展到高维空间。在本文中,我们提出了基于距离的离群值定义和检测算法ODDC(分布聚类离群值检测)。我们通过在分布差异空间而不是原始特征空间中进行聚类来重新定义问题。结果,新算法是稳定的,而与不同输入无关,并且可扩展到维数。在综合和真实数据集上进行的实验表明,ODDC在有效性和效率上均优于同类数据集。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号