首页> 外文期刊>IEEE intelligent systems >Isolation Forest Based Anomaly Detection Framework on Non-IID Data
【24h】

Isolation Forest Based Anomaly Detection Framework on Non-IID Data

机译:非IID数据的隔离林基异常检测框架

获取原文
获取原文并翻译 | 示例
           

摘要

Anomaly detection is a significant but challenging data mining task in a wide range of applications. Different domains usually use different ways to measure the characteristics of data and to define the anomaly types. As a result, it is a big challenge to develop a versatile anomaly detection framework that can be universally applied with satisfactory performance in most, if not all, applications. In this article, we propose a generic isolation forest based ensemble framework named EDBHiForest, which can be universally applied to data spaces with arbitrary distance measures. It is realized through embedding the isolation forest structure with extended distance-based hashing (EDBH), which can significantly enhance the versatility and applicability of isolation forest based anomaly detection. This framework overcomes the limitations of existing isolation forest based methods that can only be applied to datasets with a very limited range of distance measure types. Extensive experiments on various non-independent and identically distributed datasets demonstrate the effectiveness and efficiency of our approach.
机译:异常检测是在各种应用中的一个重要但具有挑战性的数据挖掘任务。不同的域通常使用不同的方式来测量数据的特征并定义异常类型。因此,开发一种多功能异常检测框架是一个很大的挑战,可以在大多数情况下通过令人满意地应用令人满意的性能,如果不是全部,应用程序。在本文中,我们提出了一个名为Edbhiforest的通用隔离林的集合框架,该框架可以普遍应用于具有任意距离测量的数据空间。通过将隔离林结构嵌入具有扩展距离的散列(EDBH),这可以显着提高分离林基异常检测的多功能性和适用性。该框架克服了现有的基于森林基于方法的限制,这些方法只能应用于具有非常有限的距离测量类型范围的数据集。各种非独立和相同分布的数据集的广泛实验证明了我们方法的有效性和效率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号