首页> 外文期刊>Journal of information and computational science >Outlier Detection Using the Information Entropy of Neighborhood Rough Sets
【24h】

Outlier Detection Using the Information Entropy of Neighborhood Rough Sets

机译:使用邻域粗糙集信息熵的异常值检测

获取原文
获取原文并翻译 | 示例
           

摘要

As an important research direction in data mining, outlier detection has been drawing much attention from different communities. Much use has been made of rough set theory for outlier detection as it is completely data-driven and no other knowledge is required; most other methods require some additional information. However, traditional rough set-based methods in the literature are restricted to the requirement that all data must be discrete. It is therefore not possible to consider real-valued or noisy data. This is usually addressed by employing a discrete method, which can result in information loss. This paper proposes a new approach based on the neighborhood rough set model, which has the ability to deal with real-valued data whilst simultaneously retaining dataset semantics. More significantly, this paper describes the underlying mechanism for this new approach to utilize the information entropy for measuring uncertainty. The use of this measurement can improved outlier detection accuracy. These results are supported by an experimental evaluation which compares the proposed approach with a number of existing outlier detection techniques.
机译:作为数据挖掘的重要研究方向,离群值检测已引起了不同社区的广泛关注。粗糙集理论已被广泛用于异常值检测,因为它是完全由数据驱动的,不需要其他知识。大多数其他方法需要一些其他信息。但是,文献中传统的基于粗糙集的方法仅限于所有数据必须是离散的要求。因此,不可能考虑实值或嘈杂的数据。这通常通过采用离散方法来解决,这可能导致信息丢失。本文提出了一种基于邻域粗糙集模型的新方法,该方法具有处理实值数据并同时保留数据集语义的能力。更重要的是,本文描述了这种新方法利用信息熵测量不确定性的潜在机制。使用此测量可以提高异常值检测精度。这些结果得到实验评估的支持,该评估将提出的方法与许多现有的异常检测技术进行了比较。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号