首页> 外文期刊>Engineering Economics >Outlier Mining Based on Neighbor-Density-Deviation with Minimum Hyper-Sphere
【24h】

Outlier Mining Based on Neighbor-Density-Deviation with Minimum Hyper-Sphere

机译:基于最小超球面的邻近密度偏差的离群挖掘

获取原文
       

摘要

Outlier mining is to find exceptional behaviors of objects that deviate from the rest of the dataset or do not satisfy the common patterns. This paper introduces a density definition using the minimum hyper sphere and proposes an outlier mining algorithm based on neighbor-density-deviation. First, the definition of local space-density of an object is proposed by using the minimum hyper sphere. Second, the nearest neighbor sequence (NNS) based on the distance between an object and the neighbors of the object is established. After getting the space-density and the NNS of the object, the neighborhood density deviation (NDD) in NNS can be calculated based on the sum of density difference between the object and its neighbors. Finally, the neighbor-density-deviation-based outlier factor (NDDOF) is obtained to indicate the degree of the object being an outlier. To evaluate the effectiveness and the performance of the novel definition of space density and the NDDOF algorithm, we experiment on a synthetic dataset and three real UCI datasets. The results verify that the space-density is meaningful and the NDDOF algorithm has higher quality in outlier mining.DOI: http://dx.doi.org/10.5755/j01.itc.45.3.13164
机译:离群挖掘是为了发现偏离数据集其余部分或不满足通用模式的对象的异常行为。本文介绍了使用最小超球面的密度定义,并提出了一种基于邻域密度偏差的离群值挖掘算法。首先,利用最小超球面提出了物体局部空间密度的定义。其次,基于对象与对象的邻居之间的距离,建立最近邻居序列(NNS)。在获得对象的空间密度和NNS之后,可以基于对象与其邻居之间的密度差之和来计算NNS中的邻域密度偏差(NDD)。最终,获得基于邻域密度偏差的离群因子(NDDOF),以指示对象的离群程度。为了评估新颖的空间密度定义和NDDOF算法的有效性和性能,我们在一个合成数据集和三个真实UCI数据集上进行了实验。结果验证了空间密度是有意义的,并且NDDOF算法在离群挖掘中具有更高的质量.DOI:http://dx.doi.org/10.5755/j01.itc.45.3.13164

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号