首页> 外文期刊>Neurocomputing >A Neural Probabilistic outlier detection method for categorical data
【24h】

A Neural Probabilistic outlier detection method for categorical data

机译:分类数据的神经概率离群值检测方法

获取原文
获取原文并翻译 | 示例
           

摘要

Unsupervised outlier detection for categorical data is important and essential for broad applications in various domains. The complex interactions between attributes and the relevance of attributes make it a stem challenge. Existing methods, including patterns-based and couplings-based methods, either fail to capture the complex interactions or cannot handle the diverse attributes well. In this paper, we propose a novel Neural Probabilistic Outlier Detection method for categorical data, called NPOD. We present a new log-bilinear neural model to learn the categorical distributions, and we observe that the inliers and the outliers can be well separated according to learning loss. Based on this basic observation, we give both empirical and theoretical analysis and present a new neural network architecture that captures the interactions of attributes. Moreover, the discriminative information is used in the proposed bias training process to make the inliers and the outliers more separable. Lastly, to distinguish relevance of attributes, two indicators are proposed for computing ensemble outlier score to get a reliable result. Experimental results show that NPOD significantly outperforms the state-of-the-art competitors on 12 real-world data sets in terms of AUC and P@k. (C) 2019 Elsevier B.V. All rights reserved.
机译:对分类数据进行无监督的离群值检测对于在各个领域中的广泛应用非常重要和必不可少。属性之间复杂的交互作用以及属性的相关性使其成为一个挑战。现有的方法,包括基于模式的方法和基于耦合的方法,要么无法捕获复杂的交互作用,要么无法很好地处理各种属性。在本文中,我们提出了一种新的用于分类数据的神经概率离群值检测方法,称为NPOD。我们提出了一种新的对数-双线性神经模型来学习分类分布,并且我们发现,根据学习损失,可以将离群值和离群值很好地分开。基于此基本观察,我们提供了经验和理论分析,并提出了一种捕获属性交互作用的新神经网络体系结构。此外,在建议的偏差训练过程中使用区分性信息,以使内部值和异常值更可分离。最后,为区分属性的相关性,提出了两个指标用于计算整体离群值,以获得可靠的结果。实验结果表明,就AUC和P @ k而言,NPOD在12个真实数据集上明显优于最新的竞争对手。 (C)2019 Elsevier B.V.保留所有权利。

著录项

  • 来源
    《Neurocomputing》 |2019年第6期|325-335|共11页
  • 作者单位

    Natl Univ Def Technol Sci & Technol Parallel & Distributed Lab Changsha Hunan Peoples R China|Natl Univ Def Technol Coll Comp Changsha Hunan Peoples R China;

    Natl Univ Def Technol Coll Comp Changsha Hunan Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Unsupervised outlier detection; Categorical data; Neural probabilistic model;

    机译:无监督的异常值检测;分类数据;神经概率模型;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号