A Neural Probabilistic outlier detection method for categorical data

Cheng Li; Wang Yijie; Ma Xingkong

首页> 外文期刊>Neurocomputing >A Neural Probabilistic outlier detection method for categorical data

【24h】

A Neural Probabilistic outlier detection method for categorical data

机译：分类数据的神经概率离群值检测方法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Unsupervised outlier detection for categorical data is important and essential for broad applications in various domains. The complex interactions between attributes and the relevance of attributes make it a stem challenge. Existing methods, including patterns-based and couplings-based methods, either fail to capture the complex interactions or cannot handle the diverse attributes well. In this paper, we propose a novel Neural Probabilistic Outlier Detection method for categorical data, called NPOD. We present a new log-bilinear neural model to learn the categorical distributions, and we observe that the inliers and the outliers can be well separated according to learning loss. Based on this basic observation, we give both empirical and theoretical analysis and present a new neural network architecture that captures the interactions of attributes. Moreover, the discriminative information is used in the proposed bias training process to make the inliers and the outliers more separable. Lastly, to distinguish relevance of attributes, two indicators are proposed for computing ensemble outlier score to get a reliable result. Experimental results show that NPOD significantly outperforms the state-of-the-art competitors on 12 real-world data sets in terms of AUC and P@k. (C) 2019 Elsevier B.V. All rights reserved.

机译：对分类数据进行无监督的离群值检测对于在各个领域中的广泛应用非常重要和必不可少。属性之间复杂的交互作用以及属性的相关性使其成为一个挑战。现有的方法，包括基于模式的方法和基于耦合的方法，要么无法捕获复杂的交互作用，要么无法很好地处理各种属性。在本文中，我们提出了一种新的用于分类数据的神经概率离群值检测方法，称为NPOD。我们提出了一种新的对数-双线性神经模型来学习分类分布，并且我们发现，根据学习损失，可以将离群值和离群值很好地分开。基于此基本观察，我们提供了经验和理论分析，并提出了一种捕获属性交互作用的新神经网络体系结构。此外，在建议的偏差训练过程中使用区分性信息，以使内部值和异常值更可分离。最后，为区分属性的相关性，提出了两个指标用于计算整体离群值，以获得可靠的结果。实验结果表明，就AUC和P @ k而言，NPOD在12个真实数据集上明显优于最新的竞争对手。（C）2019 Elsevier B.V.保留所有权利。

著录项

来源
《Neurocomputing》 |2019年第6期|325-335|共11页
作者
Cheng Li; Wang Yijie; Ma Xingkong;
展开▼
作者单位

Natl Univ Def Technol Sci & Technol Parallel & Distributed Lab Changsha Hunan Peoples R China|Natl Univ Def Technol Coll Comp Changsha Hunan Peoples R China;

Natl Univ Def Technol Coll Comp Changsha Hunan Peoples R China;

展开▼
收录信息美国《科学引文索引》(SCI);美国《工程索引》(EI);
原文格式 PDF
正文语种 eng
中图分类
关键词
Unsupervised outlier detection; Categorical data; Neural probabilistic model;

机译：无监督的异常值检测;分类数据;神经概率模型;

相似文献

外文文献
中文文献
专利

1. A Neural Probabilistic outlier detection method for categorical data [J] . Cheng Li, Wang Yijie, Ma Xingkong Neurocomputing . 2019,第NOVa6期

机译：分类数据的神经概率离群值检测方法
2. ACCELERATING THE OUTLIER DETECTION METHODS FOR CATEGORICAL DATA BY USING MATRIX OF ATTRIBUTE VALUE FREQUENCY [J] . NUR ROKHMAN, SUBANAR, EDI WINARKO Journal of Theoretical and Applied Information Technology . 2017,第21期

机译：利用属性值频率矩阵加速分类数据的边缘检测方法
3. ACCELERATING THE OUTLIER DETECTION METHODS FOR CATEGORICAL DATA BY USING MATRIX OF ATTRIBUTE VALUE FREQUENCY [J] . NUR ROKHMAN, SUBANAR, EDI WINARKO Journal of Theoretical and Applied Information Technology . 2017,第21期

机译：利用属性值频率矩阵加速分类数据的边缘检测方法
4. WMEVF: An outlier detection methods for categorical data [C] . Nur Rokhman, Subanar, Edi Winarko 2016 International Conference on Informatics and Computing . 2016

机译：WMEVF：分类数据的异常检测方法
5. Towards outlier detection for high-dimensional data streams using projected outlier analysis strategy. [D] . Zhang, Ji. 2009

机译：使用投影离群值分析策略实现对高维数据流的离群值检测。
6. Outlier Detection Transilience-Probabilistic Model for Wind Tunnels Based on Sensor Data [O] . Encarna Quesada, Juan J. Cuadrado-Gallego, Miguel Ángel Patricio, 2021

机译：基于传感器数据的风隧道风轨道探测跨电 - 概率模型
7. A Semi-Supervised Approach to the Detection and Characterization of Outliers in Categorical Data [O] . Ienco, Dino, Pensa, Ruggero, Meo, Rosa 2016

机译：分类数据中异常值的检测和表征的半监督方法
8. Probabilistic Neural Networks for Chemical Sensor Array Pattern Recognition:Comparison Studies, Improvements and Automated Outlier Rejection [R] . Shaffer, R., Rose-Pehrsson, S. L., McGill, R. A. 1998

机译：用于化学传感器阵列模式识别的概率神经网络：比较研究，改进和自动异常值抑制

A Neural Probabilistic outlier detection method for categorical data

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅