首页> 外文期刊>Information Sciences: An International Journal >Large scale anomaly detection in mixed numerical and categorical input spaces
【24h】

Large scale anomaly detection in mixed numerical and categorical input spaces

机译:混合数值和分类输入空间中的大规模异常检测

获取原文
获取原文并翻译 | 示例
           

摘要

This work presents the ADMNC method, designed to tackle anomaly detection for large-scale problems with a mixture of categorical and numerical input variables. A flexible parametric probability measure is adjusted to input data, allowing low likelihood values to be tracked as anomalies. The main contribution of this method is that, to cope with the variable nature of the variables, we factorize the joint probability measure into two parts, namely, the marginal density of the continuous variables and the conditional probability of the categorical variables given the continuous part of the feature vector. The result is a model trained through a maximum likelihood objective function optimized with stochastic gradient descent that yields an effective and scalable algorithm. Compared with other well-known anomaly detection algorithms over several datasets, ADMNC is observed to both offer top level accuracy in datasets that are out of reach for the most effective existing methods and to scale up well to processing very large datasets. This makes it a powerful tool for solving a problem growing in popularity that currently lacks suitable scalable algorithms. (C) 2019 Elsevier Inc. All rights reserved.
机译:这项工作介绍了ADMNC方法,旨在用分类和数值输入变量的混合来解决大规模问题的异常检测。将柔性参数概率测量​​调整为输入数据,允许将低似然值被跟踪为异常。这种方法的主要贡献是,为了应对变量的可变性质,我们将联合概率测量分解成两部分,即连续变量的边缘密度以及给出连续部分的分类变量的条件概率特征向量。结果是通过使用随机梯度下降优化的最大似然物镜函数训练的模型,其产生有效且可扩展的算法。与其他几个数据集的其他众所周知的异常检测算法相比,ADMNC将观察到在数据集中提供顶级准确性,以获得最有效的现有方法,并扩大到处理非常大的数据集。这使它成为解决当前缺乏合适的可扩展算法的普及问题的强大工具。 (c)2019 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号