【24h】

Anomaly Detection in Categorical Datasets Using Bayesian Networks

机译:使用贝叶斯网络的分类数据集中的异常检测

获取原文

摘要

In this paper we present a method for finding anomalous records in categorical or mixed datasets in an unsupervised fashion. Since the data in many problems consist of normal records with a small minority of anomalies, many approaches build a model from the training data and compare the test records against it. But instead of building a model, we keep track of the number of occurrences of different attribute value combinations. We also consider a more meaningful definition of anomalies and incorporate the Bayesian network structure in it. A scoring technique is defined for each test record. In this procedure we combine supports of different rules according to the Bayesian network structure in order to determine the label of the test instances. As it is shown in the results, our proposed method has a higher or similar f-measure and precision compared to a Bayesian network based approach in all cases.
机译:在本文中,我们提出了一种以无监督的方式在分类或混合数据集中查找异常记录的方法。由于许多问题中的数据都由正常记录组成,并且异常情况很少,因此许多方法都可以从训练数据中构建模型,并将测试记录与模型进行比较。但是,我们无需建立模型,而是跟踪不同属性值组合的出现次数。我们还考虑了对异常的更有意义的定义,并将贝叶斯网络结构纳入其中。为每个测试记录定义了一种评分技术。在此过程中,我们根据贝叶斯网络结构组合了不同规则的支持,以确定测试实例的标签。结果表明,在所有情况下,与基于贝叶斯网络的方法相比,我们提出的方法具有更高或相似的f值和精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号