首页> 外文会议> >A Variable Markovian Based Outlier Detection Method for Multi-Dimensional Sequence over Data Stream
【24h】

A Variable Markovian Based Outlier Detection Method for Multi-Dimensional Sequence over Data Stream

机译:数据流多维序列的基于变马尔可夫异常检测方法

获取原文

摘要

Nowadays sequence data tends to be multi-dimensional sequence over data stream, it has a large state space and arrives at unprecedented speed. It is a big challenge to design a multi-dimensional sequence outlier detection method to meet the accurate and high speed requirements. The traditional methods can't handle multi-dimensional sequence effectively as they have poor abilities for multi-dimensional sequence modeling, and can't detect outlier timely as they have high computational complexity. In this paper we propose a variable Markovian based outlier detection method for multi-dimensional sequence over data stream, VMOD, which consists of two algorithms: mutual information based feature selection algorithm (MIFS), variable Markovian based sequential analysis algorithm (VMSA). It uses MIFS algorithm to reduce the state space and redundant features, and uses VMSA algorithm to accelerate the outlier detection. Through VMOD method, we can improve the detection rate and detection speed. The MIFS algorithm uses mutual information as similarity measures and adopt clustering based strategy to select features, it can improve the abilities for sequence modeling through reducing the state space and redundant features, consequently, to improve the detection rate. The VMSA algorithm use random sample and index structure to accelerate the variable Markovian model construction and reduce the model complexity, consequently, to quicken the outlier detection. The experiments show that VMOD can detect outlier effectively, and reduce the detection time by at least 50% compared with the traditional methods.
机译:如今,序列数据往往是数据流上的多维序列,它具有很大的状态空间,并且以前所未有的速度到达。设计一种多维序列离群值检测方法来满足准确和高速的要求是一个巨大的挑战。传统方法不能有效地处理多维序列,因为它们对多维序列建模的能力很差,并且由于它们具有很高的计算复杂性而不能及时检测到异常值。本文针对数据流上的多维序列提出了一种基于变量马尔可夫的离群值检测方法VMOD,该方法由两种算法组成:基于互信息的特征选择算法(MIFS),基于变量马尔可夫的顺序分析算法(VMSA)。它使用MIFS算法来减少状态空间和冗余功能,并使用VMSA算法来加速离群值检测。通过VMOD方法,可以提高检测率和检测速度。 MIFS算法使用互信息作为相似性度量,并采用基于聚类的策略选择特征,通过减少状态空间和冗余特征来提高序列建模的能力,从而提高检测率。 VMSA算法使用随机样本和索引结构来加快变量马尔可夫模型的构建并降低模型复杂度,从而加快异常值的检测。实验表明,与传统方法相比,VMOD可以有效地检测离群值,并将检测时间减少至少50%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号