首页> 外文会议> >A Variable Markovian Based Outlier Detection Method for Multi-Dimensional Sequence over Data Stream

【24h】

A Variable Markovian Based Outlier Detection Method for Multi-Dimensional Sequence over Data Stream

机译：数据流多维序列的基于变马尔可夫异常检测方法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Nowadays sequence data tends to be multi-dimensional sequence over data stream, it has a large state space and arrives at unprecedented speed. It is a big challenge to design a multi-dimensional sequence outlier detection method to meet the accurate and high speed requirements. The traditional methods can't handle multi-dimensional sequence effectively as they have poor abilities for multi-dimensional sequence modeling, and can't detect outlier timely as they have high computational complexity. In this paper we propose a variable Markovian based outlier detection method for multi-dimensional sequence over data stream, VMOD, which consists of two algorithms: mutual information based feature selection algorithm (MIFS), variable Markovian based sequential analysis algorithm (VMSA). It uses MIFS algorithm to reduce the state space and redundant features, and uses VMSA algorithm to accelerate the outlier detection. Through VMOD method, we can improve the detection rate and detection speed. The MIFS algorithm uses mutual information as similarity measures and adopt clustering based strategy to select features, it can improve the abilities for sequence modeling through reducing the state space and redundant features, consequently, to improve the detection rate. The VMSA algorithm use random sample and index structure to accelerate the variable Markovian model construction and reduce the model complexity, consequently, to quicken the outlier detection. The experiments show that VMOD can detect outlier effectively, and reduce the detection time by at least 50% compared with the traditional methods.

机译：如今，序列数据往往是数据流上的多维序列，它具有很大的状态空间，并且以前所未有的速度到达。设计一种多维序列离群值检测方法来满足准确和高速的要求是一个巨大的挑战。传统方法不能有效地处理多维序列，因为它们对多维序列建模的能力很差，并且由于它们具有很高的计算复杂性而不能及时检测到异常值。本文针对数据流上的多维序列提出了一种基于变量马尔可夫的离群值检测方法VMOD，该方法由两种算法组成：基于互信息的特征选择算法（MIFS），基于变量马尔可夫的顺序分析算法（VMSA）。它使用MIFS算法来减少状态空间和冗余功能，并使用VMSA算法来加速离群值检测。通过VMOD方法，可以提高检测率和检测速度。 MIFS算法使用互信息作为相似性度量，并采用基于聚类的策略选择特征，通过减少状态空间和冗余特征来提高序列建模的能力，从而提高检测率。 VMSA算法使用随机样本和索引结构来加快变量马尔可夫模型的构建并降低模型复杂度，从而加快异常值的检测。实验表明，与传统方法相比，VMOD可以有效地检测离群值，并将检测时间减少至少50％。

著录项

来源
《》|2016年|183-188|共6页
会议地点
作者
Dongsheng Yang; Yijie Wang; Yongmou Li; Xingkong Ma;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Clustering algorithms; Algorithm design and analysis; Mutual information; Computational modeling; Data models; Feature extraction; Indexes;

机译：聚类算法;算法设计与分析;互信息;计算建模;数据模型;特征提取;索引;

相似文献

外文文献
中文文献
专利

1. UWFP-Outlier: an efficient frequent-pattern-based outlier detection method for uncertain weighted data streams [J] . Cai Saihua, Li Li, Li Qian, Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 2020,第10期

机译：UWFP - 异常值：基于有效的基于频繁模式的异常转速检测方法，用于不确定加权数据流
2. FAAD:an unsupervised fast and accurate anomaly detection method for a multi-dimensional sequence over data stream [J] . Bin LI, Yi-jie WANG, Dong-sheng YANG, 浙江大学学报（英文版）（C辑：计算机与电子） . 2019,第003期

机译：FAAD：一种用于数据流多维序列的无监督快速准确的异常检测方法
3. Generalised linear model-based algorithm for detection of outliers in environmental data and comparison with semi-parametric outlier detection methods [J] . Martina ?ampulová, Jaroslav Michálek, Ji?í Mou?ka Atmospheric Pollution Research . 2019,第4期

机译：基于线性模型的基于线性模型的算法，用于检测环境数据中的异常值和半参数异常检测方法的比较
4. A Variable Markovian Based Outlier Detection Method for Multi-Dimensional Sequence over Data Stream [C] . Dongsheng Yang, Yijie Wang, Yongmou Li, International Conference on Parallel and Distributed Computing, Applications and Technologies . 2016

机译：基于Marlovian基于Markovian的数据流多维序列的异常检测方法
5. Parallelized Cell-Based Outlier Detection for Data Streams [D] . Green, Wyatt. 2021

机译：基于链接的基于单元的异常检测数据流
6. Evaluation of Two Outlier-Detection-Based Methods for Detecting Tissue-Selective Genes from Microarray Data [O] . Koji Kadota, Tomokazu Konishi, Kentaro Shimizu 2007

机译：两种基于离群检测的方法从微阵列数据检测组织选择基因的评价。
7. Incremental Principal Component Analysis Based Outliers Detection Methods for Spatiotemporal Data Streams [O] . Bhushan Alka, Sharker Monir, Karimi Hassan A. 2015

机译：基于增量主成分分析的时空数据流离群值检测方法
8. Multiple Outliers in Linear Regression: Advances in Detection Methods, Robust Estimation, and Variable Selection [R] . Wisnowski, J. W. 1999

机译：线性回归中的多个异常值：检测方法，稳健估计和变量选择的进展

A Variable Markovian Based Outlier Detection Method for Multi-Dimensional Sequence over Data Stream

摘要

著录项

相似文献

相关主题

期刊订阅