MiFI-Outlier: Minimal infrequent itemset-based outlier detection approach on uncertain data stream

首页> 外文期刊>Knowledge-Based Systems >MiFI-Outlier: Minimal infrequent itemset-based outlier detection approach on uncertain data stream

【24h】

MiFI-Outlier: Minimal infrequent itemset-based outlier detection approach on uncertain data stream

机译：MiFI-Outlier：针对不确定数据流的基于偶项集的极少频率异常检测方法

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Massive outlier detection approaches have been proposed for static datasets in the past twenty years, and they have acquired good achievements. In real life, uncertain data stream is more and more common, but most existing outlier detection approaches were not suitable for uncertain data stream environment. In addition, many outlier detection approaches have not considered the appearing frequency of each element, which resulted the detected outliers not coincide with the definition of outlier. Itemset-based outlier detection approaches provided a good solution for this problem, and they have got more attentions in these years. In this paper, a novel two-step minimal infrequent itemset-based outlier detection approach called MiFI-Outlier is proposed to effectively detect the outliers from uncertain data stream. In itemset mining phase, a matrix-based method called MiFIUDSM is proposed to mine the minimal infrequent itemsets (Mins) from uncertain data stream, and then an improved approach called MiFI-UDSM* is proposed for more effectively mining these minimal infrequent itemsets using the ideas of "item cap" and "support cap". In outlier detection phase, based on the mined MiFIs, three deviation indices including minimal infrequent itemset deviation index (MiFIDI), similarity deviation index (SDI) and transaction deviation index (TDI) are defined to measure the deviation degree of each transaction, and then the MiFI-Outlier is used to identify the outliers from uncertain data stream. Several experimental studies are conducted on public datasets and synthetic datasets, and the results show that the proposed approaches outperform in infrequent itemset mining phase and outlier detection phase. (C) 2019 Elsevier B.V. All rights reserved.

机译：在过去的二十年中，已经提出了针对静态数据集的大规模离群值检测方法，并取得了良好的成绩。在现实生活中，不确定的数据流越来越普遍，但是大多数现有的异常检测方法都不适合不确定的数据流环境。另外，许多离群值检测方法没有考虑每个元素的出现频率，这导致检测到的离群值与离群值的定义不一致。基于项集的离群值检测方法为该问题提供了很好的解决方案，并且近年来受到了越来越多的关注。本文提出了一种新的基于两步最小不频繁项集的离群值检测方法，称为MiFI-Outlier，可以有效地从不确定数据流中检测离群值。在项目集挖掘阶段，提出了一种名为MiFIUDSM的基于矩阵的方法，用于从不确定数据流中挖掘最小不频繁项目集（Mins），然后提出了一种改进的方法，称为MiFI-UDSM *，可以使用以下方法更有效地挖掘这些最小不频繁项目集。 “项目上限”和“支持上限”的想法。在离群检测阶段，基于挖掘的MiFI，定义了三个偏差指数，包括最小不频繁项目集偏差指数（MiFIDI），相似性偏差指数（SDI）和交易偏差指数（TDI），以测量每个交易的偏差程度，然后MiFI异常值用于从不确定的数据流中识别异常值。在公共数据集和综合数据集上进行了一些实验研究，结果表明，该方法在不频繁项集挖掘阶段和离群值检测阶段的表现优于大数据集。（C）2019 Elsevier B.V.保留所有权利。

著录项

来源
《Knowledge-Based Systems》 |2020年第5期|317-338|共22页
作者

展开▼
作者单位

China Agr Univ Coll Informat & Elect Engn Beijing 100083 Peoples R China;

China Agr Univ Coll Informat & Elect Engn Beijing 100083 Peoples R China|Minist Agr Sci Res Base Integrated Technol Precis Agr Anim H Beijing 100083 Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Outlier detection; Minimal infrequent itemset mining; Uncertain data stream; Deviation indices; Data mining;

机译：离群值检测;最少的项目集挖掘;不确定的数据流;偏差指标;数据挖掘;

相似文献

外文文献
中文文献
专利

1. Minimal weighted infrequent itemset mining-based outlier detection approach on uncertain data stream [J] . Neural computing & applications . 2020,第11期

机译：最小加权不频繁的替代项目集基于不确定数据流的异常转速检测方法
2. Minimal infrequent pattern based approach for mining outliers in data streams [J] . Hemalatha C. Sweetlin, Vaidehi V., Lakshmi R. Expert Systems with Application . 2015,第4期

机译：基于最小频率的基于模式的方法来挖掘数据流中的异常值
3. An efficient approach for outlier detection from uncertain data streams based on maximal frequent patterns [J] . Cai Saihua, Li Li, Li Sicong, Expert systems with applications . 2020,第Deca期

机译：基于最大频繁模式的不确定数据流中的高效检测方法
4. Minimal Rare-Pattern-Based Outlier Detection Method for Data Streams by Considering Anti-monotonic Constraints [C] . Saihua Cai, Jinfu Chen, Xinru Li, Information Security Conference . 2020

机译：通过考虑反单调约束来最小的基于稀有模式的异常转口检测方法
5. Towards outlier detection for high-dimensional data streams using projected outlier analysis strategy. [D] . Zhang, Ji. 2009

机译：使用投影离群值分析策略实现对高维数据流的离群值检测。
6. Designing a Streaming Algorithm for Outlier Detection in Data Mining—An Incremental Approach [O] . Kangqing Yu, Wei Shi, Nicola Santoro 2020

机译：设计用于数据挖掘中异常值检测的流算法—一种增量方法
7. Designing a Streaming Algorithm for Outlier Detection in Data Mining—An Incrementa Approach [O] . Kangqing Yu, Wei Shi, Nicola Santoro 2020

机译：设计数据挖掘中的异常检测流算法 - 一种递增方法

MiFI-Outlier: Minimal infrequent itemset-based outlier detection approach on uncertain data stream

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅