首页> 外文期刊>Neural computing & applications >Minimal weighted infrequent itemset mining-based outlier detection approach on uncertain data stream
【24h】

Minimal weighted infrequent itemset mining-based outlier detection approach on uncertain data stream

机译:最小加权不频繁的替代项目集基于不确定数据流的异常转速检测方法

获取原文
获取原文并翻译 | 示例
           

摘要

Outliers are a critical factor that affects the accuracy of data-based predictions and some other data-based processing; thus, outliers must be effectively detected as soon as possible to improve the credibility of the data. In recent years, massive outlier detection approaches have been proposed for static data and precise data; however, the uncertainty and weight information of each item was not considered in this prior work. Moreover, traditional outlier detection approaches only take the deviation degree of each data element as the standard for determining outliers; therefore, the detected outliers do not fit the definition of an outlier (i.e., rarely appearing and different from most of the other data). Aimed at these problems, a minimal weighted infrequent itemset mining-based outlier detection approach that can be applied to an uncertain data stream, called MWIFIM-OD-UDS, is proposed in this paper to effectively detect implicit outliers, which have a rarely occurring frequency, uncertainty and a certain weight of the itemset, while the characteristics of the data stream are considered. In particular, a matrix structure-based approach that is called MWIFIM-UDS is proposed to mine the minimal weighted infrequent itemsets (MWiFIs) from an uncertain data stream, and then, the MWIFIM-OD-UDS method is proposed based on the mined MWiFIs and the designed deviation indexes. Experimental results show that the proposed MWIFIM-OD-UDS method outperforms the frequent itemset mining-based outlier detection methods, FindFPOF and LFP, in terms of its runtime and detection accuracy.
机译:异常值是影响基于数据的预测和一些其他基于数据的准确性的关键因素;因此,必须尽快有效地检测到异常值以提高数据的可信度。近年来,已经提出了静态数据和精确数据的大规模异常检测方法;然而,在本前工作中未考虑每个项目的不确定性和重量信息。此外,传统的异常传统检测方法仅将每个数据元素的偏差程度作为确定异常值的标准;因此,检测到的异常值不适合异常值的定义(即,很少从大多数其他数据出现和不同)。旨在这些问题,在本文中提出了一种基于重量的不频繁的异常挖掘的基于不频繁的异常挖掘,可以应用于称为MWIFIM-OD-UDS的不确定数据流,以有效地检测隐式异常值,这些异常值具有很少发生的频率,不确定度和一定重量的项目集,而虽然考虑了数据流的特征。特别地,提出了一种被称为MWIFIM-UDS的基于基于矩阵结构的方法,以从不确定的数据流中挖掘最小加权的不频繁项目集(MWIFIS),然后,基于所开采的MWIFIS提出MWIFIM-OD-UDS方法和设计的偏差指标。实验结果表明,提出的MWIFIM-OD-UDS方法在其运行时和检测准确性方面优于频繁的项目集挖掘出口检测方法,FindFPOF和LFP。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号