...
首页> 外文期刊>Intelligent data analysis >Statistical supports for mining sequential patterns and improving the incremental update process on data streams
【24h】

Statistical supports for mining sequential patterns and improving the incremental update process on data streams

机译:统计支持,用于挖掘顺序模式并改进数据流上的增量更新过程

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Recently, the knowledge extraction community takes a closer, look at new models where data arrive in timely manner like a fast and continuous flow, i.e. data streams. As only a part of the stream can be stored, mining data streams for sequential patterns and updating previously found frequent patterns need to cope with uncertainty. In this paper, we introduce a new statistical approach which biases the initial support for sequential patterns. This approach holds the advantage to maximize either the precision or the recall, as chosen by the user, and limit the degradation of the other criterion. Moreover, these statistical supports help building statistical borders which are the relevant sets of frequent patterns to use into an incremental mining process. From the statistical standpoint, theoretical results show that the technique is not far from the optimum. Experiments performed on sequential patterns demonstrate the interest of this approach and the potential of such techniques.
机译:最近,知识提取社区更仔细地研究了新模型,在这些模型中,数据以快速连续的方式(即数据流)及时到达。由于只能存储一部分流,因此需要为顺序模式挖掘数据流并更新以前发现的频繁模式,以应对不确定性。在本文中,我们介绍了一种新的统计方法,该方法会偏向对顺序模式的初始支持。这种方法的优点是可以使用户选择的精度或召回率最大化,并限制其他标准的降级。此外,这些统计支持有助于建立统计边界,这些边界是在增量采矿过程中要使用的频繁模式的相关集合。从统计的角度来看,理论结果表明该技术与最佳方法相距不远。在顺序模式上进行的实验证明了这种方法的重要性以及这种技术的潜力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号