...
首页> 外文期刊>Journal of information science and engineering >PTree: Mining Sequential Patterns Efficiently in Multiple Data Streams Environment
【24h】

PTree: Mining Sequential Patterns Efficiently in Multiple Data Streams Environment

机译:PTree:在多个数据流环境中有效地挖掘顺序模式

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Although issues of data streams have been widely studied and utilized, it is nevertheless challenging to deal with sequential mining of data streams. In this paper, we assume that the transaction of a user is partially coming and that there is no auxiliary for buffering and integrating. We adopt the Path Tree for mining frequent sequential patterns over data streams and integrate the user's sequences efficiently. Algorithms with regards to accuracy (PAlgorithm) and space (PSAlgorithm) are proposed to meet the different aspects of users, although GAlgorithm for mining frequent sequential patterns with a gap limitation is proposed. Many pruning properties are used to further reduce the space usage and improve the accuracy of our algorithms. We also prove that PAlgorithm mine frequent sequential patterns with the approximate support of error guarantee. Through thoughtful experiments, synthetic and real datasets are utilized to verify the feasibility of our algorithms.
机译:尽管数据流问题已得到广泛研究和利用,但是处理数据流的顺序挖掘仍然具有挑战性。在本文中,我们假设用户的交易部分到来,并且没有用于缓冲和集成的辅助工具。我们采用路径树来挖掘数据流上的频繁顺序模式,并有效地集成用户的序列。尽管提出了用于挖掘具有间隙限制的频繁序列模式的GAlgorithm,但仍提出了有关准确性(PAlgorithm)和空间(PSAlgorithm)的算法来满足用户的不同方面。许多修剪属性用于进一步减少空间使用并提高我们算法的准确性。我们还证明了PAlgorithm可以在错误保证的近似支持下挖掘频繁的顺序模式。通过深思熟虑的实验,利用合成数据集和真实数据集来验证我们算法的可行性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号