首页> 外文期刊>IEEE Transactions on Knowledge and Data Engineering >Mining Probabilistically Frequent Sequential Patterns in Large Uncertain Databases
【24h】

Mining Probabilistically Frequent Sequential Patterns in Large Uncertain Databases

机译:在大型不确定数据库中挖掘概率频率顺序模式

获取原文
获取原文并翻译 | 示例

摘要

dData uncertainty is inherent in many real-world applications such as environmental surveillance and mobile tracking. Mining sequential patterns from inaccurate data, such as those data arising from sensor readings and GPS trajectories, is important for discovering hidden knowledge in such applications. In this paper, we propose to measure pattern frequentness based on the possible world semantics. We establish two uncertain sequence data models abstracted from many real-life applications involving uncertain sequence data, and formulate the problem of mining probabilistically frequent sequential patterns (or p-FSPs) from data that conform to our models. However, the number of possible worlds is extremely large, which makes the mining prohibitively expensive. Inspired by the famous PrefixSpan algorithm, we develop two new algorithms, collectively called U-PrefixSpan, for p-FSP mining. U-PrefixSpan effectively avoids the problem of “possible worlds explosion”, and when combined with our four pruning and validating methods, achieves even better performance. We also propose a fast validating method to further speed up our U-PrefixSpan algorithm. The efficiency and effectiveness of U-PrefixSpan are verified through extensive experiments on both real and synthetic datasets.
机译:dData不确定性在许多实际应用中都是固有的,例如环境监视和移动跟踪。从不准确的数据(例如由传感器读数和GPS轨迹产生的数据)中挖掘顺序模式对于在此类应用中发现隐藏的知识非常重要。在本文中,我们建议根据可能的世界语义来测量模式频繁性。我们建立了两个不确定序列数据模型,这些模型是从涉及不确定序列数据的许多现实应用中抽象出来的,并提出了从符合我们模型的数据中挖掘概率性频繁序列模式(或p-FSP)的问题。但是,可能的世界数量非常多,这使得采矿成本过高。受著名的PrefixSpan算法启发,我们开发了两种用于p-FSP挖掘的新算法,统称为U-PrefixSpan。 U-PrefixSpan有效地避免了“可能的世界爆炸”的问题,并且与我们的四种修剪和验证方法结合使用时,可以实现更好的性能。我们还提出了一种快速验证方法,以进一步加快我们的U-PrefixSpan算法。 U-PrefixSpan的效率和有效性通过在真实和合成数据集上进行的大量实验得到了验证。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号