Mining Sequential Patterns from Probabilistic Databases

机译：从概率数据库中挖掘顺序模式

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We consider sequential pattern mining in situations where there is uncertainty about which source an event is associated with. We model this in the probabilistic database framework and consider the problem of enumerating all sequences whose expected support is sufficiently large. Unlike frequent itemset mining in probabilistic databases [C. Aggar-wal et al. KDD'09; Chui et al., PAKDD'07; Chui and Kao, PAKDD'08], we use dynamic programming (DP) to compute the probability that a source supports a sequence, and show that this suffices to compute the expected support of a sequential pattern. Next, we embed this DP algorithm into candidate generate-and-test approaches, and explore the pattern lattice both in a breadth-first (similar to GSP) and a depth-first (similar to SPAM) manner. We propose optimizations for efficiently computing the frequent 1-sequences, for re-using previously-computed results through incremental support computation, and for elmiminating candidate sequences without computing their support via probabilistic pruning. Preliminary experiments show that our optimizations are effective in improving the CPU cost.

机译：在不确定事件与哪个来源相关联的情况下，我们考虑顺序模式挖掘。我们在概率数据库框架中对此建模，并考虑枚举其预期支持足够大的所有序列的问题。与概率数据库中频繁的项目集挖掘不同[C. Aggar-wal等。 KDD'09; Chui等人，PAKDD'07; Chui和Kao，PAKDD'08]，我们使用动态规划（DP）来计算源支持序列的可能性，并表明这足以计算对序列模式的预期支持。接下来，我们将此DP算法嵌入到候选的生成和测试方法中，并以广度优先（类似于GSP）和深度优先（类似于SPAM）的方式探索模式晶格。我们提出了优化方案，以有效地计算频繁的1序列，通过增量支持计算重用先前计算的结果以及消除候选序列而无需通过概率修剪来计算其支持。初步实验表明，我们的优化可有效降低CPU成本。

著录项

来源
《Pacific-Asia conference on knowledge discovery and data mining;PAKDD 2011;Workshop on behavior informatics;BI 2011;Workshop on advances and issues in traditional Chinese medicine clinical data mining;Workshop on quality issues, measures of interestingness and evaluation of data mining models;AI-TCM;QIMIE 2011;Workshop on biologically inspired techniques for data mining;BDM 2011;Workshop on data mining for healthcare management;DMHM 2011》|2011年|p.210-221|共12页
会议地点
作者
Muhammad Muzammal; Rajeev Raman;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.13;
关键词
mining uncertain data; mining complex sequential data; probabilistic databases; novel models and algorithms.;

机译：挖掘不确定数据;挖掘复杂的顺序数据;概率数据库;新颖的模型和算法。;

相似文献

外文文献
中文文献
专利

1. Learning Trajectory Patterns by Sequential Pattern Mining from Probabilistic Databases [J] . Josky A?zan, Cina Motamed, Eugene C. Ezin Computer Science & Information Technology . 2018,第15期

机译：通过概率数据库中的顺序模式挖掘来学习轨迹模式
2. Mining sequential patterns from probabilistic databases [J] . Muzammal Muhammad, Raman Rajeev Knowledge and information systems . 2015,第2期

机译：从概率数据库中挖掘顺序模式
3. Mining Probabilistically Frequent Sequential Patterns in Large Uncertain Databases [J] . Zhao Z., Yan D., Ng W. IEEE Transactions on Knowledge and Data Engineering . 2014,第5期

机译：在大型不确定数据库中挖掘概率频率顺序模式
4. Mining Sequential Patterns from Probabilistic Databases by Pattern-Growth [C] . Muhammad Muzammal Advances in databases . 2011

机译：通过模式增长从概率数据库中挖掘顺序模式
5. New algorithms for frequent sequential pattern and itemset data mining in certain and uncertain databases. [D] . Peterson, Erich Allen. 2012

机译：在某些不确定数据库中频繁进行顺序模式和项集数据挖掘的新算法。
6. Mining of high utility-probability sequential patterns from uncertain databases [O] . Binbin Zhang, Jerry Chun-Wei Lin, Philippe Fournier-Viger, 2011

机译：从不确定的数据库中挖掘高实用概率顺序模式
7. LEARNING TRAJECTORY PATTERNS BY SEQUENTIAL PATTERN MINING FROM PROBABILISTIC DATABASES [O] . Josky Aïzan, Cina Motamed, Eugene C. Ezin 2018

机译：通过概率数据库顺序模式挖掘学习轨迹模式

Mining Sequential Patterns from Probabilistic Databases

摘要

著录项

相似文献

相关主题

期刊订阅