Permu-pattern: Discovery of Mutable Permutation Patterns with Proximity Constraint

机译：Permu-pattern：发现具有邻近约束的可变排列模式

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Pattern discovery in sequences is an important problem in many applications, especially in computational biology and text mining. However, due to the noisy nature of data, the traditional sequential pattern model may fail to reflect the underlying characteristics of sequence data in these applications. There are two challenges: First, the mutation noise exists in the data, and therefore symbols may be misrepresented by other symbols; Secondly, the order of symbols in sequences could be permutated. To address the above problems, in this paper we propose a new sequential pattern model called mutable permutation patterns. Since the Apriori property does not hold for our permutation pattern model, a novel Permu-pattern algorithm is devised to mine frequent mutable permutation patterns from sequence databases. A reachability property is identified to prune the candidate set. Last but not least, we apply the permutation pattern model to a real genome dataset to discover gene clusters, which shows the effectiveness of the model. A large amount of synthetic data is also utilized to demonstrate the efficiency of the Permu-pattern algorithm.

机译：序列中的模式发现是许多应用程序中的重要问题，尤其是在计算生物学和文本挖掘中。但是，由于数据的嘈杂性，传统的顺序模式模型可能无法反映这些应用程序中序列数据的基本特征。存在两个挑战：首先，数据中存在突变噪声，因此，其他符号可能会误表示符号;其次，可以改变序列中符号的顺序。为了解决上述问题，在本文中，我们提出了一种新的顺序模式模型，称为可变排列模式。由于Apriori属性不适用于我们的排列模式模型，因此设计了一种新颖的Permu模式算法来从序列数据库中挖掘频繁的可变排列模式。确定可达性属性以修剪候选集。最后但并非最不重要的一点是，我们将置换模式模型应用于实际的基因组数据集以发现基因簇，这表明了该模型的有效性。大量的合成数据也被用来证明Permu模式算法的效率。

著录项

来源
《ACMKDD International Conference on Knowledge Discovery and Data Mining;KDD 2008》|2008年|300-308|共9页
会议地点
作者
Meng Hu; Jiong Yang; Wei Su;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类信息与知识传播;
关键词
sequential pattern; permutation pattern; proximity pattern;

机译：顺序模式;排列模式;邻近模式;

相似文献

外文文献
中文文献
专利

1. Proximity-to-goal as a constraint on patterns of behaviour in attacker-defender dyads in team games. [J] . Headrick J, Davids K, Renshaw I, Journal of sports sciences. . 2012,第3期

机译：接近目标是对团队游戏中攻击者后卫二元组行为模式的限制。
2. Gapped permutation pattern discovery for gene order comparisons [J] . Parida L Journal of computational biology: A journal of computational molecular cell biology . 2007,第1期

机译：缺口排列模式发现，用于基因顺序比较
3. Permutation pattern discovery in biosequences [J] . Eres R, Landau GM, Parida L Journal of computational biology: A journal of computational molecular cell biology . 2004,第6期

机译：生物序列中的排列模式发现
4. Permu-pattern: Discovery of Mutable Permutation Patterns with Proximity Constraint [C] . ACMKDD International Conference on Knowledge Discovery and Data Mining . 2008

机译：Permu-Pattern：发现具有邻近约束的可变排放模式
5. Pattern Matching Statistics in the Permtutations Sn and the Alternating Permutations An for Minimally Overlapping Patterns [D] . Duane, Adrian Scott 2013

机译：最小重叠图案中置换Sn和交替置换An中的模式匹配统计量
6. Phase variable genes of Campylobacter jejuni exhibit high mutation rates and specific mutational patterns but mutability is not the major determinant of population structure during host colonization [O] . Christopher D. Bayliss, Fadil A. Bidmos, Awais Anjum, 2012

机译：空肠弯曲菌的相变基因具有较高的突变率和特定的突变模式但变异性并不是宿主定殖过程中种群结构的主要决定因素。
7. Efficient Discovery of Proximity Patterns with Suffix Arrays (Extended Abstract) [O] . Hiroki Arimura, Hiroki Asaka, Hiroshi Sakamoto, 2001

机译：使用后缀数组高效发现邻近模式（扩展摘要）

Permu-pattern: Discovery of Mutable Permutation Patterns with Proximity Constraint

摘要

著录项

相似文献

相关主题

期刊订阅