An efficient polynomial space and polynomial delay algorithm for enumeration of maximal motifs in a sequence

Arimura H; Uno T

首页> 外文期刊>Journal of combinatorial optimization >An efficient polynomial space and polynomial delay algorithm for enumeration of maximal motifs in a sequence

【24h】

An efficient polynomial space and polynomial delay algorithm for enumeration of maximal motifs in a sequence

机译：一种有效的多项式空间和多项式延迟算法，用于枚举序列中的最大图案

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we consider the problem of enumerating all maximal motifs in an input string for the class of repeated motifs with wild cards. A maximal motif is such a representative motif that is not properly contained in any larger motifs with the same location lists. Although the enumeration problem for maximal motifs with wild cards has been studied in Parida et al. (2001), Pisanti et al. (2003) and Pelfrene et al. (2003), its output-polynomial time computability has been still open. The main result of this paper is a polynomial space polynomial delay algorithm for the maximal motif enumeration problem for the repeated motifs with wild cards. This algorithm enumerates all maximal motifs in an input string of length n in O(n(3)) time per motif with O(n) space, in particular O(n(3)) delay. The key of the algorithm is depth-first search on a tree-shaped search route over all maximal motifs based on a technique called prefix-preserving closure extension. We also show an exponential lower bound and a succinctness result on the number of maximal motifs, which indicate the limit of a straightforward approach. The results of the computational experiments show that our algorithm can be applicable to huge string data such as genome data in practice, and does not take large additional computational cost compared to usual frequent motif mining algorithms.

机译：在本文中，我们考虑了使用通配符对重复主题类别枚举输入字符串中所有最大主题的问题。最大主题是这样的代表性主题，它没有正确包含在具有相同位置列表的任何较大主题中。尽管在Parida等人中已经研究了带有通配符的最大图案的枚举问题。（2001），Pisanti等。（2003）和Pelfrene等。（2003年），其输出多项式时间可计算性仍然是开放的。本文的主要结果是针对具有通配符的重复图案的最大图案枚举问题的多项式空间多项式延迟算法。此算法枚举每个主题的O（n（3））时间（特别是O（n（3））延迟）在长度为n的输入字符串中的所有最大主题，每个主题的O（n（3））时间。该算法的关键是基于一种称为前缀保留闭包扩展的技术，在所有最大主题的树形搜索路径上进行深度优先搜索。我们还显示了最大图案数的指数下界和简洁结果，这表明了简单方法的局限性。计算实验的结果表明，我们的算法在实践中可适用于巨大的字符串数据，如基因组数据，与常用的频繁基序挖掘算法相比，不需要花费额外的计算成本。

著录项

来源
《Journal of combinatorial optimization》 |2007年第3期|共20页
作者
Arimura H; Uno T;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类管理学;
关键词
motif; maximal motif; data mining; sequence mining; algorithm; delay; enumeration; polynomial time; closed itemset; closed pattern; pattern discovery; PATTERNS;

机译：主题;最大主题;数据挖掘;序列挖掘;算法;延迟;枚举;多项式时间;封闭项集;封闭模式;模式发现;模式;

相似文献

外文文献
中文文献
专利

1. An efficient polynomial space and polynomial delay algorithm for enumeration of maximal motifs in a sequence [J] . Arimura H, Uno T Journal of combinatorial optimization . 2007,第3期

机译：一种有效的多项式空间和多项式延迟算法，用于枚举序列中的最大图案
2. An efficient polynomial space and polynomial delay algorithm for enumeration of maximal motifs in a sequence [J] . Hiroki Arimura, Takeaki Uno Journal of Combinatorial Optimization . 2007,第3期

机译：一种有效的多项式空间和多项式延迟算法，用于枚举序列中的最大图案
3. A Polynomial Space Polynomial Delay Algorithm for Enumerating Maximal Motifs in a Sequence [J] . Hiroki ARIMURA, Takeaki UNO 電子情報通信学会技術研究報告. コンピュテ-ション. Theoretical Foundations of Computing . 2005,第273期

机译：序列中最大母题的多项式空间多项式延迟算法
4. A Polynomial Space and Polynomial Delay Algorithm for Enumeration of Maximal Motifs in a Sequence [C] . Hiroki Arimura, Takeaki Uno International Symposium on Algorithms and Computation(ISAAC 2005); 20051219-21; Sanya(CN) . 2005

机译：序列中最大母题枚举的多项式空间和多项式延迟算法
5. A transformation of orthogonal polynomial sequences into orthogonal Laurent polynomial sequences. [D] . Hagler, Brian Allan. 1997

机译：将正交多项式序列转换为正交Laurent多项式序列。
6. Polynomial algorithms for the Maximal Pairing Problem: efficient phylogenetic targeting on arbitrary trees [O] . Christian Arnold, Peter F Stadler 2010

机译：最大配对问题的多项式算法：针对任意树的有效系统发育目标
7. An efficient polynomial space and polynomial delay algorithm for enumeration of maximal motifs in a sequence [O] . Hiroki Arimura, Takeaki Uno 2007

机译：一种有效的多项式空间和多项式延迟算法，用于枚举序列中的最大基序
8. Polynomial space polynomial delay algorithms for listing families of graphs. [R] . Goldberg, L. A. 1992

机译：用于列出图族的多项式空间多项式时滞算法。

An efficient polynomial space and polynomial delay algorithm for enumeration of maximal motifs in a sequence

摘要

著录项

相似文献

相关主题

期刊订阅