New Algorithms for Finding Monad Patterns in DNA Sequences

机译：用于在DNA序列中查找Monad模式的新算法

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we present two new algorithms for discovering monad patterns in DNA sequences. Monad patterns are of the form (l,d)-k, where l is the length of the pattern, d is the maximum number of mismatches allowed, and k is the minimum number of times the pattern is repeated in the given sample. The time-complexity of some of the best known algorithms to date is O(nt~2 l~d σ~d), where t is the number of input sequences, n is the length of each input sequence, and σ = |Σ| is the size of the alphabet. The first algorithm that we present in this paper takes O(n~2 t~2 l~(d/2)) time and O(ntl~(d/2) σ~(d/2)) space, and the second algorithm takes O(n~3 t~3 l~(d/2) σ~(d/2)) time using O(l~(d/2) σ~(d/2)) space. In practice, our algorithms have much better performance provided the d/l ratio is small. The second algorithm performs very well even for large values l and d as long as the d/l ratio is small.

机译：在本文中，我们提出了两个用于在DNA序列中发现Monad模式的新算法。 Monad图案是形式（L，D）-K，其中L是图案的长度，D是允许的最大不匹配数，K是在给定样品中重复模式的最小次数。迄今为止一些最知名的算法的时间复杂性是O（nt〜2 l〜dσ〜d），其中t是输入序列的数量，n是每个输入序列的长度，σ= |σ |是字母表的大小。我们在本文中存在的第一算法需要O（n〜2 t〜2 l〜（d / 2））时间和o（ntl〜（d / 2）σ〜（d / 2））空间，第二个算法采用O（n〜3 t〜3 l〜（d / 2）σ〜（d / 2））时间使用o（l〜（d / 2）σ〜（d / 2））空间。在实践中，我们的算法提供了更好的性能，提供了D / L比例很小。即使对于大值L和D，第二算法也可以非常好，只要D / L比例很小即可。

著录项

来源
《International Conference on String Processing and Information Retrieval》|2004年||共13页
会议地点
作者
Ravi Vijaya Satya; Amar Mukherjee;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类数据备份与恢复;
关键词

相似文献

外文文献
中文文献
专利

1. Genetic algorithm for dyad pattern finding in DNA sequences [J] . Zare-Mirakabad Fatemeh, Ahrabian Hayedeh, Sadeghi Mehdi, Genes & Genetic Systems . 2009,第1期

机译：DNA序列中二分体模式发现的遗传算法
2. Genetic algorithm for dyad pattern finding in DNA sequences [J] . Abbas Nowzari-Dalini, Bahram Goliaei, Fatemeh Zare-Mirakabad, Genes & Genetic Systems . 2009,第1期

机译：DNA序列中二分体模式发现的遗传算法
3. Pattern locator: a new tool for finding local sequence patterns in genomic DNA sequences [J] . Mrazek J, Xie SH Bioinformatics . 2006,第24期

机译：模式定位器：一种用于寻找基因组DNA序列中局部序列模式的新工具
4. New Algorithms for Finding Monad Patterns in DNA Sequences [C] . Ravi Vijaya Satya, Amar Mukherjee International Conference on String Processing and Information Retrieval(SPIRE 2004); 20041005-08; Padova(IT) . 2004

机译：在DNA序列中寻找Monad模式的新算法
5. Finding patterns in DNA sequences through visualization with symbolic scatter plots. [D] . Cox, David N. 2010

机译：通过使用符号散点图进行可视化查找DNA序列中的模式。
6. WORDUP: an efficient algorithm for discovering statistically significant patterns in DNA sequences. [O] . G Pesole, N Prunella, S Liuni, 1992

机译：WORDUP：一种有效的算法用于发现DNA序列中具有统计学意义的模式。
7. PRUNER: Algorithms for Finding Monad Patterns in DNA Sequences [O] . Ravi Vijayasatya, Amar Mukherjee 2014

机译：pRUNER：在DNa序列中寻找monad模式的算法

New Algorithms for Finding Monad Patterns in DNA Sequences

摘要

著录项

相似文献

相关主题

期刊订阅