首页> 外文期刊>Theoretical computer science >Algorithms for computing variants of the longest common subsequence problem
【24h】

Algorithms for computing variants of the longest common subsequence problem

机译:计算最长公共子序列问题的变体的算法

获取原文
获取原文并翻译 | 示例
       

摘要

The longest common subsequence (LCS) problem is one of the classical and well-studied problems in computer science. The computation of the LCS is a frequent task in DNA sequence analysis, and has applications to genetics and molecular biology. In this paper we introduce new variants of LCS problem and present efficient algorithms to solve them. In particular we introduce the notion of gap constraints in the LCS problems. For the LCS problem with fixed gap, we first present a naive algorithm runs in O(n{sup}2 + R(K + 1){sup}2) time, where R is the total number of ordered pairs of positions at which the two strings match and K is the fixed gap constraint. We then improve the running time to O(n{sup}2 + RK + R log log n) using some novel techniques. Furthermore, we present an algorithm that is independent of K and runs in O(n{sup}2 + R log log n) time. Using these techniques, we also present a new O(n{sup}2) algorithm to solve the original LCS problem. Additionally, we modify our algorithms to handle elastic and rigid gaps. We also apply the notion of rigidness to the original LCS problem and modify the traditional dynamic programming solution to handle the rigidness presenting a O(n{sup}2) algorithm to solve the problem. Finally, we also improve the solution to Rigid Fixed Gap LCS to O(n{sup}2). Notably, in all of the above cases, we assume that the two given strings are of equal length i.e. n. But our results can be easily extended to handle two strings of different length.
机译:最长的公共子序列(LCS)问题是计算机科学中经过经典研究的问题之一。 LCS的计算是DNA序列分析中的常见任务,并已应用于遗传学和分子生物学。在本文中,我们介绍了LCS问题的新变体,并提出了有效的算法来解决它们。特别是,我们在LCS问题中引入了间隙约束的概念。对于具有固定间隔的LCS问题,我们首先提出一个在O(n {sup} 2 + R(K +1){sup} 2)时间中运行的朴素算法,其中R是在此位置的有序对的总数两个字符串匹配,并且K是固定间隙约束。然后,我们使用一些新颖的技术将运行时间缩短到O(n {sup} 2 + RK + R log log n)。此外,我们提出了一种独立于K且在O(n {sup} 2 + R log log n)时间中运行的算法。使用这些技术,我们还提出了一种新的O(n {sup} 2)算法来解决原始的LCS问题。此外,我们修改了算法以处理弹性和刚性间隙。我们还将刚度的概念应用于原始的LCS问题,并修改了传统的动态规划解决方案以处理刚度,提出了O(n {sup} 2)算法来解决该问题。最后,我们还将刚性固定间隙LCS的解决方案改进为O(n {sup} 2)。值得注意的是,在上述所有情况下,我们假设两个给定的字符串长度相等,即n。但是我们的结果可以轻松扩展为处理两个不同长度的字符串。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号