首页> 外文期刊>Journal of Algorithms >Transposition invariant string matching
【24h】

Transposition invariant string matching

机译:转置不变字符串匹配

获取原文
获取原文并翻译 | 示例
       

摘要

Given strings A = a(1)a(2)...a(m) and B=b(1)b(2)...b(n) over an alphabet Sigma subset of U, where U is some numerical universe closed under addition and subtraction, and a distance function d(A, B) that gives the score of the best (partial) matching of A and B, the transposition invariant distance is min(t is an element of U){d(A + t, B)}, where A + t = (a(1) + t)(a(2) + t)...(a(m) + t). We study the problem of computing the transposition invariant distance for various distance (and similarity) functions d, including Hamming distance, longest common subsequence (LCS), Levenshtein distance, and their versions where the exact matching condition is replaced by an approximate one. For all these problems we give algorithms whose time complexities are close to the known upper bounds without transposition invariance, and for some we achieve these upper bounds. In particular, we show how sparse dynamic programming can be used to solve transposition invariant problems, and its connection with multidimensional range-minimum search. As a byproduct, we give improved sparse dynamic programming algorithms to compute LCS and Levenshtein distance. (c) 2004 Elsevier Inc. All rights reserved.
机译:在U的字母Sigma子集上给定字符串A = a(1)a(2)... a(m)和B = b(1)b(2)... b(n),其中U是一些数字宇宙在加法和减法下闭合,并且距离函数d(A,B)给出A和B最佳(部分)匹配的分数,换位不变距离为min(t是U的元素){d( A + t,B)},其中A + t =(a(1)+ t)(a(2)+ t)...(a(m)+ t)。我们研究了计算各种距离(和相似性)函数d的换位不变距离的问题,这些函数包括汉明距离,最长公共子序列(LCS),Levenshtein距离,以及它们的版本,其中精确匹配条件被替换为近似值。对于所有这些问题,我们给出了时间复杂度接近已知上限且无转置不变性的算法,对于某些问题,我们实现了这些上限。特别是,我们展示了稀疏动态规划如何用于解决转置不变性问题及其与多维范围最小搜索的联系。作为副产品,我们提供了改进的稀疏动态规划算法来计算LCS和Levenshtein距离。 (c)2004 Elsevier Inc.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号