首页> 外文会议>IEEE International Conference on Computational Advances in Bio and Medical Sciences >Linear Space String Correction Algorithm Using The Damerau-Levenshtein Distance
【24h】

Linear Space String Correction Algorithm Using The Damerau-Levenshtein Distance

机译:使用Dameau-Levenshtein距离的线性空间字符串校正算法

获取原文

摘要

We develop linear space algorithms to compute the Damerau-Levenshtein (DL) distance between two strings as well as to find a sequence of edit operations of length equal to the DL distance. The Damerau-Levenshtein (DL) distance between two strings is the minimum number of substitutions, inserts, deletes, and transpositions of adjacent characters required to transform one string into the other. In our previous research, we developed algorithms that run in 0(mn) time using only 0(s * min {m, n} + m + n) space, where s is the size of the alphabet comprising the strings. In this paper, we develop algorithms to compute the DL distance and corresponding edit sequence using 0(m+n) space and 0(mn) time. Extensive experiments conducted on three computational platforms-Xeon E5 2603, I7 -x980 and Xeon E5 2695 -show that, our algorithms, in addition to using less space, are much faster than earlier algorithms. In fact, our fastest algorithm for the DL distance is up to 56.4% faster than the fastest algorithm when run on a single core. The single core speedup to find the corresponding edit sequence is up to 57.4%. Our algorithms may be adapted to run on multicores providing a speedup up to 59.3%.
机译:我们开发了线性空间算法来计算两个字符串之间的Damerau-Levenshtein(DL)距离,并找到长度等于DL距离的一系列编辑操作。两个字符串之间的Damerau-Levenshtein(DL)距离是将一个字符串转换为另一个字符串所需的相邻字符的替换,插入,删除和换位的最小数量。在我们之前的研究中,我们开发了仅在0(s * min {m,n} + m + n)空间中以0(mn)时间运行的算法,其中s是组成字符串的字母的大小。在本文中,我们开发了使用0(m + n)空间和0(mn)时间计算DL距离和相应编辑序列的算法。在三个计算平台Xeon E5 2603,I7 -x980和Xeon E5 2695上进行的广泛实验表明,我们的算法除了占用更少的空间外,还比早期算法快得多。实际上,与在单个内核上运行的最快算法相比,我们最快的DL距离算法要快56.4%。查找相应编辑序列的单核加速高达57.4%。我们的算法可能适合在多核上运行,从而提供高达59.3%的加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号