...
首页> 外文期刊>Mathematical structures in computer science >The longest common substring problem
【24h】

The longest common substring problem

机译:最长的普通子串问题

获取原文
获取原文并翻译 | 示例

摘要

Given a set D of q documents, the Longest Common Substring (LCS) problem asks, for anyrninteger 2 ≤ k ≤ q, the longest substring that appears in k documents. LCS is a well-studiedrnproblem having a wide range of applications in Bioinformatics: from microarrays to DNArnsequences alignments and analysis. This problem has been solved by Hui (2000 InternationalrnJournal of Computer Science and Engineering 15 73–76) by using a famous constant-timernsolution to the Lowest Common Ancestor (LCA) problem in trees coupled with the use ofrnsuffix trees.rnIn this article, we present a simple method for solving the LCS problem by using suffix treesrn(STs) and classical union-find data structures. In turn, we show how this simple algorithmrncan be adapted in order to work with other space efficient data structures such as thernenhanced suffix arrays (ESA) and the compressed suffix tree.
机译:给定一组q个文档,最长公共子字符串(LCS)问题要求任何整数2≤k≤q,出现在k个文档中的最长子字符串。 LCS是一个经过充分研究的问题,在生物信息学中有广泛的应用:从微阵列到DNA序列比对和分析。 Hui(2000 Internationalrn计算机科学与工程学报15 73–76)已通过使用著名的恒定时间解来解决树木中的最低共同祖先(LCA)问题,并使用了后缀树,从而解决了这个问题。提出了一种使用后缀treesrn(ST)和经典联合查找数据结构来解决LCS问题的简单方法。反过来,我们展示了如何修改此简单算法,以便与其他空间效率高的数据结构(如增强后缀数组(ESA)和压缩后缀树)一起使用。

著录项

  • 来源
    《Mathematical structures in computer science 》 |2017年第2期| 277-295| 共19页
  • 作者单位

    Department of Informatics, King’s College London, London, UK Université Paris-Est, Marne-La-Vallée, France;

    Department of Informatics, King’s College London, London, UK;

    Department of Informatics, King’s College London, London, UK DMI, Universit`a degli Studi di Palermo, Palermo, Italy;

    DISIM, Università dell’Aquila, L’Aquila, Italy;

  • 收录信息
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号