首页> 外文会议>String processing and information retrieval >Computing Matching Statistics and Maximal Exact Matches on Compressed Full-Text Indexes
【24h】

Computing Matching Statistics and Maximal Exact Matches on Compressed Full-Text Indexes

机译:在压缩的全文本索引上计算匹配统计信息和最大精确匹配

获取原文
获取原文并翻译 | 示例

摘要

Exact string matching is a problem that computer programmers face on a regular basis, and full-text indexes like the suffix tree or the suffix array provide fast string search over large texts. In the last decade, research on compressed indexes has flourished because the main problem in large-scale applications is the space consumption of the index. Nowadays, the most successful compressed indexes are able to obtain almost optimal space and search time simultaneously. It is known that a myriad of sequence analysis and comparison problems can be solved efficiently with established data structures like the suffix tree or the suffix array, but algorithms on compressed indexes that solve these problem are still lacking at present. Here, we show that matching statistics and maximal exact matches between two strings S_1 and S_2 can be computed efficiently by matching S2 backwards against a compressed index of S_1.
机译:精确的字符串匹配是计算机程序员经常要面对的一个问题,而诸如后缀树或后缀数组之类的全文本索引可以对大文本进行快速的字符串搜索。在过去的十年中,由于大型应用程序中的主要问题是索引的空间消耗,因此对压缩索引的研究蓬勃发展。如今,最成功的压缩索引能够同时获得几乎最佳的空间和搜索时间。众所周知,使用诸如后缀树或后缀数组之类的已建立数据结构可以有效地解决大量的序列分析和比较问题,但是目前仍然缺少解决这些问题的压缩索引算法。在这里,我们表明,通过针对压缩索引S_1向后匹配S2,可以有效地计算两个字符串S_1和S_2之间的匹配统计信息和最大精确匹配。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号