首页> 外文OA文献 >Algorithms and data structures for grammar-compressed strings
【2h】

Algorithms and data structures for grammar-compressed strings

机译:语法压缩字符串的算法和数据结构

摘要

Textual databases for e.g. biological or web-data are growing rapidly, and it is often only feasible to store the data in compressed form. However, compressing the data comes at a price. Traditional algorithms for e.g. pattern matching requires all data to be decompressed - a computationally demanding task. In this thesis we design data structures for accessing and searching compressed data efficiently.Our results can be divided into two categories. In the first category we study problems related to pattern matching. In particular, we present new algorithms for counting and comparing substrings, and a new algorithm for finding all occurrences of a pattern in which we may insert gaps. In the other category we deal with accessing and decompressing parts of the compressed string. We show how to quickly access a single character of the compressed string, and present a data structure that supports fast decompression of substrings from prespecified positions.
机译:文字数据库,例如生物或网络数据正在迅速增长,通常仅以压缩形式存储数据是可行的。但是,压缩数据是有代价的。传统算法,例如模式匹配要求将所有数据解压缩-这是一项计算量很大的任务。本文设计了一种有效访问和搜索压缩数据的数据结构。我们的研究结果可以分为两类。在第一类中,我们研究与模式匹配有关的问题。特别是,我们提供了用于计数和比较子字符串的新算法,以及用于查找所有可能插入空格的模式的新算法。在另一类中,我们处理访问和解压缩部分压缩字符串。我们展示了如何快速访问压缩字符串的单个字符,并展示了一种数据结构,该数据结构支持从预定位置快速解压缩子字符串。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号