【24h】

Processing Compressed Texts: A Tractability Border

机译:处理压缩文本:可操作性边界

获取原文
获取原文并翻译 | 示例

摘要

What kind of operations can we perform effectively (without full unpacking) with compressed texts? In this paper we consider three fundamental problems: (1) check the equality of two compressed texts, (2) check whether one compressed text is a substring of another compressed text, and (3) compute the number of different symbols (Hamming distance) between two compressed texts of the same length. We present an algorithm that solves the first problem in O(n~3) time and the second problem in O(n~2m) time. Here n is the size of compressed representation (we consider representations by straight-line programs) of the text and m is the size of compressed representation of the pattern. Next, we prove that the third problem is actually #P-complete. Thus, we indicate a pair of similar problems (equivalence checking, Hamming distance computation) that have radically different complexity on compressed texts. Our algorithmic technique used for problems (1) and (2) helps for computing minimal periods and covers of compressed texts.
机译:我们可以对压缩文本进行有效的操作(无需完全解压缩)?在本文中,我们考虑了三个基本问题:(1)检查两个压缩文本的相等性;(2)检查一个压缩文本是否是另一个压缩文本的子字符串;(3)计算不同符号的数量(汉明距离)在两个相同长度的压缩文本之间。我们提出了一种算法,可以解决O(n〜3)时间中的第一个问题和O(n〜2m)时间中的第二个问题。在这里,n是文本的压缩表示形式(我们考虑采用直线程序表示)的大小,m是模式的压缩表示形式的大小。接下来,我们证明第三个问题实际上是#P-complete。因此,我们指出了一对相似的问题(等效性检查,汉明距离计算),它们在压缩文本上具有根本不同的复杂性。我们用于问题(1)和(2)的算法技术有助于计算最小周期和压缩文本的覆盖范围。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号