首页> 外文会议>Annual Symposium on Combinatorial Pattern Matching(CPM 2007); 20070709-11; London(CA) >Move-to-Front, Distance Coding, and Inversion Frequencies Revisited
【24h】

Move-to-Front, Distance Coding, and Inversion Frequencies Revisited

机译:再谈前移,距离编码和反转频率

获取原文
获取原文并翻译 | 示例
获取外文期刊封面目录资料

摘要

Move-to-Front, Distance Coding and Inversion Frequencies are three somewhat related techniques used to process the output of the Burrows-Wheeler Transform. In this paper we analyze these techniques from the point of view of how effective they are in the task of compressing low-entropy strings, that is, strings which have many regularities and are therefore highly compressible. This is a non-trivial task since many compressors have non-constant overheads that become non-negligible when the input string is highly compressible. Because of the properties of the Burrows-Wheeler transform, being locally optimal ensures an algorithm compresses low-entropy strings effectively. Informally, local optimality implies that an algorithm is able to effectively compress an arbitrary partition of the input string. We show that in their original formulation neither Move-to-Front, nor Distance Coding, nor Inversion Frequencies is locally optimal. Then, we describe simple variants of the above algorithms which are locally optimal. To achieve local optimality with Move-to-Front it suffices to combine it with Run Length Encoding. To achieve local optimality with Distance Coding and Inversion Frequencies we use a novel "escape and re-enter" strategy. Since we build on previous results, our analyses are simple and shed new light on the inner workings of the three techniques considered in this paper.
机译:前移,距离编码和反转频率是三种用于处理Burrows-Wheeler变换输出的技术。在本文中,我们从压缩低熵字符串(即具有许多规则性并因此具有高度可压缩性的字符串)的压缩效率方面出发,分析了这些技术。这是一项不平凡的任务,因为许多压缩器具有非常高的开销,当输入字符串高度可压缩时,这些开销变得不可忽略。由于Burrows-Wheeler变换的特性,局部最优可确保算法有效压缩低熵字符串。非正式地,局部最优意味着算法能够有效地压缩输入字符串的任意分区。我们表明,在其原始公式中,“向前移动”,“距离编码”或“反转频率”都不是局部最优的。然后,我们描述了上述算法的局部最优的简单变体。为了通过“向前移动”实现局部最优,将其与“行程编码”组合就足够了。为了通过距离编码和反演频率实现局部最优,我们使用了一种新颖的“转义并重新输入”策略。由于我们基于先前的结果,因此我们的分析很简单,并且为本文考虑的三种技术的内部工作提供了新的思路。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号