【24h】

Efficient and Compact Representations of Some Non-canonical Prefix-Free Codes

机译:某些非规范无前缀代码的高效紧凑表示

获取原文

摘要

For many kinds of prefix-free codes there are efficient and compact alternatives to the traditional tree-based representation. Since these put the codes into canonical form, however, they can only be used when we can choose the order in which codewords are assigned to characters. In this paper we first show how, given a probability distribution over an alphabet of σ characters, we can store a nearly optimal alphabetic prefix-free code in o(σ) bits such that we can encode and decode any character in constant time. We then consider a kind of code introduced recently to reduce the space usage of wavelet matrices (Claude, Navarro, and Ordonez, Information Systems, 2015). They showed how to build an optimal prefix-free code such that the codewords' lengths are non-decreasing when they are arranged such that their reverses are in lexicographic order. We show how to store such a code in O(σ log L + 2~(∈L)) bits, where L is the maximum codeword length and e is any positive constant, such that we can encode and decode any character in constant time under reasonable assumptions. Otherwise, we can always encode and decode a codeword of £ bits in time O(ℓ) using O(σ log L) bits of space.
机译:对于许多类型的无前缀代码,有传统的基于树的表示形式的有效且紧凑的替代方法。但是,由于这些将代码置于规范形式,因此只有在我们可以选择将代码字分配给字符的顺序时,才可以使用它们。在本文中,我们首先展示在给定一个σ字符字母表上的概率分布的情况下,我们如何在o(σ)位中存储几乎最佳的无字母前缀代码,以便我们可以在恒定时间内对任何字符进行编码和解码。然后,我们考虑一种最近引入的减少小波矩阵空间使用的代码(Claude,Navarro和Ordonez,信息系统,2015年)。他们展示了如何构建最佳的无前缀代码,以使码字的排列顺序使得它们的倒序按字典顺序排列时,其长度不减小。我们展示了如何在O(σlog L + 2〜(∈L))位中存储这样的代码,其中L是最大码字长度,e是任何正常数,这样我们就可以在恒定时间内对任何字符进行编码和解码在合理的假设下。否则,我们总是可以使用空间的O(σlog L)位在时间O(ℓ)中对£位的码字进行编码和解码。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号