【24h】

A quasi word-based compression method of English text using byte-oriented coding scheme

机译:基于字节编码方案的基于半字的英文文本压缩方法

获取原文

摘要

In this paper we present a universal compression algorithm for English text, ERecode. The proposed scheme highlights the importance of pre-processing work for English text, and employs one or two bytes code values to recode the 511 most common used English words, sequences of symbols and ASCⅡ codes based on their occurrence frequency. Acting as a preprocessing tool for English text by the popular compression utilities, ERecode can improve their compression ratio from 0.89% to 19.65%. The proposed method also is applicable to text files for other languages.
机译:在本文中,我们提出了一种针对英语文本的通用压缩算法ERecode。该方案突出了英语文本预处理工作的重要性,并根据其出现频率,采用一或两个字节的代码值对511个最常用的英语单词,符号序列和ASCⅡ代码进行重新编码。作为流行的压缩实用程序的英语文本预处理工具,ERecode可以将其压缩率从0.89%提高到19.65%。所提出的方法也适用于其他语言的文本文件。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号