首页> 外文会议> >Using inverted files to compress text
【24h】

Using inverted files to compress text

机译:使用倒排文件压缩文本

获取原文

摘要

This is the first report on a new approach to text compression. It consists of representing the text file with compressed inverted file index in conjunction with very compact lexicon, where lexicon includes every word in the text. The index is compressed using standard index compression techniques, and lexicon is compressed with original dictionary compression method that gives better compression results than existing procedures. Compression procedure is complex, but decompression time is linear with the file size, although it requires two passes and hence can not be performed online. First experiments show that this method, when refined, can be competitive for larger texts that only need to be decompressed in the real time.
机译:这是有关文本压缩新方法的第一份报告。它由压缩的倒排文件索引与非常紧凑的词典一起表示文本文件组成,其中词典包含文本中的每个单词。使用标准索引压缩技术压缩索引,并使用原始字典压缩方法压缩词典,与现有过程相比,原始字典压缩方法可提供更好的压缩结果。压缩过程很复杂,但是解压缩时间与文件大小成线性关系,尽管它需要两次通过,因此无法在线执行。最初的实验表明,这种方法经过改进后,可以与只需要实时解压缩的较大文本竞争。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号