...
首页> 外文期刊>Information technology and libraries >The Efficient Storage of Text Documents in Digital Libraries
【24h】

The Efficient Storage of Text Documents in Digital Libraries

机译:数字图书馆中文本文档的有效存储

获取原文
获取原文并翻译 | 示例
           

摘要

In this paper we investigate the possibility of improving the efficiency of data compression, and thus reducing storage requirements, for seven widely used text document formats. We propose an open-source text compression software library, featuring an advanced word-substitution scheme with static and semidynamic word dictionaries. The empirical results show an average storage space reduction as high as 78 percent compared to uncompressed documents, and as high as 30 percent compared to documents compressed with the free compression software gzip.
机译:在本文中,我们研究了七种广泛使用的文本文档格式提高数据压缩效率,从而降低存储需求的可能性。我们提出了一个开放源代码的文本压缩软件库,该库具有先进的单词替换方案以及静态和半动态单词词典。实验结果表明,与未压缩的文档相比,平均存储空间减少多达78%,与使用免费压缩软件gzip压缩的文档相比,减少了30%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号