首页> 外文会议>Advances in Information Systems >Database Compression Using an Offline Dictionary Method
【24h】

Database Compression Using an Offline Dictionary Method

机译:使用脱机字典方法进行数据库压缩

获取原文

摘要

Off-line dictionary compression is becoming more attractive for applications where compressed data are searched directly in compressed form. While there has been large body of related work describing specific database compression algorithms, the Hibase architecture is unique in processing queries in compressed data. However, this technique does not compress the representation of strings in the domain dictionaries. Primary keys, data with high cardinality and semi-structured data contribute very little or no compression. To achieve high performance irrespective of type of data, the string representation must be in compressed form. At the same time, the direct addressability of compressed data is maintained. Serial compression techniques cannot be used. In this paper, we present a prefix dictionary-based off-line method that can be incorporated with systems like Hibase where compressed data can be accessed directly without prior decompression. The complexity is O(n) in time and space.
机译:对于直接以压缩形式搜索压缩数据的应用程序,脱机词典压缩正变得越来越有吸引力。尽管有大量相关工作描述了特定的数据库压缩算法,但Hibase体系结构在处理压缩数据中的查询方面是独一无二的。但是,此技术不会压缩域词典中字符串的表示。主键,具有高基数的数据和半结构化数据的压缩很少或没有压缩。为了获得高性能,而与数据类型无关,字符串表示形式必须为压缩形式。同时,保持了压缩数据的直接寻址能力。不能使用串行压缩技术。在本文中,我们提出了一种基于前缀字典的离线方法,该方法可以与Hibase等系统结合使用,该系统可以直接访问压缩数据,而无需事先进行解压缩。复杂度在时间和空间上为O(n)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号