【24h】

A Benchmark Framework for Data Compression Techniques

机译:数据压缩技术基准框架

获取原文

摘要

Lightweight data compression is frequently applied in main memory database systems to improve query performance. The data processed by such systems is highly diverse. Moreover, there is a high number of existing lightweight compression techniques. Therefore, choosing the optimal technique for a given dataset is non-trivial. Existing approaches are based on simple rules, which do not suffice for such a complex decision. In contrast, our vision is a cost-based approach. However, this requires a detailed cost model, which can only be obtained from a systematic benchmarking of many compression algorithms on many different datasets. A naieve benchmark evaluates every algorithm under consideration separately. This yields many redundant steps and is thus inefficient. We propose an efficient and extensible benchmark framework for compression techniques. Given an ensemble of algorithms, it minimizes the overall run time of the evaluation. We experimentally show that our approach outperforms the naive approach.
机译:轻量级数据压缩通常用于主内存数据库系统中,以提高查询性能。这样的系统处理的数据是高度多样化的。此外,存在大量现有的轻量级压缩技术。因此,为给定的数据集选择最佳技术并非易事。现有的方法基于简单的规则,而这些规则不足以满足如此复杂的决定。相反,我们的愿景是基于成本的方法。但是,这需要一个详细的成本模型,该模型只能从许多压缩算法在许多不同数据集上的系统基准测试中获得。天真的基准会分别评估正在考虑的每种算法。这产生许多冗余步骤,因此效率低下。我们提出了一种有效且可扩展的压缩技术基准框架。给定一组算法,可以最大程度地减少评估的总运行时间。我们通过实验证明了我们的方法优于幼稚的方法。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号