首页> 外文会议>IEEE International Conference on Big Data >Error-Controlled Lossy Compression Optimized for High Compression Ratios of Scientific Datasets
【24h】

Error-Controlled Lossy Compression Optimized for High Compression Ratios of Scientific Datasets

机译:针对科学数据集的高压缩比而优化的误差控制有损压缩

获取原文

摘要

Today's scientific simulations require a significant reduction of the data size because of extremely large volumes of data they produce and the limitation of storage bandwidth and space. If the compression is set to reach a high compression ratio, however, the reconstructed data are often distorted too much to tolerate. In this paper, we explore a new compression strategy that can effectively control the data distortion when significantly reducing the data size. The contribution is threefold. (1) We propose an adaptive compression framework to select either our improved Lorenzo prediction method or our optimized linear regression method dynamically in different regions of the dataset. (2) We explore how to select them accurately based on the data features in each block to obtain the best compression quality. (3) We analyze the effectiveness of our solution in details using four real-world scientific datasets with 100+ fields. Evaluation results confirm that our new adaptive solution can significantly improve the rate distortion for the lossy compression with fairly high compression ratios. The compression ratio of our compressor is 1.5X~8X as high as that of two other leading lossy compressors (SZ and ZFP) with the same peak single-to-noise ratio (PSNR), in the high-compression cases. Parallel experiments with 8,192 cores and 24 TB of data shows that our solution obtains 1.86X dumping performance and 1.95X loading performance compared with the second-best lossy compressor, respectively.
机译:当今的科学模拟需要显着减小数据大小,因为它们产生的数据量非常大,并且存储带宽和空间受到限制。但是,如果将压缩率设置为达到较高的压缩率,则重构数据通常会失真得太大,无法容忍。在本文中,我们探索了一种新的压缩策略,该策略可以在显着减小数据大小时有效地控制数据失真。贡献是三倍。 (1)我们提出了一种自适应压缩框架,以在数据集的不同区域动态选择我们改进的Lorenzo预测方法或优化的线性回归方法。 (2)我们探索如何根据每个块中的数据特征准确地选择它们,以获得最佳压缩质量。 (3)我们使用四个具有100多个字段的真实科学数据集来详细分析解决方案的有效性。评估结果证实,我们的新的自适应解决方案可以以相当高的压缩比显着改善有损压缩的速率失真。在高压缩情况下,我们的压缩机的压缩率是其他两个具有相同峰值单噪声比(PSNR)的领先有损压缩机(SZ和ZFP)的压缩率的1.5X〜8X。与8,192个内核和24 TB数据的并行实验表明,与第二好的有损压缩器相比,我们的解决方案分别获得了1.86倍的转储性能和1.95倍的加载性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号