【24h】

A High Performance Compression Method for Climate Data

机译:气候数据的高性能压缩方法

获取原文

摘要

Climate modeling data are usually multidimen-sional arrays of floating-point numbers. These arrays typically have two or three spatial dimensions and one temporal dimension, describing the evolvement of climate variables in a time span. With the advances of high performance computing, the volume of climate data is expanding exponentially, bringing tough challenges for climate data archiving and sharing. In this paper, we propose a lossless compression algorithm for the time-spatial climate floating-point arrays. Our compression algorithm can eliminate more data redundancy efficiently through adaptive prediction, XOR-differencing, and multi-way compression. In addition, static regions, which are very common in climate data, can be identified and compressed more efficiently. Moreover, to utilize the multi-cores on modern computers, we proposed a method to parallelize our compression algorithm. Evaluations demonstrate that single thread version of our compression method can achieve the best balance in compression ratios, deflating throughputs and inflating throughputs. And the parallel version can achieve 800 MB/s deflating throughputs and over 2600 MB/s inflating throughputs on a 16-core server.
机译:气候建模数据通常是浮点数的多媒体阵列。这些阵列通常具有两个或三个空间尺寸和一个时间尺寸,描述了在时间跨度中的气候变量的演变。随着高性能计算的进步,气候数据的体积是指数级展示的,为气候数据归档和共享带来了艰难的挑战。在本文中,我们提出了一种用于时间空间气候浮点阵列的无损压缩算法。我们的压缩算法可以通过自适应预测,XOR差异和多路压缩有效地消除更多数据冗余。另外,可以更有效地识别和压缩在气候数据中非常常见的静态区域。此外,要利用现代计算机上的多核,我们提出了一种并行化压缩算法的方法。评估表明,我们的压缩方法的单线程版本可以实现压缩比的最佳平衡,缩小吞吐量和充气吞吐量。并行版本可以在16核服务器上实现800 MB / s的缩小吞吐量,超过2600 MB / s膨胀吞吐量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号