首页> 外文期刊>IEICE Transactions on Information and Systems >Lossless Compression of Double-Precision Floating-Point Data for Numerical Simulations: Highly Parallelizable Algorithms for GPU Computing
【24h】

Lossless Compression of Double-Precision Floating-Point Data for Numerical Simulations: Highly Parallelizable Algorithms for GPU Computing

机译:用于数值模拟的双精度浮点数据的无损压缩:GPU计算的高度并行化算法

获取原文
获取原文并翻译 | 示例
       

摘要

In numerical simulations using massively parallel computers like GPGPU (General-Purpose computing on Graphics Processing Units), we often need to transfer computational results from external devices such as GPUs to the main memory or secondary storage of the host machine. Since size of the computation results is sometimes unacceptably large to hold them, it is desired that the data is compressed and stored. In addition, considering overheads for transferring data between the devices and host memories, it is preferable that the data is compressed in a part of parallel computation performed on the devices. Traditional compression methods for floating-point numbers do not always show good parallelism. In this paper, we propose a new compression method for massively-parallel simulations running on GPUs, in which we combine a few successive floating-point numbers and interleave them to improve compression efficiency. We also present numerical examples of compression ratio and throughput obtained from experimental implementations of the proposed method runnig on CPUs and GPUs.
机译:在使用大型并行计算机(例如GPGPU)(在图形处理单元上进行通用计算)的数值模拟中,我们经常需要将计算结果从外部设备(例如GPU)传输到主机的主存储器或辅助存储器。由于计算结果的大小有时难以容纳而无法接受,因此希望对数据进行压缩和存储。另外,考虑到在设备和主机存储器之间传输数据的开销,优选地,在对设备执行的并行计算的一部分中压缩数据。传统的浮点数压缩方法并不总是显示出良好的并行性。在本文中,我们为在GPU上运行的大规模并行仿真提出了一种新的压缩方法,该方法将几个连续的浮点数合并并交织以提高压缩效率。我们还提供了压缩比和吞吐量的数值示例,这些示例是通过在CPU和GPU上运行建议的方法runnig的实验实现获得的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号