...
首页> 外文期刊>Computers & geosciences >Accelerating universal Kriging interpolation algorithm using CUDA-enabled GPU
【24h】

Accelerating universal Kriging interpolation algorithm using CUDA-enabled GPU

机译:使用支持CUDA的GPU加速通用克里格插值算法

获取原文
获取原文并翻译 | 示例
           

摘要

Kriging algorithms are a group of important interpolation methods, which are very useful in many geological applications. However, the algorithm based on traditional general purpose processors can be computationally expensive, especially when the problem scale expands. Inspired by the current trend in graphics processing technology, we proposed an efficient parallel scheme to accelerate the universal Kriging algorithm on the NVIDIA CUDA platform. Some high-performance mathematical functions have been introduced to calculate the compute-intensive steps in the Kriging algorithm, such as matrix-vector multiplication and matrix-matrix multiplication. To further optimize performance, we reduced the memory transfer overhead by reconstructing the time-consuming loops, specifically for the execution on GPU. In the numerical experiment, we compared the performances among different multi-core CPU and GPU implementations to interpolate a geological site. The improved CUDA implementation shows a nearly 18 x speedup with respect to the sequential program and is 6.32 times faster compared to the OpenMP-based version running on Intel Xeon E5320 quad-cores CPU and scales well with the size of the system.
机译:克里金算法是一组重要的插值方法,在许多地质应用中非常有用。但是,基于传统通用处理器的算法在计算上可能会非常昂贵,尤其是在问题规模扩大时。受到当前图形处理技术趋势的启发,我们提出了一种有效的并行方案来加速NVIDIA CUDA平台上的通用Kriging算法。引入了一些高性能的数学函数来计算Kriging算法中的计算密集型步骤,例如矩阵矢量乘法和矩阵矩阵乘法。为了进一步优化性能,我们通过重建耗时的循环(特别是在GPU上执行)来减少内存传输开销。在数值实验中,我们比较了在不同的多核CPU和GPU实施方案之间进行插值的性能。改进的CUDA实施相对于顺序程序而言显示了近18倍的加速,并且比在Intel Xeon E5320四核CPU上运行的基于OpenMP的版本快6.32倍,并且可以很好地随系统大小扩展。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号