首页> 美国卫生研究院文献>Royal Society Open Science >Accelerating adaptive inverse distance weighting interpolation algorithm on a graphics processing unit
【2h】

Accelerating adaptive inverse distance weighting interpolation algorithm on a graphics processing unit

机译:在图形处理单元上加速自适应逆距离加权插值算法

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

This paper focuses on designing and implementing parallel adaptive inverse distance weighting (AIDW) interpolation algorithms by using the graphics processing unit (GPU). The AIDW is an improved version of the standard IDW, which can adaptively determine the power parameter according to the data points’ spatial distribution pattern and achieve more accurate predictions than those predicted by IDW. In this paper, we first present two versions of the GPU-accelerated AIDW, i.e. the naive version without profiting from the shared memory and the tiled version taking advantage of the shared memory. We also implement the naive version and the tiled version using two data layouts, structure of arrays and array of aligned structures, on both single and double precision. We then evaluate the performance of parallel AIDW by comparing it with its corresponding serial algorithm on three different machines equipped with the GPUs GT730M, M5000 and K40c. The experimental results indicate that: (i) there is no significant difference in the computational efficiency when different data layouts are employed; (ii) the tiled version is always slightly faster than the naive version; and (iii) on single precision the achieved speed-up can be up to 763 (on the GPU M5000), while on double precision the obtained highest speed-up is 197 (on the GPU K40c). To benefit the community, all source code and testing data related to the presented parallel AIDW algorithm are publicly available.
机译:本文着重于利用图形处理单元(GPU)设计和实现并行自适应逆距离加权(AIDW)插值算法。 AIDW是标准IDW的改进版本,它可以根据数据点的空间分布模式自适应地确定功率参数,并且比IDW预测的功率预测更准确。在本文中,我们首先介绍了GPU加速的AIDW的两个版本,即未从共享内存中获利的朴素版本和利用共享内存的平铺版本。我们还使用单精度和双精度两个数据布局(数组结构和对齐结构数组)来实现朴素版本和平铺版本。然后,我们通过在配备GPU GT730M,M5000和K40c的三台不同机器上将并行AIDW与相应的串行算法进行比较来评估并行AIDW的性能。实验结果表明:(i)当使用不同的数据布局时,计算效率没有显着差异; (ii)平铺版本始终比纯朴版本快一些; (iii)在单精度下,达到的最高加速可以达到763(在GPU M5000上),而在双精度下,可以达到的最高加速达到197(在GPU K40c上)。为了使社区受益,与提出的并行AIDW算法相关的所有源代码和测试数据都是公开可用的。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号