首页> 外文期刊>Concurrency, practice and experience >Evaluating optimizations that reduce globalmemory accesses of stencil computations in GPGPUs
【24h】

Evaluating optimizations that reduce globalmemory accesses of stencil computations in GPGPUs

机译:评估减少GPGPU中的模板计算的GlobalMemory访问的优化

获取原文
获取原文并翻译 | 示例

摘要

This work compares the performance of optimizations that transform replicated global memoryaccesses into localmemoryaccesses on3Dstencil computations in theNVIDIATeslaK80GPGPU.The optimizations reduce global memory contention caused by the set of multiprocessors. Evaluatedoptimizations are grid tiling, inserting spatial and temporal loops into kernels, register reuse,andsomeof their combinations.Astandardized experiment evaluates performance variationwithgrid size and stencil size for each optimization. Experimental data show that codes that use theseoptimizations are up to 3.3 times faster than the classical stencil formulation. It also shows thatthemost profitable optimization varieswith grid and stencil sizes.
机译:这项工作比较了转换复制的全局内存的优化的性能在当时的in3dstencencess in 3dstencencesslak80GPGPU中访问LocalMemoryAccesses。优化减少了由该组多处理器引起的全局内存争用。评估优化是网格平铺,将空间和时间循环插入核,注册重用,他们的组合和组合.Astandardized实验评估了性能变化每个优化的网格尺寸和模板尺寸。实验数据显示使用这些的代码优化比经典模版配方快33.3倍。它也表明了TheMOST有利可图优化变化栅格和模板尺寸。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号