首页> 外文会议>Symposium on Application Accelerators in High Performance Computing >Automatically Optimized GPU Acceleration of Element Subroutines in Finite Element Method
【24h】

Automatically Optimized GPU Acceleration of Element Subroutines in Finite Element Method

机译:有限元方法自动优化元素子程序的GPU加速度

获取原文

摘要

The element subroutines in finite element method (FEM) provides enough parallelism to be successfully accelerated by contemporary GPUs. However, their efficient implementation is not straightforward and requires time-consuming exploration of numerous implementation variants. In this paper, we present kernel fusion as an optimization technique and its application for element subroutines. Moreover, we show how the kernel fusion is automated using our source-to-source compiler. We demonstrate the optimization of the element subroutines for FEM model using St., Venant-Kirchhoff material. The performance of code generated by our compiler outperforms our previously published hand-tuned implementation by factor of 1.32 -- 1.54 depending on used GPU architecture. Although the optimization techniques are demonstrated on element subroutines for using St., Venant-Kirchhoff material, they are generally usable for wider area of computationally-demanding problems.
机译:有限元方法(FEM)中的元素子程序提供了足够的并行性,以便通过当代GPU成功加速。然而,它们的有效实施并不简单,并且需要耗时对许多实施变体的探索。在本文中,我们将内核融合作为优化技术及其对元素子程序的应用。此外,我们展示了内核融合是如何使用我们的源代码编译器自动进行自动化的。我们展示了使用St.,Venant-Kirchhoff材料的FEM模型的元素子程序优化。由我们的编译器生成的代码的性能优于我们之前发布的手续执行,这取决于使用的GPU架构的因子为1.32 - 1.54。尽管在使用St.,Venant-Kirchhoff材料的元件子程序上对优化技术进行了说明,但它们通常可以用于更广泛的计算苛刻问题的领域。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号