首页> 外文会议>2012 19th International Conference on High Performance Computing >Automatic efficient data layout for multithreaded stencil codes on CPU sand GPUs
【24h】

Automatic efficient data layout for multithreaded stencil codes on CPU sand GPUs

机译:自动高效的数据布局,用于CPU沙GPU上的多线程模板代码

获取原文
获取原文并翻译 | 示例

摘要

Stencil based computation on structured grids is a kernel at the heart of a large number of scientific applications. The variety of stencil kernels used in practice make this computation pattern difficult to assemble into a high performance computing library. With the multiplication of cores on a single chip, answering architectural alignment requirements became an even more important key to high performance. Along with vector accesses, data layout optimization must also consider concurrent parallel accesses. In this paper, we develop a strategy to automatically generate stencil codes for multicore vector architectures, searching for the best data layout possible to answer architectural alignment problems. We introduce a new method for aligning multidimensional data structures, called multipadding, that can be adapted to specificities of multicores and GPUs architectures. We present multiple methods with different level of complexity. We show on different stencil patterns that generated codes with multipadding display better performance than existing optimizations.
机译:在结构化网格上基于模板的计算是许多科学应用程序的核心。实践中使用的各种模版内核使此计算模式难以组装到高性能计算库中。随着单个芯片上内核数量的增加,满足架构对齐要求成为高性能的一个更为重要的关键。除了矢量访问,数据布局优化还必须考虑并发并行访问。在本文中,我们开发了一种策略,该策略可自动为多核向量架构生成模板代码,并寻找可能的最佳数据布局来回答架构对齐问题。我们介绍了一种用于对齐多维数据结构的新方法,称为多填充,可以适应多核和GPU架构的特殊性。我们提出了具有不同复杂程度的多种方法。我们在不同的模板图案上显示,使用多填充生成的代码显示的性能要优于现有的优化。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号