首页> 外文会议>Compiler construction >Data Layout Transformation for Stencil Computations on Short-Vector SIMD Architectures
【24h】

Data Layout Transformation for Stencil Computations on Short-Vector SIMD Architectures

机译:短向量SIMD架构上模板计算的数据布局转换

获取原文
获取原文并翻译 | 示例

摘要

Stencil computations are at the core of applications in many domains such as computational electromagnetics, image processing, and partial differential equation solvers used in a variety of scientific and engineering applications. Short-vector SIMD instruction sets such as SSE and VMX provide a promising and widely available avenue for enhancing performance on modern processors. However a fundamental memory stream alignment issue limits achieved performance with stencil computations on modern short SIMD architectures. In this paper, we propose a novel data layout transformation that avoids the stream alignment conflict, along with a static analysis technique for determining where this transformation is applicable. Significant performance increases are demonstrated for a variety of stencil codes on three modern SIMD-capable processors.
机译:模具计算是许多领域中应用程序的核心,例如电磁计算,图像处理和偏微分方程求解器等在各种科学和工程应用中使用。短向量SIMD指令集(例如SSE和VMX)为增强现代处理器的性能提供了一种有希望且广泛可用的途径。但是,基本的内存流对齐问题限制了在现代短SIMD架构上使用模板计算实现的性能。在本文中,我们提出了一种新颖的数据布局转换方法,该方法避免了流对齐冲突,并提供了一种静态分析技术来确定该转换方法在何处适用。在三个具有现代SIMD功能的处理器上,各种模板代码的性能得到了显着提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号