首页> 外文期刊>ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages >Data transformations enabling loop vectorization on multithreaded data parallel architectures
【24h】

Data transformations enabling loop vectorization on multithreaded data parallel architectures

机译:数据转换可在多线程数据并行体系结构上实现循环矢量化

获取原文
获取原文并翻译 | 示例
           

摘要

Loop vectorization, a key feature exploited to obtain high performance on Single Instruction Multiple Data (SIMD) vector architectures, is significantly hindered by irregular memory access patterns in the data stream. This paper describes data transformations that allow us to vectorize loops targeting massively multithreaded data parallel architectures. We present a mathematical model that captures loop-based memory access patterns and computes the most appropriate data transformations in order to enable vectorization. Our experimental results show that the proposed data transformations can significantly increase the number of loops that can be vectorized and enhance the data-level parallelism of applications. Our results also show that the overhead associated with our data transformations can be easily amortized as the size of the input data set increases. For the set of high performance benchmark kernels studied, we achieve consistent and significant performance improve-ments (up to 11.4X) by applying vectorization using our data trans-formation approach.
机译:循环矢量化是为在单指令多数据(SIMD)矢量架构上获得高性能而开发的一项关键功能,由于数据流中不规则的内存访问模式而受到严重阻碍。本文介绍了数据转换,这些数据转换使我们可以向量化针对大规模多线程数据并行体系结构的循环。我们提出了一个数学模型,该模型捕获基于循环的内存访问模式并计算最合适的数据转换,以实现矢量化。我们的实验结果表明,提出的数据转换可以显着增加可矢量化的循环数,并增强应用程序的数据级并行性。我们的结果还表明,随着输入数据集大小的增加,可以轻松地摊销与数据转换相关的开销。对于所研究的一组高性能基准内核,我们通过使用数据转换方法应用矢量化来实现一致且显着的性能改进(最高11.4倍)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号