...
首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Compiler techniques for the distribution of data and computation
【24h】

Compiler techniques for the distribution of data and computation

机译:用于数据分配和计算的编译器技术

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

This paper presents a new method that can be applied by a parallelizing compiler to find, without user intervention, the iteration and data decompositions that minimize communication and load imbalance overheads in parallel programs targeted at NUMA architectures. One of the key ingredients in our approach is the representation of locality as a locality-communication graph (ICG) and the formulation of the compiler technique as a mixed integer nonlinear programming (MINLP) optimization problem on this graph. The objective function and constraints of the optimization problem model communication costs and load imbalance. The solution to this optimization problem is a decomposition that minimizes the parallel execution overhead. This paper summarizes the process of how the compiler extracts the locality information from a nonannotated code and focuses on how this compiler can derive the optimization problem, solve it, and generate the parallel code with the automatically selected iteration and data distributions. In addition, we include a discussion about our model and the solutions - the decompositions - that it provides. The approach presented in the paper is evaluated using several benchmarks. The experimental results demonstrate that the MINLP formulation does not increase compilation time significantly and that our framework generates very efficient iteration/data distributions for a variety of NUMA machines.
机译:本文提出了一种新方法,可以由并行化编译器应用,该方法无需用户干预即可查找迭代和数据分解,从而最大程度地减少针对NUMA架构的并行程序中的通信和负载不平衡开销。我们方法中的关键要素之一是将局部性表示为局部性通信图(ICG),并将编译器技术表达为该图上的混合整数非线性规划(MINLP)优化问题。优化问题的目标函数和约束条件可模型化通信成本和负载不平衡。该优化问题的解决方案是最大程度地减少并行执行开销的分解。本文总结了编译器如何从非注释代码中提取位置信息的过程,并着重于该编译器如何得出优化问题,解决该问题以及使用自动选择的迭代和数据分布生成并行代码的过程。此外,我们还讨论了我们的模型及其提供的解决方案-分解。本文中提出的方法是使用多个基准进行评估的。实验结果表明,MINLP公式不会显着增加编译时间,并且我们的框架为各种NUMA机器生成了非常有效的迭代/数据分布。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号