首页> 外文会议>ACM/IEEE conference on Supercomputing >Techniques for speculative run-time parallelization of loops
【24h】

Techniques for speculative run-time parallelization of loops

机译:循环的推测运行时并行化技术

获取原文
获取外文期刊封面目录资料

摘要

This paper presents a set of new run-time tests for speculative parallelization of loops that defy parallelization based on static analysis alone. It presents a novel method for speculative array privatization that is not only more efficient than previous methods when the speculation is correct, but also does not require rolling back the computation in case the variable is found not to be privatizable. We present another method for speculative parallelization which can overcome all loop-carried anti and output dependences, with even lower overhead than previous techniques which could not break such dependences. Again, in order to ameliorate the problem of paying a heavy penalty for speculatively parallelizing loops that turn out to be serial, we present a technique that enables early detection of loop-carried dependences. Our experimental results from a preliminary implementation of these tests on an IBM G30 SMP machine show a significant reduction in the penalty paid for mis-speculation, from roughly 50% to between 2% and 18% of the serial execution time. For parallel loops, we obtain about the same, and often, even better performance relative to the previous methods, making our techniques extremely attractive.
机译:本文针对环路的推测性并行化提出了一组新的运行时测试,这些测试无视仅基于静态分析的并行化。它提出了一种用于推测性数组私有化的新颖方法,该方法不仅在推测正确时比以前的方法更有效,而且在发现变量不可私有化的情况下也不需要回滚计算。我们提出了一种推测性并行化的另一种方法,该方法可以克服所有循环进行的反输出依赖性,并且其开销比无法打破这种依赖性的先前技术还要低。再次,为了缓解因推测性并行化而变成串行的循环而要付出沉重代价的问题,我们提出了一种能够及早发现循环携带的依赖关系的技术。我们在IBM G30 SMP机器上初步实施这些测试的实验结果表明,因错误推测而付出的代价大大降低了,从串行执行时间的大约50%减少到2%至18%。对于并行循环,与以前的方法相比,我们获得的性能大致相同,而且往往甚至更高,这使我们的技术极具吸引力。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号