【24h】

A Dynamically Adaptive Approach for Speculative Loop Execution in SMT Architectures

机译:SMT体系结构中的一种动态自适应的推测性循环执行方法

获取原文
获取原文并翻译 | 示例

摘要

Simultaneous multithreading allows the exploitation of thread-level speculation on the same processor. Due to the contention for shared processor resources, the performance of speculative threads often suffers from the potential of inter-thread interference, which is hard to be statically estimated by the compiler. Thus we propose an approach to dynamically determine and extract speculative threads from parallel regions until runtime. It relies on a cycle counter architecture to collect the performance profiles of each parallelized loop and uncover the potential of loop-level parallelism. These performance profiles are obtained from the relative single-threaded execution time prediction for speculative threads using thread execution cycle breakdown. The performance of different loop levels is dynamically evaluated by the prediction and only the best loop level will be chosen to parallelize. Several performance tuning policies are also examined. The best policy can achieve an average speedup of 1.45 using SPEC CPU2000 benchmarks, and it outperforms the static loop selection by 33%.
机译:同时多线程允许在同一处理器上利用线程级推测。由于对共享处理器资源的争夺,推测性线程的性能经常遭受潜在的线程间干扰,这很难由编译器静态估计。因此,我们提出了一种动态确定并从并行区域提取推测线程直到运行时的方法。它依靠循环计数器体系结构来收集每个并行化循环的性能概况,并发现潜在的循环级并行性。这些性能配置文件是从使用线程执行周期细分的推测性线程的相对单线程执行时间预测中获得的。预测会动态评估不同环路级别的性能,并且只会选择最佳环路级别进行并行化。还检查了几种性能调整策略。使用SPEC CPU2000基准测试,最佳策略可以使平均速度提高1.45,并且比静态循环选择高出33%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号