【24h】

The performance potential of data dependence speculation amp; collapsing

机译:数据依赖推测和崩溃的性能潜力

获取原文

摘要

Two hardware methods for remedying the effects of true data dependences are studied. The first method dependence speculation, is used to eliminate address generation-load dependences. This is enabled by address prediction that permits load instructions to proceed speculatively without waiting for their address operands. The second technique, dependence collapsing, is used to eliminate data dependences by combining a dependence among multiple instructions into one instruction. The potential of these techniques for improving processor performance is demonstrated via trace-driven simulation. When both techniques are used with maximum issue widths of 4, 8, 16, and 32, the overall speedups in comparison to a base instruction level parallel machine are 1.20, 1.35, 1.51, and 1.66, respectively. In general, dependence collapsing contributes the majority of the improvement in performance. Under the dependence collapsing model, 298 to 478 of the total number of instructions in a trace may be collapsed. Thedistance separating the collapsed instructions is nearly always less than 8. Our experimentation also suggests that further performance improvements can be achieved by incorporating mechanisms that increase the address prediction rate.
机译:研究了两种纠正真实数据依赖影响的硬件方法。第一种方法依赖性推测,用于消除地址生成-负载依赖性。这是通过地址预测启用的,该地址预测允许装入指令以推测方式进行而无需等待其地址操作数。第二种技术,依赖性崩溃,用于通过将多个指令之间的依赖性组合为一个指令来消除数据依赖性。通过跟踪驱动的仿真证明了这些技术在提高处理器性能方面的潜力。当两种技术的最大发布宽度分别为4、8、16和32时,与基本指令级并行机相比,总体加速比分别为1.20、1.35、1.51和1.66。通常,依赖关系崩溃是性能提升的主要部分。在依赖崩溃模型下,迹线中指令总数的298至478可能崩溃。折叠的指令之间的距离几乎总是小于8。我们的实验还表明,通过合并增加地址预测率的机制,可以进一步提高性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号