...
首页> 外文期刊>Computer architecture news >EOLE: Paving the Way for an Effective Implementation of Value Prediction
【24h】

EOLE: Paving the Way for an Effective Implementation of Value Prediction

机译:EOLE:为有效实施价值预测铺平道路

获取原文
获取原文并翻译 | 示例

摘要

Even in the multicore era, there is a continuous demand to increase the performance of single-threaded applications. However, the conventional path of increasing both issue width and instruction window size inevitably leads to the power wall. Value prediction (VP) was proposed in the mid 90's as an alternative path to further enhance the performance of wide-issue superscalar processors. Still, it was considered up to recently that a performance-effective implementation of Value Prediction would add tremendous complexity and power consumption in almost every stage of the pipeline. Nonetheless, recent work in the field of VP has shown that given an efficient confidence estimation mechanism, prediction validation could be removed from the out-of-order engine and delayed until commit time. As a result, recovering from mispredictions via selective replay can be avoided and a much simpler mechanism - pipeline squashing - can be used, while the out-of-order engine remains mostly unmodified. Yet, VP and validation at commit time entails strong constraints on the Physical Register File. Write ports are needed to write predicted results and read ports are needed in order to validate them at commit time, potentially rendering the overall number of ports unbearable. Fortunately, VP also implies that many single-cycle ALU instructions have their operands predicted in the front-end and can be executed in-place, in-order. Similarly, the execution of single-cycle instructions whose result has been predicted can be delayed until commit time since predictions are validated at commit time. Consequently, a significant number of instructions - 10% to 60% in our experiments - can bypass the out-of-order engine, allowing the reduction of the issue width, which is a major contributor to both out-of-order engine complexity and register file port requirement. This reduction paves the way for a truly practical implementation of Value Prediction. Furthermore, since Value Prediction in itself usually increases performance, our resulting {Early Out-of-Order Late} Execution architecture, EOLE, is often more efficient than a baseline VP-augmented 6-issue superscalar while having a significantly narrower 4-issue out-of-order engine.
机译:即使在多核时代,仍然存在不断提高单线程应用程序性能的需求。然而,增加发行宽度和指示窗口大小的常规途径不可避免地导致了权力壁垒。值预测(VP)是在90年代中期提出的,它是进一步提高宽问题超标量处理器性能的替代途径。尽管如此,直到最近才考虑到,对价值预测进行性能有效的实施会在管道的几乎每个阶段中增加巨大的复杂性和功耗。但是,VP领域中的最新工作表明,只要有一个有效的置信度估计机制,就可以从乱序引擎中删除预测验证,并推迟到提交时间。结果,可以避免通过选择性重播而从错误预测中恢复,并且可以使用更简单的机制-管线压缩-,而乱序引擎基本上保持不变。但是,提交时的VP和验证对物理寄存器文件提出了严格的约束。需要写入端口来写入预测结果,并且需要读取端口才能在提交时对其进行验证,从而可能导致端口总数无法承受。幸运的是,VP还暗示着许多单周期ALU指令的操作数在前端进行了预测,并且可以按顺序执行。同样,由于预测已在提交时进行验证,因此可以将其结果已被预测的单周期指令的执行延迟到提交时间为止。因此,大量指令(在我们的实验中为10%到60%)可以绕过乱序引擎,从而减小问题宽度,这是乱序引擎复杂性和注册文件端口要求。这种减少为真正实现价值预测铺平了道路。此外,由于价值预测本身通常可以提高性能,因此我们得到的{Early Out-of-Order Late} Execution体系结构EOLE通常比基线VP增强的6问题超标量更有效率,而4问题的缩小幅度却更大订单引擎。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号