首页> 外文OA文献 >Reducing the Delay for Decoding Instructions by Predicting Their Source Register Operands
【2h】

Reducing the Delay for Decoding Instructions by Predicting Their Source Register Operands

机译:通过预测其源寄存器操作数来减少解码指令的延迟

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The fetched instructions would have data dependency with in-flight ones in the pipeline execution of a processor, so the dependency prevents the processor from executing the incoming instructions for guaranteeing the program’s correctness. The register and memory dependencies are detected in the decode and memory stages, respectively. In a small embedded processor that supports as many ISAsas possible to reduce code size, the instruction decoding to identify register usage with the dependence check generally results in long delay and sometimes a critical path in its implementation. For reducing the delay, this paper proposes two methods—One method assumes the widely used source register operand bit-fields without fully decoding the instructions. However, this assumption would cause additional stalls due to the incorrect prediction; thus, it would degrade the performance. To solve this problem, as the other method, we adopt a table-based way to store the dependence history and later use this information for more precisely predicting the dependency. We applied our methods to the commercial EISC embedded processor with the Samsung 65nm process; thus, we reduced the critical path delay and increased its maximum operating frequency by 12.5% and achieved an average 11.4% speed-up in the execution time of the EEMBC applications. We also improved the static, dynamic power consumption, and EDP by 7.2%, 8.5%, and 13.6%, respectively, despite the implementation area overhead of 2.5%.
机译:获取的指令将在处理器的管道执行中具有用于飞行中的网络依赖性,因此依赖关系防止处理器执行用于保证程序的正确性的传入指令。寄存器和内存依赖项分别在解码和内存阶段中检测到。在支持尽可能多的ISAS的小型嵌入式处理器中,指令解码以识别识别寄存器使用的依赖性检查通常会导致长延迟,有时是其实现中的关键路径。为了减少延迟,本文提出了两种方法 - 一种方法假设广泛使用的源寄存器操作数位字段而无需完全解码指令。然而,由于预测不正确,这种假设会导致额外的档位;因此,它会降低性能。为了解决这个问题,作为另一个方法,我们采用基于表的方式来存储依赖历史,然后使用此信息来更精确地预测依赖性。我们将我们的方法应用于商业EISC嵌入式处理器,三星65nm工艺;因此,我们减少了临界路径延迟,并将其最大工作频率提高了12.5%,并且在EEMBC应用的执行时间中平均加速了11.4%。尽管实施面积超过2.5%,但我们还分别提高了7.2%,8.5%和13.6%的静态,动态功耗和eDP。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号