首页> 外文会议>Design, Automation Test in Europe Conference Exhibition >A Mechanism for energy-efficient reuse of decoding and scheduling of x86 instruction streams
【24h】

A Mechanism for energy-efficient reuse of decoding and scheduling of x86 instruction streams

机译:一种高效利用x86指令流的解码和调度的机制

获取原文

摘要

Current superscalar x86 processors decompose each CISC instruction (variable-length and with multiple addressing modes) into multiple RISC-like pops at runtime so they can be pipelined and scheduled for concurrent execution. This challenging and power-hungry process, however, is usually repeated several times on the same instruction sequence, inefficiently producing the very same decoded and scheduled pops. Therefore, we propose a transparent mechanism to save the decoding and scheduling transformation for later reuse, so that next time the same instruction sequence is found it can automatically bypass the costly pipeline stages involved. We use a coarse-grained reconfigurable array as a means to save this transformation, since its structure enables the recovery of pops already allocated in time and space, and also larger ILP exploitation than superscalar processors. The technique can reduce the energy consumption of a powerful 8-issue superscalar by 31.4% at low area costs, while also improving performance by 32.6%.
机译:当前的超标量x86处理器在运行时将每个CISC指令(可变长度并具有多种寻址模式)分解为多个类似RISC的pop,因此可以对它们进行流水线处理并安排并行执行。但是,通常在同一指令序列上重复执行此具有挑战性和耗能的过程,但效率很低,无法有效地产生完全相同的解码和调度弹出消息。因此,我们提出了一种透明的机制来保存解码和调度转换,以供以后重用,以便下次找到相同的指令序列时,它可以自动绕开所涉及的昂贵流水线阶段。我们使用粗粒度可重配置数组作为保存此转换的一种方法,因为它的结构允许恢复已在时间和空间上分配的pop,并且比超标量处理器还具有更大的ILP利用率。该技术可以以较低的面积成本将功能强大的8问题超标量的能耗降低31.4%,同时还可以将性能提高32.6%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号