首页> 外文期刊>IEEE Transactions on Computers >A speculative control scheme for an energy-efficient banked register file
【24h】

A speculative control scheme for an energy-efficient banked register file

机译:一种节能银行寄存文件的推测控制方案

获取原文
获取原文并翻译 | 示例

摘要

Multiported register files are critical components of modern superscalar and simultaneously multithreaded (SMT) processors, but conventional designs consume considerable die area and power as register counts and issue widths grow. Banked multiported register files consisting of multiple interleaved banks of lesser ported cells can be used to reduce area, power, and access time and previous work has shown that such designs can provide sufficient bandwidth for a superscalar machine. These previous banked designs, however, have complex control structures to avoid bank conflicts or to buffer conflicting requests, which add to design complexity and would likely limit cycle time. This paper presents a much simpler and faster control scheme that speculatively issues potentially conflicting instructions, and then quickly repairs the pipeline if conflicts occur. We show that, once optimizations to avoid regfile reads are employed, the remaining read accesses observed in detailed simulations are close to randomly distributed and this contributes to the effectiveness of our speculative control scheme. For a four-issue superscalar processor with 64 physical registers, we show that we can reduce area by a factor of three, access time by 25 percent, and energy by 40 percent, while decreasing IPC by less than 5 percent. For an eight-issue SMT processor with 512 physical registers, area is reduced by a factor of seven, access time by 30 percent, and energy by 60 percent, while decreasing IPC by less than 2 percent.
机译:多端口寄存器文件是现代超标量和同时多线程(SMT)处理器的关键组件,但是常规设计会随着寄存器数量和发行宽度的增加而消耗大量的裸片面积和功耗。由较少端口单元的多个交错存储区组成的存储多端口寄存器文件可用于减少面积,功耗和访问时间,先前的工作表明,此类设计可为超标量机器提供足够的带宽。但是,这些先前的库设计具有复杂的控制结构,以避免库冲突或缓冲冲突的请求,这增加了设计的复杂性并可能会限制周期时间。本文提出了一种更简单,更快速的控制方案,该方案推测性地发出潜在冲突的指令,然后在发生冲突时迅速修复管道。我们表明,一旦采用了避免regfile读取的优化方法,在详细模拟中观察到的其余读取访问将接近随机分布,这有助于我们的推测控制方案的有效性。对于具有64个物理寄存器的四问题超标量处理器,我们证明我们可以将面积减少三倍,访问时间减少25%,能耗减少40%,而IPC减少不到5%。对于具有512个物理寄存器的八次发布SMT处理器,面积减少了七倍,访问时间减少了30%,能耗减少了60%,而IPC减少了不到2%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号