...
首页> 外文期刊>Computer-Aided Design of Integrated Circuits and Systems, IEEE Transactions on >Performance, Area, and Power Evaluations of Ultrafine-Grained Run-Time Power-Gating Routers for CMPs
【24h】

Performance, Area, and Power Evaluations of Ultrafine-Grained Run-Time Power-Gating Routers for CMPs

机译:用于CMP的超细粒度运行时功率门控路由器的性能,面积和功率评估

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

This paper proposes the ultrafine-grained run-time power gating of on-chip routers, in which the power supply to each router component (e.g., virtual-channel buffer, virtual-channel multiplexer, and crossbar multiplexer and output latch) can be individually controlled based on the applied workload. Since only the router components that are transferring a packet are activated, the leakage power of the on-chip network can be reduced to a near-optimal level. However, such techniques inherently increase the communication latency and degrade the application performance, since a certain amount of wakeup latency is required to activate the sleeping components. To mitigate this wakeup latency, an early wakeup method that can preliminarily detect the next packet arrival and activate the corresponding components is essential. We designed and implemented an ultrafine-grained power-gating router using a commercial 65 nm process. We propose four early wakeup methods and combine them with the power-gating router. The proposed router with the early wakeup methods is evaluated in terms of its application performance, area overhead, and leakage power reduction taking into account the on/off energy overhead. The simulation results showed that it reduces the leakage power by 54.4–59.9% on average even when the application programs are fully running, at the expense of 4.6% of the area and 0.7–3.7% of the performance overheads when we assume a 1 GHz operation.
机译:本文提出了片上路由器的超细粒度运行时门控,其中可以单独给每个路由器组件(例如,虚拟通道缓冲器,虚拟通道多路复用器,交叉开关多路复用器和输出锁存器)供电。根据应用的工作负载进行控制。由于仅激活正在传输数据包的路由器组件,因此片上网络的泄漏功率可以降低到接近最佳水平。然而,由于需要一定量的唤醒等待时间来激活睡眠组件,因此这些技术固有地增加了通信等待时间并降低了应用性能。为了减轻这种唤醒等待时间,必须能够早期检测下一个数据包到达并激活相应组件的早期唤醒方法。我们使用商业65 nm工艺设计并实现了超细粒度的功率门控路由器。我们提出了四种早期唤醒方法,并将它们与电源门控路由器结合在一起。拟议的采用早期唤醒方法的路由器会根据其应用性能,面积开销和泄漏功率的降低进行评估,并考虑开/关能量开销。仿真结果表明,即使在应用程序完全运行时,它也可以平均降低54.4–59.9%的泄漏功率,而在假设1 GHz的情况下,则以占面积4.6%和性能开销0.7-3.7%的代价为代价操作。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号