首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >On Runtime Communication and Thermal-Aware Application Mapping and Defragmentation in 3D NoC Systems
【24h】

On Runtime Communication and Thermal-Aware Application Mapping and Defragmentation in 3D NoC Systems

机译:3D NoC系统中的运行时通信和热感知应用程序映射和碎片整理

获取原文
获取原文并翻译 | 示例

摘要

. Mapping applications at runtime to 3D NoCs is the key to maintain high throughput of the overall chip under a thermal/power constraint. However, the goals of optimizing both the communication latency and chip peak temperature are contradicting due to several reasons. First, exploiting the vertical TSV links can accelerate communications, while low peak temperature prefers that the tasks to be mapped closer to the heat sink, instead of using the vertical links. Second, mapping tasks in close proximity can reduce communication latency, but at the cost of poor heat dissipation. To address these issues, in this paper, we propose an efficient runtime mapping algorithm to reduce both communication latency and overall application running time under thermal constraint. In essence, this algorithm first selects a 3D cuboid core region of a specific shape for each incoming application by setting the region & x0027;s number of occupied vertical layers and its distance to the heat sink, in order to optimize its communication performance and peak temperature. Next, the exact locations of the core regions in the chip are determined, followed by a task-to-core mapping. A defragmentation algorithm is also proposed to keep free core regions contiguous. The experimental results have confirmed that, compared to two recently proposed runtime mapping algorithms, our proposed approach can reduce the total running time by up to 48% and communication cost by up to 44%, with a low runtime overhead.
机译:。在运行时将应用程序映射到3D NoC是在热/功率限制下维持整个芯片高吞吐量的关键。但是,由于多种原因,优化通信延迟和芯片峰值温度的目标是矛盾的。首先,利用垂直TSV链接可以加快通信速度,而较低的峰值温度则希望将任务映射到更靠近散热器的位置,而不是使用垂直链接。其次,在附近绘制地图任务可以减少通信延迟,但以散热不佳为代价。为了解决这些问题,在本文中,我们提出了一种有效的运行时映射算法,以减少在热约束下的通信延迟和整个应用程序的运行时间。从本质上讲,该算法首先通过设置区域所占的垂直层数及其与散热器的距离,来为每个传入的应用选择特定形状的3D长方体核心区域,以优化其通信性能和峰值温度。接下来,确定芯片中核心区域的确切位置,然后确定任务到核心的映射。还提出了碎片整理算法以保持自由核心区域连续。实验结果证实,与最近提出的两种运行时映射算法相比,我们提出的方法可以将总运行时间减少多达48%,将通信成本减少多达44%,并且运行时开销较低。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号