首页> 外文会议>IEEE International Parallel and Distributed Processing Symposium >Accelerated Reply Injection for Removing NoC Bottleneck in GPGPUs
【24h】

Accelerated Reply Injection for Removing NoC Bottleneck in GPGPUs

机译:加速回复注入以消除GPGPU中的NoC瓶颈

获取原文

摘要

The high level of parallelism in GPGPUs has resulted in significantly changed on-chip data traffic behaviors. This demands new research to identify and address the limiting factors of networks-on-chip (NoCs) in the context of GPGPUs. In this paper, we quantitatively analyze the performance of on-chip networks in GPGPUs, and address a serious NoC bottleneck where the reply data from memory controllers experience large contention when being injected to the reply network. To remove this reply injection bottleneck, we propose Accelerated Reply Injection (ARI), a very effective scheme that can supply a fast rate of data traffic from memory controllers to feed the reply injection points, and accelerates the consumption of the injected packets by quickly transferring the packets out of the injection points, thus increasing both supply and consumption of reply traffic injection. Simulation results on a wide range of benchmarks show that the proposed ARI reduces the data stall time in memory controllers by 67.8% on average, and increases IPC by more than 15.4% on average, with less than 1% area overhead.
机译:GPGPU中的高度并行性已导致片上数据流量行为发生了显着变化。这就需要进行新的研究,以识别和解决GPGPU上下文中的片上网络(NoC)的限制因素。在本文中,我们定量分析了GPGPU中片上网络的性能,并解决了严重的NoC瓶颈,在该瓶颈中,来自内存控制器的答复数据在注入到答复网络时会经历较大的竞争。为了消除此答复注入瓶颈,我们提出了加速答复注入(ARI),这是一种非常有效的方案,可以从内存控制器提供快速的数据流量速率以提供答复注入点,并通过快速传输来加速注入的数据包的消耗。数据包从注入点流出,因此增加了应答流量注入的供应和消耗。在各种基准上的仿真结果表明,所提出的ARI平均将内存控制器中的数据停顿时间平均缩短了67.8%,将IPC平均增加了15.4%以上,而面积开销却不到1%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号