首页> 外文会议>IEEE Computer Society Annual Symposium on VLSI >Exploiting On-Chip Routers to Store Dirty Cache Blocks in Tiled Chip Multi-processors
【24h】

Exploiting On-Chip Routers to Store Dirty Cache Blocks in Tiled Chip Multi-processors

机译:利用片上路由器将脏缓存块存储在平铺芯片多处理器中

获取原文

摘要

To meet the worst-case performance requirements, on-chip routers in Tiled Chip Multi-Processors (TCMPs) are provisioned with input port buffers. However, in real applications, the average buffer utilisation in routers is significantly low except during peak network congestion. In this work, we exploit the idle buffers in Network-on-Chip (NoC) routers to store dirty cache blocks evicted from the L1 cache. Future requests for such recently evicted cache blocks are serviced from the router buffer, thereby significantly reducing the cache miss latency. We propose two variations on how long we can store these evicted cache blocks on NoC router buffers; one up to a time threshold and another up to the arrival of a demand from an L2 cache bank. We make architectural modifications on the buffer management circuitry of NoC routers to make sure that the evicted, dirty cache blocks are stored in the local routers for the longest duration possible to facilitate local reply. Experimental results show that our proposed modifications achieve a maximum system speedup of up to 15% and an average system speedup of 12%.
机译:为了满足最坏情况下的性能要求,在图块式多处理器(TCMP)中的片上路由器配备了输入端口缓冲区。但是,在实际应用中,除了峰值网络拥塞期间,路由器中的平均缓冲区利用率非常低。在这项工作中,我们利用片上网络(NoC)路由器中的空闲缓冲区来存储从L1缓存中逐出的脏缓存块。从路由器缓冲区满足对此类最近退出的高速缓存块的将来请求,从而显着减少了高速缓存未命中延迟。对于将被逐出的缓存块可以存储在NoC路由器缓冲区中的时间,我们提出了两种变体。一个达到时间阈值,另一个达到L2缓存组的需求到达。我们在NoC路由器的缓冲区管理电路上进行了体系结构修改,以确保被驱逐的脏缓存块在尽可能长的时间内存储在本地路由器中,以利于本地回复。实验结果表明,我们提出的修改可实现高达15%的最大系统加速和12%的平均系统加速。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号