【24h】

Re-NUCA: Boosting CMP Performance Through Block Replication

机译:重新NUCA:通过块复制提高CMP性能

获取原文

摘要

Chip Multiprocessor (CMP) systems have become the reference architecture for designing micro-processors, thanks to the improvements in semiconductor nanotechnology that have continuously provided a crescent number of faster and smaller per-chip transistors. The interests for CMPs grew up since classical techniques for boosting performance, e.g. the increase of clock frequency and the amount of work performed at each clock cycle, can no longer deliver to significant improvement due to energy constrains and wire delay effects. CMP systems generally adopt a large last-level-cache (LLC) (typically, L2 or L3) shared among all cores, and private L1 caches. As the miss resolution time for private caches depends on the response time of the LLC, which is wire-delay dominated, performance are affected by wire delay. NUCA caches have been proposed for single and multi core systems as a mechanism for tolerating wire-delay effects on the overall performance. In this paper, we introduce a novel NUCA architecture, called Re-NUCA, specifically suited for (but not limited to) CMPs in which cores are placed at different sides of the shared cache. The idea is to allow shared blocks to be replicated inside the shared cache, in order to avoid the limitations to performance improvements that arise in classical D-NUCA caches due to the conflict hit problem. Our results show that Re-NUCA outperforms D-NUCA of more then 5% on average, but for those applications that strongly suffer from the conflict hit problem we observe performance improvements up to 15%.
机译:由于半导体纳米技术的改进,芯片多处理器(CMP)系统已成为设计微处理器的参考体系结构,该技术不断提供新月数量更快,更小的单芯片晶体管。自从提高性能的经典技术以来,CMP的兴趣就增长了。由于能量的限制和线路延迟的影响,时钟频率的增加和每个时钟周期执行的工作量不再能带来显着的改善。 CMP系统通常采用在所有内核之间共享的大型最后一级高速缓存(LLC)(通常为L2或L3)和专用L1高速缓存。由于专用高速缓存的未解决时间取决于有线延迟为主的LLC的响应时间,因此性能会受到线路延迟的影响。 NUCA高速缓存已被提出用于单核和多核系统,作为一种容忍线路延迟对整体性能的影响的机制。在本文中,我们介绍了一种称为Re-NUCA的新颖NUCA体系结构,该体系结构特别适用于(但不限于)将内核置于共享缓存的不同侧的CMP。这个想法是允许在共享缓存中复制共享块,以避免由于冲突命中问题而对传统D-NUCA缓存产生的性能改进的限制。我们的结果表明,Re-NUCA的性能平均优于D-NUCA的5%以上,但是对于那些遭受冲突打击问题严重影响的应用程序,我们观察到性能可提高15%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号