【24h】

L2-Cache Hierarchical Organizations for Multi-core Architectures

机译:适用于多核体系结构的L2缓存分层组织

获取原文
获取原文并翻译 | 示例

摘要

Nowadays the market is moving to have multiple cores on the same chip (Chip Multiprocessors - CMP) with a multi-sliced L2 which is shared by 2 cores. CMPs with 8 cores can already be found, and future CMPs will have more than 8 cores. Typical implementations of CMPs share the L2 cache among the processors and have 2 cores sharing the same L2. We are interested in investigating the behavior of the pair: L2 sharing × L2 cache size. So, we construct models of two different organizations of CMPs: (ⅰ) tiles, with LI and L2 private, interconnected through a router; (ⅱ) tiles with LI private and L2 shared among processors. The (ⅱ) organization is evaluated with different numbers (2, 4) of cores sharing the same L2 slice and also, the L2 shared slice size is changed (1 MB, 2MB and 4 MB). With a total number of 32 cores, the proposed configurations of (ii) organization are evaluated with a full-system simulation under SPLASH-2 benchmarks. By applying both techniques, results show that the execution time is improved of about 18.9% for Ocean, 88.8% for Raytrace,and 31.8% for Volrend.
机译:如今,市场正在朝着同一芯片(芯片多处理器-CMP)上具有多个内核的方向发展,该芯片具有由2个内核共享的多层L2。具有8个核心的CMP已经可以找到,将来的CMP将具有8个以上的核心。 CMP的典型实现在处理器之间共享L2缓存,并具有2个共享同一L2的内核。我们有兴趣调查该对的行为:L2共享×L2缓存大小。因此,我们构建了CMP的两个不同组织的模型:(ⅰ)具有L1和L2私有的图块,它们通过路由器互连。 (ⅱ)具有处理器之间的L1专用和L2共享的图块。 (ⅱ)组织使用共享同一L2片的不同数量(2、4)的内核进行评估,并且L2共享片的大小也发生了更改(1 MB,2MB和4 MB)。在SPLASH-2基准下,使用总共32个核心的(ii)组织的建议配置进行了全系统仿真评估。通过应用这两种技术,结果表明,Ocean的执行时间缩短了约18.9%,Raytrace的执行时间缩短了88.8%,Volrend的执行时间缩短了31.8%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号