首页> 外文OA文献 >Exploring the relationship between architectures and management policies in the design of NUCA-based chip multicore systems
【2h】

Exploring the relationship between architectures and management policies in the design of NUCA-based chip multicore systems

机译:基于NUCA的芯片多核系统设计中的架构与管理政策的关系

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

The last level on-chip cache (LLC) is becoming bigger and more complex to effectively support the various localities coming from multiple cores and threads running concurrently in modern processors. Furthermore, LLC design can be constrained by various restrictions that limit the freedom in their organization, for example in the relative positioning and clustering of processing cores and cache banks. Non Uniform Cache Architectures (NUCAs) offer a hierarchy of access times, which can be usefully exploited by the NUCA management policies (i.e. the ways in which data are either mapped to cache banks and/or moved among them upon access) to achieve high performance and low power consumption. The objective of the work is to single out the optimal combination of data management policies and cache-core layouts and to highlight which is the most performing one. With this aim, we compare two basic layouts for NUCA based systems, the first with cores connected to only one side of the shared NUCA cache (one-side), the second with half of the cores on one side and the others at the opposite side of the NUCA (two-sides). For all the configurations, we evaluate the effectiveness of both static and dynamic NUCAs and, where applicable, we consider also optimizations based on profile-guided bank remapping and replication of shared copies. As overall design guidelines, our results show that the one-side layout achieves the best performance and the lowest power consumption with the considered hw-sw optimizations. Then, similar results can be achieved in the two-sides layout only by introducing more sophisticated copy replications. Lastly, software based profile driven optimization allows the system to achieve the lowest usage of network resources.
机译:最后级别的片上缓存(LLC)正在变得更大,更复杂,可以有效地支持来自多个核心和线程在现代处理器中同时运行的各种本地。此外,LLC设计可以受到限制在其组织中自由的各种限制的限制,例如在处理核和高速缓存库的相对定位和聚类中。非统一缓存架构(NUCAS)提供访问时间的层次结构,可以由NUCA管理策略(即数据映射到缓存库的方式和/或在访问时移动)以实现高性能和低功耗。这项工作的目的是阐明数据管理策略和高速缓存核心布局的最佳组合,并突出显示最多的表现。通过此目的,我们比较基于Nuca的系统的两个基本布局,第一个连接到仅与共享Nuca缓存(一侧)的一侧连接的核心,第二个与一侧有一半的核心和其他相反的核心Nuca(双面)的一侧。对于所有配置,我们评估静态和动态NUCAS的有效性,并且在适用的情况下,我们考虑基于配置文件引导的银行重新映射和共享副本复制的优化。作为整体设计指南,我们的结果表明,单面布局可实现最佳性能和最低功耗,并考虑的HW-SW优化。然后,仅通过引入更复杂的复制复制,可以在双面布局中实现类似的结果。最后,基于软件的配置文件驱动优化允许系统实现网络资源的最低使用率。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号