首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Cache Line Aware Algorithm Design for Cache-Coherent Architectures
【24h】

Cache Line Aware Algorithm Design for Cache-Coherent Architectures

机译:高速缓存一致性体系结构的高速缓存行感知算法设计

获取原文
获取原文并翻译 | 示例

摘要

The increase in the number of cores per processor and the complexity of memory hierarchies make cache coherence key for programmability of current shared memory systems. However, ignoring its detailed architectural characteristics can harm performance significantly. In order to assist performance-centric programming, we propose a methodology to allow semi-automatic performance tuning with the systematic translation from an algorithm to an analytic performance model for cache line transfers. For this, we design a simple interface for cache line aware optimization, a translation methodology, and a full performance model that exposes the block-based design of caches to middleware designers. We investigate two different architectures to show the applicability of our techniques and methods: the many-core accelerator Intel Xeon Phi and a multi-core processor with a NUMA configuration (Intel Sandy Bridge). We use mathematical optimization techniques to tune synchronization algorithms to the microarchitectures, identifying three techniques to design and optimize data transfers in our model: single-use, single-step broadcast, and private cache lines.
机译:每个处理器的内核数量的增加以及内存层次结构的复杂性,使高速缓存一致性成为当前共享内存系统可编程性的关键。但是,忽略其详细的体系结构特征可能会严重损害性能。为了协助以性能为中心的编程,我们提出了一种方法,该方法允许半自动性能调整,以及从算法到缓存行传输的分析性能模型的系统转换。为此,我们设计了一个用于缓存行感知优化的简单界面,一种转换方法以及一个完整的性能模型,该模型向中间件设计人员公开了基于块的缓存设计。我们研究了两种不同的体系结构,以展示我们的技术和方法的适用性:多核加速器Intel Xeon Phi和具有NUMA配置的多核处理器(Intel Sandy Bridge)。我们使用数学优化技术将同步算法调整到微体系结构,确定了设计和优化模型中数据传输的三种技术:一次性使用,单步广播和专用缓存行。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号