...
首页> 外文期刊>Computers, IEEE Transactions on >A Cache Hierarchy Aware Thread Mapping Methodology for GPGPUs
【24h】

A Cache Hierarchy Aware Thread Mapping Methodology for GPGPUs

机译:GPGPU的缓存层次结构感知线程映射方法

获取原文
获取原文并翻译 | 示例
           

摘要

The recently proposed GPGPU architecture has added a multi-level hierarchy of shared cache to better exploit the data locality of general purpose applications. The GPGPU design philosophy allocates most of the chip area to processing cores, and thus results in a relatively small cache shared by a large number of cores when compared with conventional multi-core CPUs. Applying a proper thread mapping scheme is crucial for gaining from constructive cache sharing and avoiding resource contention among thousands of threads. However, due to the significant differences on architectures and programming models, the existing thread mapping approaches for multi-core CPUs do not perform as effective on GPGPUs. This paper proposes a formal model to capture both the characteristics of threads as well as the cache sharing behavior of multi-level shared cache. With appropriate proofs, the model forms a solid theoretical foundation beneath the proposed cache hierarchy aware thread mapping methodology for multi-level shared cache GPGPUs. The experiments reveal that the three-staged thread mapping methodology can successfully improve the data reuse on each cache level of GPGPUs and achieve an average of 2.3× to 4.3× runtime enhancement when compared with existing approaches.
机译:最近提出的GPGPU架构增加了共享缓存的多层结构,以更好地利用通用应用程序的数据局部性。 GPGPU设计理念将大部分芯片区域分配给处理内核,因此与传统的多核CPU相比,导致大量内核共享相对较小的缓存。应用适当的线程映射方案对于从构造性缓存共享中获得收益以及避免数千个线程之间的资源争用至关重要。但是,由于架构和编程模型上的重大差异,现有的多核CPU线程映射方法在GPGPU上的效果不佳。本文提出了一个正式的模型来捕获线程的特征以及多级共享缓存的缓存共享行为。有了适当的证明,该模型为多层共享缓存GPGPU的建议的缓存层次结构感知线程映射方法奠定了坚实的理论基础。实验表明,与现有方法相比,三阶段线程映射方法可以成功地改善GPGPU的每个缓存级别上的数据重用性,并实现平均2.3倍至4.3倍的运行时间增强。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号