首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Improving the Performance and Energy Efficiency of GPGPU Computing through Integrated Adaptive Cache Management
【24h】

Improving the Performance and Energy Efficiency of GPGPU Computing through Integrated Adaptive Cache Management

机译:通过集成的自适应缓存管理提高GPGPU计算的性能和能效

获取原文
获取原文并翻译 | 示例

摘要

Hardware caches are widely employed in GPGPUs to achieve higher performance and energy efficiency. Incorporating hardware caches in GPGPUs, however, does not immediately guarantee enhanced performance and energy efficiency due to high cache contention and thrashing. To address the inefficiency of GPGPU caches, various adaptive techniques (e.g., warp limiting) have been proposed. However, relatively little work has been done in the context of creating an architectural framework that tightly integrates adaptive cache management techniques and investigating their effectiveness and interaction. To bridge this gap, we propose IACM, integrated adaptive cache management for high-performance and energy-efficient GPGPU computing. IACM integrates the state-of-the-art adaptive cache management techniques (i.e., cache indexing, bypassing, and warp limiting) in a unified architectural framework. Our quantitative evaluation demonstrates that IACM significantly improves the performance and energy efficiency of various GPGPU workloads over the baseline architecture (i.e., 98.1 and 61.9 percent on average, respectively), achieves considerably higher performance than the state-of-the-art technique (i.e., 361.4 percent at maximum and 7.7 percent on average), and delivers significant performance and energy-efficiency gains over the baseline GPGPU architecture enhanced with advanced architectural technologies.
机译:硬件缓存广泛用于GPGPU中,以实现更高的性能和能效。但是,由于高缓存争用和抖动,在GPGPU中集成硬件缓存并不能立即保证增强的性能和能效。为了解决GPGPU高速缓存的低效率问题,已经提出了各种自适应技术(例如,翘曲限制)。但是,在创建紧密集成自适应缓存管理技术并研究其有效性和交互作用的体系结构框架的背景下,所做的工作相对较少。为了弥合这一差距,我们提出了IACM,用于高性能和高能效GPGPU计算的集成自适应缓存管理。 IACM在一个统一的体系结构框架中集成了最新的自适应高速缓存管理技术(即高速缓存索引,旁路和扭曲限制)。我们的定量评估表明,IACM在基准架构上显着提高了各种GPGPU工作负载的性能和能效(即分别平均为98.1%和61.9%),其性能比最新技术(即(最高为361.4%,平均为7.7%),并且与通过先进架构技术增强的基准GPGPU架构相比,具有显着的性能和能效提升。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号