Improving the Performance and Energy Efficiency of GPGPU Computing through Integrated Adaptive Cache Management

Kim Kyu Yeun; Park Jinsu; Baek Woongki

首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >Improving the Performance and Energy Efficiency of GPGPU Computing through Integrated Adaptive Cache Management

【24h】

Improving the Performance and Energy Efficiency of GPGPU Computing through Integrated Adaptive Cache Management

机译：通过集成的自适应缓存管理提高GPGPU计算的性能和能效

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Hardware caches are widely employed in GPGPUs to achieve higher performance and energy efficiency. Incorporating hardware caches in GPGPUs, however, does not immediately guarantee enhanced performance and energy efficiency due to high cache contention and thrashing. To address the inefficiency of GPGPU caches, various adaptive techniques (e.g., warp limiting) have been proposed. However, relatively little work has been done in the context of creating an architectural framework that tightly integrates adaptive cache management techniques and investigating their effectiveness and interaction. To bridge this gap, we propose IACM, integrated adaptive cache management for high-performance and energy-efficient GPGPU computing. IACM integrates the state-of-the-art adaptive cache management techniques (i.e., cache indexing, bypassing, and warp limiting) in a unified architectural framework. Our quantitative evaluation demonstrates that IACM significantly improves the performance and energy efficiency of various GPGPU workloads over the baseline architecture (i.e., 98.1 and 61.9 percent on average, respectively), achieves considerably higher performance than the state-of-the-art technique (i.e., 361.4 percent at maximum and 7.7 percent on average), and delivers significant performance and energy-efficiency gains over the baseline GPGPU architecture enhanced with advanced architectural technologies.

机译：硬件缓存广泛用于GPGPU中，以实现更高的性能和能效。但是，由于高缓存争用和抖动，在GPGPU中集成硬件缓存并不能立即保证增强的性能和能效。为了解决GPGPU高速缓存的低效率问题，已经提出了各种自适应技术（例如，翘曲限制）。但是，在创建紧密集成自适应缓存管理技术并研究其有效性和交互作用的体系结构框架的背景下，所做的工作相对较少。为了弥合这一差距，我们提出了IACM，用于高性能和高能效GPGPU计算的集成自适应缓存管理。 IACM在一个统一的体系结构框架中集成了最新的自适应高速缓存管理技术（即高速缓存索引，旁路和扭曲限制）。我们的定量评估表明，IACM在基准架构上显着提高了各种GPGPU工作负载的性能和能效（即分别平均为98.1％和61.9％），其性能比最新技术（即（最高为361.4％，平均为7.7％），并且与通过先进架构技术增强的基准GPGPU架构相比，具有显着的性能和能效提升。

著录项

来源
《IEEE Transactions on Parallel and Distributed Systems》 |2019年第3期|630-645|共16页
作者
Kim Kyu Yeun; Park Jinsu; Baek Woongki;
展开▼
作者单位

UNIST, Sch Elect & Comp Engn, Ulsan 44919, South Korea;

UNIST, Sch Elect & Comp Engn, Ulsan 44919, South Korea;

UNIST, Sch Elect & Comp Engn, Ulsan 44919, South Korea;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Integrated adaptive cache management; GPGPU computing; high performance; energy efficiency;

机译：集成式自适应缓存管理;GPGPU计算;高性能;能源效率;

相似文献

外文文献
中文文献
专利

1. Quantifying the performance and energy efficiency of advanced cache indexing for GPGPU computing [J] . Chris Lupo Computing reviews . 2016,第12期

机译：量化用于GPGPU计算的高级缓存索引的性能和能效
2. Quantifying the performance and energy efficiency of advanced cache indexing for GPGPU computing [J] . Kim Kyu Yeun, Baek Woongki Microprocessors and microsystems . 2016,第JUNa期

机译：量化用于GPGPU计算的高级缓存索引的性能和能效
3. Designing a Practical Data Filter Cache to Improve Both Energy Efficiency and Performance [J] . ALEN BARDIZBANYAN, MAGNUS SJALANDER, DAVID WHALLEY, ACM Transactions on Architecture and Code Optimization . 2013,第4期

机译：设计实用的数据过滤器缓存以提高能源效率和性能
4. IACM: Integrated adaptive cache management for high-performance and energy-efficient GPGPU computing [C] . Kyu Yeun Kim, Jinsu Park, Woongki Baek International conference on computer design . 2016

机译：IACM：集成的自适应高速缓存管理，用于高性能和高能效的GPGPU计算
5. Exploring Hybrid SPM-Cache Architectures to Improve Performance and Energy Efficiency for Real-time Computing. [D] . Wu, Lan. 2013

机译：探索混合SPM缓存体系结构，以提高实时计算的性能和能效。
6. Research on Energy Management of Hybrid Unmanned Aerial Vehicles to Improve Energy-Saving and Emission Reduction Performance [O] . Mingliang Bai, Wenjiang Yang, Dongbin Song, 2020

机译：混合动力无人机能量管理改善节能减排性能的研究
7. Exploring Hybrid SPM-Cache Architectures to Improve Performance and Energy Efficiency for Real-time Computing [O] . Wu Lan 2013

机译：探索混合SPM缓存架构以提高实时计算的性能和能效

Improving the Performance and Energy Efficiency of GPGPU Computing through Integrated Adaptive Cache Management

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅