LATTE-CC: Latency Tolerance Aware Adaptive Cache Compression Management for Energy Efficient GPUs

机译：LATTE-CC：延迟公差感知的自适应缓存压缩管理，用于节能型GPU

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

General-purpose GPU applications are significantly constrained by the efficiency of the memory subsystem and the availability of data cache capacity on GPUs. Cache compression, while is able to expand the effective cache capacity and improve cache efficiency, comes with the cost of increased hit latency. This has constrained the application of cache compression to mostly lower level caches, leaving it unexplored for L1 caches and for GPUs. Directly applying state-of-the-art high performance cache compression schemes on GPUs results in a wide performance variation from -52% to 48%. To maximize the performance and energy benefits of cache compression for GPUs, we propose a new compression management scheme, called LATTE-CC. LATTE-CC is designed to exploit the dynamically-varying latency tolerance feature of GPUs. LATTE-CC compresses cache lines based on its prediction of the degree of latency tolerance of GPU streaming multiprocessors and by choosing between three distinct compression modes: no compression, low-latency, and high-capacity. LATTE-CC improves the performance of cache sensitive GPGPU applications by as much as 48.4% and by an average of 19.2%, outperforming the static application of compression algorithms. LATTE-CC also reduces GPU energy consumption by an average of 10%, which is twice as much as that of the state-of-the-art compression scheme.

机译：通用GPU应用受到内存子系统的效率以及GPU上数据缓存容量可用性的极大限制。高速缓存压缩虽然可以扩展有效的高速缓存容量并提高高速缓存效率，但会增加命中延迟时间。这已将高速缓存压缩的应用限制在大多数较低级别的高速缓存中，从而使得L1高速缓存和GPU尚未探索它。在GPU上直接应用最新的高性能高速缓存压缩方案会导致从-52％到48％的巨大性能差异。为了最大程度地提高GPU缓存压缩的性能和能源效益，我们提出了一种称为LATTE-CC的新压缩管理方案。 LATTE-CC旨在利用GPU动态变化的延迟容限功能。 LATTE-CC基于对GPU流多处理器延迟容忍度的预测，并通过在三种不同的压缩模式之间进行选择来压缩高速缓存行：无压缩，低延迟和高容量。 LATTE-CC将高速缓存敏感的GPGPU应用程序的性能提高了48.4％，平均提高了19.2％，优于压缩算法的静态应用程序。 LATTE-CC还可以将GPU能耗平均降低10％，是最新压缩方案的两倍。

著录项

来源
《IEEE International Symposium on High Performance Computer Architecture》|2018年|221-234|共14页
会议地点
作者
Akhil Arunkumar; Shin-Ying Lee; Vignesh Soundararajan; Carole-Jean Wu;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Computer architecture;

机译：计算机架构;

相似文献

外文文献
中文文献
专利

1. An Energy Efficient Sensor Network Processor with Latency-Aware Adaptive Compression [J] . Yongpan Liu, Shuangchen Li, Jue Wang, IEICE Transactions on Electronics . 2011,第7期

机译：具有延迟感知自适应压缩的节能型传感器网络处理器
2. Efficient cooperative cache management for latency-aware data intelligent processing in edge environment [J] . Chunlin Li, Jun Liu, Qingchuan Zhang, Future generation computer systems . 2021,第Octa期

机译：高效的合作缓存管理，用于边缘环境中的延迟感知数据智能处理
3. Improving the Performance and Energy Efficiency of GPGPU Computing through Integrated Adaptive Cache Management [J] . Kim Kyu Yeun, Park Jinsu, Baek Woongki IEEE Transactions on Parallel and Distributed Systems . 2019,第3期

机译：通过集成的自适应缓存管理提高GPGPU计算的性能和能效
4. LATTE-CC: Latency Tolerance Aware Adaptive Cache Compression Management for Energy Efficient GPUs [C] . Akhil Arunkumar, Shin-Ying Lee, Vignesh Soundararajan, IEEE International Symposium on High Performance Computer Architecture . 2018

机译：Latte-CC：延迟宽容意识到节能GPU的自适应缓存压缩管理
5. Energy efficient and latency aware adaptive compression in wireless sensor networks. [D] . Szalapski, Thomas Mark Daniel. 2014

机译：无线传感器网络中的节能和延迟感知自适应压缩。
6. CPAC: Energy-Efficient Data Collection through Adaptive Selection of Compression Algorithms for Sensor Networks [O] . HyungJune Lee, HyunSeok Kim, Ik Joon Chang 2014

机译：CPAC：通过自适应选择传感器网络的压缩算法来进行节能数据收集
7. Adaptive Cache Management for Energy-efficient GPU Computing [O] . 2015

机译：用于节能GpU计算的自适应高速缓存管理

LATTE-CC: Latency Tolerance Aware Adaptive Cache Compression Management for Energy Efficient GPUs

摘要

著录项

相似文献

相关主题

期刊订阅