Instruction Prefetch for Improving GPGPU Performance

Cao Jianli; Chen Zhikui; Wang Yuxin; Guo He; Wang Pengcheng

首页> 外文期刊>IEICE Transactions on fundamentals of electronics, communications & computer sciences >Instruction Prefetch for Improving GPGPU Performance

【24h】

Instruction Prefetch for Improving GPGPU Performance

机译：用于提高GPGPU性能的指令预取

获取原文

获取原文并翻译 | 示例

获取外文期刊封面目录资料

开具论文收录证明 >>

文献代查 >>

文献数据库（团队版） >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Like many processors, GPGPU suffers from memory wall. The traditional solution for this issue is to use efficient schedulers to hide long memory access latency or use data prefetch mech-anism to reduce the latency caused by data transfer. In this paper, we study the instruction fetch stage of GPU's pipeline and analyze the relationship between the capacity of GPU kernel and instruction miss rate. We improve the next line prefetch mechanism to fit the SIMT model of GPU and determine the optimal parameters of prefetch mechanism on GPU through experiments. The experimental result shows that the prefetch mechanism can achieve 12.17% performance improvement on average. Compared with the solution of enlarging I-Cache, prefetch mechanism has the advantages of more beneficiaries and lower cost.

机译：像许多处理器一样，GPGPU遭受了记忆墙。此问题的传统解决方案是使用有效的调度程序来隐藏长内存访问延迟或使用数据预取机制 - anism来减少数据传输引起的延迟。本文研究了GPU管道指令获取阶段，分析了GPU内核能力与指令错号之间的关系。我们提高了下一行预取机制，以适应GPU的SIMT模型，并通过实验确定GPU上的预取机制的最佳参数。实验结果表明，预取机理可平均达到12.17％的性能提高。与扩大i高速缓存的解决方案相比，预取机制具有更多受益者和更低的成本。

著录项

来源
《IEICE Transactions on fundamentals of electronics, communications & computer sciences》 |2021年第5期|773-785|共13页
作者
Cao Jianli; Chen Zhikui; Wang Yuxin; Guo He; Wang Pengcheng;
展开▼
作者单位

Dalian Univ Technol Sch Software Technol Dalian Peoples R China;

Dalian Univ Technol Sch Software Technol Dalian Peoples R China;

Dalian Univ Technol Sch Comp Sci & Technol Dalian Peoples R China;

Dalian Univ Technol Sch Software Technol Dalian Peoples R China;

Ahui Univ Jianghuai Coll Hefei Peoples R China;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
GPGPU; I-Cache; warp scheduler; instruction prefetch;

机译：GPGPU;i-cache;翘曲调度程序;指令预取;

相似文献

外文文献
中文文献
专利

1. Methods to improve performance of instruction prefetching through balanced improvement of two primary performance factors [J] . Gi-Ho Park, Oh-Young Kwon, Tack-Don Han, Journal of systems architecture . 1998,第9a10期

机译：通过平衡两个主要性能因素来提高指令预取性能的方法
2. A performance study of instruction cache prefetching methods [J] . Hsu W.-C., Smith J.E. IEEE Transactions on Computers . 1998,第5期

机译：指令缓存预取方法的性能研究
3. Non-referenced prefetch (NRP) cache for instruction prefetching [J] . Park G.-H., Kwon O.-Y. IEE proceedings. Part E . 1996,第1期

机译：非参考预取（NRP）高速缓存，用于指令预取
4. A Branch Target Instruction Prefetching Technique for Improved Performance [C] . Gade, P.R., Paily, Advanced Computing and Communications (ADCOM), 2007 15th International Conference on . 2007

机译：分支目标指令预取技术可提高性能
5. Improving memory hierarchy performance with hardware prefetching and cache replacement. [D] . Lin, Wei-Fen. 2002

机译：通过硬件预取和缓存替换来提高内存层次结构的性能。
6. Combining Instruction Prefetching with Partial Cache Locking to Improve WCET in Real-Time Systems [O] . Fan Ni, Xiang Long, Han Wan, -1

机译：将指令预取与部分缓存锁定相结合以改善实时系统中的WCET
7. Combining instruction prefetching with partial cache locking to improve WCET in real-time systems. [O] . Fan Ni, Xiang Long, Han Wan, 2013

机译：将指令预取与部分高速缓存锁定相结合，以改善实时系统中的WCET。

Instruction Prefetch for Improving GPGPU Performance

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅