首页> 外文期刊>IEEE Transactions on Computers >Adaptive Cooperation of Prefetching and Warp Scheduling on GPUs
【24h】

Adaptive Cooperation of Prefetching and Warp Scheduling on GPUs

机译:GPU上预取和扭曲调度的自适应合作

获取原文
获取原文并翻译 | 示例

摘要

This paper proposes a new architecture, called Adaptive PREfetching and Scheduling (APRES), which improves cache efficiency of GPUs. APRES relies on the observation that GPU loads tend to have either high locality or strided access patterns across warps. APRES schedules warps so that as many cache hits are generated as possible before the generation of any cache miss. Without directly predicting future cache hits/misses for each warp, APRES creates a warp group that will execute the same static load shortly and prioritizes the grouped warps. If the first executed warp in the group hits the cache, grouped warps are likely to access the same cache lines. Unless, APRES considers the load as a strided type and generates prefetch requests for the grouped warps. In addition, APRES includes a new dynamic L1 prefetch and data cache partitioning to reduce contentions between demand-fetched and prefetched lines. In our evaluation, APRES achieves 27.8 percent performance improvement.
机译:本文提出了一种新的架构,称为自适应预取和调度(APRES),从而提高了GPU的缓存效率。 APRES依赖于观察到GPU负载倾向于具有跨越Warps的高地位或进入访问模式。 APRES计划扭曲,以便在生成任何缓存未命中时,尽可能多地生成缓存命中。如果不直接预测每个扭曲的未来缓存命中/未命中,则APRES会创建一个扭曲组,即将执行相同的静态负载并优先考虑分组的扭曲。如果组中的第一个执行的翘曲命中缓存,则分组的扭曲可能会访问相同的缓存行。除非,APRES将负载视为string类型,并为分组的扭曲生成预取请求。此外,APRES包括一个新的动态L1预取和数据缓存分区,以减少需求获取和预取行之间的竞争。在我们的评估中,APRES实现了27.8%的绩效改进。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号