首页> 外文期刊>IEEE Transactions on Computers >Adaptive Cooperation of Prefetching and Warp Scheduling on GPUs
【24h】

Adaptive Cooperation of Prefetching and Warp Scheduling on GPUs

机译:GPU上预取和翘曲调度的自适应协作

获取原文
获取原文并翻译 | 示例

摘要

This paper proposes a new architecture, called Adaptive PREfetching and Scheduling (APRES), which improves cache efficiency of GPUs. APRES relies on the observation that GPU loads tend to have either high locality or strided access patterns across warps. APRES schedules warps so that as many cache hits are generated as possible before the generation of any cache miss. Without directly predicting future cache hits/misses for each warp, APRES creates a warp group that will execute the same static load shortly and prioritizes the grouped warps. If the first executed warp in the group hits the cache, grouped warps are likely to access the same cache lines. Unless, APRES considers the load as a strided type and generates prefetch requests for the grouped warps. In addition, APRES includes a new dynamic L1 prefetch and data cache partitioning to reduce contentions between demand-fetched and prefetched lines. In our evaluation, APRES achieves 27.8 percent performance improvement.
机译:本文提出了一种称为自适应预取和调度(APRES)的新架构,该架构可提高GPU的缓存效率。 APRES依赖于以下观察结果:GPU负载倾向于具有较高的局部性或跨扭曲的跨越式访问模式。 APRES调度扭曲,以便在生成任何高速缓存未命中之前尽可能多地生成高速缓存命中。在不直接预测每个扭曲的将来缓存命中/未命中的情况下,APRES创建了一个扭曲组,该扭曲组将很快执行相同的静态负载并确定分组扭曲的优先级。如果组中第一个执行的翘曲命中缓存,则分组的翘曲可能会访问相同的缓存行。除非,否则APRES会将负载视为跨步类型,并为分组的经线生成预取请求。此外,APRES包括一个新的动态L1预取和数据缓存分区,以减少按需提取和预取行之间的争用。在我们的评估中,APRES实现了27.8%的性能提升。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号