首页> 外文会议>IEEE International Symposium on Real-Time Distributed Computing >Warp-Based Load/Store Reordering to Improve GPU Data Cache Time Predictability and Performance
【24h】

Warp-Based Load/Store Reordering to Improve GPU Data Cache Time Predictability and Performance

机译:基于扭曲的加载/存储重新排序,以改善GPU数据缓存的时间可预测性和性能

获取原文

摘要

Graphics Processing Units (GPUs) have great potential to improve the performance and energy efficiency for data-parallel real-time applications. However, it is very difficult to compute worst-case execution time (WCET) for current GPUs that are design for improving the average-case throughput, not for time predictability. In this paper, we propose a warp-based load/store reordering mechanism to improve the time predictability of GPU data caching without incurring much performance overhead. This mechanism can be used in conjunction with dynamic warp scheduling to achieve better performance than a pure round-robin based scheduling while enabling accurate static timing analysis to bound the worst-case GPU L1 data cache misses.
机译:图形处理单元(GPU)在提高数据并行实时应用程序的性能和能效方面具有巨大潜力。但是,对于当前用于设计平均GPU吞吐量而不是时间可预测性的GPU,很难计算最坏情况执行时间(WCET)。在本文中,我们提出了一种基于扭曲的加载/存储重排序机制,以提高GPU数据缓存的时间可预测性,而不会产生太多性能开销。与基于纯循环调度的调度相比,该机制可与动态扭曲调度结合使用,以实现更好的性能,同时能够进行精确的静态时序分析以限制最坏情况下的GPU L1数据高速缓存未命中。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号