首页> 外文期刊>Journal of Parallel and Distributed Computing >A general purpose contention manager for software transactions on the GPU
【24h】

A general purpose contention manager for software transactions on the GPU

机译:GPU上的软件事务通用争用管理器

获取原文
获取原文并翻译 | 示例

摘要

The Graphics Processing Unit (GPU) is now used extensively for general purpose GPU programming (GPGPU), allowing for greater exploitation of the multi-core model across many application domains. This is particularly true in cloud/edge/fog computing, where multiple GPU enabled servers support many different end user services. This move away from the naturally parallel domain of graphics can incur significant performance issues. Unlike the CPU, code that is hindered from execution due to blocking/waiting on the GPU can affect thousands of threads, rendering the advantages of a GPU irrelevant and reducing a highly parallel environment down to a serial one in the worst case. In this paper we present a solution that minimises blocking/waiting in GPGPU computing using a contention manager that offsets memory conflicts across threads through thread re-ordering. We consider conflicts of memory not only to avoid corruption (standard for transactional memory) but also in the semantic layer of application logic (e.g., enforcing ordering to ensure money drawn from bank account occurs after all deposits). We demonstrate how our approach is successful across a number of industry benchmarks and compare our approach to the only other related solution. We also demonstrate that our approach is scalable in terms of thread numbers (a key requirement on the GPU). We believe this is the first work of its kind demonstrating a generalised conflict and semantic contention manager suitable for the scale of parallel execution found on a GPU.
机译:图形处理单元(GPU)现在广泛用于通用GPU编程(GPGPU),允许在许多应用程序域中更大地利用多核模型。云/边缘/雾计算中尤其如此,其中多个GPU支持的服务器支持许多不同的最终用户服务。这远离图形的自然并行域不会产生重大的性能问题。与CPU不同,由于GPU的阻塞/等待而被阻碍的代码可以影响数千个线程,使GPU无关紧要并将高度平行的环境降低到最坏情况下的串行。在本文中,我们提出了一种解决方案,该解决方案可以使用争用管理器通过线程重新订购偏移跨线程的争用管理器最小化GPGPU计算中的阻塞/等待。我们认为不仅要避免腐败的内存冲突(事务存储器的标准),而且还在应用程序逻辑的语义层中(例如,执行排序以确保在所有存款之后发生从银行账户中汲取的资金)。我们展示了我们的方法在许多行业基准中如何成功,并将我们的方法与唯一的其他相关解决方案进行比较。我们还证明我们的方法在线程数(GPU上的关键要求)方面是可扩展的。我们认为这是它的第一项工作,证明了适合在GPU上发现的并行执行规模的广义冲突和语义争论经理。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号