首页> 外文会议>International conference on algorithms and architectures for parallel processing >MC: A Software Solution to Reduce Errors Caused by Incoherent Caches on GPUs in Unstructured Graphic Algorithm
【24h】

MC: A Software Solution to Reduce Errors Caused by Incoherent Caches on GPUs in Unstructured Graphic Algorithm

机译:M&C:一种软件解决方案,可减少非结构化图形算法中GPU上的不相关缓存导致的错误

获取原文

摘要

Recently, researchers have focused on addressing incoherent caches on GPUs as current GPUs lack hardware to support that. Moreover, the support for inter-block communication also lacks which limits the scalability of parallel programming especially in the unstructured algorithm in which program would share data between different threads. Barrier synchronization can be a solution but it becomes invalid because of incoherent caches. In this paper, we propose a set of rules for programming on current GPUs to avoid the errors caused by incoherent caches when applying barrier synchronization. We also leverage these rules into an unstructured graphic algorithm - constrained Delaunay triangulation. In comparison with traditional ways such as (1) disabling L1 cache and (2) using keyword volatile, we find that when vertices is over 300K the error rate is lesser than (1) by 91.19% and than (2) by 84.2% on average.
机译:最近,研究人员专注于解决GPU上的非一致性缓存,因为当前的GPU缺乏支持该缓存的硬件。此外,还缺少对块间通信的支持,这限制了并行编程的可伸缩性,尤其是在非结构化算法中,在该非结构化算法中,程序将在不同线程之间共享数据。屏障同步可以是一种解决方案,但是由于缓存不连贯,它变得无效。在本文中,我们提出了一套用于在当前GPU上进行编程的规则,以避免在应用障碍同步时避免由非相干缓存导致的错误。我们还将这些规则用于非结构化图形算法-约束Delaunay三角剖分。与传统方法(如(1)禁用L1缓存和(2)使用关键字volatile)相比,我们发现,当顶点超过300K时,错误率比(1)小91.19%,比(2)小84.2%平均。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号