首页> 外文会议>International conference on parallel and distributed computing >GT-Race: Graph Traversal Based Data Race Detection for Asynchronous Many-Task Parallelism
【24h】

GT-Race: Graph Traversal Based Data Race Detection for Asynchronous Many-Task Parallelism

机译:GT-Race:基于图形遍历的数据竞速检测用于异步多任务并行

获取原文

摘要

Asynchronous Many-Task (AMT) parallelism is growing in popularity because of its promise to support future platforms with new heterogeneity and resiliency requirements. It supports the construction of parallel programs with fine-grained tasks, thereby enabling portability across a wide range of platforms. However, applications written for AMT parallelism still remain vulnerable to data races, and existing data race detection tools are unsuitable for AMT programs because they either incur intractably large overheads or are limited to restricted task structures such as fork-join parallelism. In this paper, we propose GT-Race, a new graph-traversal based data race detector for AMT parallelism. It leverages the computation graph data structure, which encodes the general happens-before structures in AMT programs. After introducing a baseline algorithm for data race detection, we propose key optimizations to reduce its time and space complexity, including the epoch adjacency list to compress the computation graph representation, the reachability cache combined with depth filtering to reduce the number of unnecessary traversals, and bounded race detection to limit the range of data that is monitored. The impact of these optimizations is demonstrated for nine benchmark programs written for the Open Community Runtime (OCR), an open source AMT runtime that supports point-to-point synchronization and disjoint data blocks.
机译:异步多任务(AMT)并行性正变得越来越流行,因为它有望以新的异构性和弹性要求支持未来的平台。它支持带有细粒度任务的并行程序的构建,从而实现了跨多种平台的可移植性。但是,为AMT并行性编写的应用程序仍然容易受到数据争用的影响,并且现有的数据争用检测工具不适合AMT程序,因为它们要么招致巨大的开销,要么被限制在有限的任务结构(如fork-join并行性)中。在本文中,我们提出了GT-Race,这是一种用于AMT并行性的新的基于图遍历的数据竞速检测器。它利用了计算图数据结构,该结构对AMT程序中一般发生的事件进行编码。在介绍了用于数据竞速检测的基准算法之后,我们提出了一些关键的优化措施以减少其时间和空间复杂度,其中包括用于压缩计算图表示形式的历时邻接表,可及性缓存与深度过滤相结合以减少不必要的遍历次数,以及有限制的种族检测以限制所监视的数据范围。针对为开放社区运行时(OCR)编写的九种基准程序,演示了这些优化的影响。开放社区运行时(OCR)是一种开源AMT运行时,支持点对点同步和不相交的数据块。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号