首页> 外文期刊>Parallel Computing >Data race avoidance and replay scheme for developing and debugging parallel programs on distributed shared memory systems
【24h】

Data race avoidance and replay scheme for developing and debugging parallel programs on distributed shared memory systems

机译:在分布式共享内存系统上开发和调试并行程序的数据争用避免和重播方案

获取原文
获取原文并翻译 | 示例

摘要

Distributed shared memory (DSM) allows parallel programs to run on distributed computers by simulating a global virtual shared memory, but data racing bugs may easily occur when the threads of a multi-threaded process concurrently access the physically distributed memory. Earlier tools to help programmers locate data racing bugs in non-DSM parallel programs are not easily applied to DSM systems. This study presents the data race avoidance and replay scheme (DRARS) to assist debugging parallel programs on DSM or multi-core systems. DRARS is a novel tool which controls the consistency protocol of the target program, automatically preventing a large class of data racing bugs when the parallel program is subsequently run, obviating much of the need for manual debugging. For data racing bugs that cannot be avoided automatically, DRARS performs a deterministic replay-type function on DSM systems, faithfully reproducing the behavior of the parallel program during run time. Because one class of data racing bugs has already been eliminated, the remaining manual debugging task is greatly simplified. Unlike previous debugging methods, DRARS does not require that the parallel program be written in a specific style or programming language. Moreover, DRARS can be implemented in most consistency protocols. In this paper, DRARS is realized and verified in real experiments using the eager release consistency protocol on a DSM system with various applications.
机译:分布式共享内存(DSM)通过模拟全局虚拟共享内存允许并行程序在分布式计算机上运行,​​但是当多线程进程的线程同时访问物理分布式内存时,数据争用错误很容易发生。早期的工具可以帮助程序员定位非DSM并行程序中的数据竞速错误,因此很难将其应用于DSM系统。这项研究提出了数据争用避免和重播方案(DRARS),以帮助调试DSM或多核系统上的并行程序。 DRARS是一种新颖的工具,它可以控制目标程序的一致性协议,并在随后运行并行程序时自动防止出现大量的数据竞速错误,从而避免了手动调试的大量需要。对于无法自动避免的数据竞速错误,DRARS在DSM系统上执行确定性重播类型功能,如实再现运行时并行程序的行为。由于已经消除了一类数据竞速错误,因此大大简化了其余的手动调试任务。与以前的调试方法不同,DRARS不需要以特定的样式或编程语言编写并行程序。此外,DRARS可以在大多数一致性协议中实现。在本文中,DRARS是在具有各种应用程序的DSM系统上使用渴望的发布一致性协议在实际实验中实现和验证的。

著录项

  • 来源
    《Parallel Computing》 |2011年第1期|p.11-25|共15页
  • 作者单位

    Department of Electrical Engineering, National Cheng Kung University, No. 1, Ta-Hsueh Rd, Tainan 701, Taiwan;

    Department of Electrical Engineering, National Cheng Kung University, No. 1, Ta-Hsueh Rd, Tainan 701, Taiwan;

    Department of Electronic Engineering, Lunghwa University of Science and Technology, Taoyuan 33306, Taiwan;

    Department of Electrical Engineering, National Kaohsiung University of Applied Sciences, No. 415, Chien Kung Rd, Sanmin District, Kaohsiung 80778, Taiwan;

    Department of Electronic Engineering, Lunghwa University of Science and Technology, Taoyuan 33306, Taiwan;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    parallel program; distributed shared memory; dsm; debugger; data race; access conflict; deterministic replay;

    机译:并行程序;分布式共享内存;dsm;调试器;数据争用;访问冲突;确定性重播;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号