首页> 外文会议>International conference on embedded software >Can we put concurrency back into redundant multithreading?
【24h】

Can we put concurrency back into redundant multithreading?

机译:我们可以将并发放回冗余多线程中吗?

获取原文

摘要

Software-implemented fault tolerance (SIFT) mechanisms allow to tolerate transient hardware faults in commercial off-the-shelf (COTS) systems without using specialized resilient hardware. Unfortunately, existing SIFT methods at both the compiler and the operating system levels are often restricted to single-threaded applications and hence do not apply to multithreaded software on modern multicore platforms. We present RomainMT, an operating system service that provides replication for unmodified multithreaded applications. Replicating these programs is challenging, because scheduling-induced non-determinism may cause replicated threads to execute different valid code paths. This complicates the distinction between valid behavior and the effects of hardware errors. RomainMT solves these problems by transparently making multithreaded execution deterministic. We present two alternative mechanisms that differ in the assumptions made about the respective applications and investigate their performance implications. Our evaluation using the SPLASH2 benchmark suite shows that the overhead for triple-modular redundancy (TMR) is 24% for applications with two application threads and 65% for four application threads.
机译:软件实现的容错(SIFT)机制允许在商用现货(COTS)系统中容忍瞬态硬件故障,而无需使用专门的弹性硬件。不幸的是,在编译器和操作系统级别上的现有SIFT方法通常仅限于单线程应用程序,因此不适用于现代多核平台上的多线程软件。我们介绍RomainMT,这是一种操作系统服务,可为未修改的多线程应用程序提供复制。复制这些程序具有挑战性,因为调度引起的不确定性可能导致复制的线程执行不同的有效代码路径。这使有效行为与硬件错误影响之间的区分变得复杂。 RomainMT通过透明地确定多线程执行来解决这些问题。我们提出了两种替代机制,它们对各个应用程序的假设有所不同,并研究了它们对性能的影响。我们使用SPLASH2基准套件进行的评估表明,对于具有两个应用程序线程的应用程序,三重模块冗余(TMR)的开销为24%,对于四个应用程序线程的开销为65%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号