【24h】

Modeling Soft-Error Propagation in Programs

机译:在程序中对软错误传播建模

获取原文

摘要

As technology scales to lower feature sizes, devices become more susceptible to soft errors. Soft errors can lead to silent data corruptions (SDCs), seriously compromising the reliability of a system. Traditional hardware-only techniques to avoid SDCs are energy hungry, and hence not suitable for commodity systems. Researchers have proposed selective software-based protection techniques to tolerate hardware faults at lower costs. However, these techniques either use expensive fault injection or inaccurate analytical models to determine which parts of a program must be protected for preventing SDCs. In this work, we construct a three-level model, TRIDENT, that captures error propagation at the static data dependency, control-flow and memory levels, based on empirical observations of error propagations in programs. TRIDENT is implemented as a compiler module, and it can predict both the overall SDC probability of a given program and the SDC probabilities of individual instructions, without fault injection. We find that TRIDENT is nearly as accurate as fault injection and it is much faster and more scalable. We also demonstrate the use of TRIDENT to guide selective instruction duplication to efficiently mitigate SDCs under a given performance overhead bound.
机译:随着技术扩展到较小的特征尺寸,设备变得更容易受到软错误的影响。软错误可能导致静默数据损坏(SDC),从而严重损害系统的可靠性。避免SDC的传统纯硬件技术非常耗能,因此不适用于商品系统。研究人员提出了选择性的基于软件的保护技术,以较低的成本容忍硬件故障。但是,这些技术使用昂贵的故障注入或不准确的分析模型来确定必须保护程序的哪些部分以防止SDC。在这项工作中,我们构建了一个三级模型TRIDENT,该模型基于对程序中错误传播的经验观察,捕获了静态数据依赖项,控制流和内存级别的错误传播。 TRIDENT是作为编译器模块实现的,它可以预测给定程序的总体SDC概率和单个指令的SDC概率,而无需注入错误。我们发现TRIDENT几乎与故障注入一样准确,并且速度更快,可扩展性更高。我们还演示了使用TRIDENT来指导选择性指令复制,以在给定的性能开销限制下有效缓解SDC。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号