【24h】

Modeling Soft-Error Propagation in Programs

机译:在程序中建模软错误传播

获取原文

摘要

As technology scales to lower feature sizes, devices become more susceptible to soft errors. Soft errors can lead to silent data corruptions (SDCs), seriously compromising the reliability of a system. Traditional hardware-only techniques to avoid SDCs are energy hungry, and hence not suitable for commodity systems. Researchers have proposed selective software-based protection techniques to tolerate hardware faults at lower costs. However, these techniques either use expensive fault injection or inaccurate analytical models to determine which parts of a program must be protected for preventing SDCs. In this work, we construct a three-level model, TRIDENT, that captures error propagation at the static data dependency, control-flow and memory levels, based on empirical observations of error propagations in programs. TRIDENT is implemented as a compiler module, and it can predict both the overall SDC probability of a given program and the SDC probabilities of individual instructions, without fault injection. We find that TRIDENT is nearly as accurate as fault injection and it is much faster and more scalable. We also demonstrate the use of TRIDENT to guide selective instruction duplication to efficiently mitigate SDCs under a given performance overhead bound.
机译:随着技术缩小到更低的特征尺寸,设备变得更容易受到软错误。软错误可能导致静默数据损坏(SDC),严重损害系统的可靠性。避免SDCS的传统硬件仅限能源饥饿,因此不适合商品系统。研究人员提出了基于选择性的软件的保护技术,以较低的成本宽容。但是,这些技术使用昂贵的故障注入或不准确的分析模型来确定必须保护哪些部分以防止SDC。在这项工作中,我们基于程序中的误差传播的经验观察,我们构建了三级模型,三级模型,即捕获静态数据依赖性,控制流和内存级别的错误传播。三叉戟以编译器模块实现,它可以预测给定程序的总体SDC概率和单个指令的SDC概率,而不会出现故障注射。我们发现三叉戟几乎可以作为故障注入准确,它更快,更可扩展。我们还演示了使用Trident来指导选择性指导复制,以便在给定的性能开销下有效地缓解SDC。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号