首页> 外文期刊>IEEE Transactions on Computers >Self-stabilizing microprocessor: analyzing and overcoming soft errors
【24h】

Self-stabilizing microprocessor: analyzing and overcoming soft errors

机译:自稳定微处理器:分析和克服软错误

获取原文
获取原文并翻译 | 示例

摘要

Soft errors are changes in memory value caused by external radiation or electrical noise. Decreases in computing feature sizes and power usages and shorting the microcycle period enhance the influence of soft errors. Self-stabilizing systems are designed to be started in an arbitrary, possibly a corrupted, state due to, say, soft errors, and to converge to a desired behavior. Self-stabilization is defined by the state space of the components and is essentially a well-founded, clearly defined form of the terms self-healing, automatic-recovery, automatic-repair, and autonomic-computing. To implement a self-stabilizing system, one needs to ensure that the microprocessor that executes the program is self-stabilizing. A self-stabilizing microprocessor copes with any combination of soft errors, converging to perform fetch-decode-execute in fault-free periods. Still, it is important that the microprocessor will avoid convergence periods if possible by masking the effect of soft errors immediately. In this work, we present design schemes for a self-stabilizing microprocessor and a new technique for analyzing the effect of soft errors. Previous schemes for analyzing the effect of soft errors were based on simulations. In contrast, our scheme computes a lower bound on microprocessor reliability and enables the microprocessor designer to evaluate the reliability of the design and to identify reliability bottlenecks. When analyzing the resiliency of digital circuits to soft errors, we examine the logical masking, i.e., errors in internal nodes of the circuits that are masked later by the computation. We show that the problem of computing the reliability of a circuit such that logical masking is taken into account is an NP-hard problem.
机译:软错误是由外部辐射或电气噪声引起的存储值更改。计算功能大小和功耗的减少以及微周期的缩短会增加软错误的影响。自稳定系统被设计为由于软错误而在任意(可能是损坏的)状态下启动,并收敛到所需的行为。自稳定由组件的状态空间定义,并且本质上是术语“自愈”,“自动恢复”,“自动修复”和“自主计算”的充分明确定义的形式。为了实现自稳定系统,需要确保执行程序的微处理器是自稳定的。一个自我稳定的微处理器可以应对软错误的任意组合,从而可以在无故障时段内执行提取-解码-执行。仍然重要的是,如果可能的话,微处理器将通过立即掩盖软错误的影响来避免收敛期。在这项工作中,我们提出了一种自稳定微处理器的设计方案以及一种分析软错误影响的新技术。用于分析软错误影响的先前方案是基于仿真的。相反,我们的方案计算了微处理器可靠性的下限,并使微处理器设计者能够评估设计的可靠性并确定可靠性瓶颈。在分析数字电路对软错误的恢复能力时,我们检查了逻辑掩蔽,即电路内部节点中的错误,这些错误随后会被计算所掩盖。我们表明,计算电路可靠性从而考虑逻辑掩蔽的问题是一个NP难题。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号