...
首页> 外文期刊>ACM Journal on Emerging Technologies in Computing Systems >Exploiting Idle Hardware to Provide Low Overhead Fault Tolerance for VLIW Processors
【24h】

Exploiting Idle Hardware to Provide Low Overhead Fault Tolerance for VLIW Processors

机译:利用空闲硬件为VLIW处理器提供低开销容错

获取原文
获取原文并翻译 | 示例
           

摘要

Because of technology scaling, the soft error rate has been increasing in digital circuits, which affects system reliability. Therefore, modern processors, including VLIW architectures, must have means to mitigate such effects to guarantee reliable computing. In this scenario, our work proposes three low overhead fault tolerance approaches based on instruction duplication with zero latency detection, which uses a rollback mechanism to correct soft errors in the pipelanes of a configurable VLIW processor. The first uses idle issue slots within a period of time to execute extra instructions considering distinct application phases. The second works at a finer grain, adaptively exploiting idle functional units at run-time. However, some applications present high instruction-level parallelism (ILP), so the ability to provide fault tolerance is reduced: less functional units will be idle, decreasing the number of potential duplicated instructions. The third approach attacks this issue by dynamically reducing ILP according to a configurable threshold, increasing fault tolerance at the cost of performance. While the first two approaches achieve significant fault coverage with minimal area and power overhead for applications with low ILP, the latter improves fault tolerance with low performance degradation. All approaches are evaluated considering area, performance, power dissipation, and error coverage.
机译:由于技术缩放,柔软的错误率在数字电路中一直在增加,这影响了系统可靠性。因此,包括VLIW架构在内的现代处理器必须具有减轻此类效果的方法,以保证可靠的计算。在这种情况下,我们的工作提出了三种基于具有零延迟检测的指令复制的低开销容错方法,它使用回滚机制来校正可配置VLIW处理器的管道中的软错误。第一个在一段时间内使用空闲问题插槽来执行考虑不同的应用程序阶段的额外指令。第二种在更精细的谷物中的工作,在运行时自适应地利用空闲功能单元。然而,一些应用程序存在高指令级并行性(ILP),因此减少了提供容错能力的能力:较少的功能单元将是空闲的,减少潜在的重复指令的数量。第三种方法通过根据可配置阈值动态减少ILP来攻击该问题,提高性能成本的容错性。虽然前两种方法具有显着的故障覆盖,但对于具有低ILP的应用,虽然具有最小面积和电源开销,但后者提高了具有低性能下降的容错性。所有方法都考虑了区域,性能,功耗和错误覆盖。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号