首页> 外文期刊>Journal of circuits, systems and computers >IMPLEMENTING LOW-COST FAULT TOLERANCE VIA HYBRID SYNCHRONOUS/ASYNCHRONOUS CHECKS
【24h】

IMPLEMENTING LOW-COST FAULT TOLERANCE VIA HYBRID SYNCHRONOUS/ASYNCHRONOUS CHECKS

机译:通过混合同步/异步检查实现低成本的故障容限

获取原文
获取原文并翻译 | 示例
           

摘要

As semiconductor technologies scale down to deep sub-micron dimensions, transient faults will soon become a critical reliability concern. Due to their prohibitive costs, traditional high-end solutions are unacceptable for the mainstream commodity market. This paper presents Ftpipe, a hybrid software/hardware solution, which provides sufficient fault coverage with affordable overhead for single-threaded programs running on commodity systems. Leveraging existing exception mechanisms with minor modifications to handle exception-causing faults, Ftpipe focuses on tolerating silent data corruptions by using compile-time analysis and performing selective instruction replication in a modern superscalar pipeline extended with minimal hardware overhead. Unlike existing instruction replication-based solutions, which detect faults by synchronous checks, the Ftpipe platform has exploited a novel hybrid synchronous/asynchronous check method for the replicated instructions. In this manner, better performance can be obtained without degradation of fault coverage. By synchronous checks, the validation of the result of a replicated instruction must be finished before it is committed, whereas such a guarantee is not required by an asynchronous check. Evaluation using a set of nine programs from the Mibench benchmark suite demonstrates that Ftpipe can tolerate 89.8% of transient faults under a modest performance overhead of 20.1%.
机译:随着半导体技术缩小到深亚微米尺寸,瞬态故障将很快成为关键的可靠性问题。由于成本高昂,传统的高端解决方案对于主流商品市场而言是无法接受的。本文介绍了Ftpipe,它是一种混合的软件/硬件解决方案,它为商品系统上运行的单线程程序提供了足够的故障覆盖范围和负担得起的开销。利用现有的异常机制进行少量修改以处理引起异常的错误,Ftpipe专注于通过使用编译时分析并在以最少的硬件开销扩展的现代超标量管道中执行选择性指令复制来容忍静默数据损坏。与现有的基于指令复制的解决方案可以通过同步检查来检测故障不同,Ftpipe平台已为复制的指令开发了一种新颖的混合同步/异步检查方法。以此方式,可以获得更好的性能而不会降低故障覆盖率。通过同步检查,必须在提交复制指令之前完成对复制指令结果的验证,而异步检查不需要这种保证。使用Mibench基准套件中的九个程序进行评估,结果表明Ftpipe在20.1%的适度性能开销下可以承受89.8%的瞬时故障。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号