首页> 外文会议>2011 9th International Conference on High Performance Computing Simulation >CPU-aware, process-level redundancy to tolerate faults in multi-core
【24h】

CPU-aware, process-level redundancy to tolerate faults in multi-core

机译:支持CPU的进程级冗余以容忍多核中的故障

获取原文

摘要

This paper proposes: 1) A dynamically scheduled Process-Level Redundancy (PLR) for enhancing reliability of multi-core systems, 2) A comparison between PLR and Thread-Level Redundancy (TLR), and 3) A fault study on the thread selector unit of a modern processor. The proposed technique employs underutilized CPU resources to improve fault tolerance ability of a system. The evaluation on PLR reliability proves that it performs better than Thread-Level Redundancy (TLR) when the reliability of sub modules in a system is higher than almost 0.8. In this technique, a set of redundant processes are created per application process. The number of replicas is then modified dynamically to achieve better performance. The experimental results on some standard benchmarks show that on average, the CPU is utilized less than 20% during the execution time of applications which can be used to provide 100% fault detection and recovery with almost 10% performance overhead using the proposed technique. Also, the fault study proves that among 7000 faults injected into the thread selector module using OpenSPARC simulator, 83.5% of faults are benign faults, and 16.5% of faults lead to system failure which affect either hardware (13.7%), or program outputs (2.8%). These faults can be all detected using this technique.
机译:本文提出:1)动态调度的进程级冗余(PLR)以增强多核系统的可靠性; 2)PLR和线程级冗余(TLR)之间的比较;以及3)关于线程选择器的故障研究现代处理器的单位。所提出的技术利用未充分利用的CPU资源来提高系统的容错能力。对PLR可靠性的评估证明,当系统中子模块的可靠性几乎高于0.8时,其性能优于线程级冗余(TLR)。在此技术中,每个应用程序进程都会创建一组冗余进程。然后可以动态修改副本的数量,以获得更好的性能。在某些标准基准上的实验结果表明,在应用程序执行期间,平均CPU利用率不到20%,使用所提出的技术可用于提供100%的故障检测和恢复以及近10%的性能开销。此外,故障研究证明,在使用OpenSPARC模拟器注入到线程选择器模块的7000个故障中,有83.5%的故障是良性故障,而16.5%的故障导致系统故障,这会影响硬件(13.7%)或程序输出( 2.8%)。使用此技术可以检测所有这些故障。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号