首页> 外文会议>Annual IEEE/ACM International Symposium on Microarchitecture >Low-cost soft error resilience with unified data verification and fine-grained recovery for acoustic sensor based detection
【24h】

Low-cost soft error resilience with unified data verification and fine-grained recovery for acoustic sensor based detection

机译:低成本的软错误恢复能力,具有统一的数据验证和细粒度恢复功能,可用于基于声传感器的检测

获取原文

摘要

This paper presents Turnstile, a hardware/software cooperative technique for low-cost soft error resilience. Leveraging the recent advance of acoustic sensor based soft error detection, Turnstile achieves guaranteed recovery by taking into account the bounded detection latency. The compiler forms verifiable regions and selectively inserts store instructions to checkpoint their register inputs so that Turnstile can verify the register/memory states with regard to a region boundary in a unified way without expensive register file protection. At runtime, for each region, Turnstile regards any stores (to both memory and register checkpoints) as unverified, and thus holds them in a store queue until the region ends and spends the time of the error detection latency. If no error is detected during the time, the verified stores are merged into memory systems, and registers are checkpointed. When all the stores including checkpointing stores prior to a region boundary are verified, the architectural and memory states with regard to the boundary are verified, thus it can serve as a recovery point. In this way, Turnstile contains the errors within the core without extra memory buffering. When an error is detected, Turnstile invalidates unverified entries in the store queue and restores the checkpointed register values to get the architectural and memory states back to what they were at the most recently verified region boundary. Then, Turnstile simply redirects program control to the verified region boundary and continues execution. The experimental results demonstrate that Turnstile can offer guaranteed soft error recovery with low performance overhead (<;8% on average).
机译:本文介绍了Turnstile,这是一种用于低成本软错误恢复的硬件/软件协作技术。利用基于声学传感器的软错误检测的最新进展,Turnstile通过考虑有限的检测延迟来实现有保证的恢复。编译器形成可验证的区域,并有选择地插入存储指令以检查其寄存器输入,以便Turnstile可以以统一的方式针对区域边界验证寄存器/存储器状态,而无需昂贵的寄存器文件保护。在运行时,对于每个区域,Turnstile都将所有存储(包括内存和寄存器检查点)视为未验证,因此将它们保存在存储队列中,直到该区域结束并花费了错误检测等待时间。如果在此期间未检测到错误,则将已验证的存储合并到内存系统中,并对寄存器进行检查。当验证了包括区域边界之前的检查点商店在内的所有商店时,关于边界的体系结构和内存状态也将得到验证,因此可以用作恢复点。这样,Turnstile即可在内核中包含错误,而无需额外的内存缓冲。当检测到错误时,Turnstile将使存储队列中未验证的条目无效,并恢复检查点寄存器的值,以使体系结构和内存状态恢复为它们在最近验证的区域边界处的状态。然后,Turnstile仅将程序控制重定向到已验证的区域边界并继续执行。实验结果表明,Turnstile可以以较低的性能开销(平均<;; 8%)提供有保证的软错误恢复。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号