...
首页> 外文期刊>Microelectronics reliability >Analyzing and increasing soft error resilience of Deep Neural Networks on ARM processors
【24h】

Analyzing and increasing soft error resilience of Deep Neural Networks on ARM processors

机译:扶手处理器深神经网络的软误差弹性分析和提高

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

Deep Neural Networks (DNNs) have been successfully deployed in safety-critical applications due to the capability of computing in complex tasks. Because of low energy, ARM (Advanced RISC Machine) processors are used for DNNs in embedded applications. However, in harsh environments, soft errors induced by radiation strikes may cause Silent Data Corruptions (SDCs) and Detected Unrecoverable Errors (DUEs). In this work, for DNNs, we evaluate the soft error resilience of the register file and analyze the impact of compiler optimizations. The results show that compiler optimization significantly degrades the reliability of DNNs. Furthermore, we track SDC propagation and record execution time for each layer. The results indicate that for most DNNs, convolutional layers are the most vulnerable because they are the most time-consuming parts. For instructions, we evaluate Program Vulnerability Factor (PVF) contributions of instructions and identify the vulnerable instructions that may cause critical SDCs. To mitigate critical SDCs, we propose two efficient approaches: 1) selective kernel hardening and 2) Symptom-based Duplication with Comparison (SDWC). The former reduces SDCs by an order of magnitude and incurs 33.56% time overhead. The second approach reduces critical SDCs to 0 and incurs less than 10% time overhead. For DUEs, we propose an idempotency-based recovery. Our approach mitigates more than 92.2% DUEs and incurs 3.43% latency overhead on average.
机译:由于复杂任务中的计算能力,在安全关键应用中已成功部署了深度神经网络(DNN)。由于能量低,ARM(高级RISC机)处理器用于嵌入式应用中的DNN。然而,在恶劣的环境中,由辐射撞击引起的软误差可能导致静默数据损坏(SDC)和检测到的不可恢复的错误(会费)。在这项工作中,对于DNN,我们评估寄存器文件的软错误弹性,并分析编译器优化的影响。结果表明,编译器优化显着降低了DNN的可靠性。此外,我们跟踪每个图层的SDC传播和记录执行时间。结果表明,对于大多数DNN,卷积层是最脆弱的,因为它们是最耗时的部件。有关说明,我们评估程序漏洞因素(PVF)指令的贡献,并确定可能导致关键SDC的易损指令。为了缓解关键的SDC,我们提出了两种有效的方法:1)选择性内核硬化和2)基于症状的复制(SDWC)。前者通过数量级减少了SDC,并突出了33.56%的开销。第二种方法将临界SDC减少为0,并且突出超过10%的开销。对于会费,我们提出了一种基于IDEM的恢复。我们的方法平均减轻了92.2%以上的会费,平均突出了3.43%的延迟开销。

著录项

  • 来源
    《Microelectronics reliability》 |2021年第9期|114331.1-114331.11|共11页
  • 作者单位

    Hohai Univ Informat Div Nanjing 210098 Jiangsu Peoples R China;

    Hohai Univ Informat Div Nanjing 210098 Jiangsu Peoples R China;

    Hohai Univ Informat Div Nanjing 210098 Jiangsu Peoples R China;

    China Inst Atom Energy Beijing 100000 Peoples R China;

    Hohai Univ Informat Div Nanjing 210098 Jiangsu Peoples R China|Beijing Microelect Technol Inst Beijing 100076 Peoples R China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Soft error; Error resilience; DNN; ARM;

    机译:软错误;错误弹性;DNN;臂;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号