首页> 外文期刊>Computer Architecture Letters >Architectural Reliability: Lifetime Reliability Characterization and Management ofMany-Core Processors
【24h】

Architectural Reliability: Lifetime Reliability Characterization and Management ofMany-Core Processors

机译:体系结构可靠性:多核处理器的终生可靠性表征和管理

获取原文
获取原文并翻译 | 示例
       

摘要

This paper presents a lifetime reliability characterization of many-core processors based on a full-system simulation of integrated microarchitecture, power, thermal, and reliability models. Under normal operating conditions, our model and analysis reveal that the mean-time-to-failure of cores on the die show normal distribution. From the processor-level perspective, the key insight is that reducing the variance of the distribution can improve lifetime reliability by avoiding early failures. Based on this understanding, we present two variance reduction techniques for proactive reliability management; i) proportional dynamic voltage-frequency scaling (DVFS) and ii) coordinated thread swapping. A major advantage of using variance reduction techniques is that the improvement of system lifetime reliability can be achieved without adding design margins or spare components.
机译:本文基于集成微体系结构,功率,热和可靠性模型的全系统仿真,提出了多核处理器的终生可靠性表征。在正常工作条件下,我们的模型和分析表明,芯片上芯的平均失效时间表现出正态分布。从处理器级别的角度来看,关键的见解是减少分布的差异可以通过避免早期故障来提高使用寿命。基于这种理解,我们提出了两种用于主动可靠性管理的方差减少技术; i)比例动态电压频率缩放(DVFS)和ii)协调线程交换。使用方差减少技术的主要优点是无需增加设计裕量或备用组件即可实现系统使用寿命可靠性的提高。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号