A Framework for Correction of Multi-Bit Soft Errors in L2 Caches Based on Redundancy

Bhattacharya K.; Ranganathan N.; Kim S.

首页> 外文期刊>IEEE transactions on very large scale integration (VLSI) systems >A Framework for Correction of Multi-Bit Soft Errors in L2 Caches Based on Redundancy

【24h】

A Framework for Correction of Multi-Bit Soft Errors in L2 Caches Based on Redundancy

机译：基于冗余的L2缓存中多位软错误校正框架

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

With the continuous decrease in the minimum feature size and increase in the chip density due to technology scaling, on-chip L2 caches are becoming increasingly susceptible to multi-bit soft errors. The increase in multi-bit errors could lead to higher risk of data corruption and potentially result in the crashing of application programs. Traditionally, the L2 caches have been protected from soft errors using techniques such as: 1) error detection/correction codes; 2) physical interleaving of cache bit lines to convert multi-bit errors into single-bit errors; and 3) cache scrubbing. While the first two methods incur large area overheads for multi-bit errors, identifying the time interval for scrubbing could be tricky. In this paper, we investigate in detail the multi-bit soft error rates in large L2 caches and propose a framework of solutions for their correction based on the amount of redundancy present in the memory hierarchy. We investigate several new techniques for reducing multi-bit errors in large L2 caches, in which, the multi-bit errors are detected using simple error detection codes and corrected using the data redundancy in the memory hierarchy. We also propose several techniques to control/mine the redundancy in the memory hierarchy to further improve the reliability of the L2 cache. The proposed techniques were implemented in the Simplescalar framework and validated using the SPEC 2000 integer and floating point benchmarks for L2 cache vulnerability, global cache miss-rate, average cycle count and main memory write back rate, considering the area and power overheads. Experimental results indicate that the vulnerability of L2 caches can be decreased by 40% on the average for integer benchmarks and 32% on the average for floating point benchmarks, with an average multi-bit error coverage of about 96%, with significantly less area and power overheads and with virtually no performance penalty. The proposed techniques are applicable to both single and-n-n multi-core processor-based systems.

机译：随着技术规模的扩大，最小特征尺寸的不断减小和芯片密度的增加，片上L2高速缓存越来越容易受到多位软错误的影响。多位错误的增加可能导致更高的数据损坏风险，并可能导致应用程序崩溃。传统上，使用以下技术来保护L2高速缓存不受软错误的影响：1）错误检测/纠正码； 2）高速缓存位线的物理交织，以将多位错误转换为单位错误；和3）缓存清理。尽管前两种方法会产生多位错误的大面积开销，但确定清理时间间隔可能很棘手。在本文中，我们将详细研究大型L2高速缓存中的多位软错误率，并根据内存层次结构中存在的冗余量提出一种用于校正它们的解决方案框架。我们研究了几种用于减少大型L2高速缓存中多位错误的新技术，其中，使用简单的错误检测代码检测多位错误，并使用内存层次结构中的数据冗余进行纠正。我们还提出了几种技术来控制/消除内存层次结构中的冗余，以进一步提高L2缓存的可靠性。拟议的技术在Simplescalar框架中实现，并使用SPEC 2000整数和浮点基准对L2缓存漏洞，全局缓存未命中率，平均周期数和主内存回写率进行了验证，并考虑了面积和功耗。实验结果表明，对于整数基准而言，L2缓存的漏洞平均可以降低40％，对于浮点基准而言，其平均漏洞可以降低32％，平均多位错误覆盖率约为96％，并且面积和占用空间明显更少。电源开销，几乎没有性能损失。所提出的技术适用于基于单核和n-n多核处理器的系统。

著录项

来源
《IEEE transactions on very large scale integration (VLSI) systems》 |2009年第2期|p.194-206|共13页
作者
Bhattacharya K.; Ranganathan N.; Kim S.;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类微电子学、集成电路（IC）;
关键词
Control/mine redundancy; error detection and correction; l2 caches; multi-bit errors; soft errors;

机译：控制/地雷冗余;错误检测和纠正;二级缓存;多位错误;软错误;

相似文献

外文文献
中文文献
专利

1. Multi-bit soft error tolerable L1 data cache based on characteristic of data value [J] . WANG Dang-hui, LIU He-peng, CHEN Yi-ran 中南大学学报（英文版） . 2015,第005期

机译：基于数据值特征的多位可容错的L1数据高速缓存
2. Soft Error Benchmarking of L2 Caches with PARMA [J] . Jinho Suh, Mehrtash Manoochehri, Murali Annavaram, Performance evaluation review . 2011,第1期

机译：使用PARMA对L2缓存进行软错误基准测试
3. Multi-bit upset aware hybrid error-correction for cache in embedded processors [J] . Dong Jiaqi, Qiu Keni, Zhang Weigong, Journal of Semiconductors . 2015,第11期

机译：嵌入式处理器中用于缓存的多位不安感知混合错误校正
4. Online Correction of Hard Errors and Soft Errors via One-Step Decodable OLS Codes for Emerging Last Level Caches [C] . Abhishek Das, Nur A. Touba Latin American Test Symposium . 2019

机译：通过一步可解码OLS代码在线纠正硬错误和软错误，以生成最新的末级高速缓存
5. A fully digital technique for the estimation and correction of the DAC error in multi-bit delta sigma ADCs. [D] . Wang, Xuesheng. 2004

机译：一种用于评估和校正多位delta sigma ADC中DAC误差的全数字技术。
6. Midbrain dopamine neurons compute inferred and cached value prediction errors in a common framework [O] . Brian F Sadacca, Joshua L Jones, Geoffrey Schoenbaum 2016

机译：中脑多巴胺神经元在一个通用框架中计算推断和缓存的值预测误差
7. An Error Correction Scheme through Time Redundancy for Enhancing Persistent Soft-Error Tolerance of CGRAs [O] . Takashi IMAGAWA, Masayuki HIROMOTO, Hiroyuki OCHI, 2015

机译：一种通过时间冗余的纠错方案，用于增强CGRas的持久软错误容差

A Framework for Correction of Multi-Bit Soft Errors in L2 Caches Based on Redundancy

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅