首页> 外文会议>Annual IEEE/IFIP International Conference on Dependable Systems and Networks >PARBOR: An Efficient System-Level Technique to Detect Data-Dependent Failures in DRAM
【24h】

PARBOR: An Efficient System-Level Technique to Detect Data-Dependent Failures in DRAM

机译:PARBOR:一种有效的系统级技术,可检测DRAM中与数据有关的故障

获取原文

摘要

System-level detection and mitigation of DRAM failures offer a variety of system enhancements, such as better reliability, scalability, energy, and performance. Unfortunately, system-level detection is challenging for DRAM failures that depend on the data content of neighboring cells (data-dependent failures). DRAM vendors internally scramble/remap the system-level address space. Therefore, testing data-dependent failures using neighboring system-level addresses does not actually test the cells that are physically adjacent. In this work, we argue that one promising way to uncover data-dependent failures in the system is to determine the location of physically neighboring cells in the system address space. Unfortunately, if done naively, such a test takes 49 days to detect neighboring addresses even in a single memory row, making it infeasible in real systems. We develop PARBOR, an efficient system-level technique that determines the locations of the physically neighboring DRAM cells in the system address space and uses this information to detect data-dependent failures. To our knowledge, this is the first work that solves the challenge of detecting data-dependent failures in DRAM in the presence of DRAM-internal scrambling of system-level addresses. We experimentally demonstrate the effectiveness of PARBOR using 144 real DRAM chips from three major vendors. Our experimental evaluation shows that PARBOR 1) detects neighboring cell locations with only 66-90 tests, a 745,654X reduction compared to the naive test, and 2) uncovers 21.9% more failures compared to a random-pattern test that is unaware of the neighbor cell locations. We introduce a new mechanism that utilizes PARBOR to reduce refresh rate based on the data content of memory locations, thereby improving system performance and efficiency. We hope that our fast and efficient system-level detection technique enables other new ideas and mechanisms that improve the reliability, performance, and energy efficiency of DRAM-based memory systems.
机译:系统级的检测和减轻DRAM故障可提供各种系统增强功能,例如更好的可靠性,可伸缩性,能源和性能。不幸的是,对于依赖于相邻单元的数据内容的DRAM故障(依赖数据的故障),系统级检测具有挑战性。 DRAM供应商在内部对系统级地址空间进行加扰/重新映射。因此,使用相邻的系统级地址测试与数据相关的故障实际上并不会测试物理上相邻的单元。在这项工作中,我们认为揭示系统中与数据相关的故障的一种有前途的方法是确定系统地址空间中物理上相邻单元的位置。不幸的是,如果天真地执行此测试,即使在单个内存行中,也要花费49天的时间才能检测到相邻地址,这使其在实际系统中不可行。我们开发了PARBOR,这是一种有效的系统级技术,可确定系统地址空间中物理上相邻的DRAM单元的位置,并使用此信息来检测与数据相关的故障。据我们所知,这是解决在系统级地址的DRAM内部加扰的情况下检测DRAM中数据相关故障的挑战的第一项工作。我们通过实验证明了使用来自三个主要供应商的144个真实DRAM芯片进行PARBOR的有效性。我们的实验评估表明,PARBOR 1)仅通过66-90次测试即可检测到相邻的单元位置,与朴素的测试相比,减少了745,654X,并且2)与不知道相邻模式的随机模式测试相比,发现了21.9%的失败单元位置。我们引入了一种新机制,该机制利用PARBOR来基于内存位置的数据内容降低刷新率,从而提高系统性能和效率。我们希望我们快速高效的系统级检测技术能够带来其他新的思想和机制,从而提高基于DRAM的存储系统的可靠性,性能和能效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号