首页> 外文会议>IEEE International Symposium on On-Line Testing and Robust System Design >DRAM Characterization under Relaxed Refresh Period Considering System Level Effects within a Commodity Server
【24h】

DRAM Characterization under Relaxed Refresh Period Considering System Level Effects within a Commodity Server

机译:考虑商品服务器内系统级影响的刷新周期下的DRAM特性

获取原文

摘要

Today’s rapid generation of data and the increased need for higher memory capacity has triggered a lot of studies on aggressive scaling of refresh period, which is currently set according to rare worst case conditions. Such studies analysed in detail the data-dependent circuit level factors and indicated the need for online DRAM characterization due to the variable cell retention time. They have done so by executing few test data patterns on FPGAs under controlled temperatures by using thermal testbeds, which however cannot be available in the field. Moreover, the existing studies were not able to reveal any system level effects, which may be excited under the execution of workloads on real systems and directly or indirectly affect DRAM reliability. In this paper, we develop an experimental framework based on a state-of-the-art 64-bit ARM based server with Linux OS, in which we enabled the DRAM characterization under relaxed refresh period by executing conventional test data patterns as well as popular HPC and Cloud workloads. Our results indicate that common test patterns are ineffective in identifying error-prone locations at low DRAM temperatures. Furthermore, we reveal that there is a strong correlation between the SOC utilization and DRAM reliability. By exploiting such findings, we developed a benchmark, which can indirectly stress the DRAM temperature and thus used for characterization in the field without needing any complicated thermal equipment. Our study shows that the refresh period can be relaxed by 35 times on such a commodity system with all errors being corrected by the available error correcting codes, resulting in 11.5% power savings on average.
机译:如今,数据的快速生成以及对更高存储容量的需求不断增长,引发了许多有关积极调整刷新周期的研究,而刷新周期是根据罕见的最坏情况而设置的。此类研究详细分析了与数据相关的电路级因素,并指出由于可变的单元保留时间,需要在线进行DRAM表征。他们这样做是通过使用热测试台在受控温度下在FPGA上执行少量测试数据模式来实现的,但是在现场却无法获得。而且,现有的研究无法揭示任何系统级的影响,这种影响可能会在实际系统上执行工作负载时被激发,并直接或间接影响DRAM的可靠性。在本文中,我们基于具有Linux OS的最新64位基于ARM的服务器开发了一个实验框架,该框架通过执行常规测试数据模式以及流行的测试模式,在轻松的刷新周期内实现了DRAM表征。 HPC和云工作负载。我们的结果表明,在低DRAM温度下,常见的测试模式无法有效识别容易出错的位置。此外,我们发现SOC利用率和DRAM可靠性之间存在很强的相关性。通过利用这些发现,我们开发了一个基准,该基准可以间接施加DRAM温度,从而无需任何复杂的热设备即可用于现场表征。我们的研究表明,在这种商品系统上,刷新周期可以减少35倍,并且所有错误都可以通过可用的纠错码进行纠正,从而平均节省了11.5%的功耗。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号