首页> 外文期刊>IEEE Transactions on Parallel and Distributed Systems >HL-PCM: MLC PCM Main Memory with Accelerated Read
【24h】

HL-PCM: MLC PCM Main Memory with Accelerated Read

机译:HL-PCM:具有加速读取功能的MLC PCM主存储器

获取原文
获取原文并翻译 | 示例

摘要

Multi-Level Cell Phase Change Memory (MLC PCM) is a promising candidate technology for DRAM replacement in main memory of modern computers. Despite of its high density and low power advantages, this technology seriously suffers from slow read and write operations. While prior works extensively studied the problem of slow write, this paper targets high read latency problem in MLC PCM and introduces an architecture mechanism to overcome it. To this end, we rely on the fact that reading different bits from an MLC cell takes different latencies, i.e., for a 2-bit MLC, reading its Most-Significant Bit (MSB) is fast, while reading its Least-Significant Bits (LSBs) is slower. We then propose Half-Line PCM (HL-PCM), a novel memory architecture that leverages this non-uniformity in reading MLC PCM’s content to send a requested memory block to the processor in different cycles–it sends half of a memory block to the processor ahead of the other half. If the processor requested a word belonging to the first half, it can resume its execution on receiving the first half, while the other half has not sent yet and scheduled to be received by the memory controller later. HL-PCM is easy and simple to implement, i.e., it needs minor modifications at memory controller, the search/evict policies at last level cache, as well as data layout in main memory. Our experimental results show that the proposed design improves the average memory access latency by 33–43 percent and program’s execution time by 23 percent, on average, while incurring negligible overhead at memory controller and PCM DIMM, in a 16-core chip multiprocessor (CMP) running memory-intensive benchmarks.
机译:多层单元相变存储器(MLC PCM)是一种有前途的候选技术,可替代现代计算机主存储器中的DRAM。尽管具有高密度和低功耗的优点,但该技术严重地受到读写操作缓慢的困扰。在先前的工作广泛研究慢写问题的同时,本文针对MLC PCM中的高读取延迟问题,并介绍了一种克服它的体系结构机制。为此,我们依赖这样一个事实,即从MLC单元中读取不同的位会产生不同的延迟,即对于2位MLC,读取其最高有效位(MSB)很快,而读取其最低有效位( LSB)比较慢。然后,我们提出Half-Line PCM(HL-PCM),这是一种新颖的内存体系结构,利用这种不一致性来读取MLC PCM的内容,以便在不同的周期内将请求的内存块发送给处理器,它将一半的内存块发送给处理器。处理器领先于另一半。如果处理器请求了属于前半部分的单词,则它可以在接收到前半部分后恢复执行,而另一半尚未发送并计划稍后由存储控制器接收。 HL-PCM易于实现且易于实现,即它需要在存​​储控制器上进行少量修改,在最后一级缓存中执行搜索/逐出策略以及在主存储器中进行数据布局。我们的实验结果表明,在16核芯片多处理器(CMP)中,所提出的设计将平均内存访问延迟平均提高了33%至43%,将程序的执行时间平均缩短了23%,同时在内存控制器和PCM DIMM方面的开销可忽略不计)运行内存密集型基准测试。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号