【24h】

Pattern-Aware Staging for Hybrid Memory Systems

机译:混合内存系统的模式感知过渡

获取原文

摘要

The ever increasing demand for higher memory performance and-at the same time-larger memory capacity is leading the industry towards hybrid main memory designs, i.e., memory systems that consist of multiple different memory technologies. This trend, however, naturally leads to one important question: how can we efficiently utilize such hybrid memories? Our paper proposes a software-based approach to solve this challenge by deploying a pattern-aware staging technique. Our work is based on the following observations: (a) the high-bandwidth fast memory outperforms the large memory for memory intensive tasks; (b) but those tasks can run for much longer than a bulk data copy to/from the fast memory, especially when the access pattern is more irregular/sparse. We exploit these observations by applying the following staging technique if the accesses are irregular and sparse: (1) copying a chunk (few GB of sequential data) from large to fast memory; (2) performing a memory intensive task on the chunk; and (3) writing it back to the large memory. To check the regularity/sparseness of the accesses at runtime with negligible performance impact, we develop a lightweight pattern detection mechanism using a helper threading inspired approach with two different Bloom filters. Our case study using various scientific codes on a real system shows that our approach achieves significant speed-ups compared to executions with using only the large memory or hardware caching: 3× or 41% speedups in the best, respectively.
机译:对更高的存储器性能以及同时更大的存储器容量的不断增长的需求正引导工业朝着混合主存储器设计,即由多种不同的存储器技术组成的存储器系统。但是,这种趋势自然会引出一个重要的问题:我们如何才能有效地利用这种混合存储器?我们的论文提出了一种基于软件的方法,通过部署模式感知的登台技术来解决这一难题。我们的工作基于以下观察:(a)对于需要大量内存的任务,高带宽快速内存的性能优于大型内存; (b),但这些任务的运行时间比向/从快速存储器的大容量数据复制要长得多,尤其是在访问模式更不规则/稀疏的情况下。如果访问不规则且稀疏,我们将通过以下分段技术来利用这些观察结果:(1)从大内存复制块(几GB的连续数据)到大容量内存; (2)在块上执行内存密集型任务; (3)将其写回到大内存中。为了在运行时检查访问的规律性/稀疏性,而对性能的影响可以忽略不计,我们使用启发式方法和两个不同的Bloom过滤器,开发了一种轻量级的模式检测机制。我们在真实系统上使用各种科学代码的案例研究表明,与仅使用大容量内存或硬件缓存的执行相比,我们的方法可实现显着的速度提升:最佳速度分别提高3倍或41%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号