首页> 外文期刊>Parallel Computing >Characterizing the performance benefit of hybrid memory system for HPC applications
【24h】

Characterizing the performance benefit of hybrid memory system for HPC applications

机译:表征针对HPC应用的混合存储系统的性能优势

获取原文
获取原文并翻译 | 示例

摘要

Heterogenous memory systems that consist of multiple memory technologies are becoming common in high-performance computing environments. Modern processors and accelerators, such as the Intel Knights Landing (KNL) CPU and NVIDIA Volta GPU, feature small size high-bandwidth memory near the compute cores and large-size normal-bandwidth memory that is connected off-chip. Theoretically, HBM can provide about four times higher bandwidth than conventional DRAM. However, many factors impact the actual performance improvement that an application can achieve on such system. In this paper, we focus on the Intel KNL system and identify the most important factors on the application performance, including the application memory access pattern, the problem size, the threading level and the actual memory configuration. We use a set of representative applications from both scientific and data-analytics domains. Our results show that applications with regular memory access benefit from MCDRAM, achieving up to three times performance when compared to the performance obtained using only DRAM. On the contrary, applications with irregular memory access pattern are latency-bound and may suffer from performance degradation when using only MCDRAM. Also, we provide memory-centric analysis of four applications, identify their major data objects, correlate their characteristics to the performance improvement on the testbed. (C) 2018 Published by Elsevier B.V.
机译:由多种存储技术组成的异构存储系统在高性能计算环境中正变得越来越普遍。诸如Intel Knights Landing(KNL)CPU和NVIDIA Volta GPU之类的现代处理器和加速器在计算内核附近具有小尺寸高带宽内存,并在芯片外连接了大尺寸正常带宽内存。从理论上讲,HBM可以提供比传统DRAM高大约四倍的带宽。但是,许多因素影响应用程序在此类系统上可以实现的实际性能改进。在本文中,我们重点研究英特尔KNL系统,并确定影响应用程序性能的最重要因素,包括应用程序内存访问模式,问题大小,线程级别和实际内存配置。我们使用了一组来自科学和数据分析领域的代表性应用程序。我们的结果表明,具有常规内存访问权限的应用程序受益于MCDRAM,与仅使用DRAM获得的性能相比,其性能提高了三倍。相反,具有不规则内存访问模式的应用程序受延迟限制,仅使用MCDRAM时,性能可能会下降。此外,我们还提供了以内存为中心的四个应用程序分析,确定了它们的主要数据对象,并将它们的特性与测试平台上的性能改进相关联。 (C)2018由Elsevier B.V.发布

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号