首页> 外文会议>IEEE International Parallel and Distributed Processing Symposium >Profiling Directed NUMA Optimization on Linux Systems: A Case Study of the Gaussian Computational Chemistry Code
【24h】

Profiling Directed NUMA Optimization on Linux Systems: A Case Study of the Gaussian Computational Chemistry Code

机译:Proping Linux系统上的指示Numa优化:一种高斯计算化学代码的案例研究

获取原文
获取外文期刊封面目录资料

摘要

The parallel performance of applications running on Non-Uniform Memory Access (NUMA) platforms is strongly influenced by the relative placement of memory pages to the threads that access them. As a consequence there are Linux application programmer interfaces (APIs) to control this. For large parallel codes it can, however, be difficult to determine how and when to use these APIs. In this paper we introduce the NUMAgrind profiling tool which can be used to simplify this process. It extends the Valgrind binary translation framework to include a model which incorporates cache coherency, memory locality domains and interconnect traffic for arbitrary NUMA topologies. Using NUMAgrind, cache misses can be mapped to memory locality domains, page access modes determined, and pages that are referenced by multiple threads quickly determined. We show how the NUMAgrind tool can be used to guide the use of Linux memory and thread placement APIs in the Gaussian computational chemistry code. The performance of the code before and after use of these APIs is also presented for three different commodity NUMA platforms.
机译:在非统一存储器访问(NUMA)平台上运行的应用程序的并行性能受到存储器页面的相对放置到访问它们的线程的相对放置。因此,存在Linux应用程序程序员接口(API)来控制此。然而,对于大的并行代码,它可以难以确定如何以及何时使用这些API。在本文中,我们介绍了不可用于简化此过程的Numagrind分析工具。它扩展了Valgrind二进制转换框架,包括包含高速缓存一致性,存储位置域和任意NUMA拓扑的互连流量的模型。使用NUMAGRIND,缓存未命中可以映射到内存位置域,确定的页面访问模式,以及快速确定多个线程引用的页面。我们展示了Numagrind工具如何用于指导Linux内存和线程放置API在高斯计算化学代码中的使用。在使用这些API之前和之后的代码的性能也用于三种不同的商品NUMA平台。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号