首页> 外文会议>Parallel Processing Workshops, 2009. ICPPW '09 >Characterizing the Performance of “Big Memory” on Blue Gene Linux
【24h】

Characterizing the Performance of “Big Memory” on Blue Gene Linux

机译:表征Blue Gene Linux上“大内存”的性能

获取原文

摘要

Efficient use of Linux for high-performance applications on Blue Gene/P (BG/P) compute nodes is challenging because of severe performance hits resulting from translation lookaside buffer (TLB) misses and a hard-to-program torus network DMA controller. To address these difficulties, we present the design and implementation of ȁC;Big MemoryȁD;ȁ4; an alternative, transparent memory space for computational processes. Big Memory uses extremely large memory pages available on PowerPC CPUs to create a TLB-miss-free, flat memory area that can be used for application code and data and is easier to use for DMA operations. One of our singlenode memory benchmarks shows that the performance gap between regular PowerPC Linux with 4KB pages and IBM BG/P compute node kernel (CNK) is about 68% in the worst case. Big Memory narrows the worst case performance gap to just 0.04%. We verify this result on 1024 nodes of Blue Gene/P using the NAS Parallel Benchmarks and find the performance under Linux with Big Memory to fluctuate within 0.7% of CNK. Originally intended exclusively for compute node tasks, our new memory subsystem turns out to dramatically improve the performance of certain I/O node applications as well. We demonstrate this performance using the central processor of the LOw Frequency ARray (LOFAR) radio telescope as an example.
机译:有效地将Linux用于Blue Gene / P(BG / P)计算节点上的高性能应用程序具有挑战性,因为翻译后备缓冲区(TLB)未命中和难以编程的环形网络DMA控制器会严重打击性能。为了解决这些困难,我们介绍了ȁC; BigMemoryȁD;ȁ4;的设计和实现。用于计算过程的替代透明内存空间。大内存使用PowerPC CPU上可用的非常大的内存页面来创建无TLB缺失的平面内存区域,该区域可用于应用程序代码和数据,并且更易于用于DMA操作。我们的单节点内存基准测试之一表明,在最坏的情况下,具有4KB页面的常规PowerPC Linux和IBM BG / P计算节点内核(CNK)之间的性能差距约为68%。大内存将最坏情况下的性能差距缩小到0.04%。我们使用NAS并行基准测试在Blue Gene / P的1024个节点上验证了此结果,并发现在具有大内存的Linux下,性能波动在CNK的0.7%以内。我们的新内存子系统最初专门用于计算节点任务,事实证明也可以显着提高某些I / O节点应用程序的性能。我们以低频率ARray(LOFAR)射电望远镜的中央处理器为例来演示这种性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号