首页> 外文期刊>Concurrency and Computation >OpenMP-oriented applications for distributed shared memory architectures
【24h】

OpenMP-oriented applications for distributed shared memory architectures

机译:面向面向OpenMP的应用程序,用于分布式共享内存体系结构

获取原文
获取原文并翻译 | 示例

摘要

The rapid rise of OpenMP as the preferred parallel programming paradigm for small-to-medium scale parallelism could slow unless OpenMP can show capabilities for becoming the model-of-choice for large scale high-performance parallel computing in the coming decade. The main stumbling block for the adaptation of OpenMP to distributed shared memory (DSM) machines, which are based on architectures like cc-NUMA, stems from the lack of capabilities for data placement among processors and threads for achieving data locality. The absence of such a mechanism causes remote memory accesses and inefficient cache memory use, both of which lead to poor performance. This paper presents a simple software programming approach called copy-inside-copy-back (CC) that exploits the data privatization mechanism of OpenMP for data placement and replacement. This technique enables one to distribute data manually without taking away control and flexibility from the programmer and is thus an alternative to the automat and implicit approaches. Moreover, the CC approach improves on the OpenMP-SPMD style of programming that makes the development process of an OpenMP application more structured and simpler. The CC technique was tested and analyzed using the NAS Parallel Benchmarks on SGI Origin 2000 multiprocessor machines. This study shows that OpenMP improves performance of coarse-grained parallelism, although a fast copy mechanism is essential.
机译:除非OpenMP能够在未来十年内展现出成为大型高性能并行计算的首选模型的能力,否则OpenMP作为中小型并行性的首选并行编程范例的迅速崛起可能会放缓。基于cc-NUMA之类的体系结构,OpenMP适应分布式共享内存(DSM)机器的主要绊脚石是由于缺乏用于实现数据局部性的处理器和线程之间的数据放置功能。缺少这种机制会导致远程内存访问和高速缓存内存使用效率低下,这两者都会导致性能下降。本文提出了一种简单的软件编程方法,称为复制内复制回(CC),该方法利用OpenMP的数据私有化机制进行数据放置和替换。这种技术使人们能够手动分配数据,而不会失去程序员的控制权和灵活性,因此是自动和隐式方法的替代方法。此外,CC方法改进了OpenMP-SPMD编程风格,使OpenMP应用程序的开发过程更加结构化和简单。使用SGI Origin 2000多处理器计算机上的NAS并行基准测试和分析了CC技术。这项研究表明,OpenMP可以提高粗粒度并行性的性能,尽管快速复制机制至关重要。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号