...
首页> 外文期刊>Parallel and Distributed Computing Practices >MPI AND OPENMP PARADIGMS ON CLUSTERS OF SMP ARCHITECTURES: THE VACANCY TRACKING ALGORITHM FOR MULTI-DIMENSIONAL ARRAY TRANSPOSITION
【24h】

MPI AND OPENMP PARADIGMS ON CLUSTERS OF SMP ARCHITECTURES: THE VACANCY TRACKING ALGORITHM FOR MULTI-DIMENSIONAL ARRAY TRANSPOSITION

机译:SMP体系结构集群上的MPI和OPENMP范例:多维数组转换的空缺跟踪算法

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

We evaluate remapping multi-dimensional arrays on cluster of SMP architectures under OpenMP, MPI, and hybrid paradigms. Traditional method of multi-dimensional array transpose needs an auxiliary array of the same size and a copy back stage. We recently developed an in-place method using vacancy tracking cycles, The vacancy tracking algorithm outperforms the traditional 2-array method as demonstrated by extensive comparisons, Performance of multi-threaded parallelism using OpenMP are first tested with different scheduling methods and different number of threads. Both methods are then parallelized using several parallel paradigms. At node level, pure OpenMP outperforms pure MPI by a factor of 2.76 for vacancy tracking method. Across entire cluster of SMP nodes, by carefully choosing thread numbers, the hybrid MPI/OpenMP implementation outperforms pure MPI by a factor of 3.79 for traditional method and 4.44 for vacancy tracking method, demonstrating the validity of the parallel paradigm of mixing MPI with OpenMP.
机译:我们在OpenMP,MPI和混合范例下评估SMP体系结构集群上的多维数组重映射。传统的多维数组转置方法需要具有相同大小的辅助数组和复制后台。我们最近开发了一种使用空位跟踪周期的就地方法,通过大量比较证明,空位跟踪算法的性能优于传统的2数组方法。首先使用不同的调度方法和不同的线程数测试使用OpenMP的多线程并行性能。然后,使用几种并行范例对这两种方法进行并行化。在节点级别,对于空缺跟踪方法,纯OpenMP优于纯MPI 2.76倍。在整个SMP节点群集中,通过仔细选择线程号,对于传统方法,MPI / OpenMP混合实现的性能要比纯MPI高3.79倍,对于空位跟踪方法,则要达到4.44倍,这证明了将MPI与OpenMP混合的并行范例的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号