首页> 外文会议>International Conference on Parallel and Distributed Computing >Progress Thread Placement for Overlapping MPI Non-blocking Collectives Using Simultaneous Multi-threading
【24h】

Progress Thread Placement for Overlapping MPI Non-blocking Collectives Using Simultaneous Multi-threading

机译:使用同步多线程重叠MPI非阻塞集集的进度线程放置

获取原文

摘要

Non-blocking collectives have been proposed so as to allow communications to be overlapped with computation in order to amortize the cost of MPI collective operations. To obtain a good overlap ratio, communications and computation have to run in parallel. To achieve this, different hardware and software techniques exists. Dedicated some cores to run progress threads is one of them. However, some CPUs provide Simultaneous Multi-Threading, which is the ability for a core to have multiple hardware threads running simultaneously, sharing the same arithmetic units. Our idea is to use them to run progress threads to avoid dedicated cores allocation. We have run benchmarks on Haswell processors, using its Hyper-Threading capability, and get good results for both performance and overlap only when inter-node communications are used by MPI processes. However, we also show that enabling Simultaneous Multi-Threading for intra-communications leads to bad performances due to cache effects.
机译:已经提出了非阻塞集集团,以便允许通过计算重叠的通信,以便摊销MPI集体操作的成本。为了获得良好的重叠比率,通信和计算必须并行运行。为实现这一目标,存在不同的硬件和软件技术。专用一些运行进度线程的核心是其中之一。但是,一些CPU提供了同步多线程,这是核心具有同时运行多个硬件线程的核心的能力,共享相同的算术单元。我们的想法是使用它们来运行进度线程以避免专用核心分配。我们使用其超线程能力运行基准测试,使用其超线程能力,并且只有在MPI进程使用间互通通信时,才能对性能进行良好的结果。但是,我们还表明,由于高速缓存效果,可以为帧内通信提供同时多线程对不良性能。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号