首页> 外文会议>International Conference for High Performance Computing, Networking, Storage and Analysis >A Unified Programming Model for Intra- and Inter-Node Offloading on Xeon Phi Clusters
【24h】

A Unified Programming Model for Intra- and Inter-Node Offloading on Xeon Phi Clusters

机译:Xeon Phi群集上节点内和节点间卸载的统一编程模型

获取原文

摘要

Standard offload programming models for the Xeon Phi, e.g. Intel LEO and OpenMP 4.0, are restricted to a single compute node and hence a limited number of coprocessors. Scaling applications across a Xeon Phi cluster/supercomputer thus requires hybrid programming approaches, usually MPI+X. In this work, we present a framework based on heterogeneous active messages (HAM-Offload) that provides the means to offload work to local and remote (co)processors using a unified offload API. Since HAM-Offload provides similar primitives as current local offload frameworks, existing applications can be easily ported to overcome the single-node limitation while keeping the convenient offload programming model. We demonstrate the effectiveness of the framework by using it to enable a real-world application from the field of molecular dynamics to use multiple local and remote Xeon Phis. The evaluation shows good scaling behavior. Compared with LEO, performance is equal for large offloads and significantly better for small offloads.
机译:Xeon Phi的标准卸载编程模型,例如, Intel Leo和OpenMP 4.0仅限于单个计算节点,从而限制了有限数量的协处理器。围绕Xeon Phi群集/超级计算机的缩放应用需要混合编程方法,通常是MPI + x。在这项工作中,我们介绍了一种基于异构活动消息(HAM-OFFLOAD)的框架,该框架使用统一的卸载API提供对本地和远程(CO)处理器的卸载工作的手段。由于HAM-OFFLOAD提供了类似的基元作为当前的本地卸载框架,因此可以轻松地移植现有的应用程序来克服单节点限制,同时保持方便的卸载编程模型。我们通过使用它来展示框架的有效性,以使来自分子动力学领域的现实世界应用来使用多个本地和远程Xeon Phis。评估显示出良好的缩放行为。与Leo相比,性能相同,对于大型卸载,并且对于小型卸载而言显着更好。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号