首页> 外文会议>Annual Computing and Communication Workshop and Conference >Partitioning Mesh for Workload Balance According to the Capability of Each Computing Node
【24h】

Partitioning Mesh for Workload Balance According to the Capability of Each Computing Node

机译:根据每个计算节点的能力进行划分网格。

获取原文

摘要

Mesh-based application uses mesh to represent a physical system discretely and performs iterative computation over the mesh data to solve the equation for simulation. It generally runs in parallel on a supercomputer or a cluster with many computing nodes. During execution of the application, each node holds one partition of the mesh and performs iterative computation over the corresponding mesh data. In this paper, we propose a mesh partitioning algorithm that partitions the mesh according to the computing capabilities of the nodes in the target system while minimizing the amount of data exchanged between partitions. We also propose a scheme of using HDF5 (Hierarchical Data Format v5) file to store the partitioned mesh and define a storage format of the partitioned mesh in HDF5 file. Thus, we can partition the mesh and store the partitioned mesh in the HDF5 file in application development phase. When executing the application, each node reads in concurrently its corresponding partition of the mesh from the HDF5 file directly. The experimental result shows that when executing the application over the partitioned mesh produced by our algorithm, the workload on the target system among its nodes is more balanced and thus the execution time is reduced. Furthermore, our approach of partitioning mesh in advance can save much execution time in target system and thus can reduce cost of executing the application on the target cluster or supercomputer.
机译:基于网格的应用程序使用网格离散地表示物理系统,并通过网状数据执行迭代计算以解决模拟的方程。它通常在超级计算机或具有许多计算节点的群集中并行运行。在执行应用程序期间,每个节点包含网格的一个分区,并通过相应的网格数据执行迭代计算。在本文中,我们提出了一种网格划分算法,其根据目标系统中的节点的计算能力分区网格,同时最小化分区之间交换的数据量。我们还提出了一种使用HDF5(分层数据格式V5)文件来存储分区网格的方案,并在HDF5文件中定义分区网格的存储格式。因此,我们可以分区网格并将分区网格存储在应用程序开发阶段中的HDF5文件中。执行应用程序时,每个节点在直接从HDF5文件中同时读取其对应的网格分区。实验结果表明,当在我们的算法产生的分区网格上执行应用时,其节点之间的目标系统上的工作负载更加平衡,因此执行时间减少。此外,我们预先分区网格的方法可以节省目标系统中的大量执行时间,从而可以降低在目标群集或超级计算机上执行应用程序的成本。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号