首页> 外文会议>Supercomputing, 1998. SC98. IEEE/ACM Conference on >S-HARP: A Scalable Parallel Dynamic Partitioner for Adaptive Mesh-based Computations
【24h】

S-HARP: A Scalable Parallel Dynamic Partitioner for Adaptive Mesh-based Computations

机译:S-HARP:用于基于网格的自适应计算的可扩展并行动态分区器

获取原文

摘要

Computational science problems with adaptive meshes involve dynamic load balancing when implemented on parallel machines. This dynamic load balancing requires fast partitioning of computational meshes at run time. We present in this report a scalable parallel dynamic partitioner, called S-HARP. The underlying principles of S-HARP are the fast feature of inertial partitioning and the quality feature of spectral partitioning. S-HARP is a universal dynamic partitioner with three distinctive features: (a) fast partitioning from scratch with a global view, requiring no information from the previous iterations, (b) no restriction on the issue of one partition per processor, (c) no imbalance factor issue because of precise bisection using sorting. Two types of parallelism have been exploited in S-HARP, fine-grain loop-level parallelism and coarse-grain recursive parallelism. The parallel partitioner has been implemented in Message Passing Interface on Cray T3E and IBM SP2 for portability. Experimental results indicate that S-HARP can partition a mesh of over 100,000 vertices into 256 partitions in 0.18 seconds on a 64-processor Cray T3E. S-HARP is much more scalable than other dynamic partitioners, giving over 17-fold speedup on 64 processors while ParaMeTiS1.0 gives a few-fold speedup. Experimental results demonstrate that S-HARP is three to 15 times faster than the other dynamic partitioners on computational meshes of size over 100,000 vertices while giving comparable edge cuts.
机译:自适应网格的计算科学问题涉及在并行计算机上实现时的动态负载平衡。这种动态负载平衡需要在运行时快速划分计算网格。我们在本报告中介绍了一种可扩展的并行动态分区程序,称为S-HARP。 S-HARP的基本原理是惯性划分的快速特征和频谱划分的质量特征。 S-HARP是一种具有三个独特功能的通用动态分区器:(a)从全局视图从头进行快速分区,不需要先前迭代中的任何信息;(b)对每个处理器一个分区的发布没有限制;(c)由于使用了精确的二等分法,因此没有不平衡因素问题。 S-HARP中使用了两种类型的并行性:细粒度循环级并行性和粗粒度递归并行性。并行分区程序已在Cray T3E和IBM SP2的消息传递接口中实现,以实现可移植性。实验结果表明,在64处理器的Cray T3E上,S-HARP可以在0.18秒内将超过100,000个顶点的网格划分为256个分区。 S-HARP比其他动态分区程序具有更大的可伸缩性,在64个处理器上的速度提高了17倍,而ParaMeTiS1.0则使速度提高了几倍。实验结果表明,在大小超过100,000个顶点的计算网格上,S-HARP的速度比其他动态分区快3到15倍,同时具有相当的边沿切割效果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号