首页> 外文期刊>Journal of supercomputing >On a high-order compact scheme and its utilization in parallel solution of a time-dependent system on a distributed memory processor
【24h】

On a high-order compact scheme and its utilization in parallel solution of a time-dependent system on a distributed memory processor

机译:一种高阶紧凑型方案及其在分布式存储处理器上的时间相关系统的并行解决方案中的利用

获取原文
获取原文并翻译 | 示例
           

摘要

The focus of this study is the design of a parallel solution method that utilizes a fourth-order compact scheme. The applicability of the method is demonstrated on a time-dependent parabolic system with Neumann boundaries. The core of the parallel computing facilities used in the study is a 2-head-node, 224-compute-node Apple Xserve G5 multiprocessor. The system is first discretized in both time and space such that it remains in its stability regimes, before being solved with the method. The solution requires time marching in which every time step, h,, calls for a single parallel solve of the intermediary subsystems generated. The solution uses p processors ranging in numbers from 3 to 63. The speedups, S_p, approach their limiting value of p only when p is small. The solution produces good computational results at large p, but poor results as p becomes progressively small. Also, the parallel solution produces accurate results yielding good speedups and efficiencies only when p is within some reasonable range of values. The intermediary systems generated by this method are linear and fine-grained, therefore, they are best suited for solution on massively-parallel processors. The solution method proposed in this study is, therefore, expected to yield more impressive results if applied in a massively-parallel computing environment.
机译:本研究的重点是利用四阶紧凑型方案的并行求解方法的设计。该方法的适用性在具有诺伊曼边界的时变抛物线系统上得到了证明。该研究中使用的并行计算工具的核心是一个2头节点,224个计算节点的Apple Xserve G5多处理器。该系统首先在时间和空间上都离散化,以使其保持稳定状态,然后再用该方法求解。该解决方案需要时间行进,其中每个时间步h都要求对生成的中间子系统进行单个并行求解。该解决方案使用数量在3到63之间的p个处理器。仅当p小时,加速比S_p才接近其极限值p。该解决方案在大p时产生良好的计算结果,但随着p逐渐变小,结果差。而且,只有当p在某个合理的值范围内时,并行解决方案才能产生准确的结果,并产生良好的加速和效率。通过这种方法生成的中间系统是线性且细粒度的,因此,它们最适合大规模并行处理器上的解决方案。因此,如果在大规模并行计算环境中应用该方法,则有望获得更令人印象深刻的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号