首页> 外文期刊>Journal of supercomputing >Efficient parallelization of multilevel fast multipole algorithm for electromagnetic simulation on many-core SW26010 processor
【24h】

Efficient parallelization of multilevel fast multipole algorithm for electromagnetic simulation on many-core SW26010 processor

机译:多级快速多极算法对多核SW26010处理器电磁仿真的高效并行化

获取原文
获取原文并翻译 | 示例

摘要

A many-core parallel approach of the multilevel fast multipole algorithm (MLFMA) based on the Athread parallel programming model is presented on the homegrown many-core SW26010 CPU of China. In the proposed many-core implementation of MLFMA, the data access efficiency is improved by using data structures based on the structure of array. The adaptive workload distribution strategies are adopted on different MLFMA tree levels to ensure full utilization of computing capability and the scratchpad memory. A double buffering scheme is specially designed to make communication overlapped computation. The resulting Athread-based many-core implementation of the MLFMA is capable of solving real-life problems with over one million unknowns with a remarkable speedup. The capability and efficiency of the proposed method are analyzed through the examples of computing scattering by spheres and a practical aerocraft. Numerical results show that with the proposed parallel scheme, the total speedup ratios from 6.4 to 8.0 can be achieved, compared with the CPU master core.
机译:基于ATHREAD并行编程模型的多级快速多极算法(MLFMA)的许多核心并行方法介绍了中国的本土多核SW26010 CPU。在提出的MLFMA的许多核心实施中,通过使用基于阵列结构的数据结构来提高数据访问效率。在不同的MLFMA树级采用自适应工作负载分发策略,以确保充分利用计算能力和临时存储器。双缓冲方案专门设计用于使通信重叠计算。由此产生的基于ATHREAD的MLFMA的许多核心实施能够解决具有超过100万未知数的现实生活问题,具有显着的加速。通过使用球体和实用的空气活动的散射的示例分析所提出的方法的能力和效率。数值结果表明,通过提出的并行方案,与CPU主核心相比,可以实现6.4至8.0的总加速比。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号