首页> 外文会议>International Conference on High Performance Computing for Computational Science >Performance Optimization of the 3D FDM Simulation of Seismic Wave Propagation on the Intel Xeon Phi Coprocessor Using the ppOpen-APPL/FDM Library
【24h】

Performance Optimization of the 3D FDM Simulation of Seismic Wave Propagation on the Intel Xeon Phi Coprocessor Using the ppOpen-APPL/FDM Library

机译:使用PPOPEN-ALPL / FDM库对英特尔Xeon Phi Coprocessor在Intel Xeon Phi Coprocessor上的3D FDM模拟的性能优化

获取原文

摘要

We evaluate the performance of a parallel 3D finite-difference method (FDM) simulation of seismic wave propagation using the Intel Xeon Phi coprocessor. Since a continued decrease in the byte/flop ratio of future machines is forecast, program optimization with a decrease byte/flop ratio was applied by fusing the original major kernel and omitting the storing and loading of intermediate variables. We confirm that 1) MPI/OpenMP hybrid parallel computing with hyper-threading is more efficient than pure MPI parallel computing and 2) the performance of the FDM simulation with a splitting of triple DO loops is 1.3 times faster than the modified code with triple DO loops, while no performance acceleration is achieved with a fused double DO-loop calculation. We consider that loop distribution optimization is effective for prefetching and the thread parallelization of each loop by its use and reuse on cache data.
机译:我们使用Intel Xeon Phi Coprocessor评估并行3D有限差分方法(FDM)模拟地震波传播的性能。由于预测了未来机器的字节/跳法率的持续减少,因此通过融合原始主内核并省略中间变量的存储和加载来应用具有减少字节/浮标比率的程序优化。我们确认1)用超线程的MPI / OpenMP混合并行计算比纯MPI并行计算更有效,2)FDM模拟的性能与三重DO循环分裂的拆分比具有三联的修改代码快1.3倍循环,虽然没有使用融合的双重循环计算实现性能加速度。我们认为,循环分发优化对于预取和通过其使用和重用缓存数据来对每个循环的线程并行化有效。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号