首页> 外文期刊>Concurrency and computation: practice and experience >Energy efficiency and portability of oil and gas simulations on multicore and graphics processing unit architectures
【24h】

Energy efficiency and portability of oil and gas simulations on multicore and graphics processing unit architectures

机译:多核和图形处理单元架构的石油和气体模拟能效及便携性

获取原文
获取原文并翻译 | 示例

摘要

Reverse time migration (RTM) simulation is the basis of the seismic imaging tools used by the oil and gas industry. Developers have been porting their simulations to the new high-performance computing architectures, providing faster and more accurate results at each new generation. However, several challenges arrive when trying to achieve high performance on these new architectures. The first one is to choose the architecture that best fits the kind of simulation. After that, researchers should choose the API used to implement the simulation code. These two decisions are strongly related to the effort, performance, and energy efficiency of the simulations. In this article, we propose three optimizations for an oil and gas application, which reduce the floating-point operations by changing the equation derivatives. We evaluate these optimizations in different multicore and GPU architectures, investigating the impact of different APIs on the performance, energy efficiency, and portability of the code. Our experimental results show that the dedicated CUDA implementation running on the NVIDIA Volta architecture has the best performance and energy efficiency for RTM on GPUs, while the OpenMP version is the best for Intel Broadwell in the multicore. Also, the OpenACC version, which has a lower programming effort and executes on both architectures, has an up to 20% better performance and energy efficiency than the nonportable ones.
机译:相反时间迁移(RTM)仿真是石油和天然气工业使用的地震成像工具的基础。开发人员一直将其模拟移植到新的高性能计算架构,从而提供更快,更准确的结果。然而,在尝试在这些新架构上实现高性能时,有几个挑战会到达。第一个是选择最适合这种模拟的架构。之后,研究人员应该选择用于实现仿真代码的API。这两个决定与模拟的努力,性能和能源效率密切相关。在本文中,我们为石油和天然气应用提出了三种优化,这通过改变等式衍生物来减少浮点操作。我们评估了不同多核和GPU架构中的这些优化,研究了不同API对代码的性能,能源效率和可移植性的影响。我们的实验结果表明,NVIDIA Volta架构上的专用CUDA实现具有最佳的性能和高RTM在GPU上的能效,而OpenMP版本是MOSTICORE中英特尔Broadwell的最佳状态。此外,OpenACC版本具有较低的编程工作并在两个架构上执行,而且比非易性的性能和能量效率高达20%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号