Energy efficiency and portability of oil and gas simulations on multicore and graphics processing unit architectures

Serpa Matheus S.; Pavan Pablo J.; Cruz Eduardo H. M.; Machado Rodrigo L.; Panetta Jairo; Azambuja Antonio; Carissimi Alexandre S.; Navaux Philippe O. A.

首页> 外文期刊>Concurrency and computation: practice and experience >Energy efficiency and portability of oil and gas simulations on multicore and graphics processing unit architectures

【24h】

Energy efficiency and portability of oil and gas simulations on multicore and graphics processing unit architectures

机译：多核和图形处理单元架构的石油和气体模拟能效及便携性

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Reverse time migration (RTM) simulation is the basis of the seismic imaging tools used by the oil and gas industry. Developers have been porting their simulations to the new high-performance computing architectures, providing faster and more accurate results at each new generation. However, several challenges arrive when trying to achieve high performance on these new architectures. The first one is to choose the architecture that best fits the kind of simulation. After that, researchers should choose the API used to implement the simulation code. These two decisions are strongly related to the effort, performance, and energy efficiency of the simulations. In this article, we propose three optimizations for an oil and gas application, which reduce the floating-point operations by changing the equation derivatives. We evaluate these optimizations in different multicore and GPU architectures, investigating the impact of different APIs on the performance, energy efficiency, and portability of the code. Our experimental results show that the dedicated CUDA implementation running on the NVIDIA Volta architecture has the best performance and energy efficiency for RTM on GPUs, while the OpenMP version is the best for Intel Broadwell in the multicore. Also, the OpenACC version, which has a lower programming effort and executes on both architectures, has an up to 20% better performance and energy efficiency than the nonportable ones.

机译：相反时间迁移（RTM）仿真是石油和天然气工业使用的地震成像工具的基础。开发人员一直将其模拟移植到新的高性能计算架构，从而提供更快，更准确的结果。然而，在尝试在这些新架构上实现高性能时，有几个挑战会到达。第一个是选择最适合这种模拟的架构。之后，研究人员应该选择用于实现仿真代码的API。这两个决定与模拟的努力，性能和能源效率密切相关。在本文中，我们为石油和天然气应用提出了三种优化，这通过改变等式衍生物来减少浮点操作。我们评估了不同多核和GPU架构中的这些优化，研究了不同API对代码的性能，能源效率和可移植性的影响。我们的实验结果表明，NVIDIA Volta架构上的专用CUDA实现具有最佳的性能和高RTM在GPU上的能效，而OpenMP版本是MOSTICORE中英特尔Broadwell的最佳状态。此外，OpenACC版本具有较低的编程工作并在两个架构上执行，而且比非易性的性能和能量效率高达20％。

著录项

来源
《Concurrency and computation: practice and experience》 |2021年第18期|e6212.1-e6212.14|共14页
作者
Serpa Matheus S.; Pavan Pablo J.; Cruz Eduardo H. M.; Machado Rodrigo L.; Panetta Jairo; Azambuja Antonio; Carissimi Alexandre S.; Navaux Philippe O. A.;
展开▼
作者单位

Fed Univ Rio Grande do Sul UFRGS Inst Informat Porto Alegre RS Brazil;

Fed Univ Rio Grande do Sul UFRGS Inst Informat Porto Alegre RS Brazil;

Fed Inst Parana IFPR Paranavai Brazil;

Aeronaut Inst Technol ITA Comp Sci Div Sao Jose Dos Campos Brazil;

Aeronaut Inst Technol ITA Comp Sci Div Sao Jose Dos Campos Brazil;

Petrobras Petr Brasileiro SA Rio De Janeiro Brazil;

Fed Univ Rio Grande do Sul UFRGS Inst Informat Porto Alegre RS Brazil;

Fed Univ Rio Grande do Sul UFRGS Inst Informat Porto Alegre RS Brazil;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
code portability; HPC; oil and gas simulation; performance optimization; reverse time migration;

机译：代码便携性;HPC;石油和天然气模拟;性能优化;相反时间迁移;

相似文献

外文文献
中文文献
专利

1. Multicore Processors and Graphics Processing Unit Accelerators for Parallel Retrieval of Aerosol Optical Depth From Satellite Data: Implementation, Performance, and Energy Efficiency [J] . Liu Jia, Feld Dustin, Xue Yong, Selected Topics in Applied Earth Observations and Remote Sensing, IEEE Journal of . 2015,第5期

机译：从卫星数据并行检索气溶胶光学深度的多核处理器和图形处理单元加速器：实现，性能和能效
2. A fast band-Krylov eigensolver for macromolecular functional motion simulation on multicore architectures and graphics processors [J] . Aliaga Jose I., Alonso Pedro, Badia Jose M., Journal of Computational Physics . 2016,第Null期

机译：用于多核体系结构和图形处理器上的高分子功能运动仿真的快速Band-Krylov本征求解器
3. N-body computations using skeletal frameworks on multicore CPU/graphics processing unit architectures: an empirical performance evaluation [J] . Mehdi Goli, Horacio González–Vélez Concurrency and computation: practice and experience . 2014,第4期

机译：在多核CPU /图形处理单元体系结构上使用骨架框架进行N体计算：经验性能评估
4. On the energy efficiency of graphics processing units for scientific computing [C] . Huang S., Xiao S., Feng W. IEEE International Symposium on Parallel Distributed Processing;IPDPS 2009 . 2009

机译：论用于科学计算的图形处理单元的能源效率
5. Quantum Simulations with Graphics Processing Units [D] . Smith, Steven. 2020

机译：Quantum仿真与图形处理单元
6. Are Gaming-Enabled Graphic Processing Unit Cards Convenient for Molecular Dynamics Simulation? [O] . Tommaso Biagini, Francesco Petrizzelli, Mauro Truglio, 2019

机译：具有游戏功能的图形处理单元卡是否方便进行分子动力学模拟？
7. Multicore processors and graphics processing unit accelerators for parallel retrieval of aerosol optical depth from satellite data: Implementation, performance, and energy efficiency [O] . Liu, Jia, Feld, Dustin, Xue, Yong, 2015

机译：用于从卫星数据并行检索气溶胶光学深度的多核处理器和图形处理单元加速器：实现，性能和能效
8. Guidance, Navigation, and Control System Simulations via Graphics Processor Unit [R] . Ilg, M. 2011

机译：通过图形处理器单元进行指导，导航和控制系统仿真

Energy efficiency and portability of oil and gas simulations on multicore and graphics processing unit architectures

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅