Online auto-tuning for the time-step-based parallel solution of ODEs on shared-memory systems

Natalia Kalinnik; Matthias Korch; Thomas Rauber

首页> 外文期刊>Journal of Parallel and Distributed Computing >Online auto-tuning for the time-step-based parallel solution of ODEs on shared-memory systems

【24h】

Online auto-tuning for the time-step-based parallel solution of ODEs on shared-memory systems

机译：在线自动调整共享内存系统上基于时间的ODE并行解决方案

获取原文

获取原文并翻译 | 示例

开具论文收录证明 >>

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

This article considers automatic performance tuning of time-step-based parallel solution methods for initial value problems (IVPs) of systems of ordinary differential equations (ODEs). We apply auto-tuning to the parallel execution of a class of explicit predictor-corrector (PC) methods of Runge-Kutta (RK) type on shared-memory architectures. The performance of parallel multi-threaded implementation variants of these methods depends on various factors only known at runtime, for example, the coupling structure of the ODE system to be solved, the memory access pattern resulting from this coupling structure, and the number of threads executing the program. We propose an online auto-tuning approach that exploits the time-stepping nature of ODE methods by selecting the best parallel implementation variant from a set of candidate implementations at runtime during the first time steps. Thus, the auto-tuning process is not isolated from the computation, but rather contributes to the progress of the solution process. The search space of candidate implementations is a priori reduced by estimating the synchronization overhead of each implementation variant. For implementation variants containing tiled loops, suitable tile sizes are selected using a heuristic empirical search guided by an analytical model. Runtime experiments with two different test problems show the efficiency of the online auto-tuning approach on two different shared-memory systems equipped with 48 and 1040 cores.

机译：本文考虑针对常微分方程（ODE）系统的初始值问题（IVP）的基于时间步的并行求解方法的自动性能调整。我们将自动调整应用于共享内存体系结构上一类Runge-Kutta（RK）类型的显式预测器-校正器（PC）方法的并行执行。这些方法的并行多线程实现变体的性能取决于仅在运行时才知道的各种因素，例如，要解决的ODE系统的耦合结构，由此耦合结构产生的内存访问模式以及线程数执行程序。我们提出了一种在线自动调整方法，该方法通过在运行时的第一步中从一组候选实现中选择最佳的并行实现变体来利用ODE方法的时间步长特性。因此，自动调整过程并非与计算隔离，而是有助于求解过程的进行。通过估计每个实现变体的同步开销，可以预先减少候选实现的搜索空间。对于包含平铺循环的实现变体，使用由解析模型指导的启发式经验搜索来选择合适的平铺大小。带有两个不同测试问题的运行时实验表明，在线自动调整方法在配备有48个和1040个内核的两个不同的共享内存系统上的效率很高。

著录项

来源
《Journal of Parallel and Distributed Computing》 |2014年第8期|2722-2744|共23页
作者
Natalia Kalinnik; Matthias Korch; Thomas Rauber;
展开▼
作者单位

University of Bayreuth, Department of Computer Science, 95440 Bayreuth, Germany;

University of Bayreuth, Department of Computer Science, 95440 Bayreuth, Germany;

University of Bayreuth, Department of Computer Science, 95440 Bayreuth, Germany;

展开▼
收录信息
原文格式 PDF
正文语种 eng
中图分类
关键词
Auto-tuning; Parallelization; Tile size selection; Ordinary differential equations; Predictor-corrector methods;

机译：自动调节;并行化;瓷砖尺寸选择;常微分方程;预测校正方法;

相似文献

外文文献
中文文献
专利

1. Empirical Installation of Linear Algebra Shared-Memory Subroutines for Auto-Tuning [J] . Jesus Camara, Javier Cuenca, Domingo Gimenez, International journal of parallel programming . 2014,第3期

机译：用于自动调整的线性代数共享内存子例程的经验性安装
2. Partitioned time discretization for parallel solution of coupled ODE systems [J] . Jeffrey M. Connors, Attou Miloua BIT numerical mathematics . 2011,第2期

机译：耦合ODE系统并行解决方案的分区时间离散化
3. A parallel shared-memory implementation of a high-order accurate solution technique for variable coefficient Helmholtz problems [J] . Computers & mathematics with applications . 2020,第4期

机译：可变系数亥姆霍兹问题的高阶精确解技术的并行共享内存实现
4. Parallel Solution of Cascaded ODE Systems Applied to ~(13)C-Labeling Experiments [C] . Katharina Noeh, Wolfgang Wiechert International Conference on Computational Science pt.2; 20040606-20040609; Krakow; PL . 2004

机译：级联ODE系统的并行解决方案应用于〜（13）C标签实验
5. Performance portability of parallel kernels on shared-memory systems. [D] . Stratton, John Andrew. 2013

机译：共享内存系统上并行内核的性能可移植性。
6. Rational general solutions of planar rational systems of autonomous ODEs [O] . L.X. Châu Ngô, Franz Winkler -1

机译：自治ODE平面有理系统的有理通用解。
7. Experiments with an ordinary differential equation solver in the parallel solution of method of lines problems on a shared-memory parallel computer [O] . Kahaner D.K., Ng E., Schiesser W.E., 1991

机译：在共享内存并行计算机上用线性微分方程求解器并行求解线问题方法的实验
8. Parallels between Control PDE's (Partial Differential Equations) and Systems of ODE's (Ordinary Differential Equations) [R] . Hunt, L. R., Villarreal, R. 1987

机译：Control pDE（偏微分方程）与ODE系统（常微分方程）之间的平行关系

Online auto-tuning for the time-step-based parallel solution of ODEs on shared-memory systems

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅