A Predictive Performance Model for Stencil Codes on Multicore CPUs

机译：多核CPU上的模板代码预测性能模型

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper we present an analytical performance model which yields estimates for the performance of stencil based simulations. Unlike previous models, we do neither rely on prototype implementations, nor do we examine the computational intensity only. Our model allows for memory optimizations such as cache blocking and non-temporal stores. Multi-threading, loop-unrolling, and vectorization are covered, too. The model is built from a sequence of 1D loops. For each loop we map the different parts of the instruction stream to the corresponding CPU pipelines and estimate their throughput. The load/store streams may be affected not only by their destination (the cache level or NUMA domain they target), but also by concurrent access of other threads. Evaluation of a Jacobi solver and the Himeno benchmark shows that the model is accurate enough to capture real live kernels.

机译：在本文中，我们提出了一种分析性能模型，其产生了基于模板模拟的性能的估计。与以前的模型不同，我们既不依靠原型实现，也不是我们仅检查计算强度。我们的模型允许内存优化，例如缓存阻塞和非时间商店。多线程，循环展开和矢量化也被覆盖。该模型由1D循环序列构建。对于每个循环，我们将指令流的不同部分映射到相应的CPU管道并估计其吞吐量。负载/商店流可能不仅受到目的地（高速缓存级别或它们目标的NUMA域）的影响，还可以影响其他线程的并发访问。评估Jacobi求解器和Himeno基准显示该模型足以足以捕获真实的活核。

著录项

来源
《Tutorial on High Performance Numerical Tools for the Development and Scalability of High-End Computer Applications Conference》|2013年||共16页
会议地点
作者
Andreas Schafer; Dietmar Fey;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP301-53;
关键词

相似文献

外文文献
中文文献
专利

1. Improving the accuracy of energy predictive models for multicore CPUs by combining utilization and performance events model variables [J] . Arsalan Shahid, Muhammad Fahad, Ravi Reddy Manumachu, Journal of Parallel and Distributed Computing . 2021,第May期

机译：通过组合利用和性能事件模型变量来提高多核CPU的能量预测模型的准确性
2. Moving Scientific Codes to Multicore Microprocessor CPUs [J] . Woodward P.R., Jayaraj J., Pei-Hung Lin, Computing in science & engineering . 2008,第6期

机译：将科学代码转移到多核微处理器CPU
3. MODELING THE PERFORMANCE OF GEOMETRIC MULTIGRID STENCILS ON MULTICORE COMPUTER ARCHITECTURES [J] . Ghysels Pieter, Vanroose Wim SIAM Journal on Scientific Computing . 2015,第2期

机译：在多核计算机体系结构上建模几何多重网格的性能
4. A Predictive Performance Model for Stencil Codes on Multicore CPUs [C] . Andreas Schaefer, Dietmar Fey International conference on high performance computing for computational science . 2013

机译：多核CPU上模具代码的预测性能模型
5. Auto-tuning stencil codes for cache-based multicore platforms. [D] . Datta, Kaushik. 2009

机译：自动调整基于缓存的多核平台的模具代码。
6. Using clinical data to predict high-cost performance coding issues associated with pressure ulcers: a multilevel cohort model [O] . William V Padula, Robert D Gibbons, Peter J Pronovost, 2017

机译：使用临床数据预测与压力溃疡相关的高成本性能编码问题：多级队列模型
7. Block-Relaxation Methods for 3D Constant-Coefficient Stencils on GPUs and Multicore CPUs [O] . Rodriguez, Manuel Rodriguez, Philip, Bobby, Wang, Zhen, 2013

机译：GpU上三维常系数模板的块松弛方法和多核CpU
8. Block-Iterative Methods for 3D Constant- Coefficient Stencils on GPUs and Multicore CPUs. [R] . Rodriguez, M., Philip, B., Wang, Z., 2014

机译：GpU和多核CpU上3D恒定系数模板的块迭代方法。

A Predictive Performance Model for Stencil Codes on Multicore CPUs

摘要

著录项

相似文献

相关主题

期刊订阅