Optimization of the Brillouin operator on the KNL architecture

Stephan Dürr

首页> 外文期刊>EPJ Web of Conferences >Optimization of the Brillouin operator on the KNL architecture

【24h】

Optimization of the Brillouin operator on the KNL architecture

机译：在KNL架构上优化Brillouin运算符

获取原文

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

页面导航

摘要
著录项
相似文献
相关主题

摘要

Experiences with optimizing the matrix-times-vector application of the Brillouin operator on the Intel KNL processor are reported. Without adjustments to the memory layout, performance figures of 360 Gflop/s in single and 270 Gflop/s in double precision are observed. This is with N_(c)= 3 colors, N_(v)= 12 right-hand-sides, N_(thr)= 256 threads, on lattices of size 32~(3)× 64, using exclusively OMP pragmas. Interestingly, the same routine performs quite well on Intel Core i7 architectures, too. Some observations on the much harderWilson fermion matrix-times-vector optimization problem are added.

机译：报告了在英特尔KNL处理器上优化Brillouin运算符的矩阵时间矢量应用程序的经验。在不调整内存布局的情况下，可以观察到单精度360 Gflop / s和双精度270 Gflop / s的性能指标。这是在N_（c）= 3种颜色，N_（v）= 12个右侧，N_（thr）= 256个线程，尺寸为32〜（3）×64的网格上使用的唯一OMP编译指示。有趣的是，同一例程在Intel Core i7架构上也能很好地执行。增加了对更加困难的威尔逊费米子矩阵-时间-向量优化问题的一些观察。

著录项

来源
《EPJ Web of Conferences》 |2018年第1期|共8页
作者
Stephan Dürr;
展开▼
作者单位

展开▼
收录信息
原文格式 PDF
正文语种
中图分类物理学;
关键词

相似文献

外文文献
中文文献
专利

1. Optimization of the Brillouin operator on the KNL architecture [J] . Stephan Dürr EPJ Web of Conferences . 2018,第2期

机译：在KNL架构上优化Brillouin运算符
2. Performance improvement options of scientific applications on XeonPhi KNL architectures [J] . Shajulin Benedict International Journal of Knowledge Engineering and Data Mining . 2018,第1a2期

机译：XeonPhi KNL架构上科学应用程序的性能改进选项
3. Numerical quadrature in the Brillouin zone for periodic Schrodinger operators [J] . Numerische Mathematik . 2020,第3期

机译：周期施罗德格算子布里渊区的数量正交
4. Optimizing Wilson-Dirac Operator and Linear Solvers for Intel® KNL [C] . Balint Joo, Dhiraj D. Kalamkar, Thorsten Kurth, International supercomputing conference international workshops;International Workshop on OpenPOWER for HPC;Workshop on performance scalability of storage systems;International Workshop on performance portable programming models for accelerators;Workshop on application performance on intel xeon phi - being prepared for KNL beyond;Workshop on HPC I/O in the data center;International Workshop on communication architectures at extreme scale;Workshop on exascale multi/many core computing systems;Workshop on virtualization in high-performance cloud computing . 2016

机译：针对英特尔®KNL优化Wilson-Dirac运算符和线性求解器
5. Universal Neural Memory Architectures: Multigrid Connectivity, Domain-Agnostic Geometry, and Local Operators [D] . Huynh, Tri Quoc. 2021

机译：通用神经内存架构：MultiGrid连接，域名无话学几何和本地运算符
6. A Novel Hybrid Clonal Selection Algorithm with Combinatorial Recombination and Modified Hypermutation Operators for Global Optimization [O] . Weiwei Zhang, Jingjing Lin, Honglei Jing, 2016

机译：全局重组的组合重组和修正超变异算子的混合克隆选择算法
7. Optimization of the Brillouin operator on the KNL architecture [O] . Durr, Stephan 2017

机译：在KNL架构上优化布里渊算子

Optimization of the Brillouin operator on the KNL architecture

摘要

著录项

相似文献

相关主题

期刊订阅