首页> 外文期刊>EPJ Web of Conferences >Optimization of the Brillouin operator on the KNL architecture
【24h】

Optimization of the Brillouin operator on the KNL architecture

机译:在KNL架构上优化Brillouin运算符

获取原文
       

摘要

Experiences with optimizing the matrix-times-vector application of the Brillouin operator on the Intel KNL processor are reported. Without adjustments to the memory layout, performance figures of 360 Gflop/s in single and 270 Gflop/s in double precision are observed. This is with N_(c)= 3 colors, N_(v)= 12 right-hand-sides, N_(thr)= 256 threads, on lattices of size 32~(3)× 64, using exclusively OMP pragmas. Interestingly, the same routine performs quite well on Intel Core i7 architectures, too. Some observations on the much harderWilson fermion matrix-times-vector optimization problem are added.
机译:报告了在英特尔KNL处理器上优化Brillouin运算符的矩阵时间矢量应用程序的经验。在不调整内存布局的情况下,可以观察到单精度360 Gflop / s和双精度270 Gflop / s的性能指标。这是在N_(c)= 3种颜色,N_(v)= 12个右侧,N_(thr)= 256个线程,尺寸为32〜(3)×64的网格上使用的唯一OMP编译指示。有趣的是,同一例程在Intel Core i7架构上也能很好地执行。增加了对更加困难的威尔逊费米子矩阵-时间-向量优化问题的一些观察。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号