Exploring SIMD for Molecular Dynamics, Using Intel~?Xeon~?Processors and Intel~?Xeon Phi~(TM) Coprocessors

机译：使用英特尔探索SIMD的SIMD，使用英特尔〜Xeon〜？处理器和英特尔〜？Xeon Phi〜（TM）协处理器

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

We analyse gather-scatter performance bottle-necks in molecular dynamics codes and the challenges that they pose for obtaining benefits from SIMD execution. This analysis informs a number of novel code-level and algorithmic improvements to Sandia's miniMD benchmark, which we demonstrate using three SIMD widths (128-, 256- and 512- bit). The applicability of these optimisations to wider SIMD is discussed, and we show that the conventional approach of exposing more parallelism through redundant computation is not necessarily best In single precision, our optimised implementation is up to 5x faster than the original scalar code running on Intel~?Xeon~?processors with 256-bit SIMD, and adding a single Intel~? Xeon Phi~(?) coprocessor provides up to an additional 2x performance increase. These results demonstrate: (i) the importance of effective SIMD utilisation for molecular dynamics codes on current and future hardware; and (ii) the considerable performance increase afforded by the use of Intel~?Xeon Phi~(TM) coprocessors for highly parallel workloads.

机译：我们分析了分子动力学代码中的聚集散射性能瓶颈及其姿势从SIMD执行中获益的挑战。此分析向Sandia的最小基准通知了许多新的代码级和算法改进，我们使用三个SIMD宽度（128-，256-和512位）展示。这些最佳化，以更宽的SIMD的适用性进行了讨论，我们表明，通过冗余计算暴露更多并行的传统方法不一定最好在单精度，我们的优化实现高达5倍比运行在Intel原来的标量代码快〜？Xeon〜？处理器256位SIMD，并添加单个英特尔〜？ Xeon Phi〜（？）协处理器提供额外的2倍性能增加。这些结果表明：（i）对当前和未来硬件的分子动力学代码有效SIMD利用的重要性; （ii）使用英特尔〜Xeon Phi〜（TM）协处理器提供了相当大的性能增加，用于高度平行的工作负载。

著录项

来源
《IEEE International Parallel Distributed Processing Symposium》|2013年||共13页
会议地点
作者
S. J. Pennycook; C. J. Hughes; M. Smelyanskiy; S. A. Jarvis;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.133-53;
关键词
scientific computing; accelerator architectures; parallel programming; performance analysis; high performance computing;

机译：科学计算;加速器架构;并行编程;性能分析;高性能计算;
入库时间 2022-08-20 19:59:58

相似文献

外文文献
中文文献
专利

1. Parallel BRDF-based infrared radiation simulation of aerial targets implemented on Intel Xeon processor and Xeon Phi coprocessor [J] . Guo Xing, Wu Zhensen, Wu Jiaji, Journal of Real-Time Image Processing . 2019,第1期

机译：在英特尔至强处理器和至强融核协处理器上实现的基于BRDF的空中目标的并行红外辐射仿真
2. Effective SIMD Vectorization for Intel Xeon Phi Coprocessors [J] . XinminTian, HidekiSaito, Serguei V.Preis, Scientific programming . 2015,第4期

机译：适用于英特尔至强融核协处理器的有效SIMD矢量化
3. Effective SIMD Vectorization for Intel Xeon Phi Coprocessors [J] . Tian Xinmin, Saito Hideki, Preis Serguei V., Scientific programming . 2015,第期

机译：适用于英特尔至强融核协处理器的有效SIMD矢量化
4. Exploring SIMD for Molecular Dynamics, Using Intel~?Xeon~?Processors and Intel~?Xeon Phi~(TM) Coprocessors [C] . S. J. Pennycook, C. J. Hughes, M. Smelyanskiy, IEEE International Parallel Distributed Processing Symposium . 2013

机译：使用英特尔探索SIMD的SIMD，使用英特尔〜Xeon〜？处理器和英特尔〜？Xeon Phi〜（TM）协处理器
5. Advancing LAMMPS Performance on Intel Xeon Phi Processors Coprocessors [D] . Vorsu, Sandeep Kumar. 2017

机译：在英特尔Xeon Phi处理器协处理器上推进LAMMPS性能
6. Efficient irregular wavefront propagation algorithms on Intel® Xeon Phi™ [O] . Jeremias M. Gomes, George Teodoro, Alba de Melo, -1

机译：英特尔®至强融核™上的高效不规则波前传播算法
7. Exploring SIMD for molecular dynamics, using Intel Xeon processors and Intel Xeon Phi coprocessors [O] . Pennycook, Simon J., Hughes, C. J., Smelyanskiy, M., 2013

机译：使用Intel Xeon处理器和Intel Xeon Phi协处理器探索SIMD的分子动力学

Exploring SIMD for Molecular Dynamics, Using Intel~?Xeon~?Processors and Intel~?Xeon Phi~(TM) Coprocessors

摘要

著录项

相似文献

相关主题

期刊订阅