首页> 外文会议>IEEE International Parallel Distributed Processing Symposium >Exploring SIMD for Molecular Dynamics, Using Intel~?Xeon~?Processors and Intel~?Xeon Phi~(TM) Coprocessors
【24h】

Exploring SIMD for Molecular Dynamics, Using Intel~?Xeon~?Processors and Intel~?Xeon Phi~(TM) Coprocessors

机译:使用英特尔探索SIMD的SIMD,使用英特尔〜Xeon〜?处理器和英特尔〜?Xeon Phi〜(TM)协处理器

获取原文

摘要

We analyse gather-scatter performance bottle-necks in molecular dynamics codes and the challenges that they pose for obtaining benefits from SIMD execution. This analysis informs a number of novel code-level and algorithmic improvements to Sandia's miniMD benchmark, which we demonstrate using three SIMD widths (128-, 256- and 512- bit). The applicability of these optimisations to wider SIMD is discussed, and we show that the conventional approach of exposing more parallelism through redundant computation is not necessarily best In single precision, our optimised implementation is up to 5x faster than the original scalar code running on Intel~?Xeon~?processors with 256-bit SIMD, and adding a single Intel~? Xeon Phi~(?) coprocessor provides up to an additional 2x performance increase. These results demonstrate: (i) the importance of effective SIMD utilisation for molecular dynamics codes on current and future hardware; and (ii) the considerable performance increase afforded by the use of Intel~?Xeon Phi~(TM) coprocessors for highly parallel workloads.
机译:我们分析了分子动力学代码中的聚集散射性能瓶颈及其姿势从SIMD执行中获益的挑战。此分析向Sandia的最小基准通知了许多新的代码级和算法改进,我们使用三个SIMD宽度(128-,256-和512位)展示。这些最佳化,以更宽的SIMD的适用性进行了讨论,我们表明,通过冗余计算暴露更多并行的传统方法不一定最好在单精度,我们的优化实现高达5倍比运行在Intel原来的标量代码快〜 ?Xeon〜?处理器256位SIMD,并添加单个英特尔〜? Xeon Phi〜(?)协处理器提供额外的2倍性能增加。这些结果表明:(i)对当前和未来硬件的分子动力学代码有效SIMD利用的重要性; (ii)使用英特尔〜Xeon Phi〜(TM)协处理器提供了相当大的性能增加,用于高度平行的工作负载。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号