首页> 外文会议>IEEE International Parallel and Distributed Processing Symposium >Double-Precision FPUs in High-Performance Computing: An Embarrassment of Riches?
【24h】

Double-Precision FPUs in High-Performance Computing: An Embarrassment of Riches?

机译:高性能计算中的双精度FPU:尴尬吗?

获取原文

摘要

Among the (uncontended) common wisdom in High-Performance Computing (HPC) is the applications' need for large amount of double-precision support in hardware. Hardware manufacturers, the TOP500 list, and (rarely revisited) legacy software have without doubt followed and contributed to this view. In this paper, we challenge that wisdom, and we do so by exhaustively comparing a large number of HPC proxy applications on two processors: Intel's Knights Landing (KNL) and Knights Mill (KNM). Although similar, the KNL and KNM architecturally deviate at one important point: the silicon area devoted to double-precision arithmetics. This fortunate discrepancy allows us to empirically quantify the performance impact in reducing the amount of hardware double-precision arithmetic. Our analysis shows that this common wisdom might not always be right. We find that the investigated HPC proxy applications do allow for a (significant) reduction in double-precision with little-to-no performance implications. With the advent of a failing of Moore's law, our results partially reinforce the view taken by modern industry (e.g., upcoming Fujitsu ARM64FX) to integrate hybrid-precision hardware units.
机译:高性能计算(HPC)的(无与伦比)常识中,应用程序需要在硬件中提供大量的双精度支持。毫无疑问,硬件制造商,TOP500列表和(很少重新访问)旧版软件都为这种观点做出了贡献。在本文中,我们对这种智慧提出了挑战,我们通过详尽比较两个处理器上的大量HPC代理应用程序来做到这一点:英特尔的Knights Landing(KNL)和Knights Mill(KNM)。尽管相似,但KNL和KNM在架构上有一个重要的区别:专用于双精度算术的硅面积。这种幸运的差异使我们可以凭经验量化减少硬件双精度算法数量方面的性能影响。我们的分析表明,这种常识可能并不总是正确的。我们发现,调查的HPC代理应用程序确实允许(显着)降低双精度,而对性能几乎没有影响。随着摩尔定律失效的出现,我们的结果在一定程度上加强了现代工业界(例如即将推出的富士通ARM64FX)整合混合精度硬件单元的观点。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号