首页> 外文会议>IEEE International Parallel and Distributed Processing Symposium >Order-Invariant Real Number Summation: Circumventing Accuracy Loss for Multimillion Summands on Multiple Parallel Architectures
【24h】

Order-Invariant Real Number Summation: Circumventing Accuracy Loss for Multimillion Summands on Multiple Parallel Architectures

机译:订单不变的实数求和:多征在多个并行架构上的千万汇总的精度损耗

获取原文

摘要

Achieving reproducibility of scientific results in parallel computing is both a challenge and a source of active research. A significant contribution to non-reproducibility is rounding error introduced into calculations by the non-associativity of floating point addition. Scientific applications that rely on accumulation of many small values, such as climate and N-body simulations, are susceptible to this type of error. This paper proposes a variant of an existing fixed-point method for real number summation that yields sums with perfect precision, and which are invariant to summation order and system architecture. The new method improves upon the existing technique by exhibiting improved performance for large numbers of summands, introducing tunable fractional precision to place precision where it is needed, and eliminating the aliasing problem of the original method. The proposed technique is described and its performance is demonstrated in the OpenMP, MPI, CUDA, and Xeon Phi parallel programming environments. In particular, the proposed method outperforms the previous state-of-the-art for larger problems involving over one million summands at high precision. With the anticipated convergence of exascale high-performance computing and big data analytics on hybrid architectures, computational reproducibility will become an even more difficult problem than it is today. Use of numerical techniques such as the method proposed here can help to mitigate the impact of error and variation within simulations at these large scales.
机译:在平行计算中实现科学结果的再现性是一项挑战和积极研究的来源。对不可重复性的重大贡献是通过浮点的非关联性引入计算的舍入误差。依赖于许多小值的积累的科学应用,例如气候和正文模拟,易于这种错误。本文提出了一种用于实际数量求和的现有定点方法的变种,其产生具有完美精度的总和,并且是不变的求和顺序和系统架构。通过表现出大量总结的改进性能来提高现有方法,引入可调分数精度,以在需要的情况下放置精度,并消除原始方法的混叠问题。描述了所提出的技术,其性能在OpenMP,MPI,CUDA和Xeon Phi并行编程环境中展示。特别地,所提出的方法优于前一种最先进的最先进的问题,以获得高精度超过一百万的概括。随着Exascale高性能计算和混合架构的大数据分析的预期融合,计算再现性将成为比今天更困难的问题。使用诸如提出的方法的使用数值技术可以有助于减轻这些大尺度的模拟中的误差和变化的影响。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号