Order-Invariant Real Number Summation: Circumventing Accuracy Loss for Multimillion Summands on Multiple Parallel Architectures

机译：订单不变的实数求和：多征在多个并行架构上的千万汇总的精度损耗

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Achieving reproducibility of scientific results in parallel computing is both a challenge and a source of active research. A significant contribution to non-reproducibility is rounding error introduced into calculations by the non-associativity of floating point addition. Scientific applications that rely on accumulation of many small values, such as climate and N-body simulations, are susceptible to this type of error. This paper proposes a variant of an existing fixed-point method for real number summation that yields sums with perfect precision, and which are invariant to summation order and system architecture. The new method improves upon the existing technique by exhibiting improved performance for large numbers of summands, introducing tunable fractional precision to place precision where it is needed, and eliminating the aliasing problem of the original method. The proposed technique is described and its performance is demonstrated in the OpenMP, MPI, CUDA, and Xeon Phi parallel programming environments. In particular, the proposed method outperforms the previous state-of-the-art for larger problems involving over one million summands at high precision. With the anticipated convergence of exascale high-performance computing and big data analytics on hybrid architectures, computational reproducibility will become an even more difficult problem than it is today. Use of numerical techniques such as the method proposed here can help to mitigate the impact of error and variation within simulations at these large scales.

机译：在平行计算中实现科学结果的再现性是一项挑战和积极研究的来源。对不可重复性的重大贡献是通过浮点的非关联性引入计算的舍入误差。依赖于许多小值的积累的科学应用，例如气候和正文模拟，易于这种错误。本文提出了一种用于实际数量求和的现有定点方法的变种，其产生具有完美精度的总和，并且是不变的求和顺序和系统架构。通过表现出大量总结的改进性能来提高现有方法，引入可调分数精度，以在需要的情况下放置精度，并消除原始方法的混叠问题。描述了所提出的技术，其性能在OpenMP，MPI，CUDA和Xeon Phi并行编程环境中展示。特别地，所提出的方法优于前一种最先进的最先进的问题，以获得高精度超过一百万的概括。随着Exascale高性能计算和混合架构的大数据分析的预期融合，计算再现性将成为比今天更困难的问题。使用诸如提出的方法的使用数值技术可以有助于减轻这些大尺度的模拟中的误差和变化的影响。

著录项

来源
《IEEE International Parallel and Distributed Processing Symposium》|2016年|575p|共9页
会议地点
作者
Patrick E. Small; Rajiv K. Kalia; Aiichiro Nakano; Priya Vashishta;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP311.13-53;
关键词
Standards; Computational modeling; Parallel processing; Computers; Hardware; Floating-point arithmetic; Computer architecture;

机译：标准;计算建模;并行处理;计算机;硬件;浮点算术;计算机架构;

相似文献

外文文献
中文文献
专利

1. A Multiple-FPGA Parallel Computing Architecture for Real-Time Simulation of Soft-Object Deformation [J] . Mahdavikhah Behzad, Mafi Ramin, Sirouspour Shahin, ACM Transactions on Embedded Computing Systems . 2014,第4期

机译：用于软对象变形实时仿真的多FPGA并行计算架构
2. Stage-dependent minimum bit resolution maps of full-parallel pipelined FFT/IFFT architectures incorporated in real-time optical orthogonal frequency division multiplexing transceivers [J] . Junjie Zhang, Wenyan Yuan, Kai Wang, The Journal of Engineering . 2014,第8期

机译：实时光正交频分复用收发器中包含的全并行流水线FFT / IFFT架构的与阶段有关的最小位分辨率图
3. Real-time anomaly detection using parallelized intrusion detection architecture for streaming data [J] . Chellammal P., Malarchelvi Sheba Kezia P. D. Concurrency, practice and experience . 2020,第4期

机译：使用并行入侵检测架构对流数据进行实时异常检测
4. Order-Invariant Real Number Summation: Circumventing Accuracy Loss for Multimillion Summands on Multiple Parallel Architectures [C] . Patrick E. Small, Rajiv K. Kalia, Aiichiro Nakano, IEEE International Parallel and Distributed Processing Symposium . 2016

机译：不变阶实数求和：规避多个并行体系结构上数以百万计的求和的精度损失
5. Indium arsenide/gallium arsenide quantum dots and nanomesas: Multimillion-atom molecular dynamics solutions on parallel architectures. [D] . Su, Xiaotao. 2001

机译：砷化铟/砷化镓量子点和纳米台面：并行体系结构上的数百万个原子的分子动力学解决方案。
6. Real-Time Parallel-Serial LiDAR-Based Localization Algorithm with Centimeter Accuracy for GPS-Denied Environments [O] . Jakub Niedzwiedzki, Adam Niewola, Piotr Lipinski, 2020

机译：基于实时并行串行激光雷达的定位算法具有GPS拒绝环境的厘米精度
7. Parallel Large Eddy Simulation Technique using Tetrahedral Finite-Element. 1st Report. Accuracy Evaluation for Pressure-Loss Prediction in a Realistic Problem. [O] . Masayuki KAIHO, Takashi YOKOHARI, Masahiro IKEGAWA, 2000

机译：平行大型涡流仿真技术采用四面体有限元。第一个报告。逼真问题压力损失预测的准确性评价。
8. High Order Accuracy Computational Methods in Aerodynamics Using Parallel Architectures [R] . Gottlieb, D. , Shu, C. 1998

机译：采用并行结构的空气动力学高阶精度计算方法

Order-Invariant Real Number Summation: Circumventing Accuracy Loss for Multimillion Summands on Multiple Parallel Architectures

摘要

著录项

相似文献

相关主题

期刊订阅