Methods, systems, and apparatus, including computer programs encoded on computer storage media, for computing a multivalue reduction using a parallel processing device. One of the methods includes performing a parallel M-value reduction by parallel processing units of a parallel processing device. A plurality of initial reductions are performed in serial, each initial reduction operating on data in a different respective register space of at least M register spaces. Data is moved from the M register spaces so that all results from the plurality of initial reductions are in a same first register space. One or more subsequent reductions are performed in parallel to compute M final values, each subsequent reduction operating only on data in the first register space.
展开▼
机译:方法,系统和装置,包括在计算机存储介质上编码的计算机程序,用于使用并行处理设备计算多值减少量。这些方法之一包括通过并行处理设备的并行处理单元执行并行 M I>值减小。串行执行多个初始约简,每个初始约简对至少 M I>个寄存器空间的不同寄存器空间中的数据进行操作。数据从 M I>寄存器空间中移出,因此多次初始归约的所有结果都位于同一第一寄存器空间中。并行执行一个或多个后续缩减以计算 M I>个最终值,每个后续缩减仅对第一寄存器空间中的数据进行操作。
展开▼