【24h】

Efficient Parallel Random Rearrange

机译:高效并行随机重排

获取原文
获取原文并翻译 | 示例

摘要

Classic shuffling algorithms have linear complexity, but they have the disadvantage of accessing memory with unpredictable patterns, which cause a large numbers of cache misses. In consequence, their execution times are not determined by computation complexity, but by the latency of the memory system. For parallel systems, this penalty gets worse, because of the overheads associated to atomic accesses to data that must be rearranged. This paper gives an overview of the best known serial and parallel shuffling algorithms, and proposes a new one that minimizes the number of memory accesses and thus, the processors' power consumption. Comparisons among these algorithms and some results are presented for graphic architectures.
机译:经典的改组算法具有线性复杂性,但是它们具有以不可预测的模式访问内存的缺点,这会导致大量的高速缓存未命中。结果,它们的执行时间不是由计算复杂度决定的,而是由存储系统的等待时间决定的。对于并行系统,由于与原子访问必须重新排列的数据相关的开销,这种损失会变得更糟。本文概述了最著名的串行和并行改组算法,并提出了一种新的算法,该算法最大程度地减少了内存访问次数,从而最大程度地减少了处理器的功耗。这些算法之间的比较以及一些针对图形体系结构的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号