【24h】

Efficient Parallel Random Rearrange

机译:高效并行随机重新排列

获取原文
获取外文期刊封面目录资料

摘要

Classic shuffling algorithms have linear complexity, but they have the disadvantage of accessing memory with unpredictable patterns, which cause a large numbers of cache misses. In consequence, their execution times are not determined by computation complexity, but by the latency of the memory system. For parallel systems, this penalty gets worse, because of the overheads associated to atomic ac-cesses to data that must be rearranged. This paper gives an overview of the best known serial and parallel shuffling al-gorithms, and proposes a new one that minimizes the number of memory accesses and thus, the processors' power consumption. Comparisons among these algorithms and some results are presented for graphic architectures.
机译:经典的Shuffling算法具有线性复杂性,但它们具有访问内存的缺点,不可预测的模式,这导致大量的高速缓存未命中。结果,它们的执行时间不是通过计算复杂度确定的,而是通过存储器系统的延迟来确定。对于并行系统,由于与必须重新排列的数据的原子ac-cesses相关联,这种惩罚会变得更糟。本文概述了最着名的串行和并联Shuffling Al-Gorithms,并提出了一种最大限度地减少存储器访问数量的新序列,因此,处理器的功耗。这些算法中的比较和一些结果用于图形架构。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号