...
首页> 外文期刊>電子情報通信学会技術研究報告. コンピュ-タシステム. Computer Systems >Parallel Algorithms for the Summed Area Table on the Asynchronous Hierarchical Memory Machine, with GPU implementations
【24h】

Parallel Algorithms for the Summed Area Table on the Asynchronous Hierarchical Memory Machine, with GPU implementations

机译:异步分层存储机器上求和面积表的并行算法,带有GPU实现

获取原文
获取原文并翻译 | 示例
   

获取外文期刊封面封底 >>

       

摘要

The main contribution of this paper is to introduce the asynchronous Hierarchical Memory Machine (asynchronous HMM), which supports asynchronous execution of CUDA blocks, and show a global-memory-access-optimal parallel algorithm for computing the SAT on the asynchronous HMM. We also show a combined algorithm ((1 + r)R1W SAT algorithm) of 2R1W and 1R1W SAT algorithms that may have better performance. We have implemented several algorithms on GeForce GTX 780 Ti. The experimental results show that our (1 + r)R1W SAT algorithm runs faster than any other SAT algorithms for large input matrices. Also, it runs more than 100 times faster than the best SAT algorithm using a single CPU.
机译:本文的主要贡献是介绍了异步分层存储机(异步HMM),它支持CUDA块的异步执行,并展示了一种用于在异步HMM上计算SAT的全局内存访问最优并行算法。我们还展示了2R1W和1R1W SAT算法的组合算法((1 + r)R1W SAT算法),它们可能具有更好的性能。我们已经在GeForce GTX 780 Ti上实现了几种算法。实验结果表明,对于大输入矩阵,我们的(1 + r)R1W SAT算法比其他SAT算法运行得更快。而且,它的运行速度比使用单个CPU的最佳SAT算法快100倍以上。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号