首页> 外文会议>International Symposium INFOTEH-JAHORINA >Comparison of the Non-Blocked and Blocked Floyd-Warshall Algorithm with Regard to Speedup and Energy Saving on an Embedded GPU
【24h】

Comparison of the Non-Blocked and Blocked Floyd-Warshall Algorithm with Regard to Speedup and Energy Saving on an Embedded GPU

机译:在嵌入式GPU上加速和节能的非阻塞和阻塞弗洛伊德战争算法的比较

获取原文

摘要

In this paper, three variants of the Floyd-Warshall (FW) All Pairs Shortest Path (APSP) algorithm are presented and compared - the sequential implementation, the parallel implementation using the Nvidia CUDA API, and the blocked parallel version of the FW algorithm. A performance analysis between these three approaches, as well as between the individual phases of the parallel algorithm is provided. The performance of these algorithms has been measured on regular as well as on embedded GPU hardware, and a significant speedup has been achieved. Additionally, this paper shows that a blocked data access results in significant energy savings of up to 72% on embedded hardware.
机译:在本文中,提出和比较了弗洛伊德 - 脉动(FW)的三个变体 - 所有对最短路径(APSP)算法 - 顺序实现,使用NVIDIA CUDA API的并行实现,以及FW算法的阻塞并行版本。提供了这三种方法的性能分析,以及并行算法的各个阶段之间的性能分析。这些算法的性能已经在常规以及嵌入式GPU硬件上进行测量,并且已经实现了显着的加速。此外,本文表明,嵌入式硬件上堵塞的数据访问显着节省高达72%。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号