...
首页> 外文期刊>ACM SIGPLAN Notices: A Monthly Publication of the Special Interest Group on Programming Languages >ASPIRE: Exploiting Asynchronous Parallelism in Iterative Algorithms using a Relaxed Consistency based DSM
【24h】

ASPIRE: Exploiting Asynchronous Parallelism in Iterative Algorithms using a Relaxed Consistency based DSM

机译:ASPIRE:使用基于松弛一致性的DSM在迭代算法中利用异步并行性

获取原文
获取原文并翻译 | 示例
           

摘要

Many vertex-centric graph algorithms can be expressed using asynchronous parallelism by relaxing certain read-after-write data dependences and allowing threads to compute vertex values using stale (i.e., not the most recent) values of their neighboring vertices. We observe that on distributed shared memory systems, by converting synchronous algorithms into their asynchronous counterparts, algorithms can be made tolerant to high inter-node communication latency. However, high inter-node communication latency can lead to excessive use of stale values causing an increase in the number of iterations required by the algorithms to converge. Although by using bounded staleness we can restrict the slow-down in the rate of convergence, this also restricts the ability to tolerate communication latency. In this paper we design a relaxed memory consistency model and consistency protocol that simultaneously tolerate communication latency and minimize the use of stale values. This is achieved via a coordinated use of best effort refresh policy and bounded staleness. We demonstrate that for a range of asynchronous graph algorithms and PDE solvers, on an average, our approach outperforms algorithms based upon: prior relaxed memory models that allow stale values by at least 2.27x; and Bulk Synchronous Parallel (BSP) model by 4.2x. We also show that our approach frequently outperforms GraphLab, a popular distributed graph processing framework.
机译:通过放宽某些写入后读取数据的依赖关系,并允许线程使用其相邻顶点的陈旧(即不是最新)值来计算顶点值,可以使用异步并行性来表达许多以顶点为中心的图算法。我们观察到,在分布式共享内存系统上,通过将同步算法转换为异步算法,可以使算法能够承受较高的节点间通信延迟。但是,高的节点间通信等待时间可能导致过时值的过度使用,从而导致算法收敛所需的迭代次数增加。尽管通过使用有限的陈旧性可以限制收敛速度的降低,但这也限制了容忍通信延迟的能力。在本文中,我们设计了一个宽松的内存一致性模型和一致性协议,该模型同时可以容忍通信延迟并最大程度地减少使用过时的值。这是通过协调使用尽力而为刷新策略和有限的陈旧性来实现的。我们证明,平均而言,对于一系列异步图算法和PDE求解器,我们的方法优于基于以下条件的算法:先前的宽松内存模型,其允许的陈旧值至少为2.27倍;和批量同步并行(BSP)模型的4.2倍。我们还表明,我们的方法通常优于GraphLab(一种流行的分布式图形处理框架)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号