首页> 外文会议>International Symposium on Parallel Distributed Processing >Transitive Closure on the Cell Broadband Engine: A study on Self-Scheduling in a Multicore Processor
【24h】

Transitive Closure on the Cell Broadband Engine: A study on Self-Scheduling in a Multicore Processor

机译:电池宽带引擎的传递关闭:多核处理器中自我调度研究

获取原文

摘要

In this paper, we present a mapping methodology and optimizations for solving transitive closure on the Cell multi-core processor. Using our approach, it is possible to achieve near peak performance for transitive closure on the Cell processor. We first parallelize the Standard Floyd Warshall algorithm and show through analysis and experimental results that data communication is a bottleneck for performance and scalability. We parallelize a cache optimized version of Floyd Warshall algorithm to remove the memory bottleneck. As is the case with several scientific computing and industrial applications on a multicore processor, synchronization and scheduling of the cores plays a crucial role in determining the performance of this algorithm. We define a self-scheduling mechanism for the cores of a multicore processor and design a self-scheduler for Blocked Floyd Warshall algorithm on the Cell multicore processor to remove the scheduling bottleneck. We also present optimizations in scheduling order to remove synchronization points. Our implementations achieved up to 78GFLOPS.
机译:在本文中,我们提出了一种用于在单元多核处理器上解决传递闭合的映射方法和优化。使用我们的方法,可以在电池处理器上实现用于在电池处理器上的传递关闭的近峰值性能。我们首先通过分析和实验结果并行化标准Floyd Warshall算法并显示数据通信是性能和可扩展性的瓶颈。我们将缓存优化版本的弗洛伊德Warshall算法并行化以删除内存瓶颈。与多核处理器上的多个科学计算和工业应用一样,核心的同步和调度在确定该算法的性能方面起着至关重要的作用。我们为多核处理器的核定义了一种自我调度机制,并在小区多核处理器上设计用于封锁的Floyd Warshall算法的自我调度器,以删除调度瓶颈。我们还通过调度顺序提供优化以删除同步点。我们的实现最多可实现78GFlock。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号