首页> 外文会议>International conference on supercomputing >Problem and Machine Sensitive Communication Optimization
【24h】

Problem and Machine Sensitive Communication Optimization

机译:问题和机器敏感通信优化

获取原文

摘要

Reducing communication costs can significantly improve the execution time of a parallel program. This paper presents a new approach for communication optimization in data parallel programs that is based on global data flow analysis and performance prediction. Our techniques are based on simple yet highly effective data flow equations which are solved iteratively for arbitrary control flow graphs. Previous techniques are based on fixed communication optimization strategies whose quality can be very sensible to changes of problem and machine sizes. Our algorithm is novel in that we carefully examine tradeoffs between communication latency hiding and reducing the number and volume of messages (e.g. message coalescing and aggregating) by systematically evaluating a reasonable set of promising communication placements for a given program covering several (possibly conflicting) communication guiding profit motives. P~3T, a state-of-the-art performance estimator that carefully models problem and machine characteristics, is used to ensure communication buffer-safety and to find the best communication placement of all created ones. Employing an accurate performance estimator opens ground for more aggressive communication optimization opportunities that carefully examine performance gains and losses among applicable optimization strategies which is not achievable with most existing approaches. Our techniques are based on data flow analyses that enable vectorizing, coalescing and aggregating communication, and overlapping communication with computation both within and across loop nests all in a unified framework. We present results for the Meiko CS-2 and a network of Sparc workstations based on a preliminary implementation which show that our method implies a significant reduction in communication costs and demonstrate the effectiveness of this analysis in improving the overall performance of data parallel programs.
机译:降低通信成本可以显着提高并行程序的执行时间。本文介绍了基于全局数据流分析和性能预测的数据并行程序中的通信优化的新方法。我们的技术基于简单但高效的数据流方程,其迭代地解决任意控制流程图。以前的技术基于固定的通信优化策略,其质量可以对问题和机器尺寸的变化非常明智。我们的算法是新颖的,因为我们仔细检查了通信延迟覆盖和减少了消息的数量和减少消息的数量和体积(例如消息聚合和聚合),以便为给定程序覆盖多个(可能冲突)通信的合理的通信介绍指导利润动机。 P〜3T,最先进的性能估算器,用于仔细模拟问题和机器特性,用于确保通信缓冲安全性,并找到所有创建的沟通安全性。采用准确的绩效估算器将打开地面以获得更积极的通信优化机会,仔细检查适用的优化策略之间的性能收益和损失,这些策略与大多数现有方法无法实现。我们的技术基于数据流分析,该数据流分析使得可以在统一的框架中实现与循环内部和跨越循环内部和跨越循环嵌套的传播的传播,结合和聚合通信。我们根据初步实施,我们向Meiko CS-2和SPARC工作站网络提供了结果,表明我们的方法意味着沟通成本显着降低,并证明了该分析在提高数据并行程序的整体性能方面的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号