首页> 外文会议>International Conference for High Performance Computing, Networking, Storage and Analysis >Optimizing Data Locality for Fork/Join Programs Using Constrained Work Stealing
【24h】

Optimizing Data Locality for Fork/Join Programs Using Constrained Work Stealing

机译:使用受限工作窃取来优化Fork / Join程序的数据局部性

获取原文

摘要

We present an approach to improving data locality across different phases of fork/join programs scheduled using work stealing. The approach consists of: (1) user-specified and automated approaches to constructing a steal tree, the schedule of steal operations, and (2) constrained work-stealing algorithms that constrain the actions of the scheduler to mirror a given steal tree. These are combined to construct work-stealing schedules that maximize data locality across computation phases while ensuring load balance within each phase. These algorithms are also used to demonstrate dynamic coarsening, an optimization to improve spatial locality and sequential overheads by combining many finer-grained tasks into coarser tasks while ensuring sufficient concurrency for locality-optimized load balance. Implementation and evaluation in Cilk demonstrate performance improvements of up to 2.5x on 80 cores. We also demonstrate that dynamic coarsening can combine the performance benefits of coarse task specification with the adaptability of finer tasks.
机译:我们提出了一种方法,用于在使用工作窃取计划的fork / join程序的不同阶段中改善数据局部性。该方法包括:(1)用户指定的自动方法来构造窃取树,窃取操作的时间表以及(2)约束的工作窃取算法,这些算法会限制调度程序的行为以反映给定的窃取树。将它们组合在一起,以构造工作窃取计划,以最大程度地确保整个计算阶段的数据局部性,同时确保每个阶段内的负载平衡。这些算法还用于演示动态粗化,通过将许多更细粒度的任务合并为较粗的任务来优化空间局部性和顺序开销的优化,同时确保足够的并发性以进行局部优化的负载平衡。 Cilk的实施和评估表明,在80个内核上的性能提高了2.5倍。我们还证明了动态粗化可以将粗略任务规范的性能优势与细粒度任务的适应性相结合。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号