...
首页> 外文期刊>Parallel Computing >Resource aggregation for task-based Cholesky Factorization on top of modern architectures
【24h】

Resource aggregation for task-based Cholesky Factorization on top of modern architectures

机译:基于现代架构的基于任务的Cholesky分解的资源聚合

获取原文
获取原文并翻译 | 示例
           

摘要

Hybrid computing platforms are now commonplace, featuring a large number of CPU cores and accelerators. This trend makes balancing computations between these heterogeneous resources performance critical. In this paper we propose aggregating several CPU cores in order to execute larger parallel tasks and improve load balancing between CPUs and accelerators. Additionally, we present our approach to exploit internal parallelism within tasks, by combining two runtime system schedulers: a global runtime system to schedule the main task graph and a local one to cope with internal task parallelism. We demonstrate the relevance of our approach in the context of the dense Cholesky factorization kernel implemented on top of the StarPU task-based runtime system. We present experimental results showing that our solution outperforms state-of-the-art implementations on two architectures: a heterogeneous CPU+GPU machine and the Intel Xeon Phi Knights Landing processor. (C) 2018 Elsevier B.V. All rights reserved.
机译:混合计算平台现在很普遍,具有大量的CPU内核和加速器。这种趋势使得这些异构资源性能之间的平衡计算变得至关重要。在本文中,我们建议聚合几个CPU内核,以执行更大的并行任务并改善CPU和加速器之间的负载平衡。此外,我们通过组合两个运行时系统调度程序,介绍了一种利用任务内部内部并行性的方法:一个用于调度主任务图的全局运行时系统,一个用于处理内部任务并行性的局部运行时系统。我们在基于StarPU任务的运行时系统之上实现的密集Cholesky因式分解内核的背景下,证明了我们方法的相关性。我们提供的实验结果表明,我们的解决方案在两种体系结构上均优于最新的实现:异构CPU + GPU机器和Intel Xeon Phi Knights Landing处理器。 (C)2018 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号