...
首页> 外文期刊>Parallel Computing >Resource aggregation for task-based Cholesky Factorization on top of modern architectures
【24h】

Resource aggregation for task-based Cholesky Factorization on top of modern architectures

机译:基于任务的Cholesky分解资源聚合在现代架构上的顶部

获取原文
获取原文并翻译 | 示例
           

摘要

Hybrid computing platforms are now commonplace, featuring a large number of CPU cores and accelerators. This trend makes balancing computations between these heterogeneous resources performance critical. In this paper we propose aggregating several CPU cores in order to execute larger parallel tasks and improve load balancing between CPUs and accelerators. Additionally, we present our approach to exploit internal parallelism within tasks, by combining two runtime system schedulers: a global runtime system to schedule the main task graph and a local one to cope with internal task parallelism. We demonstrate the relevance of our approach in the context of the dense Cholesky factorization kernel implemented on top of the StarPU task-based runtime system. We present experimental results showing that our solution outperforms state-of-the-art implementations on two architectures: a heterogeneous CPU+GPU machine and the Intel Xeon Phi Knights Landing processor. (C) 2018 Elsevier B.V. All rights reserved.
机译:混合计算平台现在是普遍的,具有大量CPU核心和加速器。这种趋势使得这些异构资源性能之间的平衡计算至关重要。在本文中,我们提出了聚合了多个CPU核心,以便执行更大的并行任务,并提高CPU和加速器之间的负载平衡。此外,我们通过组合两个运行时系统调度程序介绍我们在任务中利用内部并行性的方法:全局运行时系统调度主要任务图和本地任务并行性。我们展示了我们在基于Starpu任务的运行时系统之上实现的密集孔基分组内核的方法的相关性。我们提出了实验结果,表明我们的解决方案优于两种架构的最先进的实施:异构CPU + GPU机器和英特尔Xeon Phi Knights着陆处理器。 (c)2018年elestvier b.v.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号