首页> 外文期刊>Parallel Computing >Exploiting task and data parallelism in ILUPACK's preconditioned CG solver on NUMA architectures and many-core accelerators
【24h】

Exploiting task and data parallelism in ILUPACK's preconditioned CG solver on NUMA architectures and many-core accelerators

机译:在NUMA架构和多核加速器上的ILUPACK预处理CG解算器中利用任务和数据并行性

获取原文
获取原文并翻译 | 示例
           

摘要

We present specialized implementations of the preconditioned iterative linear system solver in ILUPACK for Non-Uniform Memory Access (NUMA) platforms and many-core hardware co-protessors based on the Intel Xeon Phi and graphics accelerators. For the conventional x86 architectures, our approach exploits task parallelism via the OmpSs runtime as well as a message-passing implementation based on MPI, respectively yielding a dynamic and static schedule of the work to the cores, with different numeric semantics to those of the sequential ILUPACK. For the graphics processor we exploit data parallelism by off-loading the computationally expensive kernels to the accelerator while keeping the numeric semantics of the sequential case. (C) 2015 Elsevier B.V. All rights reserved.
机译:我们在ILUPACK中为非均匀内存访问(NUMA)平台以及基于Intel Xeon Phi和图形加速器的多核硬件协处理器提供了预处理迭代线性系统求解器的专用实现。对于传统的x86架构,我们的方法通过OmpSs运行时以及基于MPI的消息传递实现来利用任务并行性,从而分别向内核生成工作的动态和静态时间表,其数字语义与顺序语义不同。 ILUPACK。对于图形处理器,我们通过将计算量大的内核卸载到加速器中,同时保留顺序大小写的数字语义,从而利用数据并行性。 (C)2015 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号