首页> 外文期刊>Parallel Computing >Static scheduling of the LU factorization with look-ahead on asymmetric multicore processors
【24h】

Static scheduling of the LU factorization with look-ahead on asymmetric multicore processors

机译:在非对称多核处理器上进行超前预测的LU分解的静态调度

获取原文
获取原文并翻译 | 示例

摘要

We analyze the benefits of look-ahead in the parallel execution of the LU factorization with partial pivoting (LUpp) in two distinct "asymmetric" multicore scenarios. The first one corresponds to an actual hardware-asymmetric architecture such as the Samsung Exynos 5422 system-on-chip (SoC), equipped with an ARM big.LITTLE processor consisting of a quad core Cortex-A15 cluster plus a quad-core Cortex-A7 cluster. For this scenario, we propose a careful mapping of the different types of tasks appearing in LUpp to the computational resources, in order to produce an efficient architecture-aware exploitation of the computational resources integrated in this SoC. The second asymmetric configuration appears in a hardware-symmetric multicore architecture where the cores can individually operate at a different frequency levels. In this scenario, we show how to employ the frequency slack to accelerate the tasks in the critical path of LUpp in order to produce a faster global execution as well as a lower energy consumption. (C) 2018 Elsevier B.V. All rights reserved.
机译:我们分析了在两个不同的“非对称”多核方案中使用部分透视(LUpp)并行执行LU分解的先行优势。第一个对应于实际的非硬件对称架构,例如Samsung Exynos 5422片上系统(SoC),配备有ARM big.LITTLE处理器,该处理器由四核Cortex-A15集群和四核Cortex- A7集群。对于这种情况,我们提议将LUpp中出现的不同类型的任务仔细映射到计算资源,以便对集成在此SoC中的计算资源进行有效的架构感知开发。第二种非对称配置出现在硬件对称的多核体系结构中,其中内核可以分别在不同的频率水平下运行。在这种情况下,我们展示了如何利用频率松弛来加速LUpp关键路径中的任务,以便产生更快的全局执行速度和更低的能耗。 (C)2018 Elsevier B.V.保留所有权利。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号