首页> 外文会议>International Conference on Parallel and Distributed Processing Techniques and Applications >Dynamic performance prediction for chunk-wise parallelization on heterogeneous CPU/GPU systems
【24h】

Dynamic performance prediction for chunk-wise parallelization on heterogeneous CPU/GPU systems

机译:异构CPU / GPU系统的块明行化动态性能预测

获取原文

摘要

Many aspects of heterogeneity in multicores such as performance variation may affect the overall execution time and cores efficiency. An effective mapping should support this variation. A complex challenge is cores load balancing to minimize the program makespan. In this context, we propose a predictive approach using iterations chunking at runtime allowing parallel code adaptation to heterogeneous systems containing GPU and heterogeneous CPUs. Our approach is based on thread pinning and performance detection at runtime. From a parallel program, we run a chunk on the GPU device. Another chunk is running on CPU using a first mapping assuming homogeneous cores. Then, performance assessment would correct mapping by speculating the future GPU and core's state. The new mapping would be then applied to a new chunk for further evaluation and prediction. This process would stop when the program is fully executed or when judging that chunking is no longer effective.
机译:多电线的异质性的许多方面,例如性能变化可能会影响整体执行时间和核效率。 有效的映射应该支持这种变化。 复杂的挑战是核心负载平衡,以最大限度地减少程序Makespan。 在这种情况下,我们提出了一种预测方法,使用运行时的迭代块散布允许并行代码适应包含GPU和异构CPU的异构系统。 我们的方法是基于线程固定和运行时的性能检测。 从一个并行程序,我们在GPU设备上运行块。 使用假设均匀核心的第一映射,另一个块在CPU上运行。 然后,通过推测未来的GPU和核心状态,性能评估将正确映射。 然后将新的映射应用于新块以进行进一步评估和预测。 当程序完全执行或判断该块不再有效时,此过程将停止。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号