首页> 外文期刊>International journal of parallel programming >Partition Scheduling on Heterogeneous Multicore Processors for Multi-dimensional Loops Applications
【24h】

Partition Scheduling on Heterogeneous Multicore Processors for Multi-dimensional Loops Applications

机译:面向多维循环应用的异构多核处理器上的分区调度

获取原文
获取原文并翻译 | 示例
           

摘要

This paper addresses the scheduling problem for multi-dimensional loops applications on heterogeneous multicore processors. In the multi-dimensional loops scheduling problem, a significant issue is how to hide memory latency to reduce the schedule length. With the increasing CPU speed, the gap between the processor and memory performance is an important bottleneck for modern high-performance computer systems. To solve the bottleneck problem, a variety of techniques have been studied to hide memory latency from intermediate fast memories (caches) to various prefetching and memory management techniques. Although there are a lot of algorithms in the literature to solve the scheduling with memory management problem for multiprocessor systems, they may not deliver good quality with high performance for heterogeneous multicore processors. In this paper, we first propose a scheduling algorithm Recom_Task_Assign to reduce the write activities to main memory. Then, in conjunction with the Recom_Task_Assign algorithm, we present a new partition scheduling algorithm called heterogeneous multiprocessor partition (HMP) based on the prefetching technique for heterogeneous multicore processors, which can hide memory latencies for applications with multi-dimensional loops. This technique takes advantage of memory access pattern information and fully considers the heterogeneity f processors to achieve high processor utilization. Our HMP algorithm selects the appropriate partition size and shape according to different processors, which increases processor utilization and reduces memory latency. Experiments on DSP benchmarks show that our algorithm can efficiently reduce memory latency and enhance parallelism compared with existing methods.
机译:本文解决了异构多核处理器上多维循环应用程序的调度问题。在多维循环调度问题中,一个重要的问题是如何隐藏内存延迟以减少调度长度。随着CPU速度的提高,处理器与内存性能之间的差距成为现代高性能计算机系统的重要瓶颈。为了解决瓶颈问题,已经研究了多种技术以将存储器等待时间从中间快速存储器(高速缓存)隐藏到各种预取和存储器管理技术。尽管文献中有很多算法可以解决多处理器系统的内存管理调度问题,但是对于异构多核处理器,它们可能无法提供高质量的高性能。在本文中,我们首先提出一种调度算法Recom_Task_Assign,以减少对主内存的写入活动。然后,结合Recom_Task_Assign算法,我们基于异构多核处理器的预取技术,提出了一种称为异构多处理器分区(HMP)的新分区调度算法,该算法可以隐藏具有多维循环的应用程序的内存延迟。该技术利用了存储器访问模式信息,并充分考虑了处理器的异构性以实现较高的处理器利用率。我们的HMP算法根据不同的处理器选择合适的分区大小和形状,从而提高了处理器利用率并减少了内存延迟。 DSP基准测试表明,与现有方法相比,我们的算法可以有效地减少内存延迟并增强并行性。

著录项

  • 来源
    《International journal of parallel programming》 |2017年第4期|827-852|共26页
  • 作者

    Yan Wang; Kenli Li; Keqin Li;

  • 作者单位

    School of Computer Science and Educational Software, Guangzhou University, Guangzhou,China,College of Information Science and Engineering, Hunan University, Changsha 410082, China;

    College of Information Science and Engineering, Hunan University, Changsha 410082, China;

    College of Information Science and Engineering, Hunan University, Changsha 410082, China;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    Heterogeneous multicore processor; Memory latency; Multi-dimensional loops; Scheduling;

    机译:异构多核处理器;内存延迟;多维循环;排程;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号