首页> 外文会议>International conference on high performance computing >TaskGenX: A Hardware-Software Proposal for Accelerating Task Parallelism
【24h】

TaskGenX: A Hardware-Software Proposal for Accelerating Task Parallelism

机译:TaskGenX:加速任务并行性的软硬件建议

获取原文

摘要

As chip multi-processors (CMPs) are becoming more and more complex, software solutions such as parallel programming models are attracting a lot of attention. Task-based parallel programming models offer an appealing approach to utilize complex CMPs. However, the increasing number of cores on modern CMPs is pushing research towards the use of fine grained parallelism. Task-based programming models need to be able to handle such workloads and offer performance and scalability. Using specialized hardware for boosting performance of task-based programming models is a common practice in the research community. Our paper makes the observation that task creation becomes a bottleneck when we execute fine grained parallel applications with many task-based programming models. As the number of cores increases the time spent generating the tasks of the application is becoming more critical to the entire execution. To overcome this issue, we propose TaskGenX. TaskGenX offers a solution for minimizing task creation overheads and relies both on the runtime system and a dedicated hardware. On the runtime system side, TaskGenX decouples the task creation from the other runtime activities. It then transfers this part of the runtime to a specialized hardware. We draw the requirements for this hardware in order to boost execution of highly parallel applications. From our evaluation using 11 parallel workloads on both symmetric and asymmetric multicore systems, we obtain performance improvements up to 15 ×, averaging to 3.1 × over the baseline.
机译:随着芯片多处理器(CMP)变得越来越复杂,诸如并行编程模型之类的软件解决方案引起了广泛的关注。基于任务的并行编程模型提供了一种吸引人的方法来利用复杂的CMP。但是,现代CMP上越来越多的内核正在推动研究朝着使用细粒度并行性的方向发展。基于任务的编程模型需要能够处理此类工作负载并提供性能和可伸缩性。使用专用硬件来提高基于任务的编程模型的性能是研究社区的一种常见做法。本文观察到,当我们使用许多基于任务的编程模型执行细粒度的并行应用程序时,任务创建将成为瓶颈。随着内核数量的增加,花费在生成应用程序任务上的时间对于整个执行变得越来越重要。为解决此问题,我们建议使用TaskGenX。 TaskGenX提供了一种用于最小化任务创建开销的解决方案,并依赖于运行时系统和专用硬件。在运行时系统方面,TaskGenX将任务创建与其他运行时活动分离。然后,它将运行时的这一部分传输到专用硬件。我们提出了对此硬件的要求,以促进高度并行应用程序的执行。根据我们在对称和非对称多核系统上使用11个并行工作负载的评估,我们获得的性能提升最多可达到15倍,平均比基准高3.1倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号