首页> 外文会议>2016 5th Workshop on Extreme-Scale Programming Tools >Automatic Code Generation and Data Management for an Asynchronous Task-Based Runtime
【24h】

Automatic Code Generation and Data Management for an Asynchronous Task-Based Runtime

机译:基于任务的异步运行时的自动代码生成和数据管理

获取原文
获取原文并翻译 | 示例

摘要

Hardware scaling and low-power considerations associated with the quest for exascale and extreme scale computing are driving system designers to consider new runtime and execution models such as the event-driven-task (EDT) models that enable more concurrency and reduce the amount of synchronization. Further, for performance, productivity, and code sustainability reasons, there is an increasing demand for auto-parallelizing compiler technologies to automatically produce code for EDT-based runtimes. However achieving scalable performance in extreme-scale systems with auto-generated codes is a non-trivial challenge. Some of the key requirements that are important for achieving good scalable performance across many EDT-based systems are: (1) scalable dynamic creation of task-dependence graph and spawning of tasks, (2) scalable creation and management of data and communications, and (3) dynamic scheduling of tasks and movement of data for scalable asynchronous execution. In this paper, we develop capabilities within R-Stream - an automatic source-to-source optimization compiler - for automatic generation and optimization of code and data management targeted towards Open Community Runtime (OCR) - an exascale-ready asynchronous task-based runtime. We demonstrate the effectiveness of our techniques through performance improvements on various benchmarks and proxy application kernels that are relevant to the extreme-scale computing community.
机译:与寻求百亿亿次规模和极限规模计算相关的硬件扩展和低功耗考虑,促使系统设计人员考虑使用新的运行时和执行模型,例如事件驱动任务(EDT)模型,以实现更高的并发性并减少同步量。此外,出于性能,生产力和代码可持续性的原因,对自动并行化编译器技术以自动为基于EDT的运行时生成代码的需求日益增长。然而,在具有自动生成的代码的极端规模系统中实现可伸缩的性能并非易事。对于在许多基于EDT的系统中实现良好的可伸缩性能而言,一些关键要求很重要:(1)任务相关图的可伸缩动态创建和任务的产生;(2)数据和通信的可伸缩创建和管理;以及(3)动态任务调度和数据移动,以实现可扩展的异步执行。在本文中,我们开发了R-Stream(一种自动的源到源优化编译器)内部的功能,用于针对开放社区运行时(OCR)(一种基于百亿字节就绪的基于异步任务的运行时)的代码和数据管理的自动生成和优化。 。我们通过在与超大规模计算社区相关的各种基准和代理应用程序内核上的性能改进来证明我们的技术的有效性。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号