首页> 外文期刊>Parallel Computing >Combining PREM compilation and static scheduling for high-performance and predictable MPSoC execution
【24h】

Combining PREM compilation and static scheduling for high-performance and predictable MPSoC execution

机译:用于高性能和可预测的MPSOC执行的预售汇编和静态调度结合

获取原文
获取原文并翻译 | 示例

摘要

Many applications require both high performance and predictable timing. High-performance can be provided by COTS Multi-Core System on Chips (MPSoC), however, as cores in these systems share main memory, they are susceptible to interference from each other, which is a problem for timing predictability. We achieve predictability on multi-cores by employing the predictable execution model (PREM), which splits execution into a sequence of memory and compute phases, and schedules these such that only a single core is executing a memory phase at a time.We present a toolchain consisting of a compiler and a scheduling tool. Our compiler uses region and loop based analysis and performs tiling to transform application code into PREM-compliant binaries. In addition to enabling predictable execution, the compiler transformation optimizes accesses to the shared main memory. The scheduling tool uses a state-of-the-art heuristic algorithm and is able to schedule industrial-size instances. For smaller instances, we compare the results of the algorithm with optimal solutions found by solving an integer linear programming model. Furthermore, we solve the problem of scheduling execution on multiple cores while preventing interference of memory phases.We evaluate our toolchain on Advanced Driver Assistance System (ADAS) application workloads running on an NVIDIA Tegra X1 embedded system-on-chip (SoC). The results show that our approach maintains similar average performance to the original (unmodified) program code and execution, while reducing variance of completion times by a factor of 9 with the identified optimal solutions and by a factor of 5 with schedules generated by our heuristic scheduler. (C) 2018 Published by Elsevier B.V.
机译:许多应用需要高性能和可预测的定时。 COTS多核系统可以在芯片(MPSOC)上提供高性能(MPSOC),然而,由于这些系统中的核心共享主存储器,它们易于彼此干扰,这是用于定时可预测性的问题。我们通过采用可预测的执行模型(PREM)来实现对多核的可预测性,该预测执行模型(PROM)将执行分成一系列存储器和计算阶段,并将其调度,使得只有单个核心一次执行存储阶段.we呈现a工具链包括编译器和调度工具。我们的编译器使用区域和循环基于循环的分析,并执行平铺以将应用程序代码转换为符合预倒的二进制文件。除了启用可预测的执行之外,编译器转换还优化了对共享主存储器的访问。调度工具使用最先进的启发式算法,并且能够安排工业大小的实例。对于较小的实例,我们通过求解整数线性编程模型来比较算法的结果。此外,我们解决了在阻止内存阶段的干扰的同时解决了多核在多个核上的调度问题问题.we评估我们在NVIDIA TEGRA X1嵌入式系统上运行的高级驱动程序辅助系统(ADAS)应用程序工作负载上的Toolchain。结果表明,我们的方法对原始(未修改)计划代码和执行保持了类似的平均性能,同时将完成时间的方差减少了9系数9,其中识别的最佳解决方案和由我们启发式调度器产生的时间表有5倍。 。 (c)2018由elestvier b.v出版。

著录项

  • 来源
    《Parallel Computing》 |2019年第7期|27-44|共18页
  • 作者单位

    Czech Tech Univ Fac Elect Engn Tech 2 Prague Czech Republic|Czech Tech Univ Czech Inst Informat Robot & Cybernet Prague 1580 3 Czech Republic;

    Swiss Fed Inst Technol Inst Integrierte Syst Gloriastr 35 Zurich Switzerland;

    Czech Tech Univ Czech Inst Informat Robot & Cybernet Prague 1580 3 Czech Republic;

    Czech Tech Univ Czech Inst Informat Robot & Cybernet Prague 1580 3 Czech Republic;

    Swiss Fed Inst Technol Inst Integrierte Syst Gloriastr 35 Zurich Switzerland;

    Univ Bologna Viale Risorgimento 2 Bologna Italy;

    Czech Tech Univ Czech Inst Informat Robot & Cybernet Prague 1580 3 Czech Republic;

  • 收录信息 美国《科学引文索引》(SCI);美国《工程索引》(EI);
  • 原文格式 PDF
  • 正文语种 eng
  • 中图分类
  • 关键词

    PREM; Predictability; LLVM; Static scheduling; Integer linear programming; NVIDIA TX1;

    机译:PREM;可预测性;LLVM;静态调度;整数线性编程;NVIDIA TX1;

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号