Symbolic parallelization of loop programs for massively parallel processor arrays

机译：大型平行处理器阵列的循环程序的符号并行化

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we present a first solution to the unsolved problem of joint tiling and scheduling a given loop nest with uniform data dependencies symbolically. This problem arises for loop programs for which the iterations shall be optimally scheduled on a processor array of unknown size at compile-time. Still, we show that it is possible to derive parameterized latency-optimal schedules statically by proposing two new program transformations: In the first step, the iteration space is tiled symbolically into orthotopes of parametrized extensions. The resulting tiled program is subsequently scheduled symbolically. Here, we show that the maximal number of potential optimal schedules is upper bounded by 2n n! where n is the dimension of the loop nest. However, the real number of optimal schedule candidates being much less than this. At run-time, once the size of the processor array becomes known, simple comparisons of latency-determining expressions finally steer which of these schedules will be dynamically activated and the corresponding program configuration executed on the resulting processor array so to avoid any further run-time optimization or expensive recompilations.

机译：在本文中，我们向联合划线的未解决问题提出了第一个解决方案，并将带有统一的数据依赖性符号依赖性的给定循环嵌套的解决方案。此问题出现用于循环程序，其中迭代应在编译时在未知大小的处理器数组上进行最佳安排。尽管如此，我们仍然可以通过提出两个新的程序转换静态地静态地派生参数化延迟 - 最佳时间表：在第一步中，迭代空间符号地铺叠成参数化扩展的原位。由此产生的平铺程序符号安排。在这里，我们表明潜在的最佳时间表的最大数量是2 n n的上限！其中n是循环嵌套的尺寸。但是，实际数量的最佳时间表候选者远非这么做。在运行时，一旦处理器阵列的大小被知道，延迟确定表达式的简单比较最终操纵将动态激活哪些计划，并且在生成的处理器阵列上执行的相应节目配置以避免任何进一步运行 - 时间优化或昂贵的重新编译。

著录项

来源
《IEEE International Conference on Application-specific Systems, Architectures and Processors》|2013年||共9页
会议地点
作者
Teich Jurgen; Tanase Alexandru; Hannig Frank;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类总体结构、系统结构;
关键词

相似文献

外文文献
中文文献
专利

1. Symbolic Multi-Level Loop Mapping of Loop Programs for Massively Parallel Processor Arrays [J] . Tanase Alexandru, Witterauf Michael, Teich Juergen, ACM Transactions on Embedded Computing Systems . 2018,第2期

机译：大型并行处理器阵列的循环程序符号多级环映射
2. Solar-Blind Focal Plane Array Photodetectors for Massive Parallel Processing Application Based on Optoelectronic Integrated Circuit and Field-Programmable Gate Array [J] . Hary Oktavianto, Keisuke Yamane, Hiroto Sekiguchi, Sensors and materials . 2015,第10期

机译：基于光电集成电路和现场可编程门阵列的大规模并行处理应用的太阳盲焦平面阵列光电探测器
3. Industrial Strength Parallel Computing: Programming Massively Parallel Processors, by Alice E. Koniges [J] . Robert E.Filman Scientific programming . 2004,第1期

机译：工业强度并行计算：大规模并行处理器编程，作者：Alice E. Koniges
4. Symbolic parallelization of loop programs for massively parallel processor arrays [C] . Teich Jurgen, Tanase Alexandru, Hannig Frank IEEE International Conference on Application-specific Systems, Architectures and Processors . 2013

机译：大规模并行处理器阵列的循环程序的符号并行化
5. IMAGE PROCESSING ON MPP-LIKE ARRAYS (MASSIVELY PARALLEL PROCESSOR). [D] . COLETTI, NEIL BOYD. 1983

机译：像MPP一样的数组（大型并行处理器）上的图像处理。
6. Understanding GPU Programming for Statistical Computation: Studies in Massively Parallel Massive Mixtures [O] . Marc A. Suchard, Quanli Wang, Cliburn Chan, -1

机译：了解统计计算的GPU编程：大规模平行大规模混合物中的研究
7. Symbolic Array Dataflow Analysis for Array Privatization and Program Parallelization [O] . Junjie Gu, Zhiyuan Li, Gyungho Lee 1995

机译：阵列私有化和程序并行化的符号阵列数据流分析
8. Comparison of the Cellular Logic Image Processor (CLIP) 4; Distributed Array Processor (DAP) and Massively Parallel Processor (MPP) Processor-Array Implementations [R] . Gerritsen, F. A. 1982

机译：蜂窝逻辑图像处理器（CLIp）4的比较;分布式阵列处理器（Dap）和大规模并行处理器（mpp）处理器阵列实现

Symbolic parallelization of loop programs for massively parallel processor arrays

摘要

著录项

相似文献

相关主题

期刊订阅