首页> 外国专利> Compiler techniques for mapping program code to high-performance, power-efficient, programmable image processing hardware platforms

Compiler techniques for mapping program code to high-performance, power-efficient, programmable image processing hardware platforms

机译:用于将程序代码映射到高性能,高能效,可编程图像处理硬件平台的编译器技术

摘要

One method is described. The method comprises compiling targeted program code of an image processor having a programmable stencil processor consisting of respective two dimensional execution lanes and shift register circuitry. The program code implements a directed acyclic graph and consists of multiple kernels executed on each of the stencil processors, and compiling has a different number of kernels in the program code than the stencil processors in the image processor Recognize that at least one of the kernels is more computationally intensive than another of the kernels, and the program code has a resource requirement that exceeds the memory capacity of the image processor Recognize, including any. Compiling may further, in response to any of the above recognizing, horizontal fusion of kernels, vertical fusion of kernels, splitting of kernels into one, multiple kernels, multiple spatial divisions of kernels Including performing either a spatial partitioning into a kernel, or a partitioning of a directed acyclic graph into smaller graphs.
机译:描述了一种方法。该方法包括编译图像处理器的目标程序代码,该图像处理器具有由相应的二维执行通道和移位寄存器电路组成的可编程模板处理器。该程序代码实现了有向无环图,并且由在每个模板处理器上执行的多个内核组成,并且与图像处理器中的模板处理器相比,编译在程序代码中具有不同数量的内核认识到至少有一个内核是与另一个内核相比,它的计算强度更高,并且程序代码的资源要求超过了图像处理器Recognize的存储容量,包括任何一个。响应于上述识别中的任何一个,编译还可以进一步包括:内核的水平融合,内核的垂直融合,将内核分为一个,多个内核,内核的多个空间划分,包括将空间划分成一个内核,或者进行分区。将有向无环图分解成较小的图。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号