首页> 外国专利> Compiler technologies for mapping program code to high-performance, power-efficient programmable image processing hardware platforms

Compiler technologies for mapping program code to high-performance, power-efficient programmable image processing hardware platforms

机译:用于将程序代码映射到高性能,高能效的可编程图像处理硬件平台的编译器技术

摘要

The method is described. The method includes compiling program code that is the target of an image processor having programmable stencil processors configured with respective two-dimensional execution lanes and shift register circuit structures. The program code is for implementing a directional acyclic graph and is comprised of a plurality of kernels to run on each of the stencil processors of the stencil processors, wherein the compiling comprises: a different number of kernels than the stencil processors in the image processor, And recognizing that the user is present in the user terminal; Recognizing that at least one of the kernels is more computationally intensive than the other of the kernels; And recognizing that the program code has resource requirements that exceed the memory capacity of the image processor. The step of compiling may comprise, in response to any recognition of the above recognition: horizontal fusion of the kernels; Vertical fusion of kernels; Splitting one of the kernels into multiple kernels; Space partitioning the kernel into a number of spatially partitioned kernels; Further comprising performing any of dividing the directed acyclic graph into smaller graphs.
机译:描述了该方法。该方法包括编译程序代码,该程序代码是具有被配置有相应的二维执行通道和移位寄存器电路结构的可编程模板处理器的图像处理器的目标。程序代码用于实现有向非循环图,并且由多个内核构成,以在模板处理器的每个模板处理器上运行,其中,编译包括:与图像处理器中的模板处理器不同数量的内核,并识别出该用户存在于用户终端中;认识到至少一个内核比另一个内核的计算强度更大;并且认识到程序代码的资源要求超过了图像处理器的存储容量。响应于对上述识别的任何识别,编译步骤可以包括:内核的水平融合;以及内核的垂直融合;将一个内核拆分为多个内核;将内核空间划分为多个空间分区的内核;进一步包括执行将有向无环图划分成较小图的任一个。

著录项

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号