首页> 外文期刊>ACM Transactions on Graphics >Darkroom: Compiling High-Level Image Processing Code into Hardware Pipelines
【24h】

Darkroom: Compiling High-Level Image Processing Code into Hardware Pipelines

机译:暗室:将高级图像处理代码编译到硬件管道中

获取原文
获取原文并翻译 | 示例

摘要

Specialized image signal processors (ISPs) exploit the structure ofrnimage processing pipelines to minimize memory bandwidth usingrnthe architectural pattern of line-buffering, where all intermediate datarnbetween each stage is stored in small on-chip buffers. This providesrnhigh energy efficiency, allowing long pipelines with tera-op/sec. imagernprocessing in battery-powered devices, but traditionally requiresrnpainstaking manual design in hardware. Based on this pattern, wernpresent Darkroom, a language and compiler for image processing.rnThe semantics of the Darkroom language allow it to compile programsrndirectly into line-buffered pipelines, with all intermediaternvalues in local line-buffer storage, eliminating unnecessary communicationrnwith off-chip DRAM. We formulate the problem ofrnoptimally scheduling line-buffered pipelines to minimize bufferingrnas an integer linear program. Finally, given an optimally scheduledrnpipeline, Darkroom synthesizes hardware descriptions for ASIC orrnFPGA, or fast CPU code. We evaluate Darkroom implementationsrnof a range of applications, including a camera pipeline, low-level featurerndetection algorithms, and deblurring. For many applications, werndemonstrate gigapixel/sec. performance in under 0.5mm2 of ASICrnsilicon at 250 mW (simulated on a 45nm foundry process), realtimern1080p/60 video processing using a fraction of the resourcesrnof a modern FPGA, and tens of megapixels/sec. of throughput on arnquad-core x86 processor.
机译:专用图像信号处理器(ISP)利用行缓冲的架构模式利用图像处理流水线的结构来最大程度地减少内存带宽,其中每级之间的所有中间数据都存储在小型片上缓冲区中。这提供了很高的能源效率,从而允许以tera-op / sec的长距离管道。电池供电设备中的图像处理,但传统上需要在硬件中进行手动设计。基于这种模式,我们提出了Darkroom(一种用于图像处理的语言和编译器)。rnDarkroom语言的语义使它可以将程序直接编译为行缓冲的管道,所有中间值都存储在本地行缓冲存储器中,从而消除了与片外DRAM的不必要通信。 。我们提出了优化调度行缓冲流水线以最小化整数线性程序缓冲的问题。最后,给定最佳调度的管道,Darkroom会为ASIC orrnFPGA或快速CPU代码合成硬件描述。我们评估了Darkroom的实现方案,并提供了一系列应用程序,包括摄像头管道,低级特征检测算法和去模糊处理。对于许多应用程序,请每秒显示千兆像素。在250 mW的0.5平方毫米以下的ASIC硅上实现了高性能(在45nm铸造工艺上模拟),使用现代FPGA的一小部分资源进行实时1080p / 60视频处理以及数十百万像素/秒。 arnquad核心x86处理器的吞吐量。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号