首页> 外文期刊>Circuits, systems and signal processing >An Energy-Efficient and Approximate Accelerator Design for Real-Time Canny Edge Detection
【24h】

An Energy-Efficient and Approximate Accelerator Design for Real-Time Canny Edge Detection

机译:实时罐头边缘检测的节能近似加速器设计

获取原文
获取原文并翻译 | 示例
           

摘要

This paper proposes a dedicated hardware design approach focused on the adoption of state-of-the-art approximate adders (AAs) for the design of CMOS (complementary metal-oxide-semiconductor) Canny edge detection hardware accelerators. The proposed method leverages state-of-the-art AAs in the compute-intensive Gaussian and Gradient filter steps of the Canny edge detection algorithm. The key objectives of our accelerator architecture are: (1) to provide real-time Canny edge operation by proposing an energy-efficient ASIC (application specific integrated circuit) architecture and (2) to further reduce energy consumption when adopting the proposed design-time approach for approximate arithmetic operations. The proposed accelerator architecture considers two methods for the magnitude computation: (1) the square root operator and (2) the absolute operator. All proposed architectures herein developed were described in VHDL and synthesized in a 45 nm digital CMOS ASIC design. Results show that the baseline architecture takes only 0.42 ms to process an 8-bit 512 x 512 pixels image at a maximum VLSI operating frequency of 631 MHz. When considering all the approximate architecture versions and the methods for magnitude computation, the maximum energy reduction achieved is 44.3% when compared to the baseline architecture in an iso-performance analysis. This significant energy reduction is achieved when an averageFmeasure quality metric equal to 0.79 is obtained.
机译:本文提出了一种专注于采用最先进的近似加法器(AAS)的专用硬件设计方法,用于设计CMOS(互补金属氧化物半导体)罐头边缘检测硬件加速器。所提出的方法利用核心边缘检测算法的计算密集型高斯和梯度滤波器步骤中的最先进的AAS。我们的加速器架构的关键目标是:(1)通过提出节能ASIC(应用特定集成电路)架构和(2),以进一步降低采用所提出的设计时的能量消耗来提供实时罐头边缘操作近似算术运算的方法。所提出的加速器架构考虑了幅度计算的两种方法:(1)方形算子和(2)绝对操作员。在VHDL中描述了本文的所有提出架构,并以45nm数字CMOS ASIC设计合成。结果表明,基线架构仅需要0.42ms以处理8位512 x 512像素图像,以631 MHz的最大VLSI工作频率。在考虑所有近似架构版本和幅度计算方法时,与ISO性能分析中的基线架构相比,所实现的最大能量减少为44.3%。当获得等于0.79的平均频率质量度量时,实现了显着的能量降低。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号