首页> 外文会议>IEEE International Symposium on Smart Electronic Systems >Novel Bit-Sliced Near-Memory Computing Based VLSI Architecture for Fast Sobel Edge Detection in IoT Edge Devices
【24h】

Novel Bit-Sliced Near-Memory Computing Based VLSI Architecture for Fast Sobel Edge Detection in IoT Edge Devices

机译:基于新的位切割近记忆计算的基于VLSI架构,用于IOT边缘设备中的快速Sobel边缘检测

获取原文

摘要

For today’s Internet-of-Things (IoT) edge devices, there is an acute need for fast and power-efficient hardware for an image processing task. Traditional hardware solutions with sequential and/or pipelined architectures incur high latency and power. This motivates us to propose a novel near-memory computing architecture for rapid image processing. We propose a bit-sliced near-memory computing architecture for CMOS VLSI implementation for fast Sobel edge detection in IoT edge devices. To the best of our knowledge, this is the first work to propose near-memory computing based VLSI architecture for edge detection. The novelty of the proposed work is that one image can be processed in constant time irrespective of the image size. Binary images are used as input to the design. The Sobel operator equations are simplified by operator strength reduction, bit manipulation, and common term sharing across equations. The captured image is loaded into the design and all block-level operations are executed in parallel close to where the data resides. The architecture is highly modular and can be scaled for any image size. The block processing element (PE) is implemented at the gate-level with the Synopsys Design Compiler in 32 nm CMOS technology node using SAED 32/28 nm Process Design Kit. For processing one block frame (3x3 pixel block), the number of logic gates required is 22 with a worst-case delay of 1.5 fs and a total area of 111 n m2. The average power dissipation is 2.27 μW at 1.05 V supply voltage.
机译:对于当今的互联网(IOT)边缘设备,对于图像处理任务的快速和高功率硬件,急需需要。具有顺序和/或流水线架构的传统硬件解决方案承受了高延迟和功率。这使我们提出了一种用于快速图像处理的新型近记忆计算架构。我们为CMOS VLSI实现提出了一个有点切片的近记忆计算架构,用于IOT边缘设备中的快速Sobel边缘检测。据我们所知,这是第一个提出基于内存计算的VLSI架构的第一项工作,用于边缘检测。所提出的工作的新颖性是无论图像尺寸如何,都可以在恒定时间内处理一个图像。二进制图像用作设计的输入。通过操作员强度减小,比特操纵和跨等式共享共同共享简化了Sobel操作员方程。捕获的图像被加载到设计中,并且所有块级操作都在靠近数据所在的位置并行执行。架构高度模块化,可以缩放任何图像大小。块处理元件(PE)在栅极级实现,使用Synopsys Design Compiler在32 NM CMOS技术节点中使用SAF 32/28 NM工艺设计套件。用于处理一个块帧(3x3像素块),所需的逻辑门数是22,最坏情况延迟为1.5 fs,总面积为111 n m 2 。平均功耗为1.05 V电源电压为2.27μW。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号