首页> 美国卫生研究院文献>Sensors (Basel Switzerland) >An Approximate GEMM Unit for Energy-Efficient Object Detection
【2h】

An Approximate GEMM Unit for Energy-Efficient Object Detection

机译:用于节能对象检测的近似的GEMM单元

代理获取
本网站仅为用户提供外文OA文献查询和代理获取服务,本网站没有原文。下单后我们将采用程序或人工为您竭诚获取高质量的原文,但由于OA文献来源多样且变更频繁,仍可能出现获取不到、文献不完整或与标题不符等情况,如果获取不到我们将提供退款服务。请知悉。

摘要

Edge computing brings artificial intelligence algorithms and graphics processing units closer to data sources, making autonomy and energy-efficient processing vital for their design. Approximate computing has emerged as a popular strategy for energy-efficient circuit design, where the challenge is to achieve the best tradeoff between design efficiency and accuracy. The essential operation in artificial intelligence algorithms is the general matrix multiplication (GEMM) operation comprised of matrix multiplication and accumulation. This paper presents an approximate general matrix multiplication (AGEMM) unit that employs approximate multipliers to perform matrix–matrix operations on four-by-four matrices given in sixteen-bit signed fixed-point format. The synthesis of the proposed AGEMM unit to the 45 nm Nangate Open Cell Library revealed that it consumed only up to 36% of the area and 25% of the energy required by the exact general matrix multiplication unit. The AGEMM unit is ideally suited to convolutional neural networks, which can adapt to the error induced in the computation. We evaluated the AGEMM units’ usability for honeybee detection with the YOLOv4-tiny convolutional neural network. The results implied that we can deploy the AGEMM units in convolutional neural networks without noticeable performance degradation. Moreover, the AGEMM unit’s employment can lead to more area- and energy-efficient convolutional neural network processing, which in turn could prolong sensors’ and edge nodes’ autonomy.
机译:边缘计算将人工智能算法和图形处理单元更靠近数据源,为其设计进行自主和节能处理至关重要。近似计算已成为节能电路设计的流行策略,其中挑战是实现设计效率和准确性之间的最佳权衡。人工智能算法中的基本操作是由矩阵乘法和累积构成的一般矩阵乘法(Gemm)操作。本文呈现了一种近似的一般矩阵乘法(AGEMM)单元,其采用近似乘法器来执行以十六位符号定点格式给出的四×四个矩阵上的矩阵矩阵操作。所提出的AGEMM单元对45nm彩酸盐开放细胞库的合成透露,它仅消耗了45%的面积的36%和25%的精确通用矩阵乘法单元所需的能量。 AGEMM单元非常适合卷积神经网络,其可以适应计算中引起的错误。我们评估了随着Yolov4-Tiny卷积神经网络的蜜蜂检测的AGEMM单位的可用性。结果暗示我们可以在卷积神经网络中部署Agemm单位,而不会明显的性能下降。此外,AGEMM单位的就业可能导致更多的区域和节能卷积神经网络处理,这又可以延长传感器和边缘节点的自主权。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
代理获取

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号