【24h】

Efficient frequency domain CNN algorithm

机译:高效频域CNN算法

获取原文

摘要

Deep Learning techniques like Convolutional Neural Networks (CNN) are getting popular for image classification with broad usage spanning across automotive, industrial, medicine, robotics etc. Typical CNN network consists of multiple layers of 2D convolutions, non-linearity, spatial pooling and fully connected layer, with 2D convolutions constituting more than 90% of total computations. The Fast Fourier Transform (FFT) based approach for convolution is promising in theory, but not used in practice due to growth in memory sizing of coefficients storage. The paper proposes new frequency domain algorithm which avoids memory size growth compared to traditional FFT based approach for performing 2D convolution. The proposed algorithm performs Fourier Transform (FT) of coefficients On-The-Fly (OTF) instead of offline calculation on PC. The proposed algorithm consists of expands, OTF-FT and pruning blocks that do efficient 2D convolution in the frequency domain. The proposed algorithm is compared with the FFT-based algorithm for the coefficient transformation. As per simulations, assuming typical network configuration parameters, the proposed algorithm is 4-8X faster compared to FFT based approach for the co-efficient transform.
机译:诸如卷积神经网络(CNN)之类的深度学习技术已广泛用于图像分类,并广泛应用于汽车,工业,医学,机器人等领域。典型的CNN网络由多层2D卷积,非线性,空间池化和完全连接组成2D卷积构成总计算量的90%以上。基于快速傅里叶变换(FFT)的卷积方法在理论上是有前途的,但由于系数存储的内存大小增加,因此在实践中并未使用。本文提出了一种新的频域算法,与传统的基于FFT的2D卷积方法相比,该算法避免了内存大小的增长。所提出的算法可以实时执行系数(FTF)的傅里叶变换(FT),而不是在PC上进行离线计算。所提出的算法由扩展,OTF-FT和修剪块组成,它们在频域中进行有效的2D卷积。将所提出的算法与基于FFT的算法进行系数转换。根据仿真,假设采用典型的网络配置参数,则与基于FFT的系数转换方法相比,所提出的算法要快4-8倍。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号