...
首页> 外文期刊>Neural processing letters >A Fast Algorithm for Convolutional Neural Networks Using Tile-based Fast Fourier Transforms
【24h】

A Fast Algorithm for Convolutional Neural Networks Using Tile-based Fast Fourier Transforms

机译:基于瓷砖的快速傅里叶变换的卷积神经网络快速算法

获取原文
获取原文并翻译 | 示例
           

摘要

State-of-the-art convolution algorithms accelerate training of convolutional neural networks (CNNs) by decomposing convolutions in time or Fourier domain, these decomposition implementations are designed for small filters or large inputs, respectively. We take these two aspects into account, devote to a novel decomposition strategy in Fourier domain and propose a conceptually useful algorithm for accelerating CNNs. We extend the classical Fast Fourier Transform theory to meet the requirements of convolving large inputs with small filters in faster manner. The tile-based decomposition strategy is introduced into Fourier transforms to yield a fast convolution algorithm. The algorithm, called tFFT, is simple to program, implementing tile sized transformations in Fourier domain to minimize convolution time for modern CNNs. tFFT reduces the arithmetic complexity of CNNs by over a factor of 3 compared to FFT-based convolution algorithms. We evaluate the performance of tFFT by implementing it on a set of state-of-the-art CNNs, the experiments show good results at batch sizes from 1 to 128.
机译:最先进的卷积算法通过分解时间或傅里叶域或傅立叶域的卷积来加速卷积神经网络(CNN)的训练,这些分解实现分别为小型滤波器或大输入。我们考虑到这两个方面,投入到傅立叶域中的新型分解策略,并提出了一种用于加速CNN的概念上有用的算法。我们扩展了经典的快速傅里叶变换理论,以满足以更快的方式将大型滤波器与小型过滤器卷积的要求。将基于图块的分解策略引入傅里叶变换,以产生快速卷积算法。该算法称为TFFT,易于编程,实现傅立叶域中的瓦片大小的变换,以最小化现代CNN的卷积时间。与基于FFT的卷积算法相比,TFFT将CNN的算术复杂性降低了超过3倍。我们通过在一组最先进的CNN上实现TFFT的性能,该实验以1至128的批量尺寸显示出良好的结果。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号