首页> 外国专利> Systems and methods for vectorized FFT for multi-dimensional convolution operations

Systems and methods for vectorized FFT for multi-dimensional convolution operations

机译:用于多维卷积运算的矢量化FFT的系统和方法

摘要

A new approach is proposed to support efficient convolution for deep learning by vectorizing multi-dimensional input data for multi-dimensional fast Fourier transform (FFT) and direct memory access (DMA) for data transfer. Specifically, a deep learning processor (DLP) includes a plurality of tensor engines each configured to perform convolution operations by applying one or more kernels on multi-dimensional input data for pattern recognition and classification based on a neural network, wherein each tensor engine includes, among other components, one or more vector processing engines each configured to vectorize the multi-dimensional input data at each layer of the neural network to generate a plurality of vectors and to perform multi-dimensional FFT on the generated vectors and/or the kernels to create output for the convolution operations. Each tensor engine further includes a data engine configured to prefetch the multi-dimensional data and/or the kernels to both on-chip and external memories via DMA.
机译:提出了一种新方法来支持深度学习的有效卷积,方法是对多维输入数据进行矢量化,以进行多维快速傅里叶变换(FFT)和直接存储器访问(DMA)进行数据传输。具体而言,深度学习处理器(DLP)包括多个张量引擎,每个张量引擎都配置为通过将一个或多个内核应用于多维输入数据以基于神经网络进行模式识别和分类来执行卷积运算,其中每个张量引擎包括:除其他组件外,一个或多个矢量处理引擎分别配置为在神经网络的每一层矢量化多维输入数据,以生成多个矢量,并对生成的矢量和/或内核执行多维FFT,以为卷积运算创建输出。每个张量引擎还包括数据引擎,该数据引擎被配置为经由DMA将多维数据和/或内核预取到片上和外部存储器。

著录项

  • 公开/公告号US10796220B2

    专利类型

  • 公开/公告日2020-10-06

    原文格式PDF

  • 申请/专利权人 CAVIUM LLP;

    申请/专利号US201715593235

  • 发明设计人 MEHRAN NEKUII;

    申请日2017-05-11

  • 分类号G06N3/04;G06F17/14;G06F17/15;G06N20/10;G06N3/063;

  • 国家 US

  • 入库时间 2022-08-21 11:27:59

相似文献

  • 专利
  • 外文文献
  • 中文文献
获取专利

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号