首页> 外国专利> EFFICIENT HARDWARE ARCHITECTURE FOR ACCELERATING GROUPED CONVOLUTIONS

EFFICIENT HARDWARE ARCHITECTURE FOR ACCELERATING GROUPED CONVOLUTIONS

机译：高效的硬件架构，用于加速分组的卷积

页面导航

摘要
著录项
相似文献

摘要

Hardware accelerators for accelerated grouped convolution operations. A first buffer of a hardware accelerator may receive a first row of an input feature map (IFM) from a memory. A first group comprising a plurality of tiles may receive a first row of the IFM. A plurality of processing elements of the first group may compute a portion of a first row of an output feature map (OFM) based on the first row of the IFM and a kernel. A second buffer of the accelerator may receive a third row of the IFM from the memory. A second group comprising a plurality of tiles may receive the third row of the IFM. A plurality of processing elements of the second group may compute a portion of a third row of the OFM based on the third row of the IFM and the kernel as part of a grouped convolution operation.

机译：用于加速分组卷积操作的硬件加速器。硬件加速器的第一缓冲器可以从存储器接收第一行输入特征图（IFM）。包括多个瓦片的第一组可以接收IFM的第一行。第一组的多个处理元件可以基于IFM和内核的第一行计算输出特征映射（OFM）的第一行的一部分。加速器的第二缓冲器可以从存储器接收IFM的第三行。包括多个瓦片的第二组可以接收IFM的第三行。作为分组卷积操作的一部分，第二组的多个处理元件可以基于IFM和内核的第三行计算OFM的第三行的一部分。

著录项

公开/公告号EP3885997A1

专利类型
公开/公告日2021-09-29

原文格式PDF
申请/专利权人 INTEL CORPORATION;
展开▼

申请/专利号EP20200207096
发明设计人 VENGALLUR AMIBILI;DAGA BHARAT;JANEDULA PRADEEP;PAZHANIMALA BIJOY;SRINIVASAN ARAVID BABU;
展开▼

申请日2020-11-12
分类号G06N3/04;G06N3/063;G06N3/08;
国家 EP
入库时间 2022-08-24 21:20:21

相似文献

专利
外文文献
中文文献