首页> 外国专利> NEURAL NETWORK ACCELERATOR AND NEURAL NETWORK ACCELERATION METHOD BASED ON STRUCTURED PRUNING AND LOW-BIT QUANTIZATION

NEURAL NETWORK ACCELERATOR AND NEURAL NETWORK ACCELERATION METHOD BASED ON STRUCTURED PRUNING AND LOW-BIT QUANTIZATION

机译：基于结构性修剪和低比特量化的神经网络加速器与神经网络加速度方法

页面导航

摘要
著录项
相似文献

摘要

The present invention discloses a neural network accelerator and a neural network acceleration method based on structured pruning and low-bit quantization. The neural network accelerator includes a master controller, an activations selection unit, an extensible calculation array, a multifunctional processing element, a DMA, a DRAM and a buffer. The present invention makes full use of the data reusability during inference operation of a neural network, reduces the power consumption of selecting input activation and weights of effective calculations, and relieves the high transmission bandwidth pressure between the activations selection unit and the extensible calculation array through structured pruning and data sharing on the extensible calculation array, reduces the number of weight parameters and the storage bit width by combining the low-bit quantization technology, and further improves the throughput rate and energy efficiency of the convolutional neural network accelerator.

机译：本发明公开了一种基于结构化修剪和低比特量化的神经网络加速器和神经网络加速方法。神经网络加速器包括主控制器，激活选择单元，可扩展计算阵列，多功能处理元件，DMA，DMA和缓冲器。本发明在神经网络的推理操作期间充分利用数据可重用性，从而减少了选择输入激活和有效计算权重的功耗，并通过通过释放激活选择单元和可扩展计算阵列之间的高传输带宽压力通过组合低比特量化技术，结构化修剪和数据共享在可扩展计算阵列上，减少了重量参数和存储位宽的数量，并进一步提高了卷积神经网络加速器的吞吐率和能量效率。

著录项

公开/公告号US2022012593A1

专利类型
公开/公告日2022-01-13

原文格式PDF
申请/专利权人 ZHEJIANG UNIVERSITY;
展开▼

申请/专利号US202117485645
发明设计人 KEJIE HUANG;CHAOYANG ZHU;HAIBIN SHEN;
展开▼

申请日2021-09-27
分类号G06N3/08;G06N3/063;G06F9/50;
国家 US
入库时间 2022-08-24 23:20:33

相似文献

专利
外文文献
中文文献