EFFICIENT DEEP CONVOLUTIONAL NEURAL NETWORKS ACCELERATOR WITHOUT MULTIPLICATION AND RETRAINING

机译：高效的深度卷积神经网络加速器而不乘法和再培训

获取原文

页面导航

摘要
著录项
引文网络
相似文献
相关主题

摘要

Recently, low-precision weight method has been considered as a promising scheme to efficiently implement inference of deep convolutional neural networks (DCNN). But it suffers from expensive retraining cost and accuracy degradation. In this paper, a low-bit and retraining-free quantization method, which enables DCNNs to deal inference with only shift and add operations, is proposed. The efficiency is demonstrated in terms of power consumption and chip area. Huffman coding is adopted for further compression. Then by exploring two-level systolic, an efficient hardware accelerator is introduced with respect to the given quantization strategy. Experiment results show that our method achieves higher accuracy than other low-precision networks without retraining process on ImageNet. 5× to 8× compression is obtained on popular models compared to full-precision counterparts. Furthermore, hardware implementation indicates good reduction of slices whereas maintaining throughput.

机译：最近，低精度的重量方法被认为是有效地实现深卷积神经网络（DCNN）推断的有希望的方案。但它遭受了昂贵的再培训成本和准确性劣化。在本文中，提出了一种低比特和无回收量化方法，其使DCNNS能够仅通过换档和添加操作来处理推断。在功耗和芯片区域方面证明了效率。采用霍夫曼编码进行进一步压缩。然后通过探索两个级别的收缩，相对于给定的量化策略介绍了有效的硬件加速器。实验结果表明，我们的方法比其他低精度网络达到更高的精度，而不会在想象成上再培训过程。与全精密同行相比，在流行模型上获得5×至8倍压缩。此外，硬件实现指示切片的良好减少，而维持吞吐量。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2018年|606-1247p|共5页
会议地点
作者
Weihong Xu; Zaichen Zhang; Xiaohu You; Chuan Zhang;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TN912-53;
关键词
Deep neural networks; hardware acceleration; systolic; low-precision weights;

机译：深神经网络;硬件加速;收缩;低精度重量;

相似文献

外文文献
中文文献
专利

1. Neural Synaptic Plasticity-Inspired Computing: A High Computing Efficient Deep Convolutional Neural Network Accelerator [J] . Zihan Xia, Jienan Chen, Qiu Huang, IEEE transactions on circuits and systems . I , Regular papers . 2021,第2期

机译：神经突触可塑性启发计算：高计算有效的深度卷积神经网络加速器
2. Eyeriss: An Energy-Efficient Reconfigurable Accelerator for Deep Convolutional Neural Networks [J] . Yu-Hsin Chen, Tushar Krishna, Joel S. Emer, Solid-State Circuits, IEEE Journal of . 2017,第1期

机译：Eyeriss：适用于深度卷积神经网络的节能型可重构加速器
3. An Energy-Efficient Deep Convolutional Neural Network Accelerator Featuring Conditional Computing and Low External Memory Access [J] . Kim Minkyu, Seo Jae-Sun IEEE Journal of Solid-State Circuits . 2021,第3期

机译：节能深度卷积神经网络加速器，具有条件计算和低外部存储器访问
4. EFFICIENT DEEP CONVOLUTIONAL NEURAL NETWORKS ACCELERATOR WITHOUT MULTIPLICATION AND RETRAINING [C] . Weihong Xu, Zaichen Zhang, Xiaohu You, IEEE International Conference on Acoustics, Speech and Signal Processing . 2018

机译：高效的深度卷积神经网络加速器而不乘法和再培训
5. FPGA-based Accelerators for Convolutional Neural Networks on Embedded Devices [D] . Perera Miro, Jordi. 2020

机译：基于FPGA的嵌入式设备卷积神经网络的加速器
6. Deep convolutional neural networks in the classification of dual-energy thoracic radiographic views for efficient workflow: analysis on over 6500 clinical radiographs [O] . Jennie Crosby, Thomas Rhines, Feng Li, 2020

机译：深度卷积神经网络在高能量工作流程的分类中的分类：超过6500临床射线照相分析
7. Eyeriss: An Energy-Efficient Reconfigurable Accelerator for Deep Convolutional Neural Networks [O] . Yu-Hsin Chen, Tushar Krishna, Joel S. Emer, 2017

机译：Eyeriss：用于深卷积神经网络的节能可重构加速器

EFFICIENT DEEP CONVOLUTIONAL NEURAL NETWORKS ACCELERATOR WITHOUT MULTIPLICATION AND RETRAINING

摘要

著录项

引文网络

相似文献

相关主题

期刊订阅