Quantization-based Optimization of CNN Inference

Honyi PAN; Akram BEN AHMED; Tsutomu IKEGAMIKazuki TOMINAGATomohiro KUDOH

首页> 外文期刊>電子情報通信学会技術研究報告. VLSI設計技術. VLSI Design Technologies >Quantization-based Optimization of CNN Inference

【24h】

Quantization-based Optimization of CNN Inference

机译：Quantization-based Optimization of CNN Inference

获取原文

获取原文并翻译 | 示例

掌桥外文数据库（机构版） >>

开具论文收录证明 >>

文献代查 >>

页面导航

摘要
著录项
相关主题

摘要

With specifically designed hardware, FPGA is a promising candidate for neural network inference acceleration. The main challenge FPGA-based accelerator designs are faced is the deficiency of on-chip resources. We consider using multi-FPGA to conquer this problem. However, even for multi-FPGA, insufficient resources and communication delays are still non-negligible problems. In this paper, we use the quantization method based on LQ-NET proposed by the Microsoft group to reduce resource usage and communication traffic. At the same time, the tradeoff of the accuracy can be achieved by increasing the bit width. The synthesis results of the first two layers of Alexnet indicate that both the BRAM usage and the performance are improved.

著录项

来源
《電子情報通信学会技術研究報告. VLSI設計技術. VLSI Design Technologies》 |2020年第337期|63-68|共6页
作者
Honyi PAN; Akram BEN AHMED; Tsutomu IKEGAMIKazuki TOMINAGATomohiro KUDOH;
展开▼
作者单位

National Institute of Advanced Industrial Science and Technology (AIST);

The University of Tokyo;

展开▼
收录信息
原文格式 PDF
正文语种英语
中图分类大规模集成电路、超大规模集成电路;
关键词
FPGA; Data quantization; LQ-Net;

Quantization-based Optimization of CNN Inference

摘要

著录项

相关主题

期刊订阅