Lance: Efficient Low-Precision Quantized Winograd Convolution for Neural Networks Based on Graphics Processing Units

机译：Lance：基于图形处理单元的神经网络高效低精度量化Winograd卷积

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

Accelerating deep convolutional neural networks has become an active topic and sparked an interest in academia and industry. In this paper, we propose an efficient low-precision quan-tized Winograd convolution algorithm, called LANCE, which combines the advantages of fast convolution and quantization techniques. By embedding linear quantization operations into the Winograd-domain, the fast convolution can be performed efficiently under low-precision computation on graphics processing units. We test neural network models with LANCE on representative image classification datasets, including SVHN, CIFAR, and ImageNet. The experimental results show that our 8-bit quantized Winograd convolution improves the performance by up to 2.40× over the full-precision convolution with trivial accuracy loss.

机译：加速深度卷积神经网络已成为一个活跃的话题，并引起了学术界和工业界的兴趣。在本文中，我们提出了一种有效的低精度量化Winograd量化卷积算法，称为LANCE，它结合了快速卷积和量化技术的优点。通过将线性量化操作嵌入Winograd域，可以在图形处理单元上以低精度计算有效地执行快速卷积。我们在代表性图像分类数据集（包括SVHN，CIFAR和ImageNet）上使用LANCE测试神经网络模型。实验结果表明，与全精度卷积相比，我们的8位量化Winograd卷积将性能提高了2.40倍，而精度却有所降低。

著录项

来源
《IEEE International Conference on Acoustics, Speech and Signal Processing》|2020年|3842-3846|共5页
会议地点
作者
Guangli Li; Lei Liu; Xueying Wang; Xiu Ma; Xiaobing Feng;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
deep learning; low-precision computing; Winograd convolution; linear quantization;

机译：深度学习;低精度计算; Winograd卷积;线性量化;

相似文献

外文文献
中文文献
专利

1. Efficient Weights Quantization of Convolutional Neural Networks Using Kernel Density Estimation based Non-uniform Quantizer [J] . Sanghyun Seo, Juntae Kim Applied Sciences . 2019,第12期

机译：基于基于核密度估计的非统一量化器的高效权重量化卷积神经网络
2. O⁴-DNN: A Hybrid DSP-LUT-Based Processing Unit With Operation Packing and Out-of-Order Execution for Efficient Realization of Convolutional Neural Networks on FPGA Devices [J] . Haghi Pouya, Kamal Mehdi, Afzali-Kusha Ali, IEEE transactions on circuits and systems . I , Regular papers . 2020,第9期

机译：O⁴-DNN：一种基于混合DSP-LUT的处理单元，具有操作包装和超出执行，以便在FPGA设备上有效地实现卷积神经网络
3. High Performance and Efficient Real-Time Face Detector on Central Processing Unit Based on Convolutional Neural Network [J] . Putro Muhamad Dwisnanto, Kurnianggoro Laksono, Jo Kang-Hyun IEEE transactions on industrial informatics . 2021,第7期

机译：基于卷积神经网络的中央处理单元高性能和高效的实时探测器
4. Lance: efficient low-precision quantized winograd convolution for neural networks based on graphics processing units [C] . Guangli Li, Lei Liu, Xueying Wang, IEEE International Conference on Acoustics, Speech and Signal Processing . 2020

机译：LANCE：基于图形处理单元的神经网络有效低精度量化Winograd卷积
5. Fast Convolutional Neural Networks on Graphics Processing Units [D] . Zhang, Yulin . 2019

机译：图形处理单元上快速卷积神经网络
6. Efficient implementation of convolutional neural networks in the data processing of two-photon in vivo imaging [O] . Yangzhen Wang, Feng Su, Shanshan Wang, -1

机译：卷积神经网络在双光子体内成像数据处理中的高效实现
7. Lance: efficient low-precision quantized winograd convolution for neural networks based on graphics processing units [O] . Guangli Li, Lei Liu, Xueying Wang, 2020

机译：LANCE：基于图形处理单元的神经网络有效低精度量化Winograd卷积

Lance: Efficient Low-Precision Quantized Winograd Convolution for Neural Networks Based on Graphics Processing Units

摘要

著录项

相似文献

相关主题

期刊订阅