Binarized Convolutional Neural Networks with Separable Filters for Efficient Hardware Acceleration

机译：具有可分离滤波器的二值化卷积神经网络，用于有效的硬件加速度

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

State-of-the-art convolutional neural networks are enormously costly in both compute and memory, demanding massively parallel GPUs for execution. Such networks strain the computational capabilities and energy available to embedded and mobile processing platforms, restricting their use in many important applications. In this paper, we propose BCNN with Separable Filters (BCNNw/SF), which applies Singular Value Decomposition (SVD) on BCNN kernels to further reduce computational and storage complexity. We provide a closed form of the gradient over SVD to calculate the exact gradient with respect to every binarized weight in backward propagation. We verify BCNNw/SF on the MNIST, CIFAR-10, and SVHN datasets, and implement an accelerator for CIFAR10 on FPGA hardware. Our BCNNw/SF accelerator realizes memory savings of 17% and execution time reduction of 31.3% compared to BCNN with only minor accuracy sacrifices.

机译：在计算和记忆中，最先进的卷积神经网络在两个计算和内存中都是昂贵的，要求大量平行的GPU进行执行。这种网络应变为嵌入式和移动处理平台提供的计算能力和能量，限制它们在许多重要应用中的使用。在本文中，我们提出了具有可分离滤波器（BCNNW / SF）的BCNN，其在BCNN内核上应用奇异值分解（SVD），以进一步降低计算和存储复杂性。我们在SVD提供梯度的闭合形式，以在向后传播中计算相对于每个二值化重量的精确梯度。我们在Mnist，CiFar-10和SVHN数据集上验证BCNNW / SF，并在FPGA硬件上实施CIFAR10的加速器。我们的BCNNW / SF Accelerator与BCNN相比，4 ％的内存节省为17 ％并执行时间减少31.3 ％，只有轻微的精度牺牲。

著录项

来源
《IEEE Conference on Computer Vision and Pattern Recognition Workshops》|2017年|1 v.|共9页
会议地点
作者
Jeng-Hau Lin; Tianwei Xing; Ritchie Zhao; Zhiru Zhang; Mani Srivastava; Zhuowen Tu; Rajesh K. Gupta;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类 TP391.41;
关键词
Filtering theory; Convolution; Hardware; Training; Kernel; Backpropagation; Field programmable gate arrays;

机译：过滤理论;卷积;硬件;培训;核;BackProjagation;现场可编程门阵列;

相似文献

外文文献
中文文献
专利

1. A LIGHTWEIGHT AND EFFICIENT DEEP CONVOLUTIONAL NEURAL NETWORK BASED ON DEPTHWISE DILATED SEPARABLE CONVOLUTION [J] . HOANH NGUYEN Journal of Theoretical and Applied Information Technology . 2020,第15期

机译：基于深度扩张可分离卷积的轻质和高效的深卷积神经网络
2. Convolutional neural network for bio-medical image segmentation with hardware acceleration [J] . Vardhana M., Arunkumar N., Lasrado Sunitha, Cognitive Systems Research . 2018,第AUGa期

机译：带硬件加速的卷积神经网络用于生物医学图像分割
3. Convolutional neural network acceleration with hardware/software co-design [J] . Andrew Tzer-Yeu Chen, Morteza Biglari-Abhari, Kevin I-Kai Wang, Applied Intelligence: The International Journal of Artificial Intelligence, Neural Networks, and Complex Problem-Solving Technologies . 2018,第5期

机译：卷积神经网络加速硬件/软件共同设计
4. Binarized Convolutional Neural Networks with Separable Filters for Efficient Hardware Acceleration [C] . Jeng-Hau Lin, Tianwei Xing, Ritchie Zhao, IEEE Conference on Computer Vision and Pattern Recognition Workshops . 2017

机译：具有可分离滤波器的二元卷积神经网络，可实现有效的硬件加速
5. VHDL Auto-Generation Tool for Optimized Hardware Acceleration of Convolutional Neural Networks on FPGA (VGT) [D] . Hamdan, Muhammad K.A. 2018

机译：VHDL自动生成工具，用于在FPGA（VGT）上优化卷积神经网络的硬件加速
6. Automated design of a convolutional neural network with multi-scale filters for cost-efficient seismic data classification [O] . Zhi Geng, Yanfei Wang -1

机译：具有多尺度滤波器的卷积神经网络的自动化设计可实现经济高效的地震数据分类
7. Binarized Convolutional Neural Networks with Separable Filters for Efficient Hardware Acceleration [O] . Lin, Jeng-Hau, Xing, Tianwei, Zhao, Ritchie, 2017

机译：具有可分离滤波器的二值化卷积神经网络高效的硬件加速

Binarized Convolutional Neural Networks with Separable Filters for Efficient Hardware Acceleration

摘要

著录项

相似文献

相关主题

期刊订阅