BHNN: A memory-efficient accelerator for compressing deep neural networks with blocked hashing techniques

机译：BHNN：一种内存有效的加速器，用于使用分块哈希技术压缩深度神经网络

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In this paper, we propose a novel algorithm for compressing neural networks to reduce the memory requirements by using blocked hashing techniques. By adding blocked constraints on top of the conventional hashing technique, the test error rate is maintained while the spatial locality for the computations is preserved. Using this scheme, the synaptic connections are compressed by at least an order (10×) compared with the plain neural network with virtually no prediction accuracy loss. Compared with other compression techniques, the proposed algorithm achieves the best performance in the heavy compression regions. The blocked hashing techniques are also hardware friendly, of which the memory hierarchy of the hardware architecture can be efficiently implemented. To demonstrate the hardware efficiency, we implement the hardware architecture of the deep neural networks using the proposed blocked hashing techniques on a Xilinx Virtex-7 FPGA board. With a hardware parallelism of 32, the accelerator achieves a speed-up of 22× over the CPU, and 3~5× over the GPU in the inference phase.

机译：在本文中，我们提出了一种新的算法，用于压缩神经网络，以通过使用分块哈希技术来减少内存需求。通过在常规哈希技术的基础上添加阻塞约束，可以保持测试错误率，同时保留计算的空间局部性。使用该方案，与普通神经网络相比，突触连接至少压缩了一个顺序（10倍），几乎没有预测准确性的损失。与其他压缩技术相比，该算法在重压缩区域达到了最佳性能。分块哈希技术也是硬件友好的，可以有效地实现硬件体系结构的内存层次结构。为了演示硬件效率，我们在Xilinx Virtex-7 FPGA板上使用提出的分块哈希技术实现了深度神经网络的硬件架构。硬件并行度为32，在推理阶段，加速器在CPU上的速度提高了22倍，在GPU上的速度提高了3〜5倍。

著录项

来源
《Asia and South Pacific Design Automation Conference》|2017年|690-695|共6页
会议地点
作者
Jingyang Zhu; Zhiliang Qian; Chi-Ying Tsui;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Hardware; Computer architecture; Biological neural networks; Training; Acceleration; Feedforward neural networks;

机译：硬件;计算机体系结构;生物神经网络;培训;加速;前馈神经网络;

相似文献

外文文献
中文文献
专利

1. Sensitivity-Based Error Resilient Techniques With Heterogeneous Multiply–Accumulate Unit for Voltage Scalable Deep Neural Network Accelerators [J] . Shin Dongyeob, Choi Wonseok, Park Jongsun, Emerging and Selected Topics in Circuits and Systems, IEEE Journal on . 2019,第3期

机译：电压可扩展深度神经网络加速器的基于异构的乘积累加单元的基于灵敏度的误差恢复技术
2. Efficient Hardware Accelerator for Compressed Sparse Deep Neural Network [J] . Hao XIAO, Kaikai ZHAO, Guangzhu LIU IEICE transactions on information and systems . 2021,第5期

机译：用于压缩稀疏深神经网络的高效硬件加速器
3. FPGA-Based Deep Convolutional Neural Network Accelerator Design Techniques for the Handwritten Number Recognizer [J] . Advanced Science Letters . 2018,第3期

机译：基于FPGA的深卷积神经网络加速器设计技术识别器
4. BHNN: A memory-efficient accelerator for compressing deep neural networks with blocked hashing techniques [C] . Jingyang Zhu, Zhiliang Qian, Chi-Ying Tsui Asia and South Pacific Design Automation Conference . 2017

机译：BHNN：一种用于压缩具有阻塞散列技术的深神经网络的记忆高效加速器
5. HMC-Based Accelerator Design for Compressed Deep Neural Networks [D] . Min, Chuhan . 2019

机译：基于HMC的加速器设计，用于压缩深神经网络
6. Triplet Deep Hashing with Joint Supervised Loss Based on Deep Neural Networks [O] . Mingyong Li, Ziye An, Qinmin Wei, 2019

机译：基于深度神经网络的三联体深度哈希联合监督损失
7. DNNZip: Selective Layers Compression Technique in Deep Neural Network Accelerators [O] . Habiba Lahdhiri, Maurizio Palesi, Salvatore Monteleone, 2020

机译：DNNZIP：深神经网络加速器中的选择性层压缩技术

BHNN: A memory-efficient accelerator for compressing deep neural networks with blocked hashing techniques

摘要

著录项

相似文献

相关主题

期刊订阅