Deeper Weight Pruning without Accuracy Loss in Deep Neural Networks

机译：在深度神经网络中进行更深的权重修剪而没有精度损失

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

This work overcomes the inherent limitation of the bit-level weight pruning, that is, the maximal computation speedup is bounded by the total number of non-zero bits of the weights and the bound is invariably considered "uncontrollable" (i.e., constant) for the neural network to be pruned. Precisely, this work, based on the canonical signed digit (CSD) encoding, (1) proposes a transformation technique which converts the two’s complement representation of every weight into a set of CSD representations of the minimal or near-minimal number of essential (i.e., non-zero) bits, (2) formulates the problem of selecting CSD representations of weights that maximize the parallelism of bit-level multiplication on the weights into a multi-objective shortest path problem and solves it efficiently using an approximation algorithm, and (3) proposes a supporting novel acceleration architecture with no additional inclusion of non-trivial hardware. Through experiments, it is shown that our proposed approach reduces the number of essential bits by 69% on AlexNet and 74% on VGG-16, by which our accelerator reduces the inference computation time by 47% on AlexNet and 50% on VGG-16 over the conventional bit-level weight pruning.

机译：这项工作克服了位级权重修剪的固有局限性，即最大的计算速度受到权重的非零位总数的限制，并且对于被修剪的神经网络。准确地说，这项工作基于规范符号数字（CSD）编码，（1）提出了一种转换技术，可以将每个权重的二进制补码表示形式转换为一组最小或接近最小基本数字的CSD表示形式（即（非零）位，（2）提出了选择权重的CSD表示法的问题，这些表示法将权重上的位级乘法的并行性最大化为多目标最短路径问题，并使用近似算法有效地解决了该问题，并且（ 3）提出了一种支持性的新颖加速架构，其中没有额外包含非平凡的硬件。通过实验表明，我们提出的方法将AlexNet上的基本位数减少了69％，将VGG-16上的基本位数减少了74％，这样，我们的加速器将AlexNet上的推理计算时间减少了47％，将VGG-16上的推理计算时间减少了50％。超过了传统的位级权重修剪。

著录项

来源
《Design, Automation and Test in Europe Conference and Exhibition》|2020年|73-78|共6页
会议地点
作者
Byungmin Ahn; Taewhan Kim;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. Dynamical Channel Pruning by Conditional Accuracy Change for Deep Neural Networks [J] . Chen Zhiqiang, Xu Ting-Bing, Du Changde, Neural Networks and Learning Systems, IEEE Transactions on . 2021,第2期

机译：深度神经网络的条件精度变化动态信道修剪
2. PNPU: An Energy-Efficient Deep-Neural-Network Learning Processor With Stochastic Coarse–Fine Level Weight Pruning and Adaptive Input/Output/Weight Zero Skipping [J] . Sangyeob Kim, Juhyoung Lee, Sanghoon Kang, . 2021,第1期

机译：PNPU：具有随机粗细水平重量修剪和自适应输入/输出/重量零点的节能深色网络学习处理器
3. GXNOR-Net: Training deep neural networks with ternary weights and activations without full-precision memory under a unified discretization framework [J] . Deng Lei, Jiao Peng, Pei Jing, Neural Networks: The Official Journal of the International Neural Network Society . 2018,第期

机译：GXNOR-NET：在统一的离散化框架下，使用三元权重和激活，在没有全精密内存的情况下培训深神经网络
4. Accuracy-aware Structured Filter Pruning for Deep Neural Networks [C] . Marina Villalba Carballo, Byeong Kil Lee International Conference on Computational Science and Computational Intelligence . 2020

机译：深度神经网络的精确感知结构化滤波器修剪
5. Pruning and Acceleration of Deep Neural Networks [D] . Thivagara Sarma, Janarthanan. 2020

机译：深神经网络的修剪与加速度
6. Differential Evolution Based Layer-Wise Weight Pruning for Compressing Deep Neural Networks [O] . Tao Wu, Xiaoyang Li, Deyun Zhou, 2021

机译：基于差分进化的深层神经网络的层面重量修剪
7. Removing Confounding Factors Associated Weights in Deep Neural Networks Improves the Prediction Accuracy for Healthcare Applications [O] . Haohan Wang, Zhenglin Wu, Eric P. Xing 2018

机译：消除深度神经网络中的混淆因素相关的权重提高了医疗应用的预测准确性

Deeper Weight Pruning without Accuracy Loss in Deep Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅