Compressing CNNs by Exponent Sharing in Weights using IEEE Single Precision Format

机译：使用IEEE单精度格式通过指数共享压缩CNN

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

The performance of convolutional neural networks for computer vision and other applications has crossed human accuracy levels on high-end systems. The demand for these applications in small, mobile hardware is increasing while expecting the same performance. These devices have considerably smaller memory and power budgets. Prior work on model compression for inference on edge devices has sacrificed some accuracy to compress the models. We propose a novel model compression approach by sharing exponents of weights stored in IEEE floating-point format. This approach does not require any fine-tuning after compression. We demonstrate our technique on different trained models resulting in nearly 10% compression in storage and requiring less than 1.5 times the original execution time.

机译：用于计算机视觉和其他应用的卷积神经网络的性能在高端系统上跨越人类精度水平。对这些应用的需求在小型移动硬件中的需求正在增加，同时期待相同的性能。这些设备具有相当较小的内存和功率预算。在边缘设备上推断推断的模型压缩前的工作已经牺牲了一些准确性来压缩模型。我们通过分享存储在IEEE浮点格式中的权重的指数来提出一种新颖的模型压缩方法。这种方法在压缩后不需要任何微调。我们展示了我们在不同训练模型上的技术，导致存储近10％的压缩，并且需要原始执行时间的1.5倍。

著录项

来源
《International Symposium on Quality Electronic Design》|2021年|317-317|共1页
会议地点
作者
Prachi Kashikar; Sharad Sinha;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词

相似文献

外文文献
中文文献
专利

1. A Novel Weight-Shared Multi-Stage CNN for Scale Robustness [J] . Takahashi Ryo, Matsubara Takashi, Uehara Kuniaki IEEE Transactions on Circuits and Systems for Video Technology . 2019,第4期

机译：一种新型的重量共享多阶段CNN，可实现秤的鲁棒性
2. A Novel Weight-Shared Multi-Stage CNN for Scale Robustness [J] . Takahashi Ryo, Matsubara Takashi, Uehara Kuniaki IEEE Transactions on Circuits and Systems for Video Technology . 2019,第4期

机译：用于规模鲁棒性的新型重量共享多阶段CNN
3. Scale-Invariant Recognition by Weight-Shared CNNs in Parallel [J] . Ryo Takahashi, Takashi Matsubara, Kuniaki Uehara JMLR: Workshop and Conference Proceedings . 2017,第3期

机译：加权共享CNN并行进行尺度不变识别
4. Implementation of the SVD-based precoding sub-system for the compressed beamforming weights feedback in IEEE 802.11n/ac WLAN [C] . Lin Yang-Cheng, Liu Tsung-Hsien, Chou Chia-Peng, IEEE International Conference on Acoustics, Speech and Signal Processing . 2015

机译：基于SVD的预编码子系统在IEEE 802.11n / ac WLAN中用于压缩波束成形权重反馈的实现
5. Low power synchronous design of hardware architecture for IEEE 754 single precision floating point fast fourier transform. [D] . Sai Kiran, Sunkari. 2015

机译：IEEE 754单精度浮点快速傅立叶变换的硬件架构的低功耗同步设计。
6. Giraffes and hominins: reductionist model predictions of compressive loads at the spine base for erect exponents of the animal kingdom [O] . Michael Günther, Falk Mörl 2021

机译：长颈鹿和hominins：脊柱基础的脊柱基础的压缩载s的还原剂模型预测
7. A Configurable Multi-Precision CNN Computing Framework Based on Single Bit RRAM [O] . Zhenhua Zhu, Hanbo Sun, Yujun Lin, 2019

机译：基于单位RRAM的可配置的多精度CNN计算框架

Compressing CNNs by Exponent Sharing in Weights using IEEE Single Precision Format

摘要

著录项

相似文献

相关主题

期刊订阅