Lightweight Compression Of Neural Network Feature Tensors For Collaborative Intelligence

机译：轻量级压缩神经网络特征张量以实现协作智能

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

In collaborative intelligence applications, part of a deep neural network (DNN) is deployed on a relatively low-complexity device such as a mobile phone or edge device, and the remainder of the DNN is processed where more computing resources are available, such as in the cloud. This paper presents a novel lightweight compression technique designed specifically to code the activations of a split DNN layer, while having a low complexity suitable for edge devices and not requiring any retraining. We also present a modified entropy-constrained quantizer design algorithm optimized for clipped activations. When applied to popular object-detection and classification DNNs, we were able to compress the 32-bit floating point activations down to 0.6 to 0.8 bits, while keeping the loss in accuracy to less than 1%. When compared to HEVC, we found that the lightweight codec consistently provided better inference accuracy, by up to 1.3%. The performance and simplicity of this lightweight compression technique makes it an attractive option for coding a layer’s activations in split neural networks for edge/cloud applications.

机译：在协作智能应用程序中，深度神经网络（DNN）的一部分部署在相对较低复杂度的设备（例如手机或边缘设备）上，其余DNN在有更多计算资源可用的情况下进行处理，例如云端。本文提出了一种新颖的轻量级压缩技术，该技术专门设计用于对拆分的DNN层的激活进行编码，同时具有适用于边缘设备且无需任何重新训练的低复杂度。我们还提出了针对限幅激活优化的改进的熵约束量化器设计算法。当应用于流行的目标检测和分类DNN时，我们能够将32位浮点激活压缩到0.6至0.8位，同时将精度损失保持在1％以内。与HEVC相比，我们发现轻量级编解码器始终提供更高的推断精度，最高可达1.3％。这种轻量级压缩技术的性能和简便性使其成为在边缘/云应用的分离式神经网络中为层的激活进行编码的有吸引力的选择。

著录项

来源
《IEEE International Conference on Multimedia and Expo》|2020年|1-6|共6页
会议地点
作者
Robert A. Cohen; Hyomin Choi; Ivan V. Bajić;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Quantization (signal); Tensile stress; Neural networks; Collaborative intelligence; Complexity theory; Encoding; Entropy;

机译：量化（信号）;拉伸应力;神经网络;协作智能;复杂性理论;编码;熵;

相似文献

外文文献
中文文献
专利

1. Development and evaluation of a novel lossless image compression method (AIC: artificial intelligence compression method) using neural networks as artificial intelligence [J] . Hiroshi Fukatsu, Shinji Naganawa, Shinnichiro Yumura Japanese Journal of Radiology . 2008,第3期

机译：使用神经网络作为人工智能的新型无损图像压缩方法（AIC：人工智能压缩方法）的开发和评估
2. Feature Pooling - A Feature Compression Method Used in Convolutional Neural Networks [J] . Pei Ge, Gao Hai-Chang, Zhou Xin, Journal of Information Recording . 2020,第3期

机译：特征池 - 卷积神经网络中使用的特征压缩方法
3. Lightweight Tensor Attention-Driven ConvLSTM Neural Network for Hyperspectral Image Classification [J] . Hu Wen-Shuai, Li Heng-Chao, Deng Yang-Jun, Selected Topics in Signal Processing, IEEE Journal of . 2021,第3期

机译：轻量级张量关注驱动的经常概念神经网络，用于高光谱图像分类
4. Deep Feature Compression With Spatio-Temporal Arranging for Collaborative Intelligence [C] . Satoshi Suzuki, Motohiro Takagi, Shoichiro Takeda, IEEE International Conference on Image Processing . 2020

机译：时空安排的深度特征压缩用于协作智能
5. Feature extraction from the image of straight-edge objects and dynamic image/feature classification using non-iterative neural networks. [D] . Chanekasit, Sirikanlaya. 2004

机译：使用非迭代神经网络从直边物体的图像中提取特征并进行动态图像/特征分类。
6. Implementation of Lightweight Convolutional Neural Networks via Layer-Wise Differentiable Compression [O] . Huabin Diao, Yuexing Hao, Shaoyun Xu, 2021

机译：通过三层可微分压缩实现轻质卷积神经网络
7. Near-Lossless Deep Feature Compression for Collaborative Intelligence [O] . Hyomin Choi, Ivan V. Bajic 2018

机译：用于协作智能的近无损深度压缩

Lightweight Compression Of Neural Network Feature Tensors For Collaborative Intelligence

摘要

著录项

相似文献

相关主题

期刊订阅