Hardware-Aware Softmax Approximation for Deep Neural Networks

机译：深度神经网络的硬件感知型Softmax逼近

获取原文

页面导航

摘要
著录项
相似文献
相关主题

摘要

There has been a rapid development of custom hardware for accelerating the inference speed of deep neural networks (DNNs), by explicitly incorporating hardware metrics (e.g., area and energy) as additional constraints, in addition to application accuracy. Recent efforts mainly focused on linear functions (matrix multiplication) in convolu-tional (Conv) or fully connected (FC) layers, while there is no publicly available study on optimizing the inference of non-linear functions in DNNs, with hardware constraints. In this paper, we address the problem of cost-efficient inference for Softmax, a popular non-linear function in DNNs. We introduce a hardware-aware linear approximation framework by algorithm and hardware co-optimization, with the goal of minimizing the cost in terms of area and energy, without incurring significant loss in application accuracy. This is achieved by simultaneously reducing the operand bit-width and approximating cost-intensive operations in Softmax (e.g. exponential and division) with cost-effective operations (e.g. addition and bit shifts). We designed and synthesized a hardware unit for our approximation approach, to estimate the area and energy consumption. In addition, we introduce a training method to further save area and energy cost, by reduced precision. Our approach reduces area cost by 13 x and energy consumption by 2x with 11-bit operand width, compared to baseline at 19-bit for VOC2007 dataset in Faster R-CNN.

机译：通过明确地将硬件指标（例如，面积和能量）作为附加约束，除了应用精度之外，自定义硬件的发展迅速，以加快深度神经网络（DNN）的推理速度。最近的工作主要集中在卷积（Conv）层或完全连接（FC）层中的线性函数（矩阵乘法），而没有公开可用的研究来优化具有硬件约束的DNN中非线性函数的推论。在本文中，我们针对Softmax（一种在DNN中流行的非线性函数）解决了具有成本效益的推理问题。我们通过算法和硬件协同优化引入了一种硬件可感知的线性近似框架，其目的是在面积和能量方面将成本降至最低，而不会导致应用精度的显着降低。这是通过同时减少操作数的位宽并用具有成本效益的操作（例如加法和移位）来近似化Softmax（例如指数和除法）中的成本密集型操作来实现的。我们为逼近方法设计并合成了一个硬件单元，以估算面积和能耗。此外，我们介绍了一种训练方法，可通过降低精度进一步节省面积和能源成本。与Faster R-CNN中VOC2007数据集的19位基线相比，我们的方法在11位操作数宽度的情况下将区域成本降低了13倍，将能耗降低了2倍。

著录项

来源
《Asian Conference on Computer Vision》|2018年|107-122|共16页
会议地点
作者
Xue Geng; Jie Lin; Bin Zhao; Anmin Kong; Mohamed M. Sabry Aly; Vijay Chandrasekhar;
展开▼
作者单位

展开▼
会议组织
原文格式 PDF
正文语种
中图分类
关键词
Softmax Nonlinear operation; Power; Area;

机译：Softmax非线性操作;功率;区;

相似文献

外文文献
中文文献
专利

1. Intelligent Image Recognition System for Marine Fouling Using Softmax Transfer Learning and Deep Convolutional Neural Networks [J] . Chin C. S., Si JianTing, Clare A. S., Complexity . 2017,第1期

机译：基于Softmax转移学习和深度卷积神经网络的船舶污垢智能图像识别系统。
2. Optimization of Softmax Layer in Deep Neural Network Using Integral Stochastic Computation [J] . Hu Ruofei, Tian Binren, Yin Shouyi, Journal of Low Power Electronics . 2018,第4期

机译：基于整体随机计算的深神经网络中软MAX层的优化
3. Better Approximations of High Dimensional Smooth Functions by Deep Neural Networks with Rectified Power Units [J] . Li Bo, Tang Shanshan, Yu Haijun Mathematical research letters: MRL . 2020,第2期

机译：深度神经网络与整流电量单元的更好近似尺寸
4. Hardware-Aware Softmax Approximation for Deep Neural Networks [C] . Xue Geng, Jie Lin, Bin Zhao, Asian Conference on Computer Vision . 2019

机译：深度神经网络的硬件感知Softmax近似
5. Optimal feedback controller approximation via neural and fuzzy-neural networks. [D] . Niestroy, Michael Anthony. 1997

机译：通过神经网络和模糊神经网络的最佳反馈控制器逼近。
6. Toward Scalable Efficient and Accurate Deep Spiking Neural Networks With Backward Residual Connections Stochastic Softmax and Hybridization [O] . Priyadarshini Panda, Sai Aparna Aketi, Kaushik Roy 2020

机译：朝着可扩展高效准确的深度尖峰神经网络具有向后剩余连接随机软墨乳和杂交
7. Learning Hierarchical Representations for Face Recognition using Deep Belief Network Embedded with Softmax Regress and Multiple Neural Networks [O] . Haijun Zhang, Nanfeng Xiao 2015

机译：使用SoftMax转运和多个神经网络的深度信仰网络学习面部识别的分层表示

Hardware-Aware Softmax Approximation for Deep Neural Networks

摘要

著录项

相似文献

相关主题

期刊订阅