首页> 外文会议>IEEE International Conference on Anti-counterfeiting, Security, and Identification >Efficient FPGA Implementation of Softmax Function for DNN Applications
【24h】

Efficient FPGA Implementation of Softmax Function for DNN Applications

机译:用于DNN应用的Softmax函数的高效FPGA实现

获取原文

摘要

Deep learning, as a significant class of machine learning, is a rapidly expanding research field with immense potential. Particularly, deep neural networks (DNN), spawning millions of applications including image and speech recognition, natural language processing, have gained global research attention. A DNN is an artificial neural network with multiple layers between input and output layers. The Softmax function is often used in the final layer of DNN-based classifier. Softmax function contains massive exponential and division operations, resulting in high resource usage when implemented as hardware. In this paper we present an efficient hardware implementation of Softmax function with 16-bit fixed-point input and output. During Softmax calculation, we exploit the combination of lookup table and multi-segment linear fitting to handle the exponential operations of integer and fractional parts, respectively. Furthermore, we adopt radix-4 Booth-Wallace-based 6-stage pipeline multiplier and modified shift-compare divider for high efficiency. The overall architecture features a 13-stage pipelined design, in order to improve the operating frequency. Our proposed FPGA implementation attains the precision of magnitude of 10-5, and the frequencies of FPGA and ASIC implementation (45nm technology) reach 396.040MHz and 3.3GHz, respectively.
机译:深度学习作为机器学习中的重要一类,是一个具有巨大潜力的快速发展的研究领域。特别是,深度神经网络(DNN)产生了数百万种应用,包括图像和语音识别,自然语言处理,已引起了全球研究的关注。 DNN是在输入和输出层之间具有多层的人工神经网络。 Softmax函数通常用于基于DNN的分类器的最后一层。 Softmax函数包含大量的指数和除法运算,因此,将其实现为硬件时会占用大量资源。在本文中,我们介绍了具有16位定点输入和输出的Softmax函数的高效硬件实现。在Softmax计算期间,我们利用查找表和多段线性拟合的组合分别处理整数和小数部分的指数运算。此外,我们采用基于radix-4 Booth-Wallace的6级流水线乘法器和改进的移位比较分频器来提高效率。总体架构采用13级流水线设计,以提高工作频率。我们提出的FPGA实现可达到10级的精度 -5 ,FPGA和ASIC实现(45nm技术)的频率分别达到396.040MHz和3.3GHz。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号