首页> 外文会议>IEEE International Solid- State Circuits Conference >7.1 An 11.5TOPS/W 1024-MAC Butterfly Structure Dual-Core Sparsity-Aware Neural Processing Unit in 8nm Flagship Mobile SoC
【24h】

7.1 An 11.5TOPS/W 1024-MAC Butterfly Structure Dual-Core Sparsity-Aware Neural Processing Unit in 8nm Flagship Mobile SoC

机译:7.1 8nm旗舰级移动SoC中的11.5TOPS / W 1024-MAC蝴蝶结构双核稀疏感知神经处理单元

获取原文

摘要

Deep learning has been widely applied for image and speech recognition. Response time, connectivity, privacy and security drive applications towards mobile platforms rather than cloud. For mobile systems-on-a-chip (SoCs), energy-efficient neural processing units (NPU) have been studied for performing the convolutional layers (CLs) and fully-connected layers (FCLs) [2-5] in deep neural networks. Moreover, considering that neural networks are getting deeper, the NPU needs to integrate 1K or even more multiply/accumulate (MAC) units. For energy efficiency, compression of neural networks has been studied by pruning neural connections and quantizing weights and features with 8b or even lower fixed-point precision without accuracy loss [1]. A hardware accelerator exploited network sparsity for high utilization of MAC units [3]. However, since it is challenging to predict where pruning is possible, the accelerator needed complex circuitry for selecting an array of features corresponding to an array of non-zero weights. For reducing the power of MAC operations, bit-serial multipliers have been applied [5]. Generally, extremely low- or variable-bit-precision neural networks need to be carefully trained.
机译:深度学习已广泛应用于图像和语音识别。响应时间,连接性,隐私和安全性将应用程序推向了移动平台而非云平台。对于片上移动系统(SoC),已经研究了高能效神经处理单元(NPU),以执行深度神经网络中的卷积层(CL)和全连接层(FCL)[2-5]。 。此外,考虑到神经网络越来越深入,NPU需要集成1K甚至更多的乘法/累加(MAC)单元。为了提高能源效率,已经通过修剪神经连接并以8b甚至更低的定点精度对权重和特征进行量化来研究神经网络的压缩[1]。硬件加速器利用网络稀疏性来提高MAC单元的利用率[3]。然而,由于难以预测可能在何处进行修剪,因此加速器需要复杂的电路来选择与非零权重的数组相对应的特征数组。为了降低MAC操作的功率,已应用了位串行乘法器[5]。通常,极低位或可变位精度的神经网络需要仔细训练。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号