首页> 外文会议>IEEE International Solid- State Circuits Conference >7.1 An 11.5TOPS/W 1024-MAC Butterfly Structure Dual-Core Sparsity-Aware Neural Processing Unit in 8nm Flagship Mobile SoC
【24h】

7.1 An 11.5TOPS/W 1024-MAC Butterfly Structure Dual-Core Sparsity-Aware Neural Processing Unit in 8nm Flagship Mobile SoC

机译:7.1 11.5TOP / W 1024-MAC蝴蝶结构双核稀疏感知神经处理单元8NM旗舰移动SOC

获取原文

摘要

Deep learning has been widely applied for image and speech recognition. Response time, connectivity, privacy and security drive applications towards mobile platforms rather than cloud. For mobile systems-on-a-chip (SoCs), energy-efficient neural processing units (NPU) have been studied for performing the convolutional layers (CLs) and fully-connected layers (FCLs) [2-5] in deep neural networks. Moreover, considering that neural networks are getting deeper, the NPU needs to integrate 1K or even more multiply/accumulate (MAC) units. For energy efficiency, compression of neural networks has been studied by pruning neural connections and quantizing weights and features with 8b or even lower fixed-point precision without accuracy loss [1]. A hardware accelerator exploited network sparsity for high utilization of MAC units [3]. However, since it is challenging to predict where pruning is possible, the accelerator needed complex circuitry for selecting an array of features corresponding to an array of non-zero weights. For reducing the power of MAC operations, bit-serial multipliers have been applied [5]. Generally, extremely low- or variable-bit-precision neural networks need to be carefully trained.
机译:深度学习已被广​​泛应用于图像和语音识别。响应时间,连接,隐私和安全驱动器应用于移动平台而不是云。对于芯片(SOC)的移动系统,已经研究了节能的神经处理单元(NPU),用于在深神经网络中执行卷积层(CLS)和完全连接的层(FCLS)[2-5] 。此外,考虑到神经网络正在变得更深,NPU需要集成1K甚至更加乘以/累积(MAC)单位。为了能效,通过修剪神经连接和量化重量和具有8B或甚至更低的定点精度而无需精度丢失来研究神经网络的压缩已经研究了神经网络,而且没有精确丢失[1]。硬件加速器利用MAC单元高利用网络稀疏性[3]。然而,由于预测修剪是具有挑战性的,因此加速器需要复杂的电路,用于选择对应于非零权重的阵列的特征阵列。为了减少MAC操作的力量,已经应用了比特串行乘法器[5]。通常,需要仔细培训极低或可变比特精度的神经网络。

著录项

相似文献

  • 外文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号