首页> 外文会议>Asia and South Pacific Design Automation Conference >Lightweight Run-Time Working Memory Compression for Deployment of Deep Neural Networks on Resource-Constrained MCUs
【24h】

Lightweight Run-Time Working Memory Compression for Deployment of Deep Neural Networks on Resource-Constrained MCUs

机译:轻量级运行时工作内存压缩,用于部署资源约束MCU上的深神经网络

获取原文

摘要

This work aims to achieve intelligence on embedded devices by deploying deep neural networks (DNNs) onto resource-constrained microcontroller units (MCUs). Apart from the low frequency (e.g., 1-16 MHz) and limited storage (e.g., 16KB to 256KB ROM), one of the largest challenges is the limited RAM (e.g., 2KB to 64KB), which is needed to save the intermediate feature maps of a DNN. Most existing neural network compression algorithms aim to reduce the model size of DNNs so that they can fit into limited storage. However, they do not reduce the size of intermediate feature maps significantly, which is referred to as working memory and might exceed the capacity of RAM. Therefore, it is possible that DNNs cannot run in MCUs even after compression. To address this problem, this work proposes a technique to dynamically prune the activation values of the intermediate output feature maps in the runtime to ensure that they can fit into limited RAM. The results of our experiments show that this method could significantly reduce the working memory of DNNs to satisfy the hard constraint of RAM size, while maintaining satisfactory accuracy with relatively low overhead on memory and run-time latency.
机译:这项工作旨在通过将深神经网络(DNN)部署到资源受限的微控制器单元(MCU)上通过将深度神经网络(DNN)部署到嵌入式设备上实现智能。除了低频(例如,1-16MHz)和有限的存储(例如,16KB至256KB ROM)之外,最大的挑战之一是有限的RAM(例如,2KB至64KB),以节省中间特征DNN的地图。最现有的神经网络压缩算法旨在减少DNN的模型大小,以便它们可以符合有限的存储空间。但是,它们不会显着降低中间特征图的大小,这被称为工作存储器,并且可能超过RAM的容量。因此,即使在压缩之后,DNN也不能在MCU中运行。为了解决这个问题,这项工作提出了一种动态修剪运行时中间输出特征映射的激活值的技术,以确保它们可以适合有限的RAM。我们的实验结果表明,该方法可以显着降低DNN的工作记忆,以满足RAM尺寸的难度约束,同时在存储器和运行时延迟上保持令人满意的精度。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号