首页> 外文会议>ACM/IEEE Design Automation Conference >ReForm: Static and Dynamic Resource-Aware DNN Reconfiguration Framework for Mobile Device
【24h】

ReForm: Static and Dynamic Resource-Aware DNN Reconfiguration Framework for Mobile Device

机译:改革:移动设备的静态和动态资源感知DNN重新配置框架

获取原文

摘要

Although the Deep Neural Network (DNN) technique has been widely applied in various applications, the DNN-based applications are still too computationally intensive for the resource-constrained mobile devices. Many works have been proposed to optimize the DNN computation performance, but most of them are limited in an algorithmic perspective, ignoring certain computing issues in practical deployment. To achieve the comprehensive DNN performance enhancement in practice, the expected DNN optimization works should closely cooperate with specific hardware and system constraints (i.e. computation capacity, energy cost, memory occupancy, and inference latency). Therefore, in this work, we propose ReForm-a resource-aware DNN optimization framework. Through thorough mobile DNN computing analysis and innovative model reconfiguration schemes (i.e. ADMM based static model fine-tuning, dynamically selective computing), ReForm can efficiently and effectively recon Figure a pre-trained DNN model for practical mobile deployment with regards to various static and dynamic computation resource constraints. Experiments show that ReForm has $sim 3.5imes$ faster optimization speed than state-of-the-art resource-aware optimization method. Also, ReForm can effective recon Figure a DNN model to different mobile devices with distinct resource constraints. Moreover, ReForm achieves satisfying computation cost reduction with ignorable accuracy drop in both static and dynamic computing scenarios (at most 18% workload, 16.23% latency, 48.63% memory, and 21.5% energy enhancement).
机译:尽管深度神经网络(DNN)技术已广泛应用于各种应用中,但是基于DNN的应用仍然过于计算到资源受限的移动设备的密集。已经提出了许多作品来优化DNN计算性能,但大多数是算法视角的有限,忽略了实际部署中的某些计算问题。为了实现实践中全面的DNN性能增强,预期的DNN优化工作应与特定硬件和系统约束密切合作(即计算能力,能源成本,记忆占用和推理延迟)。因此,在这项工作中,我们建议改革 - 一种资源感知的DNN优化框架。通过彻底移动DNN计算分析和创新的模型重新配置方案(即基于ADMM的静态模型微调,动态选择性计算),改革可以有效地能够有效地侦察用于各种静态和动态的实用移动部署的预先训练的DNN模型计算资源约束。实验表明,改革具有$ SIM 3.5 倍$更快的优化速度,而不是最先进的资源感知优化方法。此外,改革可以有效地将DNN模型与不同的资源约束的不同移动设备进行有效。此外,改革实现了静态和动态计算场景中无知的精度降低的计算成本降低(最多18%的工作量,16.23%延迟,48.63%内存和21.5%的能量增强)。

著录项

相似文献

  • 外文文献
  • 中文文献
  • 专利
获取原文

客服邮箱:kefu@zhangqiaokeyan.com

京公网安备:11010802029741号 ICP备案号:京ICP备15016152号-6 六维联合信息科技 (北京) 有限公司©版权所有
  • 客服微信

  • 服务号